I am deeply indebted to Dr. Michael Berry, my major advisor, for his kind guidance and support. I also thank Dr. Susan Dumais, director of the Information Sciences Research Group at Bellcore, for her technical advice. In addition, she graciously allowed us
Traditionallexical(orBoolean)retrievaltechniques,whilesometimesvaluabletoex-pertstrainedtosearchcollectionsfromaspeci cdiscipline,oftenreturntoomuchinformationtotheuser.Othertimes,becausethetermsusedinthequerydifferfromthetermsusedinthedocument,valuableinformationisneverfoundinthedocumentcollection.
LatentSemanticIndexing(LSI)[DDF90],avector-spaceapproachtoconceptualinformationretrieval,isusefulinsituationswheretraditionallexicalinformationre-trievalapproachesfail.LSIestimatesthesemanticcontentofthedocumentsinacollectionandusesthatestimatetorankthedocumentsinorderofdecreasingrele-vancetoauser’squery.Sincethesearchisbasedontheconceptscontainedinthedocumentsratherthanthedocument’sconstituentterms,LSIcanretrievedocumentsrelatedtoauser’squeryevenwhenthequeryandthedocumentsdonotshareanycommonterms.Also,sinceLSIranksthedocumentsaccordingtotheirrelevancetotheuser’squery,thesystemhelpstheuserdecidewhichinformationmaybemorespeci ctotheuser’sinterests.
AlthoughLSIiscapableofachievingsigni cantretrievalperformancegainsoverstandardlexicalretrievaltechniques(see[Dum91]),thecomplexityoftheLSImodeloftencausesitsexecutionef ciencytolagfarbehindtheexecutionef ciencyofthesimpler,Booleanmodels,especiallyonlargedatasets.BycarefullyexaminingtheLSImodelandnotingthevariousoptimizationsthatcanbeappliedtoitsunderlyingimplementation,though,boththeretrievalbene tsoftheLSImodelandanexecu-tionef ciencynearthatoftheBooleanretrievaltechniquescanbeattained.Here,anef cient,extensible,maintainable,andportableimplementationoftheLSImodelispresented,andasimpleuserinterface,createdwiththenewimplementationoftheLSImodel,http://www.77cn.com.cningboththenewimplementationoftheLSImodelanditscorrespondinguserinterface,userscanquicklysearchlargedatasetswithoutunderstandinganydetailsoftheLSImodelorimplementation.
2
百度搜索“77cn”或“免费范文网”即可找到本站免费阅读全部范文。收藏本站方便下次阅读,免费范文网,提供经典小说综合文库Toward Large-Scale Information Retrieval Using Latent Semant(11)在线全文阅读。
相关推荐: