欢迎来到天天文库
浏览记录
ID:51780902
大小:31.90 KB
页数:18页
时间:2020-03-15
《简单搜索引擎设计和Java源代码.docx》由会员上传分享,免费在线阅读,更多相关内容在工程资料-天天文库。
1、Asimplesearchenginewiththefollowingfeatures:Includedinthepackage:sourcecode:SimpleSearchEngine.javaSimpleSearchEngineImpl.javaSimpleSearchEngineTest.javareadme:thisfilestopWords:thestopwordfilesearchFiles/:adirectorythatcontainsabunchoftestfilesUsage:1.SimpleSearchEngineTes
2、t.javacanbemodifiedtoaddmoredocumentsandaddnewqueries.1.compilethecode2.Torun:java-cp.SimpleSearchEngineTestFeatures:1.buildinvertedindexfortermsindocumentsandstoreinanindexfile.Theindexwillbeupdatedasmoredocumentsareadded.Andtheindexisloadedintomemoryduringstartup2.examine
3、stopwords3.simplequerybysplittingthequerystringintowordsandreturningthelistofthenamesofdocumentswithoneormorewordsinthem4.simplerankingofthesearchresultbasedonthenumberofsearchwordsinthedocumentsPreparation:1.adocumentfolderwhereallthedocumentsresides,assumingsearchFiles/in
4、thetest.2.thepathoftheindexfile.AnindexfilehastheinvertedindexoftermmappedtoalistofdocIds.Thisindexwillbeupdatedandthefilewillbeupdatedasdocumentsareadded.3.thepathofadocumentnameindexfile.ThisfilehasthedocIdtodocNamemapping.Thisfilewillbeupdatedasdocumentsareadded.4.astopw
5、ordfilewiththestopwords.Anexampleisgiven.SimpleSearchEngine.javaimportjava.util.List;/***Asimplesearchengine***/publicinterfaceSimpleSearchEngine{/**simplequerybysplittingthequeryintosearchtermsandlookinguptheindex,*rankingresultsbythenumberofsearchtermsappearinginadocument
6、**@returnlistofdocumentnames***/publicListquery(StringqueryStr);/***addadocumentandupdatetheindex**@paramdocNamedocumentname*/publicvoidaddDoc(StringdocName);}SimpleSearchEngineImpl.javaimportjava.io.BufferedReader;importjava.io.File;importjava.io.FileReader;importj
7、ava.io.FileWriter;importjava.io.IOException;importjava.util.Comparator;importjava.util.HashMap;importjava.util.HashSet;importjava.util.Iterator;importjava.util.List;importjava.util.Map;importjava.util.Set;importjava.util.TreeMap;importjava.util.TreeSet;/***Asimplesearchengi
8、newiththefollowingfeatures:**1.buildinvertedindexfortermsindocumentsandstoreinanin
此文档下载收益归作者所有