shhrohan / wiki-search-engine Goto Github PK
View Code? Open in Web Editor NEWSearch engine built on wiki-pedia data dump of 40 GB. Answers any query in less than 1sec. Takes care of every part/section of each document and store each token as per its occurrence in the any section of document such as internal link, out links , title etc. Project includes end to end processing from data dump tokenization, Index creation,query processing to end result display