Skip to content

A high-performance Wikipedia search engine built from scratch in C++. Features a custom Inverted Index for O(1) lookups, TF-IDF & PageRank for relevance scoring, and a Trie-based autocomplete system.

Notifications You must be signed in to change notification settings

arnavlul/WikiEngine

Repository files navigation

WikiEngine

WikiSearch Engine A robust, high-efficiency search engine implementation capable of indexing and querying Wikipedia dumps. Built entirely in C++ without external search libraries, this project demonstrates advanced data structure engineering.

Core Architecture:

  • Indexing: Custom Inverted Index handling millions of tokens.

  • Ranking: Hybrid ranking algorithm combining TF-IDF (content relevance) and PageRank (link authority).

  • Autocomplete: Optimized Trie (Prefix Tree) structure providing real-time query suggestions.

  • Performance: Optimized for low-latency queries and memory-efficient storage.

About

A high-performance Wikipedia search engine built from scratch in C++. Features a custom Inverted Index for O(1) lookups, TF-IDF & PageRank for relevance scoring, and a Trie-based autocomplete system.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published