You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
mgladkova edited this page Nov 10, 2015
·
3 revisions
MathWebSearch is a complete system capable of crawling, indexing and searching mathematical data. The components are implemented using POSIX-compliant C/C++ and a few third party libraries.
The main structure of the system is presented below:
The crawler system (crawler) indexes MathML-rich websites and produces MWS Harvests, based on the Content-enabled m:math nodes it finds. The MWS Harvests are fed into the core which parses them and updates two indexes
a fast substitution-based tree for the Mathematical structure
BTree database for the additional information (like URIs+XPaths).