Xapian Search Engine Library

Fast, scalable and flexible search

Technologies
linux, c++, golang, unicode, swig
Topics
machine learning, search, information retrieval, linguistics, indexing
Fast, scalable and flexible search

Xapian is a Search Engine Library which aims to be fast, scalable, and flexible. It's used by many organizations and software projects around the world, including Debian, Gnome, KDE, One Laptop per Child, and Ubuntu.

It supports ranking by Language Modelling, TF-IDF, probabilistic schemes, and Divergence from Randomness, plus a rich set of boolean query operators, and re-ranking using Machine Learning. The core library is written in C++, with bindings to allow use from many other languages.

2020 Program

Successful Projects

Contributor
Dipanshu Garg
Mentor
Ayush Tomar, Tanmay Sachan
Organization
Xapian Search Engine Library
NEW WEIGHTING SCHEMES
Xapian currently supports Tf-Idf weighting scheme. It has some normalisations (described by SMART) already implemented. More normalisations can be...
Contributor
Parth Kapadia
Mentor
James Aylett, Bruno Baruffaldi
Organization
Xapian Search Engine Library
Text-Extraction Libraries
Project: Text-Extraction Libraries Currently, Omega has support for various file formats such as .htm, .html, .pdf, .csv etc. This project will focus...