Fast, scalable and flexible search

Xapian is a Search Engine Library which aims to be fast, scalable, and flexible. It's used by many organizations and software projects around the world, including Debian, Gnome, KDE, One Laptop per Child, and Ubuntu.

It supports ranking by Language Modelling, TF-IDF, probabilistic schemes, and Divergence from Randomness, plus a rich set of boolean query operators, and re-ranking using Machine Learning. The core library is written in C++, with bindings to allow use from many other languages.

lightbulb_outline View ideas list

Technologies

  • c++
  • unicode
  • swig
  • linux
  • golang

Topics

comment IRC Channel
email Mailing list

Xapian Search Engine Library 2020 Projects

  • Dipanshu Garg
    NEW WEIGHTING SCHEMES
    Xapian currently supports Tf-Idf weighting scheme. It has some normalisations (described by SMART) already implemented. More normalisations can be...
  • Parth Kapadia
    Text-Extraction Libraries
    Project: Text-Extraction Libraries Currently, Omega has support for various file formats such as .htm, .html, .pdf, .csv etc. This project will focus...
close

2020