Fast, scalable and flexible search

Xapian is a Search Engine Library which aims to be fast, scalable, and flexible. It's used by many organizations around the world, including Debian, Gmane, One Laptop per Child, and Ubuntu. It supports ranking by TF-IDF, probabilistic schemes, and Divergence from Randomness, plus a rich set of boolean query operators. The core library is written in C++, with bindings to allow use from many other languages.

lightbulb_outline View ideas list


  • c++
  • unicode
  • swig
  • android
  • golang


  • Data and Databases
  • search
  • information retrieval
  • machine learning
  • linguistics
  • integration
comment IRC Channel
email Mailing list

Xapian Search Engine Library 2016 Projects

  • Richhiey
    Clustering of Search Results
    The current clustering functionality implemented in Xapian uses hierarchical clustering and though it provides fairly good results for low...
  • ayshtmr
    Learning to Rank Stabilisation
    The project aims to provide an end to end system for Learning to Rank (Letor) functionality in Xapian, which could be included in the next stable...
  • Amanda Jayanetti
    R bindings for Xapian
    R is one of the best available programming languages for computational statistics, data visualization and data science. The goal of this project is...
  • ng420
    Support for HHVM
    HHVM is an open-source virtual machine designed for executing programs written in Hack and PHP. Rather than directly interpret or compile PHP code...
  • Vivek Pal
    Weighting Schemes
    Primarily, the idea is to improve existing weighting schemes in Xapian. There are other weighting schemes which can potentially be employed but they...