Global and Unified Access to Knowledge.

The DBpedia project is aiming to extract structured content from the information created in various Wikimedia projects. This structured information resembles an open knowledge graph (KG) which is available for everyone on the Web. A knowledge graph is a special kind of database which stores knowledge in a machine-readable form and provides a means for information to be collected, organised, shared, searched and utilised. Google uses a similar approach to create those knowledge cards during search. DBpedia currently describes 38.3 million "things" of 685 different "types" in 125 languages, with over 3 billion "facts". It is interlinked to many other databases (e.g., Wikidata, New York Times, CIA World Factbook). The knowledge in DBpedia is exposed through a set of technologies called Linked Data. Started in 2006, DBpedia is one of the first (if not THE first) open knowledge graph on the Web. DBpedia provides tools that allow you to create, maintain, improve, integrate and use KGs to build applications. For example, BBC has created the World Cup 2010 website by interconnecting textual content and facts from their knowledge base. Data provided by DBpedia was greatly involved in creating this knowledge graph. More recently, IBM's Watson used DBpedia data to win the Jeopardy challenge. Several other large, medium and small companies use data from DBpedia everyday. DBpedia data is served as Linked Data, which is revolutionizing the way applications interact with the Web. One can navigate this Web of facts with standard Web browsers, automated crawlers or pose complex queries with SQL-like query languages (such as SPARQL). Have you thought of asking the Web about all cities with low criminality, warm weather and open jobs? That is the kind of query we are talking about. We are regularly growing our community through GSoC and can deliver more and more opportunities to you.

lightbulb_outline View ideas list


  • python
  • scala
  • java
  • javascript
  • rdf


  • Data and Databases
  • knowledge graph
  • natural language processing
  • semantic web
  • data science
  • linked data
comment IRC Channel
mail_outline Contact email

DBpedia 2020 Projects

  • Marco Sobrevilla
    A Multilingual Neural RDF Verbalizer
    This project aims to overcome two problems in Neural RDF Verbalizer. The first is related to the ability of the neural models to deal with unseen...
  • Zheyuan BAI
    A Neural QA model for DBpedia: Compositionality
    In order to make DBpedia and its humongous linked data available to a larger user base in their natural languages (only English for the moment), a...
  • Kirill Yankov
    Combine DBpedia/Databus with IPFS
    The project assumes implementaion of integration of Databus with IPFS. Databus ( is a service which allows people (or...
  • karankharecha
    Dashboard for Language/National Knowledge Graphs
    DBpedia holds a huge amount of data, and hence it is important to know the statistics of it. It is basically a knowledge graph. Hence, a dashboard...
  • Lahiru Hinguruduwa
    DBpedia Neural Multilingual QA
    By using the existing NSpM framework, we can translate a question asked in English language to a SPARQL query. The goal of this project is to extend...
  • Mykola Medynskyi
    Extending Extraction Framework with Citations, Commons and Lexeme Extractors
    DBpedia is a crowd-sourced community effort to extract structured content from the various Wikimedia projects which is publicly available for...
  • Niloy Purkait
    RDF-to-Text using Generative Adverserial Networks
    We envisage the usage of GANs for this problem statement will result in fluent and adequate verbalization of the input RDF triples from DBpedia,...