Global and Unified Access to Knowledge.

Technologies
python, javascript, java, scala, rdf
Topics
data science, natural language processing, semantic web, linked data, knowledge graph
Global and Unified Access to Knowledge.

The DBpedia project is aiming to extract structured content from the information created in various Wikimedia projects. This structured information resembles an open knowledge graph (KG) which is available for everyone on the Web. A knowledge graph is a special kind of database which stores knowledge in a machine-readable form and provides a means for information to be collected, organised, shared, searched and utilised. Google uses a similar approach to create those knowledge cards during search. DBpedia currently describes 38.3 million "things" of 685 different "types" in 125 languages, with over 3 billion "facts". It is interlinked to many other databases (e.g., Wikidata, New York Times, CIA World Factbook). The knowledge in DBpedia is exposed through a set of technologies called Linked Data. Started in 2006, DBpedia is one of the first (if not THE first) open knowledge graph on the Web. DBpedia provides tools that allow you to create, maintain, improve, integrate and use KGs to build applications. For example, BBC has created the World Cup 2010 website by interconnecting textual content and facts from their knowledge base. Data provided by DBpedia was greatly involved in creating this knowledge graph. More recently, IBM's Watson used DBpedia data to win the Jeopardy challenge. Several other large, medium and small companies use data from DBpedia everyday. DBpedia data is served as Linked Data, which is revolutionizing the way applications interact with the Web. One can navigate this Web of facts with standard Web browsers, automated crawlers or pose complex queries with SQL-like query languages (such as SPARQL). Have you thought of asking the Web about all cities with low criminality, warm weather and open jobs? That is the kind of query we are talking about. We are regularly growing our community through GSoC and can deliver more and more opportunities to you.

2020 Program

Successful Projects

Contributor
Zheyuan BAI
Mentor
Tommaso Soru, Anand Panchbhai
Organization
DBpedia
A Neural QA model for DBpedia: Compositionality
In order to make DBpedia and its humongous linked data available to a larger user base in their natural languages (only English for the moment), a...
Contributor
Niloy Purkait
Mentor
Diego Moussallem, Thiago Castro Ferreira, Mariana Dias da Silva
Organization
DBpedia
RDF-to-Text using Generative Adverserial Networks
We envisage the usage of GANs for this problem statement will result in fluent and adequate verbalization of the input RDF triples from DBpedia,...
Contributor
Lahiru Hinguruduwa
Mentor
Edgard Marx, RENATO FABBRI, Akshay Jagatap
Organization
DBpedia
DBpedia Neural Multilingual QA
By using the existing NSpM framework, we can translate a question asked in English language to a SPARQL query. The goal of this project is to extend...
Contributor
karankharecha
Mentor
Jan Forberg, Luca Virgili, Sebastian Hellmann
Organization
DBpedia
Dashboard for Language/National Knowledge Graphs
DBpedia holds a huge amount of data, and hence it is important to know the statistics of it. It is basically a knowledge graph. Hence, a dashboard...
Contributor
Kirill Yankov
Mentor
Amandeep Srivastava, Sebastian Hellmann
Organization
DBpedia
Combine DBpedia/Databus with IPFS
The project assumes implementaion of integration of Databus with IPFS. Databus (https://databus.dbpedia.org) is a service which allows people (or...
Contributor
Marco Sobrevilla
Mentor
Stuart Chan, Diego Moussallem, Thiago Castro Ferreira
Organization
DBpedia
A Multilingual Neural RDF Verbalizer
This project aims to overcome two problems in Neural RDF Verbalizer. The first is related to the ability of the neural models to deal with unseen...
Contributor
Mykola Medynskyi
Mentor
Beyza Yaman, Julio Hernandez, Sebastian Hellmann
Organization
DBpedia
Extending Extraction Framework with Citations, Commons and Lexeme Extractors
DBpedia is a crowd-sourced community effort to extract structured content from the various Wikimedia projects which is publicly available for...