Contributor
kellnerd

BookBrainz: Import Other Open Databases


Mentors
Monkey
Organization
MetaBrainz Foundation Inc
Technologies
postgresql, node.js, typescript, MARC 21
Topics
web, open data, Database Import
BookBrainz still has a relatively small community and contains less entities than other comparable databases. Therefore we want to provide a way to import available collections of library records into the database while still ensuring that they meet BookBrainz' high data quality standards. From a previous GSoC project, the database schema already contains additional tables set up for that purpose, where the imports will await a user's approval before becoming a fully accepted entity in the database. The project will require processing very large data dumps (e.g. MARC records or JSON files) in a robust way and transforming entities from one database schema to the BookBrainz schema. Additionally the whole process should be repeatable without creating duplicate entries.