BookBrainz: Import Other Open Databases
- Mentors
- Monkey
- Organization
- MetaBrainz Foundation Inc
- Technologies
- postgresql, node.js, typescript, MARC 21
- Topics
- web, open data, Database Import
BookBrainz still has a relatively small community and contains less entities than other comparable databases. Therefore we want to provide a way to import available collections of library records into the database while still ensuring that they meet BookBrainz' high data quality standards.
From a previous GSoC project, the database schema already contains additional tables set up for that purpose, where the imports will await a user's approval before becoming a fully accepted entity in the database.
The project will require processing very large data dumps (e.g. MARC records or JSON files) in a robust way and transforming entities from one database schema to the BookBrainz schema.
Additionally the whole process should be repeatable without creating duplicate entries.