Cuneiform Digital Library Initiative

International digital library of artifacts inscribed with cuneiform writing

Technologies
python, mariadb, rdf, php, nltk
Topics
natural language processing, semantic web, machine translation, information retrieval, linguistics
International digital library of artifacts inscribed with cuneiform writing

The mission of the Cuneiform Digital Library Initiative (CDLI) is to collect, preserve and make available images, text and metadata of all artifacts inscribed with the cuneiform script. It is the sole project with this mission and we estimate that our 334,000 catalogue entries cover some two-thirds of all sources in collections around the world. Our data are available publicly at https://cdli.ucla.edu and our audiences comprise primarily scholars and students, but with growing numbers of informal learners.

At the heart of cdli is a group of developers, language scientists, machine learning engineers, and cuneiform specialists who develop software infrastructure to process and analyze curated data. To this effect, we are actively developing two projects: Framework Update https://cdli.ucla.edu/?q=news/cdli-core-update and Machine Translation and Automated Analysis of Cuneiform Languages https://cdli-gh.github.io/mtaac/. As part of these projects we are building a natural language processing platform to empower specialists of ancient languages for undertaking automated annotation and translation of Sumerian language texts thus enabling data driven study of languages, culture, history, economy and politics of the ancient Near Eastern civilizations. As part of this platform we are focusing on data standardization using Linked Open Data to foster best practices in data exchange and integration with other digital humanities and computational philology projects.

2018 Program

Successful Projects

Contributor
Bakhtiyar Syed
Mentor
Niko Schenk, Ilya Khait
Organization
Cuneiform Digital Library Initiative
Towards building the first Semantic Role Labeling system for Sumerian
Semantic role labeling (SRL) is a task in Natural Language Processing which helps in detecting the semantic arguments of the predicate/s of a...
Contributor
Sameer Sengar
Mentor
Jayanth, Saurabh Trikande, Emilie Page-Perron
Organization
Cuneiform Digital Library Initiative
Granular Temporal Data Management
The Cuneiform Digital Library Initiative (CDLI) aims to collect, maintain and make available images, text, and metadata of all artifacts inscribed...