Cuneiform Digital Library Initiative (CDLI)

International digital library of artifacts inscribed with cuneiform writing

Technologies
python, mysql, java, php, html/css
Topics
computer vision, data processing pipeline, nlp, lod, data-collection
International digital library of artifacts inscribed with cuneiform writing

The mission of the Cuneiform Digital Library Initiative (CDLI) is to collect, preserve and make available images, text and metadata of all artifacts inscribed with the cuneiform script. It is the sole project with this mission and we estimate that our 334,000 catalogue entries cover some two-thirds of all sources in collections around the world. Our data are available publicly at https://cdli.ucla.edu and our audiences comprise primarily scholars and students, but with growing numbers of informal learners.

At the heart of cdli is a group of developers, language scientists, machine learning engineers, and cuneiform specialists who develop software infrastructure to process and analyze curated data. To this effect, we are actively developing two projects: Framework Update https://cdli.ucla.edu/?q=news/cdli-core-update and Machine Translation and Automated Analysis of Cuneiform Languages https://cdli-gh.github.io/mtaac/. As part of these projects we are building a natural language processing platform to empower specialists of ancient languages for undertaking automated annotation and translation of Sumerian language texts thus enabling data driven study of languages, culture, history, economy and politics of the ancient Near Eastern civilizations. As part of this platform we are focusing on data standardization using Linked Open Data to foster best practices in data exchange and integration with other digital humanities and computational philology projects.

2019 Program

Successful Projects

Contributor
Sagar Sagar
Mentor
Ilya Khait, Max Ionov
Organization
Cuneiform Digital Library Initiative (CDLI)
Multiple Layer Annotations Querying
Currently, there is no tool available to integrate into a website that has the capacity to query through multiple layers of linguistic annotations...
Contributor
Amaan Iqbal
Mentor
Shraddha Manchekar, rune.r, Émilie Pagé-Perron
Organization
Cuneiform Digital Library Initiative (CDLI)
CDLI - Search Results Visualizations
CDLI has rich geographical and temporal data at its disposal. Currently, this information is not fully utilized. Although the data schema is being...
Contributor
rillian
Mentor
Willis Monroe
Organization
Cuneiform Digital Library Initiative (CDLI)
TEI Export for the CDLI Corpus
I am writing an export tool for the CDLI dataset so it can be used with the Scaife viewer. The tool will need to convert the native AFT markup used...
Contributor
Ravneet Punia
Mentor
Niko Schenk
Organization
Cuneiform Digital Library Initiative (CDLI)
Neural Machine Translation for Sumerian and English
The project aims to build a machine translation model that can convert Sumerian (Language used around 2000 BC) to English Language using Neural...
Contributor
Vishal Thamizharasan
Mentor
Jayanth
Organization
Cuneiform Digital Library Initiative (CDLI)
Computer vision challenge for the cuneiform script
The current display system used at CDLI requires that a user reads a text to absorb visual and text information simultaneously, and to interpret the...