Contributor
Chaitanya Gambali

Dictionary Induction from Parallel Corpora


Mentors
Kevin Brubeck Unhammer
Organization
Apertium
Technologies
python, linux, xml, git, bash
Topics
automation, machine translation, scripting
The aim is to construct bidirectional dictionaries for a language pair, given a pair of parallel corpora - i.e., the same content in two different languages using a single script that does the job for the user in a single step, unlike the current method that requires multiple steps to do the job. The final deliverable is a Python script that does the job in one step.