Dictionary Induction from Parallel Corpora
- Mentors
- Kevin Brubeck Unhammer
- Organization
- Apertium
- Technologies
- python, linux, xml, git, bash
- Topics
- automation, machine translation, scripting
The aim is to construct bidirectional dictionaries for a language pair, given a pair of parallel corpora - i.e., the same content in two different languages using a single script that does the job for the user in a single step, unlike the current method that requires multiple steps to do the job. The final deliverable is a Python script that does the job in one step.