Context-dependent multilingual lexical lookup for under-resourced languages

Lian, Tze Lim and Enya, Kong Tang and Lay-Ki, Soon and Tek, Yong Lim and Ranaivo-Malançon, Bali (2013) Context-dependent multilingual lexical lookup for under-resourced languages. In: 51st Annual Meeting of the Association for Computational Linguistics, ACL 2013, 4 August 2013 through 9 August 2013, Sofia; Bulgaria.

Context-dependent multilingual lexical lookup for under-resourced languages (abstrak).pdf

Download (80kB) | Preview


Current approaches for word sense disambiguation and translation selection typically require lexical resources or large bilingual corpora with rich information fields and annotations, which are often infeasible for under-resourced languages. We extract translation context knowledge from a bilingual comparable corpora of a richer-resourced language pair, and inject it into a multilingual lexicon. The multilingual lexicon can then be used to perform context-dependent lexical lookup on texts of any language, including under-resourced ones. Evaluations on a prototype lookup tool, trained on a English-Malay bilingual Wikipedia corpus, show a precision score of 0.65 (baseline 0.55) and mean reciprocal rank score of 0.81 (baseline 0.771). Based on the early encouraging results, the context-dependent lexical lookup tool may be developed further into an intelligent reading aid, to help users grasp the gist of a second or foreign language text.

Item Type: Proceeding (Paper)
Uncontrolled Keywords: Computational linguistics, unimas, university, universiti, Borneo, Malaysia, Sarawak, Kuching, Samarahan, ipta, education, research, Universiti Malaysia Sarawak
Subjects: T Technology > T Technology (General)
Divisions: Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology
Depositing User: Saman
Date Deposited: 07 Jun 2017 01:02
Last Modified: 07 Jun 2017 01:02

Actions (For repository members only: login required)

View Item View Item