Abstract
We propose a novel metric ATEC for automatic MT evaluation based on explicit assessment of word choice and word order in an MT output in comparison to its reference translation(s), the two most fundamental factors in the construction of meaning for a sentence. The former is assessed by matching word forms at various linguistic levels, including surface form, stem, sound and sense, and further by weighing the informativeness of each word. The latter is quantified in term of the discordance of word position and word sequence between a translation candidate and its reference. In the evaluations using the MetricsMATR08 data set and the LDC MTC2 and MTC4 corpora, ATEC demonstrates an impressive positive correlation to human judgments at the segment level, highly comparable to the few state-of-the-art evaluation metrics.
Original language | English |
---|---|
Pages (from-to) | 141-155 |
Number of pages | 15 |
Journal | Machine Translation |
Volume | 23 |
Issue number | 2-3 |
DOIs | |
Publication status | Published - Sept 2009 |
Keywords
- ATEC
- Evaluation metrics
- MT evaluation
- Word choice
- Word order