Study Reveals Tradeoff in AI Literary Translations
A new study published on arXiv.org analyzing literary translations has found a consistent tradeoff between fluency and faithfulness in both human and machine-generated translations, with implications for AI language systems. Researchers examined 130,486 translated paragraphs from 106 novels in 16 source languages, comparing outputs from human translators, Google Translate, and TranslateGemma.
The research revealed negative correlations between fluency (naturalness of target-language expression) and faithfulness (semantic preservation of source material) that varied significantly across translation models. While large language models often produce fluent translations, the study suggests this fluency does not always correlate with accurate preservation of literary meaning.
“This challenges the assumption that fluent output equates to high-quality literary translation,” the study published on arXiv.org notes. The findings involve Google Translate, a major U.S. tech company’s system, and could influence development of AI translation tools used across American industries including publishing, education, and international business.
The work adds to growing academic scrutiny of AI translation systems as they become more prevalent in creative and professional contexts. Researchers emphasize the need for evaluation metrics that balance both fluency and semantic accuracy in literary works.