Comparison of Machine Translation Services in the Biomedical Context | IEEE Conference Publication | IEEE Xplore

Comparison of Machine Translation Services in the Biomedical Context


Abstract:

Patients and healthcare providers often face communication barriers due to different languages, which can impede healthcare quality. Moreover, with digital medicine and h...Show More

Abstract:

Patients and healthcare providers often face communication barriers due to different languages, which can impede healthcare quality. Moreover, with digital medicine and health applications, health data in different languages becomes increasingly available and connected. Machine translation services may be suitable to bridge present language barriers. This paper aims to assess the translation quality of different machine translation services in a biomedical context. We compare the European Commission’s eTranslation service, DeepL, Google Translate, and the Watson Language Translator, using an automated, extensible, and open-source pipeline. Parallel biomedical corpora are identified and used to translate German, French, and Spanish sentences to English. We apply evaluation metrics, BLEU, ROUGE, and BLEURT, and additionally rank translations in a human validation. We found that both the evaluation metrics and the human validation indicate a higher translation quality of DeepL and Google Translate, compared to the Watson Language Translator and eTranslation. Also, human raters identified more errors in eTranslation and Watson Language Translator translations. The BLEURT metric reaches the highest agreement with the human validation. However, effect sizes of evaluation metric differences are small and the raters agree that in general translations were overall reasonable. While the translations are overall understandable, in practice it should always be clear to users that translations were done automatically and can be erroneous. Moreover, when choosing a translation service for a medical service, other features than translation quality such as speed and data security should be considered. In future versions, the proposed pipeline could be extended to include updated versions of translation services, large language models, further languages, and additional evaluation metrics.
Date of Conference: 03-06 December 2024
Date Added to IEEE Xplore: 10 January 2025
ISBN Information:

ISSN Information:

Conference Location: Lisbon, Portugal

Funding Agency:

No metrics found for this document.

I. Introduction

Doctor-patient communication is a central factor for successful healthcare delivery. [1], [2] However, especially in today’s globalized world, language barriers between healthcare providers and patients can significantly impact the quality of healthcare, affecting both the access to services and the accuracy of diagnoses and treatments. [3] – [5] Also, medical data becomes increasingly available and connected in digital medicine and health applications, supported by projects such as Smart4Health1 and HEALTH-X dataLOFT2. At the same time, the quality of machine translation services improved tremendously in the last years [6]. In this work, we aim to assess the translation quality of machine translation services in the biomedical context.

Usage
Select a Year
2025

View as

Total usage sinceJan 2025:45
051015202530JanFebMarAprMayJunJulAugSepOctNovDec142110000000000
Year Total:45
Data is updated monthly. Usage includes PDF downloads and HTML views.
Contact IEEE to Subscribe

References

References is not available for this document.