Metamorphic Testing of Machine Translation Models using Back Translation (DeepTest 2023)

Who

Wentao Gao, Jiayuan He, Van-Thuan Pham

Track

DeepTest 2023

Time Zone

The program is currently displayed in (GMT+10:00) Hobart.

Use conference time zone: (GMT+10:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 15 May 2023 13:45 - 14:05 at Meeting Room 209 - Session 2

Abstract

Machine translation software has been widely adopted in recent years. The recent advance in deep learning research has massively improved the accuracy and fluency of the translated output. However, incorrect translations may still occur, which cause misunderstandings, and even more detrimental consequences when applying these systems for crucial applications, such as translating legal and medical documents. This calls for methods that can test the correctness of machine translation software efficiently and effectively. In this paper, we propose a method, which uses back-translation as a reference for machine translation testing, minimizing the knowledge and use of the NLP tools in the target language, so that the same workflow can be applied to test systems translating English to multiple languages. We build a metamorphic testing method using our proposed concept called contextual referentially transparent input (CRTI). A CRTI is a piece of text that should have a similar meaning under a certain context in any given language. Our method detects inconsistency between a CRTI in the original sentence and the back-translation to report translation errors. To evaluate our method, we translate 200 sentences using Google Translate. Our method reports 57 suspicious issues with a precision of 74% in Chinese translation and 22 suspicious issues with a precision of 82% in Vietnamese translation.

Wentao Gao

University of Melbourne

Australia

Jiayuan He

RMIT University

Australia

Van-Thuan Pham

Monash University

Australia

Time Zone

The program is currently displayed in (GMT+10:00) Hobart.

Use conference time zone: (GMT+10:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 15 May
Displayed time zone: Hobart change

13:45 - 15:15	Session 2DeepTest at Meeting Room 209

13:45 20m Talk		Metamorphic Testing of Machine Translation Models using Back Translation DeepTest Wentao Gao University of Melbourne, Jiayuan He RMIT University, Van-Thuan Pham Monash University
14:05 20m Talk		A Method of Identifying Causes of Prediction Errors to Accelerate MLOps DeepTest Keita Sakuma NEC Corporation, Ryuta Matsuno NEC Corporation, Yoshio Kameda NEC Corporation
14:25 20m Talk		DeepSHAP Summary for Adversarial Example Detection DeepTest Yi-Ching Lin National Chengchi University, Fang Yu National Chengchi University
14:45 20m Talk		DeepPatch: A Patching-Based Method for Repairing Deep Neural Networks DeepTest Hao Bu Peking University, Meng Sun Peking University