Write a Blog >>
ISSTA 2021
Sun 11 - Sat 17 July 2021 Online
co-located with ECOOP and ISSTA 2021

With the tremendous advancement of recurrent neural networks(RNN), dialogue systems have achieved significant development. Many RNN-driven dialogue systems, such as Siri, Google Home, and Alexa, have been deployed to assist various tasks. However, accompanying this outstanding performance, RNN-driven dialogue systems, which are essentially a kind of software, could also produce erroneous behaviors and result in massive losses. Meanwhile, the complexity and intractability of RNN models that power the dialogue systems make their testing challenging.
In this paper, we design and implement DialTest, the first RNN-driven dialogue system testing tool. DialTest employs a series of transformation operators to make realistic changes on seed data while preserving their oracle information properly. To improve the efficiency of detecting faults, DialTest further adopts Gini impurity to guide the test generation process. We conduct extensive experiments to validate DialTest. We first experiment it on two fundamental tasks, i.e., intent detection and slot filling, of natural language understanding. The experiment results show that DialTest can effectively detect hundreds of erroneous behaviors for different RNN-driven natural language understanding (NLU) modules of dialogue systems and improve their accuracy via retraining with the generated data. Further, we conduct a case study on an industrial dialogue system to investigate the performance of DialTest under the real usage scenario. The study shows DialTest can detect errors and improve the robustness of RNN-driven dialogue systems effectively.

Fri 16 Jul

Displayed time zone: Brussels, Copenhagen, Madrid, Paris change

02:00 - 03:20
Session 13 (time band 2) Testing Deep Learning Systems 4Technical Papers at ISSTA 1
Chair(s): Shiqing Ma Rutgers University
02:00
20m
Talk
Efficient White-Box Fairness Testing through Gradient Search
Technical Papers
Lingfeng Zhang East China Normal University, Yueling Zhang Singapore Management University, Min Zhang East China Normal University
DOI Media Attached
02:20
20m
Talk
DialTest: Automated Testing for Recurrent-Neural-Network-Driven Dialogue Systems
Technical Papers
Zixi Liu Nanjing University, Yang Feng Nanjing University, Zhenyu Chen Nanjing University
DOI
02:40
20m
Talk
AdvDoor: Adversarial Backdoor Attack of Deep Learning System
Technical Papers
Quan Zhang Tsinghua University, Yifeng Ding Tsinghua University, Yongqiang Tian Tianjin University, Jianmin Guo Tsinghua University, Min Yuan WeBank, Yu Jiang Tsinghua University
DOI
03:00
20m
Talk
ModelDiff: Testing-Based DNN Similarity Comparison for Model Reuse Detection
Technical Papers
Yuanchun Li Microsoft Research, Ziqi Zhang Peking University, Bingyan Liu Peking University, Ziyue Yang Microsoft Research, Yunxin Liu Tsinghua University
DOI

Sat 17 Jul

Displayed time zone: Brussels, Copenhagen, Madrid, Paris change

08:00 - 09:20
Session 26 (time band 3) Testing Deep Learning Systems 5Technical Papers at ISSTA 2
Chair(s): Junjie Chen Tianjin University
08:00
20m
Talk
Efficient White-Box Fairness Testing through Gradient Search
Technical Papers
Lingfeng Zhang East China Normal University, Yueling Zhang Singapore Management University, Min Zhang East China Normal University
DOI Media Attached
08:20
20m
Talk
DialTest: Automated Testing for Recurrent-Neural-Network-Driven Dialogue Systems
Technical Papers
Zixi Liu Nanjing University, Yang Feng Nanjing University, Zhenyu Chen Nanjing University
DOI
08:40
20m
Talk
AdvDoor: Adversarial Backdoor Attack of Deep Learning System
Technical Papers
Quan Zhang Tsinghua University, Yifeng Ding Tsinghua University, Yongqiang Tian Tianjin University, Jianmin Guo Tsinghua University, Min Yuan WeBank, Yu Jiang Tsinghua University
DOI
09:00
20m
Talk
ModelDiff: Testing-Based DNN Similarity Comparison for Model Reuse Detection
Technical Papers
Yuanchun Li Microsoft Research, Ziqi Zhang Peking University, Bingyan Liu Peking University, Ziyue Yang Microsoft Research, Yunxin Liu Tsinghua University
DOI