Poster: Comprehensive Comparisons of Embedding Approaches for Cryptographic API Completion
Fri 27 May 2022 13:30 - 15:00 at Ballroom Gallery - Posters 3
In this paper, we conduct a measurement study to comprehensively compare the accuracy of Cryptographic API completion tasks trained with multiple API embedding options. Embedding is the process of automatically learning to represent program elements as low-dimensional vectors. Our measurement aims to uncover the impacts of applying program analysis, token-level embedding, and sequence-level embedding on the Cryptographic API completion accuracies. Our findings show that program analysis is necessary even under advanced embedding. The results show 36.10% accuracy improvement on average when program analysis preprocessing is applied to transfer byte code sequences into API dependence paths. The best accuracy (93.52%) is achieved on API dependence paths with embedding techniques. On the contrary, the pure data-driven approach without program analysis only achieves a low accuracy (around 57.60%), even after the powerful sequence-level embedding is applied. Although sequence-level embedding shows slight accuracy advantages (0.55% on average) over token-level embedding in our basic data split setting, it is not recommended considering its expensive training cost. A more obvious accuracy improvement (5.10%) from sequence-level embedding is observed under the cross-project learning scenario when task data is insufficient. Hence, we recommend applying sequence-level embedding for cross-project learning with limited task-specific data.
Thu 12 MayDisplayed time zone: Eastern Time (US & Canada) change
11:00 - 12:00 | |||
11:00 5mPoster | Enabling End-Users to Implement Larger Block-Based Programs Posters Nico Ritschel The University of British Columbia, Felipe Fronchetti Virginia Commonwealth University, Reid Holmes University of British Columbia, Ronald Garcia University of British Columbia, David C. Shepherd Virginia Commonwealth University | ||
11:05 5mPoster | Mutation Testing of Quantum Programs written in QISKit Posters Daniel Fortunato INESC-ID, University of Porto, José Campos University of Lisbon, Portugal, Rui Abreu Faculty of Engineering, University of Porto, Portugal | ||
11:10 5mPoster | Poster: Comprehensive Comparisons of Embedding Approaches for Cryptographic API Completion Posters Ya Xiao Virginia Tech, Salman Ahmed Virginia Polytechnic Institute and State University, Xinyang Ge Microsoft Research, Bimal Viswanath Virginia Tech, Na Meng Virginia Tech, Daphne Yao Virginia Tech | ||
11:15 5mPoster | Improving Responsiveness of Android Activity Navigation via Genetic Improvement Posters | ||
11:20 5mPoster | A Quick Repair Facility for Debugging Posters | ||
11:25 5mPoster | Flexible Model-Driven Runtime Monitoring Support for Cyber-Physical Systems Posters Marco Stadler Johannes Kepler University Linz, Michael Vierhauser Johannes Kepler University Linz, Antonio Garmendia Johannes Kepler University Linz, Manuel Wimmer JKU Linz, Jane Cleland-Huang University of Notre Dame Pre-print |