Diplomatist: What Do Cross-language Dependencies Reflect Software Ecosystem Health?
In large-scale software development, multilingual projects, those involving multiple interacting programming languages, have become increasingly common in both industry and the open-source community. Research indicates that cross-language dependencies in these projects can increase the likelihood of risks, such as functionality defects and security vulnerabilities. While most existing studies focus on cross-language dependencies between host languages and specific guest languages (e.g., C/C++), interactions between host languages and a broader range of guest languages, as well as the broader impact of such dependencies on software ecosystems, remain underexplored.
To address the above limitations, in this paper, we develop a technique, \textsc{Diplomatist}, to identify and analyze cross-language dependencies between host languages, such as Java, and guest languages, including JavaScript, Python, Ruby, PHP, and C/C++. \textsc{Diplomatist} automatically analyzes \textit{cross-language invocation APIs} and constructs a large-scale knowledge repository to standardize code features for identifying library versions across various guest languages, enabling host languages to trace the guest language libraries they invoke. Evaluation shows that \textsc{Diplomatist} achieved an average precision of 88.9% and a recall of 91.5% on a high-quality benchmark, indicating its high accuracy in detecting cross-language dependencies. Using \textsc{Diplomatist}, we identified 435,258 Java libraries that indirectly or transitively depend on libraries from other ecosystems. \textsc{Diplomatist} provides a list of cross-language pivotal libraries that contribute to preserving the long-term health and sustainability of software ecosystems. Moreover, we conduct a case study to examine the impact of the risks introduced due to cross-language dependencies on programming language ecosystems, by analyzing a full-picture of the cross-language dependency graph. Our findings show that fragile projects or libraries can propagate security issues across ecosystems via these dependencies, impacting 13,739 downstream projects in the \textit{Maven} ecosystem. We utilized \textsc{Diplomatist} to provide remediation suggestions to relevant project developers. Issue reports of some subjects have been confirmed by developers.
Mon 17 NovDisplayed time zone: Seoul change
11:00 - 12:20 | Efficiency & Fairness 1Research Papers at Grand Hall 4 Chair(s): Lwin Khin Shar Singapore Management University | ||
11:00 10mTalk | AutoFid: Adaptive and Noise-Aware Fidelity Measurement for Quantum Programs via Circuit Graph Analysis Research Papers | ||
11:10 10mTalk | HybridSIMD: A Super C++ SIMD Library with Integrated Auto-tuning Capabilities Research Papers Haolin Pan Institute of Software, Chinese Academy of Sciences;School of Intelligent Science and Technology, HIAS, UCAS, Hangzhou;University of Chinese Academy of Sciences, Xulin Zhou Institute of Software, Chinese Academy of Sciences; University of Chinese Academy of Sciences, Mingjie Xing Institute of Software, Chinese Academy of Sciences, Yanjun Wu Institute of Software, Chinese Academy of Sciences | ||
11:20 10mTalk | PEACE: Towards Efficient Project-Level Performance Optimization via Hybrid Code Editing Research Papers Xiaoxue Ren Zhejiang University, Jun Wan Zhejiang University, Yun Peng The Chinese University of Hong Kong, Zhongxin Liu Zhejiang University, Ming Liang Ant Group, Dajun Chen Ant Group, Wei Jiang Ant Group, Yong Li Ant Group Pre-print | ||
11:30 10mTalk | CoTune: Co-evolutionary Configuration Tuning Research Papers Gangda Xiong University of Electronic Science and Technology of China, Tao Chen University of Birmingham Pre-print | ||
11:40 10mTalk | It's Not Easy Being Green: On the Energy Efficiency of Programming Languages Research Papers Nicolas van Kempen University of Massachusetts Amherst, USA, Hyuk-Je Kwon University of Massachusetts Amherst, Dung Nguyen University of Massachusetts Amherst, Emery D. Berger University of Massachusetts Amherst and Amazon Web Services | ||
11:50 10mTalk | When Faster Isn't Greener: The Hidden Costs of LLM-Based Code Optimization Research Papers Tristan Coignion Université de Lille - Inria, Clément Quinton Université de Lille, Romain Rouvoy University Lille 1 and INRIA | ||
12:00 10mTalk | United We Stand: Towards End-to-End Log-based Fault Diagnosis via Interactive Multi-Task Learning Research Papers Minghua He Peking University, Chiming Duan Peking University, Pei Xiao Peking University, Tong Jia Institute for Artificial Intelligence, Peking University, Beijing, China, Siyu Yu The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), Lingzhe Zhang Peking University, China, Weijie Hong Peking university, Jing Han ZTE Corporation, Yifan Wu Peking University, Ying Li School of Software and Microelectronics, Peking University, Beijing, China, Gang Huang Peking University | ||
12:10 10mTalk | Diplomatist: What Do Cross-language Dependencies Reflect Software Ecosystem Health? Research Papers Fanyi Meng Shenyang University of Technology, Ying Wang Northeastern University, Chun Yong Chong Monash University Malaysia, Hai Yu Northeastern University, China, Zhiliang Zhu Northeastern University, China | ||