Model-Driven Engineering practitioners have to deal with the construction of modelling environments by devising meta-models, grammars, editors, etc. One of the goals of the application of Machine Learning to MDE is to use ML algorithms to assist the MDE expert in these tasks. These algorithms cannot directly receive raw models or meta-models as input, but they typically have to be transformed into a numeric representation, i.e., a vector. In this context, a common approach is to use pre-trained Word Embeddings, which define mapping functions that associate words to semantic vectors. However, current word embeddings are trained with general texts and lack the technical words which typically arise in the modelling domain. To tackle this issue, we have collected a corpus of modelling texts from well-known modelling venues, and we have trained two types of word embedding models. The resulting embeddings (named WordE4MDE) are specialised to address ML tasks in the MDE domain. We have performed an extensive evaluation using the Ecore models of the ModelSet dataset and two state-of-the-art word embeddings (GloVe and Word2Vec) as baselines. We show that WordE4MDE outperforms these two baselines in three meta-modelling tasks, namely meta-model classification, meta-model clustering, and meta-model concept recommendation. WordE4MDE embeddings are available at https://github.com/models-lab/worde4mde and can be loaded using standard Python libraries for their use in ML pipelines.
Thu 5 OctDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
15:30 - 17:00 | |||
15:30 22mTalk | Word Embeddings for Model-Driven Engineering Technical Track José Antonio Hernández López Linkoping University, Carlos Durá , Jesús Sánchez Cuadrado Universidad de Murcia Pre-print Media Attached | ||
15:52 22mTalk | Automated Domain Modeling with Large Language Models: A Comparative Study Technical Track Kua Chen , Yujing Yang , Boqi Chen McGill University, José Antonio Hernández López Linkoping University, Gunter Mussbacher McGill University, Daniel Varro Linköping University / McGill University | ||
16:15 22mTalk | SkeMo: Sketch Modeling for Real-Time Model Component Generation Technical Track | ||
16:37 22mTalk | Toward a Symbiotic Approach Leveraging Generative AI for Model-Driven Engineering Technical Track Vinay Kulkarni Tata Consultancy Services Research, Sreedhar Reddy , Souvik Barat Tata Consultancy Services Research, Jaya Dutta |