A Lightweight Approach for the Automated Classification and Clustering of Metamodels

The growing adoption of model-driven engineering raised the need for techniques and tools supporting modeling artifacts’ reusability. In this respect, several model repositories have been proposed by academia and industry so that modelers can exploit advanced searching facilities to identify reusable artifacts that might fit the particular problem at hand. Despite the enduring quest for the right ways to search and retrieve modeling artifacts, satisfactory solutions are still missing. This paper investigates the adoption of general-purpose indexing and search features provided by Apache Lucene to support the classification and clustering of metamodel repositories. In particular, we show that Apache Lucene allows us to get accurate results whenever the mandatory requirements of more appropriate techniques, such as hierarchical clustering or neural networks, cannot be met.