Minería Basada en Clasificación de Componentes Reusables sobre Líneas de Productos de Software (Classification-based Mining of Reusable Components on Software Product Lines)

Maximiliano Arias (maximiliano.arias@fi.uncoma.edu.ar)1, Alan DeRenzis ( alanderenzis@fi.uncoma.edu.ar)1, Agustina Buccella (agustina.buccella@fi.uncoma.edu.ar)1, Andres Flores (andres.flores@fi.uncoma.edu.ar)1, Alejandra Cechich (alejandra.cechich@fi.uncoma.edu.ar)2

1 GIISCo, Universidad Nacional del Comahue y CONICET
2 GIISCo y Universidad Nacional del Comahue

This paper appears in: Revista IEEE América Latina

Publication Date: Feb. 2016
Volume: 14,   Issue: 2 
ISSN: 1548-0992

Software Product Lines and Component-based systems can be combined to maximize reuse in a predictable and opportunistic manner. When a product line is built for a certain subdomain within a more generic domain, future needs from a closely subdomain may be fulfilled by mining the line's internal components to build a new product line. In this work, we present an approach to classify internal and external (third party) reusable components into a repository, by applying a K-Nearest Neighbors strategy, as a support for building new product lines. Natural language techniques and the WordNet lexical database is also used to process information from software components. We validate the approach with an experiment based in a dataset of external third-party components and reusable components from a product line that we built in the geographic subdomain of marine ecology.

Index Terms:
Software Product Lines, Software Components, Software Reuse, K-Nearest Neighbors   

