Multi-objective optimization based overlapping community detection in software ecosystem
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
A software ecosystem can be described as a complex network, where there are many software projects and stakeholders. In this network, a node may belong to multiple communities, forming its overlapping community structure. For a software ecosystem network, the overlapping community detection is beneficial to understanding the interaction behaviors between individuals and promoting the healthy development of this system. However, existing methods of overlapping community detection have low accuracy and are difficult to obtain the significant community structure in a network. In view of this, we propose a method of the overlapping community detection based on multi-objective optimization in a software ecosystem. In the proposed method, a multi-objective optimization model for a software ecosystem is first formulated with the maximization of the extended kernel k -means (EKKM) and the extended ratio cut (ERC). Then, a hybrid individual representation that combines character string representation with binary representation based on overlapping communities is developed, and the corresponding crossover and mutation operators are designed to be integrated into the NSGA-II multi-objective optimization framework, which is beneficial to enhancing the population evolution. Six networks in a software ecosystem are built using data collected in GitHub. Based on them, a series of experimental results show the superiority of the proposed method.