Multi-objective optimization based overlapping community detection in software ecosystem

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

A software ecosystem can be described as a complex network, where there are many software projects and stakeholders. In this network, a node may belong to multiple communities, forming its overlapping community structure. For a software ecosystem network, the overlapping community detection is beneficial to understanding the interaction behaviors between individuals and promoting the healthy development of this system. However, existing methods of overlapping community detection have low accuracy and are difficult to obtain the significant community structure in a network. In view of this, we propose a method of the overlapping community detection based on multi-objective optimization in a software ecosystem. In the proposed method, a multi-objective optimization model for a software ecosystem is first formulated with the maximization of the extended kernel k -means (EKKM) and the extended ratio cut (ERC). Then, a hybrid individual representation that combines character string representation with binary representation based on overlapping communities is developed, and the corresponding crossover and mutation operators are designed to be integrated into the NSGA-II multi-objective optimization framework, which is beneficial to enhancing the population evolution. Six networks in a software ecosystem are built using data collected in GitHub. Based on them, a series of experimental results show the superiority of the proposed method.

Article activity feed