-
Similarity Means: A Study on Stability and Symmetry
Authors:
Gonzalo Travieso,
Luciando da F. Costa
Abstract:
The arithmetic mean plays a central role in science and technology, being directly related to the concepts of statistical expectance and centrality. Yet, it is highly susceptible to the presence of ouliers or biased interference in the original dataset to which it is applied. Described recently, the concept of similarity means has been preliminary found to have marked robustness to those same effe…
▽ More
The arithmetic mean plays a central role in science and technology, being directly related to the concepts of statistical expectance and centrality. Yet, it is highly susceptible to the presence of ouliers or biased interference in the original dataset to which it is applied. Described recently, the concept of similarity means has been preliminary found to have marked robustness to those same effects, especially when adopting the Jaccard similarity index. The present work is aimed at investigating further the properties of similarity means, especially regarding their range, translating and scaling properties, sensitivity and robustness to outliers. Several interesting contributions are reported, including an effective algorithm for obtaining the similarity mean, the analytic and experimental identification of a number of properties, as well as the confirmation of the potential stability of the similarity mean to the presence of outliers. The present work also describes an application case-example in which the Jaccard similarity is succesfully employed to study cycles of sunspots, with interesting results.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
Community detection in networks using self-avoiding random walks
Authors:
Guilherme de Guzzi Bagnato,
José Ricardo Furlan Ronqui,
Gonzalo Travieso
Abstract:
Different kinds of random walks have proven to be useful in the study of structural properties of complex networks. Among them, the restricted dynamics of self-avoiding random walks (SAW), which visit only at most once each vertex in the same walk, has been successfully used in network exploration. The detection of communities of strongly connected vertices in networks remains an open problem, des…
▽ More
Different kinds of random walks have proven to be useful in the study of structural properties of complex networks. Among them, the restricted dynamics of self-avoiding random walks (SAW), which visit only at most once each vertex in the same walk, has been successfully used in network exploration. The detection of communities of strongly connected vertices in networks remains an open problem, despite its importance, due to the high computational complexity of the associated optimization problem and the lack of a unique formal definition of communities. In this work, we propose a SAW-based method to extract the community distribution of a network and show that it achieves high modularity scores, specially for real-world networks. We combine SAW with principal component analysis to define the dissimilarity measure to be used for agglomerative hierarchical clustering. To evaluate the performance of this method we compare it with four popular methods for community detection: Girvan-Newman, Fastgreedy, Walktrap and Infomap using two types of synthetic networks and six well-known real-world cases.
△ Less
Submitted 21 January, 2018; v1 submitted 28 July, 2016;
originally announced July 2016.
-
A complex network approach to cloud computing
Authors:
Gonzalo Travieso,
Carlos Antonio Ruggiero,
Odemir Martinez Bruno,
Luciano da Fontoura Costa
Abstract:
Cloud computing has become an important means to speed up computing. One problem influencing heavily the performance of such systems is the choice of nodes as servers responsible for executing the users' tasks. In this article we report how complex networks can be used to model such a problem. More specifically, we investigate the performance of the processing respectively to cloud systems underla…
▽ More
Cloud computing has become an important means to speed up computing. One problem influencing heavily the performance of such systems is the choice of nodes as servers responsible for executing the users' tasks. In this article we report how complex networks can be used to model such a problem. More specifically, we investigate the performance of the processing respectively to cloud systems underlain by Erdos-Renyi and Barabasi-Albert topology containing two servers. Cloud networks involving two communities not necessarily of the same size are also considered in our analysis. The performance of each configuration is quantified in terms of two indices: the cost of communication between the user and the nearest server, and the balance of the distribution of tasks between the two servers. Regarding the latter index, the ER topology provides better performance than the BA case for smaller average degrees and opposite behavior for larger average degrees. With respect to the cost, smaller values are found in the BA topology irrespective of the average degree. In addition, we also verified that it is easier to find good servers in the ER than in BA. Surprisingly, balance and cost are not too much affected by the presence of communities. However, for a well-defined community network, we found that it is important to assign each server to a different community so as to achieve better performance.
△ Less
Submitted 10 April, 2015;
originally announced April 2015.
-
Analyzing complex networks through correlations in centrality measurements
Authors:
José Ricardo Furlan Ronqui,
Gonzalo Travieso
Abstract:
Many real world systems can be expressed as complex networks of interconnected nodes. It is frequently important to be able to quantify the relative importance of the various nodes in the network, a task accomplished by defining some centrality measures, with different centrality definitions stressing different aspects of the network. It is interesting to know to what extent these different cent…
▽ More
Many real world systems can be expressed as complex networks of interconnected nodes. It is frequently important to be able to quantify the relative importance of the various nodes in the network, a task accomplished by defining some centrality measures, with different centrality definitions stressing different aspects of the network. It is interesting to know to what extent these different centrality definitions are related for different networks. In this work, we study the correlation between pairs of a set of centrality measures for different real world networks and two network models. We show that the centralities are in general correlated, but with stronger correlations for network models than for real networks. We also show that the strength of the correlation of each pair of centralities varies from network to network. Taking this fact into account, we propose the use of a centrality correlation profile, consisting of the values of the correlation coefficients between all pairs of centralities of interest, as a way to characterize networks. Using the yeast protein interaction network as an example we show also that the centrality correlation profile can be used to assess the adequacy of a network model as a representation of a given real network.
△ Less
Submitted 4 March, 2015; v1 submitted 29 May, 2014;
originally announced May 2014.
-
Predicting Efficiency in master-slave grid computing systems
Authors:
Gonzalo Travieso,
Carlos A. Ruggiero,
Odemir M. Bruno,
Luciano da F. Costa
Abstract:
This work reports a quantitative analysis to predicting the efficiency of distributed computing running in three models of complex networks: Barabási-Albert, Erdős-Rényi and Watts-Strogatz. A master/slave computing model is simulated. A node is selected as master and distributes tasks among the other nodes (the clients). Topological measurements associated with the master node (e.g. its degree or…
▽ More
This work reports a quantitative analysis to predicting the efficiency of distributed computing running in three models of complex networks: Barabási-Albert, Erdős-Rényi and Watts-Strogatz. A master/slave computing model is simulated. A node is selected as master and distributes tasks among the other nodes (the clients). Topological measurements associated with the master node (e.g. its degree or betwenness centrality) are extracted and considered as predictors of the total execution time. It is found that the closeness centrality provides the best alternative. The effect of network size was also investigated.
△ Less
Submitted 12 July, 2012;
originally announced July 2012.
-
Evaluating links through spectral decomposition
Authors:
Gonzalo Travieso,
Luciano da Fontoura Costa
Abstract:
Spectral decomposition has been rarely used to investigate complex networks. In this work we apply this concept in order to define two types of link-directed attacks while quantifying their respective effects on the topology. Several other types of more traditional attacks are also adopted and compared. These attacks had substantially diverse effects, depending on each specific network (models and…
▽ More
Spectral decomposition has been rarely used to investigate complex networks. In this work we apply this concept in order to define two types of link-directed attacks while quantifying their respective effects on the topology. Several other types of more traditional attacks are also adopted and compared. These attacks had substantially diverse effects, depending on each specific network (models and real-world structures). It is also showed that the spectral-based attacks have special effect in affecting the transitivity of the networks.
△ Less
Submitted 22 September, 2011;
originally announced September 2011.
-
Can the evolution of music be analyzed in a quantitative manner?
Authors:
Vilson Vieira,
Renato Fabbri,
Gonzalo Travieso,
Luciano da Fontoura Costa
Abstract:
We propose a methodology to study music development by applying multivariate statistics on composers characteristics. Seven representative composers were considered in terms of eight main musical features. Grades were assigned to each characteristic and their correlations were analyzed. A bootstrap method was applied to simulate hundreds of artificial composers influenced by the seven representati…
▽ More
We propose a methodology to study music development by applying multivariate statistics on composers characteristics. Seven representative composers were considered in terms of eight main musical features. Grades were assigned to each characteristic and their correlations were analyzed. A bootstrap method was applied to simulate hundreds of artificial composers influenced by the seven representatives chosen. Afterwards we quantify non-numeric relations like dialectics, opposition and innovation. Composers differences on style and technique were represented as geometrical distances in the feature space, making it possible to quantify, for example, how much Bach and Stockhausen differ from other composers or how much Beethoven influenced Brahms. In addition, we compared the results with a prior investigation on philosophy. Opposition, strong on philosophy, was not remarkable on music. Supporting an observation already considered by music theorists, strong influences were identified between composers by the quantification of dialectics, implying inheritance and suggesting a stronger master-disciple evolution when compared to the philosophy analysis.
△ Less
Submitted 4 March, 2012; v1 submitted 21 September, 2011;
originally announced September 2011.
-
On the Efficiency of Data Representation on the Modeling and Characterization of Complex Networks
Authors:
Carlos A. Ruggiero,
Odemir M. Bruno,
Gonzalo Travieso,
Luciano da Fontoura Costa
Abstract:
Specific choices about how to represent complex networks can have a substantial effect on the execution time required for the respective construction and analysis of those structures. In this work we report a comparison of the effects of representing complex networks statically as matrices or dynamically as spase structures. Three theoretical models of complex networks are considered: two types of…
▽ More
Specific choices about how to represent complex networks can have a substantial effect on the execution time required for the respective construction and analysis of those structures. In this work we report a comparison of the effects of representing complex networks statically as matrices or dynamically as spase structures. Three theoretical models of complex networks are considered: two types of Erdos-Renyi as well as the Barabasi-Albert model. We investigated the effect of the different representations with respect to the construction and measurement of several topological properties (i.e. degree, clustering coefficient, shortest path length, and betweenness centrality). We found that different forms of representation generally have a substantial effect on the execution time, with the sparse representation frequently resulting in remarkably superior performance.
△ Less
Submitted 21 May, 2010;
originally announced May 2010.
-
Sensitivity of complex networks measurements
Authors:
P. R. Villas Boas,
F. A. Rodrigues,
G. Travieso,
L. da F. Costa
Abstract:
Complex networks obtained from the real-world networks are often characterized by incompleteness and noise, consequences of limited sampling as well as artifacts in the acquisition process. Because the characterization, analysis and modeling of complex systems underlain by complex networks are critically affected by the quality of the respective initial structures, it becomes imperative to devis…
▽ More
Complex networks obtained from the real-world networks are often characterized by incompleteness and noise, consequences of limited sampling as well as artifacts in the acquisition process. Because the characterization, analysis and modeling of complex systems underlain by complex networks are critically affected by the quality of the respective initial structures, it becomes imperative to devise methodologies for identifying and quantifying the effect of such sampling problems on the characterization of complex networks. Given that several measurements need to be applied in order to achieve a comprehensive characterization of complex networks, it is important to investigate the effect of incompleteness and noise on such quantifications. In this article we report such a study, involving 8 different measurements applied on 6 different complex networks models. We evaluate the sensitiveness of the measurements to perturbations in the topology of the network considering the relative entropy. Three particularly important types of progressive perturbations to the network are considered: edge suppression, addition and rewiring. The conclusions have important practical consequences including the fact that scale-free structures are more robust to perturbations. The measurements allowing the best balance of stability (smaller sensitivity to perturbations) and discriminability (separation between different network topologies) were also identified.
△ Less
Submitted 23 June, 2008; v1 submitted 7 April, 2008;
originally announced April 2008.
-
Analyzing and Modeling Real-World Phenomena with Complex Networks: A Survey of Applications
Authors:
Luciano da F. Costa,
Osvaldo N. Oliveira Jr.,
Gonzalo Travieso,
Francisco A. Rodrigues,
Paulino R. Villas Boas,
Lucas Antiqueira,
Matheus P. Viana,
Luis E. C. da Rocha
Abstract:
The success of new scientific areas can be assessed by their potential for contributing to new theoretical approaches and in applications to real-world problems. Complex networks have fared extremely well in both of these aspects, with their sound theoretical basis developed over the years and with a variety of applications. In this survey, we analyze the applications of complex networks to real…
▽ More
The success of new scientific areas can be assessed by their potential for contributing to new theoretical approaches and in applications to real-world problems. Complex networks have fared extremely well in both of these aspects, with their sound theoretical basis developed over the years and with a variety of applications. In this survey, we analyze the applications of complex networks to real-world problems and data, with emphasis in representation, analysis and modeling, after an introduction to the main concepts and models. A diversity of phenomena are surveyed, which may be classified into no less than 22 areas, providing a clear indication of the impact of the field of complex networks.
△ Less
Submitted 16 September, 2008; v1 submitted 20 November, 2007;
originally announced November 2007.
-
On the effects of geographical constraints on task execution in complex networks
Authors:
Andre Franceschi de Angelis,
Gonzalo Travieso,
Carlos Antonio Ruggiero,
Luciano da Fontoura Costa
Abstract:
In the present work we investigate the effects of spatial constraints on the efficiency of task execution in systems underlain by geographical complex networks where the probability of connection decreases with the distance between the nodes. The investigation considers several configurations of the parameters defining the network connectivity, and the Barabasi-Albert network model is also consi…
▽ More
In the present work we investigate the effects of spatial constraints on the efficiency of task execution in systems underlain by geographical complex networks where the probability of connection decreases with the distance between the nodes. The investigation considers several configurations of the parameters defining the network connectivity, and the Barabasi-Albert network model is also considered for comparisons. The results show that the effect of connectivity is significant only for shorter tasks, that the locality of connections implied by the spatial constraints reduces efficency, and that the addition of edges can improve the efficiency of the execution, although with increasing locality of the connections the improvement is small.
△ Less
Submitted 8 November, 2007;
originally announced November 2007.
-
Border trees of complex networks
Authors:
Paulino R. Villas Boas,
Francisco A. Rodrigues,
Gonzalo Travieso,
Luciano da F. Costa
Abstract:
The comprehensive characterization of the structure of complex networks is essential to understand the dynamical processes which guide their evolution. The discovery of the scale-free distribution and the small world property of real networks were fundamental to stimulate more realistic models and to understand some dynamical processes such as network growth. However, properties related to the n…
▽ More
The comprehensive characterization of the structure of complex networks is essential to understand the dynamical processes which guide their evolution. The discovery of the scale-free distribution and the small world property of real networks were fundamental to stimulate more realistic models and to understand some dynamical processes such as network growth. However, properties related to the network borders (nodes with degree equal to one), one of its most fragile parts, remain little investigated and understood. The border nodes may be involved in the evolution of structures such as geographical networks. Here we analyze complex networks by looking for border trees, which are defined as the subgraphs without cycles connected to the remainder of the network (containing cycles) and terminating into border nodes. In addition to describing an algorithm for identification of such tree subgraphs, we also consider a series of their measurements, including their number of vertices, number of leaves, and depth. We investigate the properties of border trees for several theoretical models as well as real-world networks.
△ Less
Submitted 22 June, 2007;
originally announced June 2007.
-
Seeking the best Internet Model
Authors:
F. A. Rodrigues,
P. R. Villas Boas,
G. Travieso,
L. da F. Costa
Abstract:
The models of the Internet reported in the literature are mainly aimed at reproducing the scale-free structure, the high clustering coefficient and the small world effects found in the real Internet, while other important properties (e.g. related to centrality and hierarchical measurements) are not considered. For a better characterization and modeling of such network, a larger number of topolog…
▽ More
The models of the Internet reported in the literature are mainly aimed at reproducing the scale-free structure, the high clustering coefficient and the small world effects found in the real Internet, while other important properties (e.g. related to centrality and hierarchical measurements) are not considered. For a better characterization and modeling of such network, a larger number of topological properties must be considered. In this work, we present a sound multivariate statistical approach, including feature spaces and multivariate statistical analysis (especially canonical projections), in order to characterize several Internet models while considering a larger set of relevant measurements. We apply such a methodology to determine, among nine complex networks models, which are those most compatible with the real Internet data (on the autonomous systems level) considering a set of 21 network measurements. We conclude that none of the considered models can reproduce the Internet topology with high accuracy.
△ Less
Submitted 17 August, 2007; v1 submitted 21 June, 2007;
originally announced June 2007.
-
Effective networks for real-time distributed processing
Authors:
Gonzalo Travieso,
Luciano da Fontoura Costa
Abstract:
The problem of real-time processing is one of the most challenging current issues in computer sciences. Because of the large amount of data to be treated in a limited period of time, parallel and distributed systems are required, whose performance depends on a series of factors including the interconnectivity of the processing elements, the application model and the communication protocol. Given…
▽ More
The problem of real-time processing is one of the most challenging current issues in computer sciences. Because of the large amount of data to be treated in a limited period of time, parallel and distributed systems are required, whose performance depends on a series of factors including the interconnectivity of the processing elements, the application model and the communication protocol. Given their flexibility for representing and modeling natural and human-made systems (such as the Internet and WWW), complex networks have become a primary choice in many research areas. The current work presents how the concepts and methods of complex networks can be used to develop realistic models and simulations of distributed real-time system while taking into account two representative interconnection models: uniformly random and scale free (Barabasi-Albert), including the presence of background traffic of messages. The interesting obtained results include the identification of the uniformly random interconnectivity scheme as being largely more efficient than the scale-free counterpart.
△ Less
Submitted 9 February, 2007; v1 submitted 13 December, 2006;
originally announced December 2006.
-
Analyzing Trails in Complex Networks
Authors:
Luciano da Fontoura Costa,
Francisco Aparecido Rodrigues,
Gonzalo Travieso
Abstract:
Even more interesting than the intricate organization of complex networks are the dynamical behavior of systems which such structures underly. Among the many types of dynamics, one particularly interesting category involves the evolution of trails left by moving agents progressing through random walks and dilating processes in a complex network. The emergence of trails is present in many dynamic…
▽ More
Even more interesting than the intricate organization of complex networks are the dynamical behavior of systems which such structures underly. Among the many types of dynamics, one particularly interesting category involves the evolution of trails left by moving agents progressing through random walks and dilating processes in a complex network. The emergence of trails is present in many dynamical process, such as pedestrian traffic, information flow and metabolic pathways. Important problems related with trails include the reconstruction of the trail and the identification of its source, when complete knowledge of the trail is missing. In addition, the following of trails in multi-agent systems represent a particularly interesting situation related to pedestrian dynamics and swarming intelligence. The present work addresses these three issues while taking into account permanent and transient marks left in the visited nodes. Different topologies are considered for trail reconstruction and trail source identification, including four complex networks models and four real networks, namely the Internet, the US airlines network, an email network and the scientific collaboration network of complex network researchers. Our results show that the topology of the network influence in trail reconstruction, source identification and agent dynamics.
△ Less
Submitted 22 August, 2007; v1 submitted 8 September, 2006;
originally announced September 2006.
-
The Spread of Opinions and Proportional Voting
Authors:
Gonzalo Travieso,
Luciano da Fontoura Costa
Abstract:
Election results are determined by numerous social factors that affect the formation of opinion of the voters, including the network of interactions between them and the dynamics of opinion influence. In this work we study the result of proportional elections using an opinion dynamics model similar to simple opinion spreading over a complex network. Erdos-Renyi, Barabasi-Albert, regular lattices…
▽ More
Election results are determined by numerous social factors that affect the formation of opinion of the voters, including the network of interactions between them and the dynamics of opinion influence. In this work we study the result of proportional elections using an opinion dynamics model similar to simple opinion spreading over a complex network. Erdos-Renyi, Barabasi-Albert, regular lattices and randomly augmented lattices are considered as models of the underlying social networks. The model reproduces the power law behavior of number of candidates with a given number of votes found in real elections with the correct slope, a cutoff for larger number of votes and a plateau for small number of votes. It is found that the small world property of the underlying network is fundamental for the emergence of the power law regime.
△ Less
Submitted 6 March, 2006;
originally announced March 2006.
-
Fast Community Identification by Hierarchical Growth
Authors:
Francisco A. Rodrigues,
Gonzalo Travieso,
Luciano da F. Costa
Abstract:
A new method for community identification is proposed which is founded on the analysis of successive neighborhoods, reached through hierarchical growth from a starting vertex, and on the definition of communities as a subgraph whose number of inner connections is larger than outer connections. In order to determine the precision and speed of the method, it is compared with one of the most popula…
▽ More
A new method for community identification is proposed which is founded on the analysis of successive neighborhoods, reached through hierarchical growth from a starting vertex, and on the definition of communities as a subgraph whose number of inner connections is larger than outer connections. In order to determine the precision and speed of the method, it is compared with one of the most popular community identification approaches, namely Girvan and Newman's algorithm. Although the hierarchical growth method is not as precise as Girvan and Newman's method, it is potentially faster than most community finding algorithms.
△ Less
Submitted 21 February, 2006;
originally announced February 2006.
-
Strength Distribution in Derivative Networks
Authors:
Luciano da Fontoura Costa,
Gonzalo Travieso
Abstract:
This article describes a complex network model whose weights are proportional to the difference between uniformly distributed ``fitness'' values assigned to the nodes. It is shown both analytically and experimentally that the strength density (i.e. the weighted node degree) for this model, called derivative complex networks, follows a power law with exponent $γ<1$ if the fitness has an upper lim…
▽ More
This article describes a complex network model whose weights are proportional to the difference between uniformly distributed ``fitness'' values assigned to the nodes. It is shown both analytically and experimentally that the strength density (i.e. the weighted node degree) for this model, called derivative complex networks, follows a power law with exponent $γ<1$ if the fitness has an upper limit and $γ>1$ if the fitness has no upper limit but a positive lower limit. Possible implications for neuronal networks topology and dynamics are also discussed.
△ Less
Submitted 11 January, 2005;
originally announced January 2005.