Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Understanding Missing Links in Bipartite Networks With MissBiN

Published: 01 June 2022 Publication History

Abstract

The analysis of bipartite networks is critical in a variety of application domains, such as exploring entity co-occurrences in intelligence analysis and investigating gene expression in bio-informatics. One important task is missing link prediction, which infers the existence of unseen links based on currently observed ones. In this article, we propose a visual analysis system, MissBiN, to involve analysts in the loop for making sense of link prediction results. MissBiN equips a novel method for link prediction in a bipartite network by leveraging the information of bi-cliques in the network. It also provides an interactive visualization for understanding the algorithm outputs. The design of MissBiN is based on three high-level analysis questions (what, why, and how) regarding missing links, which are distilled from the literature and expert interviews. We conducted quantitative experiments to assess the performance of the proposed link prediction algorithm, and interviewed two experts from different domains to demonstrate the effectiveness of MissBiN as a whole. We also provide a comprehensive usage scenario to illustrate the usefulness of the tool in an application of intelligence analysis.

References

[1]
C. Carrubba, M. Gabel, and S. Hug, “Legislative voting behavior, seen and unseen: A theory of roll-call vote selection,” Legislative Stud. Quart., vol. 33, no. 4, pp. 543–572, 2008.
[2]
S. C. Madeira and A. L. Oliveira, “Biclustering algorithms for biological data analysis: A survey,” IEEE/ACM Trans. Comput. Biol. Bioinf., vol. 1, no. 1, pp. 24–45, First Quarter 2004.
[3]
H. Wu, J. Vreeken, N. Tatti, and N. Ramakrishnan, “Uncovering the plot: Detecting surprising coalitions of entities in multi-relational schemas,” Data Mining Knowl. Discov., vol. 28, no. 5/6, pp. 1398–1428, 2014.
[4]
D. Liben-Nowell and J. Kleinberg, “The link prediction problem for social networks,” in Proc. 12th Int. Conf. Inf. Knowl. Manage., 2003, pp. 556–559.
[5]
C. V. Cannistraci, G. Alanis-Lobato, and T. Ravasi, “From link-prediction in brain connectomes and protein interactomes to the local-community-paradigm in complex networks,” Sci. Rep., vol. 3, no. 1, 2013, Art. no.
[6]
Z. Huang, X. Li, and H. Chen, “Link prediction approach to collaborative filtering,” in Proc. 5th ACM/IEEE-CS Joint Conf. Digit. Libraries, 2005, pp. 141–142.
[7]
J. Zhao, F. Chen, and P. Chiu, “A generic visualization framework for understanding missing links in bipartite networks,” in Proc. SIGGRAPH Asia Posters, 2018, Art. no.
[8]
J. Zhao, M. Sun, F. Chen, and P. Chiu, “MissBiN: Visual analysis of missing links in bipartite networks,” in Proc. IEEE Vis. Conf., 2019, pp. 71–75.
[9]
M. S. Granovetter, “The strength of weak ties,” Amer. J. Sociol., vol. 78, no. 6, pp. 1360–1380, 1973.
[10]
R. S. Burt, Structural Holes: The Social Structure of Competition. Cambridge, MA, USA: Harvard Univ. Press, 1995.
[11]
V. Martínez, F. Berzal, and J.-C. Cubero, “A survey of link prediction in complex networks,” ACM Comput. Surv., vol. 49, no. 4, pp. 1–33, 2016.
[12]
C. Prell, Social Network Analysis: History, Theory and Methodology. Newbury Park, CA, USA: Sage, 2011.
[13]
Y. Zhang, C. A. Phillips, G. L. Rogers, E. J. Baker, E. J. Chesler, and M. A. Langston, “On finding bicliques in bipartite graphs: A novel algorithm and its application to the integration of diverse biological data types,” BMC Bioinf., vol. 15, no. 1, 2014, Art. no.
[14]
P. Bonacich, “Power and centrality: A family of measures,” Amer. J. Sociol., vol. 92, no. 5, pp. 1170–1182, 1987.
[15]
S. P. Borgatti, “Two-mode concepts in social network analysis,” in Computational Complexity. Berlin, Germany: Springer, 2012, pp. 2912–2924.
[16]
A. S. Asratian, T. M. J. Denley, and R. Häggkvist, Bipartite Graphs and Their Applications. Cambridge, U.K.: Cambridge Univ. Press, 1998.
[17]
T. Uno, T. Asai, Y. Uchida, and H. Arimura, “An efficient algorithm for enumerating closed patterns in transaction databases,” in Proc. Int. Conf. Discov. Sci., 2004, pp. 16–31.
[18]
Y. Kluger, “Spectral biclustering of microarray data: Coclustering genes and conditions,” Genome Res., vol. 13, no. 4, pp. 703–716, 2003.
[19]
R. Keller, C. M. Eckert, and P. J. Clarkson, “Matrices or node-link diagrams: Which visual representation is better for visualising connectivity models?,” Inf. Vis., vol. 5, no. 1, pp. 62–76, 2006.
[20]
M. Ghoniem, J.-D. Fekete, and P. Castagliola, “On the readability of graphs using node-link and matrix-based representations: A controlled experiment and statistical analysis,” Inf. Vis., vol. 4, no. 2, pp. 114–135, 2005.
[21]
K. Misue, “Anchored maps: Visualization techniques for drawing bipartite graphs,” in Proc. Int. Conf. Hum.-Comput. Interact., 2007, pp. 106–114.
[22]
J. Stasko, C. Görg, and Z. Liu, “Jigsaw: Supporting investigative analysis through interactive visualization,” Inf. Vis., vol. 7, no. 2, pp. 118–132, 2008.
[23]
H.-J. Schulz, M. John, A. Unger, and H. Schumann, “Visual analysis of bipartite biological networks,” in Proc. Eurographics Workshop Vis. Comput. Biomed., 2008, pp. 135–142.
[24]
J. Abello, S. G. Kobourov, and R. Yusufov, “Visualizing large graphs with compound-fisheye views and treemaps,” in Proc. Int. Symp. Graph Drawing, 2005, pp. 431–441.
[25]
C. Partlet al., “ConTour: Data-driven exploration of multi-relational datasets for drug discovery,” IEEE Trans. Vis. Comput. Graphics, vol. 20, no. 12, pp. 1883–1892, Dec. 2014.
[26]
M. Dumas, J.-M. Robert, and M. J. Mcguffin, “Alertwheel: Radial bipartite graph visualization applied to intrusion detection system alerts,” IEEE Netw., vol. 26, no. 6, pp. 12–18, Nov./Dec. 2012.
[27]
M. Sun, P. Mi, C. North, and N. Ramakrishnan, “BiSet: Semantic edge bundling with biclusters for sensemaking,” IEEE Trans. Vis. Comput. Graphics, vol. 22, no. 1, pp. 310–319, Jan. 2016.
[28]
H. Wu, M. Sun, P. Mi, N. Tatti, C. North, and N. Ramakrishnan, “Interactive discovery of coordinated relationship chains with maximum entropy models,” ACM Trans. Knowl. Discov. Data, vol. 12, no. 1, 2018, Art. no.
[29]
M. Sun, J. Zhao, H. Wu, K. Luther, C. North, and N. Ramakrishnan, “The effect of edge bundling and seriation on sensemaking of biclusters in bipartite graphs,” IEEE Trans. Vis. Comput. Graphics, vol. 25, no. 10, pp. 2983–2998, Oct. 2019.
[30]
M. Sun, D. Koop, J. Zhao, C. North, and N. Ramakrishnan, “Interactive bicluster aggregation in bipartite graphs,” in Proc. IEEE Vis. Conf., 2019, pp. 246–250.
[31]
R. Santamaría, R. Therón, and L. Quintales, “Bicoverlapper: A tool for bicluster visualization,” Bioinformatics, vol. 24, no. 9, 2008, Art. no.
[32]
G. A. Grothaus, A. Mufti, and T. Murali, “Automatic layout and visualization of biclusters,” Algorithms Mol. Biol., vol. 1, no. 1, pp. 1–15, 2006.
[33]
J. Heinrich, R. Seifert, M. Burch, and D. Weiskopf, “Bicluster viewer: A visualization tool for analyzing gene expression data,” in Proc. Int. Symp. Advances Vis. Comput., 2011, pp. 641–652.
[34]
M. Kapusheskyet al., “Expression profiler: Next generation–An online platform for analysis of microarray data,” Nucleic Acids Res., vol. 32, pp. W465–W470, 2004.
[35]
R. Santamaria, R. Theron, and L. Quintales, “BicOverlapper 2.0: Visual analysis for gene expression,” Bioinformatics, vol. 30, no. 12, pp. 1785–1786, 2014.
[36]
J. Zhao, M. Sun, F. Chen, and P. Chiu, “BiDots: Visual exploration of weighted biclusters,” IEEE Trans. Vis. Comput. Graphics, vol. 24, no. 1, pp. 195–204, Jan. 2018.
[37]
N. Henry, J.-D. Fekete, and M. J. McGuffin, “NodeTrix: A hybrid visualization of social networks,” IEEE Trans. Vis. Comput. Graphics, vol. 13, no. 6, pp. 1302–1309, Nov./Dec. 2007.
[38]
M. Streit, S. Gratzl, M. Gillhofer, A. Mayr, A. Mitterecker, and S. Hochreiter, “Furby: Fuzzy force-directed bicluster visualization,” BMC Bioinf., vol. 15, no. Suppl 6, 2014, Art. no.
[39]
P. Fiaux, M. Sun, L. Bradel, C. North, N. Ramakrishnan, and A. Endert, “Bixplorer: Visual analytics with biclusters,” Computer, vol. 46, no. 8, pp. 90–94, Aug. 2013.
[40]
P. Xu, N. Cao, H. Qu, and J. Stasko, “Interactive visual co-cluster analysis of bipartite graphs,” in Proc. IEEE Pacific Vis. Symp., 2016, pp. 32–39.
[41]
A. Lex, M. Streit, C. Partl, K. Kashofer, and D. Schmalstieg, “Comparative analysis of multidimensional, quantitative data,” IEEE Trans. Vis. Comput. Graphics, vol. 16, no. 6, pp. 1027–1035, Nov./Dec. 2010.
[42]
A. Lex, H.-J. Schulz, M. Streit, C. Partl, and D. Schmalstieg, “VisBricks: Multiform visualization of large, inhomogeneous data,” IEEE Trans. Vis. Comput. Graphics, vol. 17, no. 12, pp. 2291–2300, Dec. 2011.
[43]
P. Wang, B. Xu, Y. Wu, and X. Zhou, “Link prediction in social networks: The state-of-the-art,” Sci. China Inf. Sci., vol. 58, no. 1, pp. 1–38, 2015.
[44]
M. A. Hasan and M. J. Zaki, “A survey of link prediction in social networks,” in Social Network Data Analytics. Berlin, Germany: Springer, 2011, pp. 243–275.
[45]
L. Lü and T. Zhou, “Link prediction in complex networks: A survey,” Physica A, Statist. Mech. Appl., vol. 390, no. 6, pp. 1150–1170, 2011.
[46]
R. N. Lichtenwalter and N. V. Chawla, “Vertex collocation profiles: Subgraph counting for link analysis and prediction,” in Proc. 21st Int. Conf. World Wide Web, 2012, pp. 1019–1028.
[47]
S. Scellato, A. Noulas, and C. Mascolo, “Exploiting place features in link prediction on location-based social networks,” in Proc. 17th ACM SIGKDD Int. Conf. Knowl. Discov. Data Mining, 2011, pp. 1046–1054.
[48]
T. Wohlfarth and R. Ichise, “Semantic and event-based approach for link prediction,” in Proc. Int. Conf. Practical Aspects Knowl. Manage., 2008, pp. 50–61.
[49]
K. Yu and W. Chu, “Gaussian process models for link analysis and transfer learning,” in Proc. Int. Conf. Neural Inf. Process. Syst., 2008, pp. 1657–1664.
[50]
D. Heckerman, C. Meek, and D. Koller, Probabilistic Entity-Relationship Models, PRMs and Plate Models. Cambridge, MA, USA: MIT Press, 2004, pp. 201–239.
[51]
Y.-J. Chang and H.-Y. Kao, “Link prediction in a bipartite network using Wikipedia revision information,” in Proc. Conf. Technol. Appl. Artif. Intell., 2012, pp. 50–55.
[52]
S. Xia, B. Dai, E.-P. Lim, Y. Zhang, and C. Xing, “Link prediction for bipartite social networks: The role of structural holes,” in Proc. Int. Conf. Advances Soc. Netw. Anal. Mining, 2012, pp. 153–157.
[53]
M. Brehmer and T. Munzner, “A multi-level typology of abstract visualization tasks,” IEEE Trans. Vis. Comput. Graphics, vol. 19, no. 12, pp. 2376–2385, Dec. 2013.
[54]
H. Tong, C. Faloutsos, and J.-Y. Pan, “Fast random walk with restart and its applications,” in Proc. Int. Conf. Data Mining, 2006, pp. 613–622.
[55]
F. Hughes and D. Schum, “Discovery, proof, choice: The art and science of the process of intelligence analysis-preparing for the future of intelligence analysis,” Washington, DC: Joint Military Intelligence College, 2003.
[56]
P. Isenberget al., 2018. [Online]. Available: http://www.vispubdata.org/site/vispubdata/
[57]
J. Davis and M. Goadrich, “The relationship between precision-recall and ROC curves,” in Proc. 23rd Int. Conf. Mach. Learn., 2006, pp. 233–240.
[58]
N. H. Riche, B. Lee, and C. Plaisant, “Understanding interactive legends: A comparative evaluation with standard widgets,” Comput. Graph. Forum, vol. 29, no. 3, pp. 1193–1202, 2010.
[59]
F. Beck, M. Burch, S. Diehl, and D. Weiskopf, “A taxonomy and survey of dynamic graph visualization,” Comput. Graph. Forum, vol. 36, no. 1, pp. 133–159, 2016.
[60]
W. S. Torgerson, “Multidimensional scaling: I. Theory and method,” Psychometrika, vol. 17, no. 4, pp. 401–419, 1952.
[61]
Washington DC Neighborhood Clusters, 2018. [Online]. Available: https://www.neighborhoodinfodc.org/nclusters/nclusters.html
[62]
Y. Hu and L. Shi, “Visualizing large graphs,” Wiley Interdisciplinary Rev., Comput. Statist., vol. 7, no. 2, pp. 115–136, 2015.
[63]
L. Lins, J. T. Klosowski, and C. Scheidegger, “Nanocubes for real-time exploration of spatiotemporal datasets,” IEEE Trans. Vis. Comput. Graphics, vol. 19, no. 12, pp. 2456–2465, Dec. 2013.
[64]
J. Abello, J. Korn, and M. Kreuseler, “Navigating giga-graphs,” in Proc. Work. Conf. Adv. Vis. Interfaces, 2002, pp. 290–299.

Cited By

View all
  • (2024)Investigating User Estimation of Missing Data in Visual AnalysisProceedings of the 50th Graphics Interface Conference10.1145/3670947.3670977(1-13)Online publication date: 3-Jun-2024
  • (2021)SightBi: Exploring Cross-View Data Relationships with BiclustersIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2021.311480128:1(54-64)Online publication date: 24-Dec-2021

Index Terms

  1. Understanding Missing Links in Bipartite Networks With MissBiN
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image IEEE Transactions on Visualization and Computer Graphics
        IEEE Transactions on Visualization and Computer Graphics  Volume 28, Issue 6
        June 2022
        251 pages

        Publisher

        IEEE Educational Activities Department

        United States

        Publication History

        Published: 01 June 2022

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 03 Feb 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Investigating User Estimation of Missing Data in Visual AnalysisProceedings of the 50th Graphics Interface Conference10.1145/3670947.3670977(1-13)Online publication date: 3-Jun-2024
        • (2021)SightBi: Exploring Cross-View Data Relationships with BiclustersIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2021.311480128:1(54-64)Online publication date: 24-Dec-2021

        View Options

        View options

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media