Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Open access

Automatic Core-Developer Identification on GitHub: A Validation Study

Published: 30 September 2023 Publication History

Abstract

Many open-source software projects are self-organized and do not maintain official lists with information on developer roles. So, knowing which developers take core and maintainer roles is, despite being relevant, often tacit knowledge. We propose a method to automatically identify core developers based on role permissions of privileged events triggered in GitHub issues and pull requests. In an empirical study on 25/GitHub projects, (1) we validate the set of automatically identified core developers with a sample of project-reported developer lists, and (2) we use our set of identified core developers to assess the accuracy of state-of-the-art unsupervised developer classification methods. Our results indicate that the set of core developers, which we extracted from privileged issue events, is sound and the accuracy of state-of-the-art unsupervised classification methods depends mainly on the data source (commit data versus issue data) rather than the network-construction method (directed versus undirected, etc.). In perspective, our results shall guide research and practice to choose appropriate unsupervised classification methods, and our method can help create reliable ground-truth data for training supervised classification methods.

References

[1]
Bram Adams, Ryan Kavanagh, Ahmed E. Hassan, and Daniel M. German. 2016. An empirical study of integration activities in distributions of open source software. Empir. Softw. Eng. 21, 3 (2016), 960–1001.
[2]
Amritanshu Agrawal, Akond Rahman, Rahul Krishna, Alexander Sobran, and Tim Menzies. 2018. We don’t need another hero? The impact of “Heroes” on software development. In Proceedings of the International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP’18). ACM, 245–253.
[3]
Ban Al-Ani, Matthew J. Bietz, Yi Wang, Erik Trainer, Benjamin Koehne, Sabrina Marczak, David Redmiles, and Rafael Prikladnicki. 2013. Globally distributed system developers: Their trust expectations and processes. In Proceedings of the International Conference on Computer-Supported Cooperative Work (CSCW’13). ACM, 563–574.
[4]
Mohamed A. Aljemabi and Zhongjie Wang. 2018. Empirical study on the evolution of developer social networks. IEEE Access 6 (2018), 51049–51060.
[5]
Christian Bird. 2011. Sociotechnical coordination and collaboration in open source software. In Proceedings of the International Conference on Software Maintenance (ICSM’11). IEEE, 568–573.
[6]
Christian Bird, Alex Gourley, Premkumar Devanbu, Michael Gertz, and Anand Swaminathan. 2006. Mining email social networks. In Proceedings of the International Working Conference on Mining Software Repositories (MSR’06). ACM, 137–143.
[7]
Christian Bird, Alex Gourley, Premkumar Devanbu, Anand Swaminathan, and Greta Hsu. 2007. Open borders? Immigration in open source projects. In Proceedings of the International Working Conference on Mining Software Repositories (MSR’07). IEEE, 6–6.
[8]
Christian Bird, David Pattison, Raissa D’Souza, Vladimir Filkov, and Premkumar Devanbu. 2008. Latent social structure in open source projects. In Proceedings of the International Symposium on Foundations of Software Engineering (FSE’08). ACM, 24–35.
[9]
Thomas Bock, Angelika Schmid, and Sven Apel. 2022. Measuring and modeling group dynamics in open-source software development: A tensor decomposition approach. ACM Trans. Softw. Eng. Methodol. 31, 2 (2022), 19:1–19:50.
[10]
Phillip Bonacich. 1972. Factoring and weighting approaches to status scores and clique identification. J. Math. Sociol. 2, 1 (1972), 113–120.
[11]
Phillip Bonacich. 2007. Some unique properties of eigenvector centrality. Soc. Netw. 29, 4 (2007), 555–564.
[12]
Stephen P. Borgatti, Martin G. Everett, and Jeffrey C. Johnson. 2018. Analyzing Social Networks (2nd ed.). Sage.
[13]
Amiangshu Bosu and Jeffrey C. Carver. 2014. Impact of developer reputation on code review outcomes in OSS projects: An empirical investigation. In Proceedings of the International Symposium on Empirical Software Engineering and Measurement (ESEM’14). ACM, 1–10.
[14]
Ulrik Brandes and Thomas Erlebach. 2005. Network Analysis: Methodological Foundations. Springer Science & Business Media.
[15]
Chris Brown and Chris Parnin. 2019. Sorry to bother you: Designing bots for effective recommendations. In Proceedings of the International Workshop on Bots in Software Engineering (BotSE’19). IEEE, 54–58.
[16]
Simon Butler, Jonas Gamalielsson, Björn Lundell, Christoffer Brax, Johan Sjöberg, Anders Mattsson, Tomas Gustavsson, Jonas Feist, and Erik Lönroth. 2021. On company contributions to community open source software projects. IEEE Trans. Softw. Eng. 47, 7 (2021), 1381–1401.
[17]
Fabio Calefato, Filippo Lanubile, and Nicole Novielli. 2017. A preliminary analysis on the effects of propensity to trust in distributed software development. In Proceedings of the International Conference on Global Software Engineering (ICGSE’17). IEEE, 56–60.
[18]
Edna D. Canedo, Rodrigo Bonifácio, Márcio V. Okimoto, Alexander Serebrenik, Gustavo Pinto, and Eduardo Monteiro. 2020. Work practices and perceptions from women core developers in OSS communities. In Proceedings of the International Symposium Empirical Software Engineering and Measurement (ESEM’20). ACM, 1–11.
[19]
Gerardo Canfora, Massimiliano Di Penta, Rocco Oliveto, and Sebastiano Panichella. 2012. Who is going to mentor newcomers in open source projects? In Proceedings of the International Symposium on Foundations of Software Engineering (FSE’12). ACM, 1–11.
[20]
Marcelo Cataldo and James D. Herbsleb. 2013. Coordination breakdowns and their impact on development productivity and software failures. IEEE Trans. Softw. Eng. 39, 3 (2013), 343–360.
[21]
Nitesh V. Chawla, Kevin W. Bowyer, Lawrence O. Hall, and W. Philip Kegelmeyer. 2002. SMOTE: Synthetic minority over-sampling technique. J. Artific. Intell. Res. 16 (2002), 321–357.
[22]
Tianqi Chen and Carlos Guestrin. 2016. XGBoost: A scalable tree boosting system. In Proceedings of the International Conference on Knowledge Discovery and Data Mining (KDD’16). ACM, 785–794.
[23]
Can Cheng, Bing Li, Zeng-Yang Li, Yu-Qi Zhao, and Feng-Ling Liao. 2017. Developer role evolution in open source software ecosystem: An explanatory study on GNOME. J. Comput. Sci. Technol. 32, 2 (2017), 396–414.
[24]
Jailton Coelho, Marco T. Valente, Luciana L. Silva, and André Hora. 2018. Why we engage in FLOSS: Answers from core developers. In Proceedings of the International Workshop on Cooperative and Human Aspects of Software Engineering (CHASE’18). ACM, 114–121.
[25]
Kattiana Constantino, Shurui Zhou, Mauricio Souza, Eduardo Figueiredo, and Christian Kästner. 2020. Understanding collaborative software development: An interview study. In Proceedings of the International Conference on Global Software Engineering (ICGSE’20). ACM, 55–65.
[26]
Thomas D. Cook and Donald T. Campbell. 1979. Quasi-experimentation—Design and Analysis Issues for Field Settings. Houghton Mifflin Company.
[27]
Kevin Crowston and James Howison. 2005. The social structure of free and open source software development. First Monday 10, 2 (2005).
[28]
Kevin Crowston, Qing Li, Kangning Wei, U. Yeliz Eseryel, and James Howison. 2007. Self-organization of teams for free/libre open source software development. Info. Softw. Technol. 49, 6 (2007), 564–575.
[29]
Kevin Crowston and Ivan Shamshurin. 2017. Core-periphery communication and the success of free/libre open source software projects. J. Internet Serv. Appl. 8, 1 (2017), 10:1–10:11.
[30]
Kevin Crowston, Kangning Wei, Qing Li, and James Howison. 2006. Core and periphery in free/libre and open source software team communications. In Proceedings of the Hawaii International Conference on System Sciences (HICSS’06). IEEE, 118a–118a.
[31]
Laura Dabbish, Colleen Stuart, Jason Tsay, and Jim Herbsleb. 2012. Social coding in GitHub: Transparency and collaboration in an open software repository. In Proceedings of the International Conference on Computer-Supported Cooperative Work (CSCW’12). ACM, 1277–1286.
[32]
Cleidson De Souza, Jon Froehlich, and Paul Dourish. 2005. Seeking the source: Software source code as a social and technical artifact. In Proceedings of the International Conference on Supporting Group Work (GROUP’05). ACM, 197–206.
[33]
Luis Felipe Dias, Igor Steinmacher, and Gustavo Pinto. 2018. Who drives company-owned OSS projects: Internal or external members? J. Brazil. Comput. Soc. 24, 1 (2018), 1–17.
[34]
Trung T. Dinh-Trong and James M. Bieman. 2005. The FreeBSD project: A replication case study of open source development. IEEE Trans. Softw. Eng. 31, 6 (2005), 481–494.
[35]
Ikram El Asri, Noureddine Kerzazi, Lamia Benhiba, and Mohammed Janati. 2017. From periphery to core: A temporal analysis of GitHub contributors’ collaboration network. In Proceedings of the Working Conference on Virtual Enterprises (PRO-VE’17): Collaboration in a Data-Rich World. Springer, 217–229.
[36]
Mariam El Mezouar, Feng Zhang, and Ying Zou. 2019. An empirical study on the teams structures in social coding using GitHub projects. Empir. Softw. Eng. 24, 6 (2019), 3790–3823.
[37]
Fabio Ferreira, Luciana L. Silva, and Marco T. Valente. 2020. Turnover in open-source projects: The case of core developers. In Proceedings of the Brazilian Symposium on Software Engineering (SBES’20). ACM, 447–456.
[38]
Harald Gall, Karin Hajek, and Mehdi Jazayeri. 1998. Detection of logical coupling based on product release history. In Proceedings of the International Conference on Software Maintenance (ICSM’98). IEEE, 190–198.
[39]
David Gibson, Jon Kleinberg, and Prabhakar Raghavan. 1998. Inferring web communities from link topology. In Proceedings of the International Conference on Hypertext and Hypermedia (HT’98). ACM, 225–234.
[40]
Mehdi Golzadeh, Alexandre Decan, Damien Legay, and Tom Mens. 2021. A ground-truth dataset and classification model for detecting bots in GitHub issue and PR comments. J. Syst. Softw. 175 (2021), 110911.
[41]
Mehdi Golzadeh, Damien Legay, Alexandre Decan, and Tom Mens. 2020. Bot or not? Detecting bots in GitHub pull request activity based on comment similarity. In Proceedings of the International Conference on Software Engineering Workshops (ICSEW’20). ACM, 31–35.
[42]
Rajdeep Grewal, Gary L. Lilien, and Girish Mallapragada. 2006. Location, location, location: How network embeddedness affects project success in open source systems. Manage. Sci. 52, 7 (2006), 1043–1056.
[43]
Rebecca E. Grinter, James D. Herbsleb, and Dewayne E. Perry. 1999. The geography of coordination: Dealing with distance in R&D work. In Proceedings of the International Conference on Supporting Group Work (GROUP’99). ACM, 306–315.
[44]
James D. Herbsleb, Audris Mockus, and Jeffrey A. Roberts. 2006. Collaboration in software engineering projects: A theory of coordination. In Proceedings of the International Conference on Information Systems (ICIS’06). Association for Information Systems, 553–568.
[45]
Qiaona Hong, Sunghun Kim, Shing Chi Cheung, and Christian Bird. 2011. Understanding a developer social network and its evolution. In Proceedings of the International Conference on Software Maintenance (ICSM’11). IEEE, 323–332.
[46]
Corey Jergensen, Anita Sarma, and Patrick Wagstrom. 2011. The onion patch: Migration in open source ecosystems. In Proceedings of the European Software Engineering Conference and the International Symposium on Foundations of Software Engineering (ESEC/FSE’11). ACM, 70–80.
[47]
Andrejs Jermakovics, Alberto Sillitti, and Giancarlo Succi. 2011. Mining and visualizing developer networks from version control systems. In Proceedings of the International Workshop on Cooperative and Human Aspects of Software Engineering (CHASE’11). ACM, 24–31.
[48]
Mitchell Joblin, Sven Apel, Claus Hunsen, and Wolfgang Mauerer. 2017. Classifying developers into core and peripheral: An empirical study on count and network metrics. In Proceedings of the International Conference on Software Engineering (ICSE’17). IEEE, 164–174.
[49]
Mitchell Joblin, Sven Apel, and Wolfgang Mauerer. 2017. Evolutionary trends of developer coordination: A network approach. Empir. Softw. Eng. 22, 4 (2017), 2050–2094.
[50]
Mitchell Joblin, Barbara Eckl-Ganser, Thomas Bock, Angelika Schmid, Janet Siegmund, and Sven Apel. 2023. Hierarchical and hybrid organizational structures in open-source software projects: A longitudinal study. ACM Trans. Softw. Eng. Methodol. 32, 4 (2023), 86:1–86:29.
[51]
Mitchell Joblin, Wolfgang Mauerer, Sven Apel, Janet Siegmund, and Dirk Riehle. 2015. From developer networks to verified communities: A fine-grained approach. In Proceedings of the International Conference on Software Engineering (ICSE’15). IEEE, 563–573.
[52]
Stefan Koch and Georg Schneider. 2002. Effort, co-operation and co-ordination in an open source software project: GNOME. Info. Syst. J. 12, 1 (2002), 27–42.
[53]
Robert E. Kraut and Lynn A. Streeter. 1995. Coordination in software development. Commun. ACM 38, 3 (1995), 69–82.
[54]
Rajiv Krishnamurthy, Varghese Jacob, Suresh Radhakrishnan, and Kutsal Dogan. 2016. Peripheral developer participation in open source projects: An empirical analysis. ACM Trans. Manage. Info. Syst. 6, 4 (2016), 1–31.
[55]
Irwin Kwan, Adrian Schroter, and Daniela Damian. 2011. Does socio-technical congruence have an effect on software build success? A study of coordination in a software project. IEEE Trans. Softw. Eng. 37, 3 (2011), 307–324.
[56]
Amy N. Langville and Carl D. Meyer. 2006. Google’s PageRank and Beyond. Princeton University Press.
[57]
Amanda Lee and Jeffrey C. Carver. 2017. Are one-time contributors different? A comparison to core and periphery developers in FLOSS repositories. In Proceedings of the International Symposium Empirical Software Engineering and Measurement (ESEM’17). IEEE, 1–10.
[58]
Amanda Lee, Jeffrey C. Carver, and Amiangshu Bosu. 2017. Understanding the impressions, motivations, and barriers of one time code contributors to FLOSS projects: A survey. In Proceedings of the International Conference on Software Engineering (ICSE’17). IEEE, 187–197.
[59]
Sherlock A. Licorish and Stephen G. MacDonell. 2013. The true role of active communicators: An empirical study of Jazz core developers. In Proceedings of the Conference on Evaluation and Assessment in Software Engineering (EASE’13). ACM, 228–239.
[60]
Sherlock A. Licorish and Stephen G. MacDonell. 2014. Understanding the attitudes, knowledge sharing behaviors and task performance of core developers: A longitudinal study. Info. Softw. Technol. 56, 12 (2014), 1578–1596.
[61]
Ju Long. 2006. Understanding the role of core developers in open source software development. J. Info. Technol. Org. 1, 1 (2006), 75–85.
[62]
Yuan Long and Keng Siau. 2007. Social network structures in open source software development teams. J. Database Manage. 18, 2 (2007), 25–40.
[63]
Luis López-Fernández, Gregorio Robles, Jesus M. Gonzalez-Barahona, and Israel Herraiz. 2006. Applying social network analysis techniques to community-driven libre software projects. Int. J. Info. Technol. Web Eng. 1 (2006), 28–50.
[64]
Wolfgang Mauerer, Mitchell Joblin, Damian A. Tamburri, Carlos Paradis, Rick Kazman, and Sven Apel. 2022. In search of socio-technical congruence: A large-scale longitudinal study. IEEE Trans. Softw. Eng. 48, 8 (2022), 3159–3184.
[65]
Andrew Meneely and Laurie Williams. 2011. Socio-technical developer networks: Should we trust our measurements? In Proceedings of the International Conference on Software Engineering (ICSE’11). ACM, 281–290.
[66]
Audris Mockus, Roy T. Fielding, and James D. Herbsleb. 2002. Two case studies of open source software development: Apache and Mozilla. ACM Trans. Softw. Eng. Methodol. 11, 3 (2002), 309–346.
[67]
João E. Montandon, Marco T. Valente, and Luciana L. Silva. 2021. Mining the technical roles of GitHub users. Info. Softw. Technol. 131 (2021), 106485.
[68]
Kumiyo Nakakoji, Yasuhiro Yamamoto, Yoshiyuki Nishinaka, Kouichi Kishida, and Yunwen Ye. 2002. Evolution patterns of open-source software systems and communities. In Proceedings of the International Workshop on Principles of Software Evolution (IWPSE’02). ACM, 76–85.
[69]
Gustavo A. Oliva, Francisco W. Santana, Kleverton C. M. de Oliveira, Cleidson R. B. de Souza, and Marco A. Gerosa. 2012. Characterizing key developers: A case study with Apache Ant. In Proceedings of the International Conference on Collaboration and Technology (CRIWG’12). Springer, 97–112.
[70]
Gustavo A. Oliva, José Teodoro da Silva, Marco A. Gerosa, Francisco W. S. Santana, Cláudia M. L. Werner, Cleidson R. B. de Souza, and Kleverton C. M. de Oliveira. 2015. Evolving the system’s core: A case study on the identification and characterization of key developers in Apache Ant. Comput. Info. 34, 3 (2015), 678–724.
[71]
Fabio Palomba and Damian A. Tamburri. 2021. Predicting the emergence of community smells using socio-technical metrics: A machine-learning approach. J. Syst. Softw. 171 (2021), 110847.
[72]
Sebastiano Panichella, Gabriele Bavota, Massimiliano Di Penta, Gerardo Canfora, and Giuliano Antoniol. 2014. How developers’ collaborations identified from different sources tell us about code changes. In Proceedings of the International Conference on Software Maintenance and Evolution (ICSME’14). IEEE, 251–260.
[73]
Gustavo Pinto, Igor Steinmacher, and Marco A. Gerosa. 2016. More common than you think: An in-depth study of casual contributors. In Proceedings of the International Conference on Software Analysis, Evolution, and Reengineering (SANER’16). IEEE, 112–123.
[74]
Mathias Pohl and Stephan Diehl. 2008. What dynamic network metrics can tell us about developer roles. In Proceedings of the International Workshop on Cooperative and Human Aspects of Software Engineering (CHASE’08). ACM, 81–84.
[75]
Mehvish Rashid, Paul M. Clarke, and Rory V. O’Connor. 2019. A systematic examination of knowledge loss in open source software projects. Int. J. Info. Manage. 46 (2019), 104–123.
[76]
Erzsébet Ravasz and Albert-László Barabási. 2003. Hierarchical organization in complex networks. Phys. Rev. E 67, 2 (2003), 026112.
[77]
Peter C. Rigby and Ahmed E. Hassan. 2007. What can OSS mailing lists tell us? A preliminary psychometric text analysis of the Apache developer mailing list. In Proceedings of the International Working Conference on Mining Software Repositories (MSR’07). IEEE, 23–23.
[78]
Peter C. Rigby, Yue Cai Zhu, Samuel M. Donadelli, and Audris Mockus. 2016. Quantifying and mitigating turnover-induced knowledge loss: Case studies of Chrome and a project at Avaya. In Proceedings of the International Conference on Software Engineering (ICSE’16). IEEE, 1006–1016.
[79]
Gregorio Robles and Jesus M. Gonzalez-Barahona. 2006. Contributor turnover in libre software projects. In Proceedings of the International Conference on Open Source Systems (OSS’06). Springer, 273–286.
[80]
Gregorio Robles, Jesus M. Gonzalez-Barahona, and Israel Herraiz. 2009. Evolution of the core team of developers in libre software projects. In Proceedings of the International Working Conference on Mining Software Repositories (MSR’09). IEEE, 167–170.
[81]
Pankaj Setia, Balaji Rajagopalan, Vallabh Sambamurthy, and Roger Calantone. 2012. How peripheral developers contribute to open-source software development. Info. Syst. Res. 23, 1 (2012), 144–163.
[82]
Pankajeshwara N. Sharma, Bastin T. R. Savarimuthu, and Nigel Stanger. 2017. Boundary spanners in open source software development: A study of python email archives. In Proceedings of the Asia-Pacific Software Engineering Conference (APSEC’17). IEEE, 308–317.
[83]
Vibha S. Sinha, Senthil Mani, and Saurabh Sinha. 2011. Entering the circle of trust: Developer initiation as committers in open-source projects. In Proceedings of the International Working Conference on Mining Software Repositories (MSR’11). ACM, 133–142.
[84]
Igor Steinmacher, Christoph Treude, and Marco A. Gerosa. 2019. Let me in: Guidelines for the successful onboarding of newcomers to open source projects. IEEE Softw. 36, 4 (2019), 41–49.
[85]
Ashish Sureka, Atul Goyal, and Ayushi Rastogi. 2011. Using social network analysis for mining collaboration data in a defect tracking system for risk and vulnerability analysis. In Proceedings of the India Software Engineering Conference (ISEC’11). ACM, 195–204.
[86]
Damian A. Tamburri, Rick Kazman, and Hamed Fahimi. 2023. On the relationship between organizational structure patterns and architecture in agile teams. IEEE Trans. Softw. Eng. 49, 1 (2023), 325–347.
[87]
Damian A. Tamburri, Patricia Lago, and Hans van Vliet. 2013. Organizational social structures for software engineering. ACM Comput. Surveys 46, 1 (2013), 1–35.
[88]
Damian A. Tamburri, Fabio Palomba, Alexander Serebrenik, and Andy Zaidman. 2019. Discovering community patterns in open-source: A systematic approach and its evaluation. Empir. Softw. Eng. 24, 3 (2019), 1369–1417.
[89]
Antonio Terceiro, Luiz Romario Rios, and Christina Chavez. 2010. An empirical study on the structural complexity introduced by core and peripheral developers in free software projects. In Proceedings of the Brazilian Symposium on Software Engineering (SBES’10). IEEE, 21–29.
[90]
Sergio L. Toral, M. Rocío Martínez-Torres, and Federico Barrero. 2010. Analysis of virtual communities supporting OSS projects using social network analysis. Info. Softw. Technol. 52, 3 (2010), 296–303.
[91]
Marat Valiev, Bogdan Vasilescu, and James D. Herbsleb. 2018. Ecosystem-level determinants of sustained activity in open-source projects: A case study of the PyPI ecosystem. In Proceedings of the European Software Engineering Conference and the International Symposium on Foundations of Software Engineering (ESEC/FSE’18). ACM, 644–655.
[92]
Zhendong Wang, Yang Feng, Yi Wang, James A. Jones, and David Redmiles. 2020. Unveiling elite developers’ activities in open source projects. IEEE Trans. Softw. Eng. 29, 3 (2020), 16:1–16:35.
[93]
Kangning Wei, Kevin Crowston, U. Yeliz Eseryel, and Robert Heckman. 2017. Roles and politeness behavior in community-based free/libre open source software development. Info. Manage. 54, 5 (2017), 573–582.
[94]
Mairieli Wessel, Bruno Mendes de Souza, Igor Steinmacher, Igor S. Wiese, Ivanilton Polato, Ana P. Chaves, and Marco A. Gerosa. 2018. The power of bots: Characterizing and understanding bots in OSS projects. In Proceedings of the ACM Conference on Human-Computer Interaction (HCI’18). 1–19.
[95]
Mairieli Wessel and Igor Steinmacher. 2020. The inconvenient side of software bots on pull requests. In Proceedings of the International Conference on Software Engineering Workshops (ICSEW’20). ACM, 51–55.
[96]
Igor S. Wiese, José Teodoro da Silva, Igor Steinmacher, Christoph Treude, and Marco A. Gerosa. 2016. Who is who in the mailing list? Comparing six disambiguation heuristics to identify multiple addresses of a participant. In Proceedings of the International Conference on Software Maintenance and Evolution (ICSME’16). IEEE, 345–355.
[97]
Claes Wohlin, Per Runeson, Martin Höst, Magnus C. Ohlsson, Björn Regnell, and Anders Wesslén. 2012. Experimentation in Software Engineering (2nd ed.). Springer.
[98]
Marvin Wyrich and Justus Bogner. 2019. Towards an autonomous bot for automatic source code refactoring. In Proceedings of the International Workshop on Bots in Software Engineering (BotSE’19). IEEE, 24–28.
[99]
Bo Xu, Donald R. Jones, and Bingjia Shao. 2009. Volunteers’ involvement in online community based software development. Info. Manage. 46, 3 (2009), 151–158.
[100]
Jin Xu, Yongqin Gao, Scott Christley, and Gregory Madey. 2005. A topological analysis of the open source software development community. In Proceedings of the Hawaii International Conference on System Sciences (HICSS’05). IEEE, 198a–198a.
[101]
Kazuhiro Yamashita, Shane McIntosh, Yasutaka Kamei, Ahmed E. Hassan, and Naoyasu Ubayashi. 2015. Revisiting the applicability of the Pareto Principle to core development teams in open source software projects. In Proceedings of the International Workshop on Principles of Software Evolution (IWPSE’15). ACM, 46–55.
[102]
Li Yan, Tan Chuan Hoo, and Teo Hock Hai. 2004. The dynamic transformation of an open source software project leader: A microorganizational behavioral perspective. In Proceedings of the Pacific Asia Conference on Information Systems (PACIS’04). Association for Information Systems, 2226–2232.
[103]
Yunwen Ye and Kouichi Kishida. 2003. Toward an understanding of the motivation of open source software developers. In Proceedings of the International Conference on Software Engineering (ICSE’03). IEEE, 419–429.
[104]
Liguo Yu. 2008. Self-organization process in open-source software: An empirical study. Info. Softw. Technol. 50, 5 (2008), 361–374.
[105]
Wen Zhang, Ye Yang, and Qing Wang. 2011. Network analysis of OSS evolution: An empirical study on ArgoUML project. In Proceedings of the International Workshop on Principles of Software Evolution and ERCIM Workshop on Software Evolution (IWPSE-EVOL’11). ACM, 71–80.
[106]
Yuxia Zhang, Minghui Zhou, Audris Mockus, and Zhi Jin. 2019. Companies’ participation in OSS development—An empirical study of OpenStack. IEEE Trans. Softw. Eng. 47, 10 (2019), 2242–2259.
[107]
Minghui Zhou, Qingying Chen, Audris Mockus, and Fengguang Wu. 2017. On the scalability of Linux kernel maintainers’ work. In Proceedings of the European Software Engineering Conference and the International Symposium on Foundations of Software Engineering (ESEC/FSE’17). ACM, 27–37.
[108]
Minghui Zhou and Audris Mockus. 2012. What make long term contributors: Willingness and opportunity in OSS community. In Proceedings of the International Conference on Software Engineering (ICSE’12). IEEE, 518–528.
[109]
Minghui Zhou, Audris Mockus, Xiujuan Ma, Lu Zhang, and Hong Mei. 2016. Inflow and retention in OSS communities with commercial involvement: A case study of three hybrid projects. ACM Trans. Softw. Eng. Methodol. 25, 2 (2016), 1–29.
[110]
Thomas Zimmermann, Andreas Zeller, Peter Weißgerber, and Stephan Diehl. 2005. Mining version histories to guide software changes. IEEE Trans. Softw. Eng. 31, 6 (2005), 429–445.

Cited By

View all
  • (2024)Predicting the First Response Latency of Maintainers and Contributors in Pull RequestsIEEE Transactions on Software Engineering10.1109/TSE.2024.344374150:10(2529-2543)Online publication date: 1-Oct-2024
  • (2024)Analyzing the Tower of Babel with KaiauluJournal of Systems and Software10.1016/j.jss.2024.111967210:COnline publication date: 25-Jun-2024
  • (2024)An exploratory study of software artifacts on GitHub from the lens of documentationInformation and Software Technology10.1016/j.infsof.2024.107425169:COnline publication date: 2-Jul-2024

Index Terms

  1. Automatic Core-Developer Identification on GitHub: A Validation Study

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Software Engineering and Methodology
    ACM Transactions on Software Engineering and Methodology  Volume 32, Issue 6
    November 2023
    949 pages
    ISSN:1049-331X
    EISSN:1557-7392
    DOI:10.1145/3625557
    • Editor:
    • Mauro Pezzè
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 September 2023
    Online AM: 22 April 2023
    Accepted: 15 March 2023
    Revised: 03 February 2023
    Received: 23 November 2022
    Published in TOSEM Volume 32, Issue 6

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Open-source software projects
    2. developer classification
    3. developer networks

    Qualifiers

    • Research-article

    Funding Sources

    • German Research Foundation

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1,372
    • Downloads (Last 6 weeks)150
    Reflects downloads up to 06 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Predicting the First Response Latency of Maintainers and Contributors in Pull RequestsIEEE Transactions on Software Engineering10.1109/TSE.2024.344374150:10(2529-2543)Online publication date: 1-Oct-2024
    • (2024)Analyzing the Tower of Babel with KaiauluJournal of Systems and Software10.1016/j.jss.2024.111967210:COnline publication date: 25-Jun-2024
    • (2024)An exploratory study of software artifacts on GitHub from the lens of documentationInformation and Software Technology10.1016/j.infsof.2024.107425169:COnline publication date: 2-Jul-2024

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    Login options

    Full Access

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media