Abstract
Communication between developers and testers can be a rich source of insights into software development processes and practices, which may not be easily discoverable from other means like retrospectives or project roadmaps. With the objective of deriving and capitalizing on potential development-related insights, we analyzed developer-tester communication in an industrial setting. We conducted a case study at a software-intensive Agile company, within the context of the development of one of their flagship products from 2016 to 2018. We applied Latent Dirichlet Allocation (LDA) to analyze communication between developers and testers, and then invited two case-company practitioners to study the results for insights into their developments processes: The findings reveal the case company’s efforts to improve their product stability, the growing emphasis on addressing end-user concerns and other quality-related issues. The practitioners interpreted these findings as indicators of evolution in their development process. Based on these findings and the state of the art, we propose an insight classification to highlight insights discoverable from developer-tester communication: Recognizing LDA’s potential for deriving insights, the practitioners are keen on incorporating it into their software development practices. The findings from this study serve as evidence for use and benefits of text-mining techniques like LDA in industrial setting, which other practitioners could adapt to elicit their own context-influenced insights. Furthermore, the insight classification can serve as a foundation for further investigation into the extent and type of insights discoverable from developer-tester communication.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
- 4.
At the case company, the term issue is used to refer to bugs, anomalies, defects etc.
- 5.
- 6.
- 7.
- 8.
- 9.
References
Haiduc, S., Arnaoudova, V., Marcus, A., Antoniol, G.: The use of text retrieval and natural language processing in software engineering. In: Proceedings - International Conference on Software Engineering, pp. 898–899 (2016)
Schütze, H., Manning, C.D., Raghavan, P.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)
Buse, R.P.L., Zimmermann, T.: Information needs for software development analytics. In: In 2012, the 34th International Conference on Software Engineering (ICSE), pp. 987–996 (2012)
Sun, X., Liu, X., Li, B., Duan, Y., Yang, H., Hu, J.: Exploring topic models in software engineering data analysis: a survey. In: 2016 IEEE/ACIS 17th International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2016, pp. 357–362. Institute of Electrical and Electronics Engineers Inc. (2016)
Thomas, S.W., Hassan, A.E., Blostein, D.: Mining unstructured software repositories. In: Mens, T., Serebrenik, A., Cleve, A. (eds.) Evolving Software Systems, pp. 139–162. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-642-45398-4_5
Bettenburg, N., Adams, B.: Workshop on mining unstructured data (MUD): because “mining unstructured data is like fishing in muddy waters”! In: Proceedings - Working Conference on Reverse Engineering, WCRE, pp. 277–278. IEEE (2010)
Di Sorbo, A., Panichella, S., Visaggio, C.A., Di Penta, M., Canfora, G., Gall, H.C.: Development emails content analyzer: intention mining in developer discussions. In: Proceedings - 2015 30th IEEE/ACM International Conference on Automated Software Engineering ASE 2015, pp. 12–23 (2016)
Shihab, E., Bettenburg, N., Adams, B., Hassan, A.E.: On the central role of mailing lists in open source projects: an exploratory study. In: Nakakoji, K., Murakami, Y., McCready, E. (eds.) JSAI-isAI 2009. LNCS (LNAI and LNB), vol. 6284, pp. 91–103. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-14888-0_9
Vassallo, C., Panichella, S., Di Penta, M., Canfora, G.: CODES: mining source code descriptions from developers discussions. In: Proceedings of the 22nd International Conference on Program Comprehension, pp. 106–109 (2014)
Soliman, M., Galster, M., Salama, A.R., Riebisch, M.: Architectural knowledge for technology decisions in developer communities: an exploratory study with StackOverflow. In: Proceedings - 2016 13th Working IEEE/IFIP Conference on Software Architecture, WICSA 2016, pp. 128–133. IEEE (2016)
Anvik, J., Hiew, L., Murphy, G.C.: Who should fix this bug? In: Proceedings - International Conference on Software Engineering 2006, pp. 361–370 (2006)
Canfora, G., Di Penta, M., Oliveto, R., Panichella, S.: Who is going to mentor newcomers in open source projects? Proceedings of ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering, FSE 2012, pp. 1–11 (2012)
Panichella, S., Aponte, J., Di Penta, M., Marcus, A., Canfora, G.: Mining source code descriptions from developer communications. In: IEEE International Conference on Program Comprehension, pp. 63–72 (2012)
Panichella, A., Dit, B., Oliveto, R., Di Penta, M., Poshynanyk, D., De Lucia, A.: How to effectively use topic models for software engineering tasks? An approach based on genetic algorithms. In: Proceedings - International Conference on Software Engineering, pp. 522–531 (2013)
Nazar, N., Hu, Y., Jiang, H.: Summarizing software artifacts: a literature review. J. Comput. Sci. Technol. 31(5), 883–909 (2016). https://doi.org/10.1007/s11390-016-1671-1
Chen, T.-H., Thomas, S.W., Hassan, A.E.: A survey on the use of topic models when mining software repositories. Empir. Softw. Eng. 21(5), 1843–1919 (2015). https://doi.org/10.1007/s10664-015-9402-8
Sinoara, R.A., Scheicher, R.B., Rezende, S.O.: Evaluation of latent dirichlet allocation for document organization in different levels of semantic complexity. In: 2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017 – Proceedings, pp. 1–8 (2018)
Blei, D.M.: Introduction to probabilistic topic models. Commun. ACM. 55, 77–84 (2012)
Lima, M., Ahmed, I., Conte, T., Nascimento, E., Oliveira, E., Gadelha, B.: Land of lost knowledge: an initial investigation into projects lost knowledge. In: International Symposium on Empirical Software Engineering and Measurement, Septemer 2019 (2019)
Blei, D.M., Lafferty, J.D.: Topic models. In: Text Mining, pp. 101–124. Chapman and Hall/CRC (2009)
Thomas, S.W., Adams, B., Hassan, A.E., Blostein, D.: Studying software evolution using topic models. Sci. Comput. Program. 80, 457–479 (2014)
Thomas, S.W., Adams, B., Hassan, A.E., Blostein, D.: Modeling the evolution of topics in source code histories. In: Proceedings of the 8th Working Conference on Mining Software Repositories, pp. 173–182 (2011)
Zhang, T., Yang, G., Lee, B., Lua, E.K.: A novel developer ranking algorithm for automatic bug triage using topic model and developer relations. In: Proceedings - Asia-Pacific Software Engineering Conference, APSEC, pp. 223–230. IEEE (2014)
Bertram, D., Voida, A., Greenberg, S., Walker, R.: Communication, collaboration, and bugs: the social nature of issue tracking in small, collocated teams. In: Proceedings of the 2010 ACM Conference on Computer Supported Cooperative Work, pp. 291–300 (2010)
Runeson, P., Höst, M.: Guidelines for conducting and reporting case study research in software engineering. Empir. Softw. Eng. 14, 131–164 (2009)
Ram, P., et al.: An empirical investigation into industrial use of software metrics programs. In: Morisio, M., Torchiano, M., Jedlitschka, A. (eds.) PROFES 2020. LNCS, vol. 12562, pp. 419–433. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-64148-1_26
Silge, J., Robinson, D.: Text Mining with R: A Tidy Approach. O’Reilly Media, Inc. (2017)
Allahyari, M., Kochut, K.: Automatic topic labeling using ontology-based topic models. In: Proceedings - 2015 IEEE 14th International Conference on Machine Learning and Applications, ICMLA 2015, pp. 259–264 (2016)
Krishna, R., Agrawal, A., Rahman, A., Sobran, A., Menzies, T.: What is the connection between issues, bugs, and enhancements? In: 2018 IEEE/ACM 40th International Conference on Software Engineering: Software Engineering in Practice Track (ICSE-SEIP), pp. 306–315 (2018)
Bizer, C., Boncz, P., Brodie, M.L., Erling, O.: The meaningful use of big data: four perspectives - four challenges. SIGMOD Rec. 40, 56–60 (2011)
Holmström Olsson, H., Bosch, J.: Towards data-driven product development: a multiple case study on post-deployment data usage in software-intensive embedded systems. In: Fitzgerald, B., Conboy, K., Power, K., Valerdi, R., Morgan, L., Stol, K.-J. (eds.) LESS 2013. LNBIP, vol. 167, pp. 152–164. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-44930-7_10
Silva, C., Mariane, C.: Reusing software engineering knowledge from developer communication. In: ESEC/FSE 2020 - Proceedings of the 28th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering, pp. 1682–1685 (2020)
Asuncion, H.U., Asuncion, A.U., Taylor, R.N.: Software traceability with topic modeling. In: Proceedings - International Conference on Software Engineering, pp. 95–104 (2010)
Hindle, A., Bird, C., Zimmermann, T., Nagappan, N.: Relating requirements to implementation via topic analysis: do topics extracted from requirements make sense to managers and developers? In: 2012 28th IEEE International Conference on Software Maintenance (ICSM), pp. 243–252 (2012)
Hindle, A., Bird, C., Zimmermann, T., Nagappan, N.: Do topics make sense to managers and developers? Empir. Softw. Eng. 20(2), 479–515 (2014). https://doi.org/10.1007/s10664-014-9312-1
Gezici, B., Tarhan, A., Chouseinoglou, O.: Internal and external quality in the evolution of mobile software: an exploratory study in open-source market. Inf. Softw. Technol. 112, 178–200 (2019)
Pettinato, M., Gil, J.P., Galeas, P., Russo, B.: Log mining to re-construct system behavior: an exploratory study on a large telescope system. Inf. Softw. Technol. 114, 121–136 (2019)
Noei, E., Zhang, F., Zou, Y.: Too many user-reviews, what should app developers look at first? IEEE Trans. Softw. Eng. 1–12 (2019)
Sbalchiero, S., Eder, M.: Topic modeling, long texts and the best number of topics. Some problems and solutions. Qual. Quant. 54, 1095–1108 (2020)
Salman, I., Turhan, B., Vegas, S.: A controlled experiment on time pressure and confirmation bias in functional software testing. Empir. Softw. Eng. 24, 1727–1761 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Ram, P., Rodríguez, P., Abherve, A., Bagnato, A., Oivo, M. (2021). Capitalizing on Developer-Tester Communication – A Case Study. In: Ardito, L., Jedlitschka, A., Morisio, M., Torchiano, M. (eds) Product-Focused Software Process Improvement. PROFES 2021. Lecture Notes in Computer Science(), vol 13126. Springer, Cham. https://doi.org/10.1007/978-3-030-91452-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-030-91452-3_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-91451-6
Online ISBN: 978-3-030-91452-3
eBook Packages: Computer ScienceComputer Science (R0)