Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3540250.3549151acmconferencesArticle/Chapter ViewAbstractPublication PagesfseConference Proceedingsconference-collections
research-article

Semi-supervised pre-processing for learning-based traceability framework on real-world software projects

Published: 09 November 2022 Publication History

Abstract

The traceability of software artifacts has been recognized as an important factor to support various activities in software development processes. However, traceability can be difficult and time-consuming to create and maintain manually, thereby automated approaches have gained much attention. Unfortunately, existing automated approaches for traceability suffer from practical issues. This paper aims to gain an understanding of the potential challenges for the underperforming of the state-of-the-art, ML-based trace link classifiers applied in real-world projects. By investigating different industrial datasets, we found that two critical (and classic) challenges, i.e. data imbalance and sparse problems, lie in real-world projects’ traceability automation. To overcome these challenges, we developed a framework called SPLINT to incorporate hybrid textual similarity measures and semi-supervised learning strategies as enhancements to the learning-based traceability approaches. We carried out experiments with six open-source platforms and ten industry datasets. The results confirm that SPLINT is able to operate at higher performance on two communities’ datasets. Specifically, the industrial datasets, which significantly suffer from data imbalance and sparsity problems, show an increase in F2-score over 14% and AUC over 8% on average. The adjusted class-balancing and self-training policies used in SPLINT (CBST-Adjust) also work effectively for the selection of pseudo-labels on minor classes from unlabeled trace sets, demonstrating SPLINT’s practicability.

References

[1]
Ahron Abadi, Mordechai Nisenson, and Yahalomit Simionovici. 2008. A Traceability Technique for Specifications. In Proceedings of the 16th International Conference on Program Comprehension (ICPC ’08). IEEE Computer Society, 103–112. https://doi.org/10.1109/ICPC.2008.30
[2]
Nasir Ali, Yann-Gaël Guéhéneuc, and Giuliano Antoniol. 2013. Trustrace: Mining Software Repositories to Improve the Accuracy of Requirement Traceability Links. IEEE Transactions on Software Engineering, 39, 5 (2013), 725–741. https://doi.org/10.1109/TSE.2012.71
[3]
Nauman Bin Ali and Kai Petersen. 2012. A Consolidated Process for Software Process Simulation: State of the Art and Industry Experience. In Proceedings of the 38th Euromicro Conference on Software Engineering and Advanced Applications (SEAA ’12). IEEE Computer Society, 327–336.
[4]
Giuliano Antoniol, Gerardo Canfora, Gerardo Casazza, Andrea De Lucia, and Ettore Merlo. 2002. Recovering Traceability Links between Code and Documentation. IEEE Transactions on Software Engineering, 28, 10 (2002), 970–983.
[5]
Giuliano Antoniol, Gerardo Canfora, Andrea De Lucia, and Ettore Merlo. 1999. Recovering Code to Documentation Links in OO Systems. In Proceedings of the 6th Working Conference on Reverse Engineering (WCRE ’99). IEEE Computer Society, 136–144. https://doi.org/10.1109/WCRE.1999.806954
[6]
Giuliano Antoniol, Jane Cleland-Huang, Jane Huffman Hayes, and Michael Vierhauser. 2017. Grand Challenges of Traceability: The Next Ten Years. CoRR, abs/1710.03129 (2017).
[7]
Hazeline U. Asuncion, Arthur U. Asuncion, and Richard N. Taylor. 2010. Software Traceability with Topic Modeling. In Proceedings of the 32nd International Conference on Software Engineering (ICSE ’10). 95–104. https://doi.org/10.1145/1806799.1806817
[8]
Hazeline U. Asuncion, Arthur U. Asuncion, and Richard N. Taylor. 2010. Software traceability with topic modeling. In Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering (ICSE ’10). ACM, 95–104. https://doi.org/10.1145/1806799.1806817
[9]
Thazin Win Win Aung, Huan Huo, and Yulei Sui. 2020. A Literature Review of Automatic Traceability Links Recovery for Software Change Impact Analysis. In Proceedings of the 28th International Conference on Program Comprehension (ICPC ’20). ACM, 14–24. https://doi.org/10.1145/3387904.3389251
[10]
Emma Effa Bella, Marie-Pierre Gervais, Reda Bendraou, Laurent Wouters, and Ali Koudri. 2018. Semi-Supervised Approach for Recovering Traceability Links in Complex Systems. In Proceedings of 23rd International Conference on Engineering of Complex Computer Systems (ICECCS ’18). IEEE Computer Society, 193–196. https://doi.org/10.1109/ICECCS2018.2018.00030
[11]
Markus Borg. 2014. Embrace your issues: compassing the software engineering landscape using bug reports. In ACM/IEEE International Conference on Automated Software Engineering (ASE ’14). ACM, 891–894. https://doi.org/10.1145/2642937.2653469
[12]
Markus Borg, Per Runeson, and Anders Ardö. 2014. Recovering from a Decade: A Systematic Mapping of Information Retrieval Approaches to Software Traceability. Empirical Software Engineering, 19, 6 (2014), 1565–1616. https://doi.org/10.1007/s10664-013-9255-y
[13]
Andrew P Bradley. 1997. The Use of the Area Under the ROC Curve in the Evaluation of Machine Learning Algorithms. Pattern recognition, 30, 7 (1997), 1145–1159. https://doi.org/10.1016/S0031-3203(96)00142-2
[14]
William B. Cavnar. 1995. Using An N-Gram-Based Document Representation With A Vector Processing Retrieval Model. NIST SPECIAL PUBLICATION SP, 269–269.
[15]
2006. Semi-Supervised Learning, Olivier Chapelle, Bernhard Schölkopf, and Alexander Zien (Eds.). The MIT Press.
[16]
Nitesh V. Chawla, Kevin W. Bowyer, Lawrence O. Hall, and W. Philip Kegelmeyer. 2002. SMOTE: Synthetic Minority Over-sampling Technique. Journal of Artificial Intelligence Research, 16 (2002), 321–357. https://doi.org/10.1613/jair.953
[17]
Bihuan Chen, Linlin Chen, Chen Zhang, and Xin Peng. 2020. BUILDFAST: History-Aware Build Outcome Prediction for Fast Feedback and Reduced Cost in Continuous Integration. In Proceedings of the 35th International Conference on Automated Software Engineering (ASE ’20). IEEE, 42–53. https://doi.org/10.1145/3324884.3416616
[18]
Xiaofan Chen and John C. Grundy. 2011. Improving Automated Documentation to Code Traceability by Combining Retrieval Techniques. In Proceedings of the 26th International Conference on Automated Software Engineering (ASE ’11). IEEE Computer Society, 223–232.
[19]
Jane Cleland-Huang. 2012. Traceability in Agile Projects. In Software and Systems Traceability. 265–275.
[20]
Jane Cleland-Huang. 2015. Toward Meaningful Industrial-Academic Partnerships. IEEE Software, 32, 1 (2015), 18–21. https://doi.org/10.1109/MS.2015.20
[21]
Jane Cleland-Huang, Adam Czauderna, Marek Gibiec, and John Emenecker. 2010. A Machine Learning Approach for Tracing Regulatory Codes to Product Specific Requirements. In Proceedings of the 32nd International Conference on Software Engineering (ICSE ’10). ACM, 155–164. https://doi.org/10.1145/1806799.1806825
[22]
Angelo Corallo, Maria Elena Latino, Marta Menegoli, and Pierpaolo Pontrandolfo. 2020. A Systematic Literature Review to Explore Traceability and Lifecycle Relationship. International Journal of Production Research, 58, 15 (2020), 4789–4807. https://doi.org/10.1080/00207543.2020.1771455
[23]
Liming Dong, He Zhang, Wei Liu, Zhiluo Weng, and Hongyu Kuang. 2022. SPLINT Package for "Semi-supervised Pre-processing for Learning-Based Traceability Framework on Real- World Software Projects". https://doi.org/10.5281/zenodo.7111136
[24]
Tianbao Du, Guohua Shen, Zhi-qiu Huang, Yaoshen Yu, and Dexiang Wu. 2020. Automatic Traceability Link Recovery via Active Learning. Frontiers of Information Technology & Electronic Engineering, 21, 8 (2020), 1217–1225. https://doi.org/10.1631/FITEE.1900222
[25]
Alexander Egyed, Florian Graf, and Paul Grünbacher. 2010. Effort and Quality of Recovering Requirements-to-Code Traces: Two Exploratory Experiments. In Proceedings of the 18th International Requirements Engineering Conference (RE ’10). IEEE Computer Society, 221–230. https://doi.org/10.1109/RE.2010.34
[26]
Davide Falessi, Massimiliano Di Penta, Gerardo Canfora, and Giovanni Cantone. 2017. Estimating the number of remaining links in traceability recovery. Empirical Software Engineering, 22, 3 (2017), 996–1027. https://doi.org/10.1007/s10664-016-9460-6
[27]
Malcom Gethers, Rocco Oliveto, Denys Poshyvanyk, and Andrea De Lucia. 2011. On Integrating Orthogonal Information Retrieval Methods to Improve Traceability Recovery. In Proceedings of the 27th International Conference on Software Maintenance (ICSM ’11). IEEE Computer Society, 133–142. https://doi.org/10.1109/ICSM.2011.6080780
[28]
Orlena Gotel, Jane Cleland-Huang, Jane Huffman Hayes, Andrea Zisman, Alexander Egyed, Paul Grünbacher, Alex Dekhtyar, Giuliano Antoniol, and Jonathan I. Maletic. 2012. The Grand Challenge of Traceability (v1.0). Springer, 343–409.
[29]
O. C. Z. Gotel and Anthony Finkelstein. 1994. An Analysis of the Requirements Traceability Problem. In Proceedings of the 1st International Conference on Requirements Engineering. IEEE Computer Society, 94–101. https://doi.org/10.1109/ICRE.1994.292398
[30]
Jin Guo, Jinghui Cheng, and Jane Cleland-Huang. 2017. Semantically Enhanced Software Traceability Using Deep Learning techniques. In Proceedings of the 39th International Conference on Requirements Engineering (ICSE ’17). 3–14. https://doi.org/10.1109/ICSE.2017.9
[31]
Jiannan Guo, Haochen Shi, Yangyang Kang, Kun Kuang, Siliang Tang, Zhuoren Jiang, Changlong Sun, Fei Wu, and Yueting Zhuang. 2021. Semi-supervised Active Learning for Semi-supervised Models: Exploit Adversarial Examples with Graph-based Virtual Labels. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV ’21). IEEE, 2876–2885. https://doi.org/10.1109/ICCV48922.2021.00289
[32]
Mouna Hammoudi, Christoph Mayr-Dorn, Atif Mashkoor, and Alexander Egyed. 2021. A Traceability Dataset for Open Source Systems. In Proceedings of the 18th International Conference on Mining Software Repositories (MSR ’21). IEEE, 555–559. https://doi.org/10.1109/MSR52588.2021.00073
[33]
Jane Huffman Hayes, Alex Dekhtyar, and Senthil Karthikeyan Sundaram. 2006. Advancing Candidate Link Generation for Requirements Tracing: The Study of Methods. IEEE Transactions on Software Engineering, 32, 1 (2006), 4–19.
[34]
Minsung Hyun, Jisoo Jeong, and Nojun Kwak. 2020. Class-Imbalanced Semi-Supervised Learning. CoRR, abs/2002.06815 (2020).
[35]
Harsurinder Kaur, Husanbir Singh Pannu, and Avleen Kaur Malhi. 2019. A Systematic Review on Imbalanced Data Challenges in Machine Learning: Applications and Solutions. Comput. Surveys, 52, 4 (2019), 79:1–79:36. https://doi.org/10.1145/3343440
[36]
Nilam Kaushik, Ladan Tahvildari, and Mark Moore. 2011. Reconstructing Traceability between Bugs and Test Cases: An Experimental Study. In Proceedings of the 18th Working Conference on Reverse Engineering (WCRE ’11). 411–414. https://doi.org/10.1109/WCRE.2011.58
[37]
Jaehyung Kim, Youngbum Hur, Sejun Park, Eunho Yang, Sung Ju Hwang, and Jinwoo Shin. 2020. Distribution Aligning Refinery of Pseudo-label for Imbalanced Semi-supervised Learning. In Proceedings of 33rd Annual Conference on Neural Information Processing Systems (NeurIPS ’20).
[38]
Bartosz Krawczyk. 2016. Learning from imbalanced data: open challenges and future directions. Progress in Artificial Intelligence, 5, 4 (2016), 221–232. https://doi.org/10.1007/s13748-016-0094-0
[39]
Tien-Duy B Le, Mario Linares-Vásquez, David Lo, and Denys Poshyvanyk. 2015. RCLinker: Automated Linking of Issue Reports and Commits Leveraging Rich Contextual Information. In Proceedings of the 23rd International Conference on Program Comprehension (ICPC ’15). 36–47. https://doi.org/10.1109/ICPC.2015.13
[40]
Dong-Hyun Lee. 2013. Pseudo-Label : The Simple and Efficient Semi-Supervised Learning Method for Deep Neural Networks. ICML 2013 Workshop : Challenges in Representation Learning, 07.
[41]
Stefan Lessmann, Bart Baesens, Christophe Mues, and Swantje Pietsch. 2008. Benchmarking Classification Models for Software Defect Prediction: A Proposed Framework and Novel Findings. IEEE Transactions on Software Engineering, 34, 4 (2008), 485–496. https://doi.org/10.1109/TSE.2008.35
[42]
Yue Li, He Zhang, Liming Dong, Bohan Liu, and Jinyu Ma. 2020. Constructing a Hybrid Software Process Simulation Model in Practice: An Exemplar from Industry. In Proceedings of the 15th International Conference on Software and System Processes (ICSSP ’20).
[43]
Jinfeng Lin, Yalin Liu, Qingkai Zeng, Meng Jiang, and Jane Cleland-Huang. 2021. Traceability Transformed: Generating more Accurate Links with Pre-Trained BERT Models. In Proceedings of the 43rd International Conference on Software Engineering (ICSE ’21). IEEE, 324–335. https://doi.org/10.1109/ICSE43902.2021.00040
[44]
Bohan Liu, He Zhang, Lanxin Yang, Liming Dong, Haifeng Shen, and Kaiwen Song. 2020. An Experimental Evaluation of Imbalanced Learning and Time-Series Validation in the Context of CI/CD Prediction. In Proceedings of the 24th Evaluation and Assessment in Software Engineering (EASE ’20). ACM, 21–30. https://doi.org/10.1145/3383219.3383222
[45]
Sugandha Lohar, Sorawit Amornborvornwong, Andrea Zisman, and Jane Cleland-Huang. 2013. Improving Trace Accuracy through Data-Driven Configuration and Composition of Tracing Features. In Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering (ESEC/FSE ’13). ACM, 378–388. https://doi.org/10.1145/2491411.2491432
[46]
Marco Lormans and Arie van Deursen. 2006. Can LSI help Reconstructing Requirements Traceability in Design and Test? In Proceedings of 10th European Conference on Software Maintenance and Reengineering (CSMR ’06). IEEE Computer Society, 47–56. https://doi.org/10.1109/CSMR.2006.13
[47]
Andrian Marcus, Jonathan I. Maletic, and Andrey Sergeyev. 2005. Recovery of Traceability Links between Software Documentation and Source Code. International Journal of Software Engineering and Knowledge Engineering, 15, 5 (2005), 811–836. https://doi.org/10.1142/S0218194005002543
[48]
Salome Maro, Miroslaw Staron, and Jan-Philipp Steghöfer. 2017. Challenges of Establishing Traceability in the Automotive Domain. In Software Quality. Complexity and Challenges of Software Engineering in Emerging Technologies - 9th International Conference (SWQD ’17). Springer, 153–172. https://doi.org/10.1007/978-3-319-49421-0_11
[49]
Chris Mills, Javier Escobar-Avila, Aditya Bhattacharya, Grigoriy Kondyukov, Shayok Chakraborty, and Sonia Haiduc. 2019. Tracing with Less Data: Active Learning for Classification-Based Traceability Link Recovery. In Proceedings of the 35th International Conference on Software Maintenance and Evolution (ICSME ’19). 103–113. https://doi.org/10.1109/ICSME.2019.00020
[50]
Chris Mills, Javier Escobar-Avila, and Sonia Haiduc. 2018. Automatic Traceability Maintenance via Machine Learning Classification. In Proceedings of the 34th International Conference on Software Maintenance and Evolution (ICSME ’18). IEEE Computer Society, 369–380. https://doi.org/10.1109/ICSME.2018.00045
[51]
Kevin Moran, David N. Palacio, Carlos Bernal-Cárdenas, Daniel McCrystal, Denys Poshyvanyk, Chris Shenefiel, and Jeff Johnson. 2020. Improving the Effectiveness of Traceability Link Recovery Using Hierarchical Bayesian Networks. In Proceedings of the 42nd International Conference on Software Engineering (ICSE ’20). ACM, 873–885.
[52]
Christian Neumuller and Paul Grünbacher. 2006. Automating Software Traceability in Very Small Companies: A Case Study and Lessons Learned. In Proceedings of the 21st International Conference on Automated Software Engineering (ASE ’06). 145–156.
[53]
Rocco Oliveto, Malcom Gethers, Denys Poshyvanyk, and Andrea De Lucia. 2020. On the Equivalence of Information Retrieval Methods for Automated Traceability Link Recovery: A Ten-Year Retrospective. In Proceedings of 28th International Conference on Program Comprehension (ICPC ’20). ACM, 1. https://doi.org/10.1145/3387904.3394491
[54]
Michael C. Panis. 2010. Successful Deployment of Requirements Traceability in a Commercial Engineering Organization...Really. In Proceedings of the 18th International Requirements Engineering Conference (RE ’10). 303–307.
[55]
Reza Meimandi Parizi, Sai Peck Lee, and Mohammad Dabbagh. 2014. Achievements and Challenges in State-of-the-Art Software Traceability Between Test and Code Artifacts. IEEE Transactions on Reliability, 63, 4 (2014), 913–926.
[56]
Julian Aron Prenner and Romain Robbes. 2021. Making the most of small Software Engineering datasets with modern machine learning. CoRR, abs/2106.15209 (2021).
[57]
Michael Rath, David Lo, and Patrick Mäder. 2018. Analyzing Requirements and Traceability Information to Improve Bug Localization. In Proceedings of the 15th International Conference on Mining Software Repositories (MSR ’18). 442–453.
[58]
Michael Rath and Patrick Mäder. 2019. The SEOSS 33 Dataset—Requirements, Bug Reports, Code History, and Trace Links for Entire Projects. Data in brief, 25 (2019), 104005. https://doi.org/10.1016/j.dib.2019.104005
[59]
Michael Rath, Jacob Rendall, Jin L. C. Guo, Jane Cleland-Huang, and Patrick Mäder. 2018. Traceability in the Wild: Automatically Augmenting Incomplete Trace Links. In Proceedings of the 40th International Conference on Software Engineering (ICSE ’18). ACM, 834–845. https://doi.org/10.1145/3180155.3180207
[60]
Patrick Rempel and Patrick Mäder. 2017. Preventing Defects: The Impact of Requirements Traceability Completeness on Software Quality. IEEE Transactions on Software Engineering, 43, 8 (2017), 777–797.
[61]
Alberto D. Rodriguez, Jane Cleland-Huang, and Davide Falessi. 2021. Leveraging Intermediate Artifacts to Improve Automated Trace Link Retrieval. In Proceedings of the 37th International Conference on Software Maintenance and Evolution. IEEE, 81–92. https://doi.org/10.1109/ICSME52107.2021.00014
[62]
Hang Ruan, Bihuan Chen, Xin Peng, and Wenyun Zhao. 2019. DeepLink: Recovering Issue-Commit Links Based on Deep Learning. Journal of Systems and Software, 158 (2019), 110406. https://doi.org/10.1016/j.jss.2019.110406
[63]
Ana Stanescu and Doina Caragea. 2014. Semi-Supervised Self-training Approaches for Imbalanced Splice Site Datasets. BICoB ’14.
[64]
Yan Sun, Qing Wang, and Ye Yang. 2017. FRLink: Improving the Recovery of Missing Issue-Commit Links by Revisiting File Relevance. Information and Software Technology, 84 (2017), 33–47. https://doi.org/10.1016/j.infsof.2016.11.010
[65]
Michael Unterkalmsteiner, Tony Gorschek, Robert Feldt, and Niklas Lavesson. 2016. Large-scale information retrieval in software engineering - an experience report from industrial application. Empirical Software Engineering, 21, 6 (2016), 2324–2365. https://doi.org/10.1007/s10664-015-9410-8
[66]
Robert Watkins and Mark Neal. 1994. Why and How of Requirements Tracing. IEEE Software, 11, 4 (1994), 104–106.
[67]
Chen Wei, Kihyuk Sohn, Clayton Mellina, Alan L. Yuille, and Fan Yang. 2021. CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning. CoRR, abs/2102.09559 (2021), https://doi.org/10.1109/CVPR46437.2021.01071
[68]
Stefan Winkler and Jens von Pilgrim. 2010. A Survey of Traceability in Requirements Engineering and Model-Driven Development. Software and Systems Modeling, 9, 4 (2010), 529–565.
[69]
Suresh Yadla, Jane Huffman Hayes, and Alex Dekhtyar. 2005. Tracing requirements to defect reports: an application of information retrieval techniques. Innovations in Systems and Software Engineering, 1, 2 (2005), 116–124. https://doi.org/10.1007/s11334-005-0011-3
[70]
Yuzhe Yang and Zhi Xu. 2020. Rethinking the Value of Labels for Improving Class-Imbalanced Learning. In Proceedings of 34th the Annual Conference on Neural Information Processing Systems (NeurIPS ’20).
[71]
Yuzhe Yang, Kaiwen Zha, Ying-Cong Chen, Hao Wang, and Dina Katabi. 2021. Delving into Deep Imbalanced Regression. In Proceedings of the 38th International Conference on Machine Learning (ICML ’21). PMLR, 11842–11851.
[72]
Hongbo Zhang, Bineng Zhong, Qing Lei, Ji-Xiang Du, Jialin Peng, Duansheng Chen, and Xiao Ke. 2017. Sparse Representation-Based Semi-Supervised Regression for People Counting. ACM Transactions on Multimedia Computing Communications and Applications, 13, 4 (2017), 47:1–47:17. https://doi.org/10.1145/3106156
[73]
Waleed Zogaan, Palak Sharma, Mehdi Mirakhorli, and Venera Arnaoudova. 2017. Datasets from Fifteen Years of Automated Requirements Traceability Research: Current State, Characteristics, and Quality. In Proceedings of the 25th International Requirements Engineering Conference. IEEE Computer Society, 110–121. https://doi.org/10.1109/RE.2017.80
[74]
Yang Zou, Zhiding Yu, B. V. K. Vijaya Kumar, and Jinsong Wang. 2018. Unsupervised Domain Adaptation for Semantic Segmentation via Class-Balanced Self-Training. In Proceedings of the 15th European Conference on Computer Vision (ECCV ’18). Springer, 297–313. https://doi.org/10.1007/978-3-030-01219-9_18
[75]
Yang Zou, Zhiding Yu, Xiaofeng Liu, B. V. K. Vijaya Kumar, and Jinsong Wang. 2019. Confidence Regularized Self-Training. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV ’19). IEEE, 5981–5990. https://doi.org/10.1109/ICCV.2019.00608

Cited By

View all
  • (2025)MPLinker: Multi-template Prompt-tuning with adversarial training for Issue–commit Link recoveryJournal of Systems and Software10.1016/j.jss.2025.112351223(112351)Online publication date: May-2025
  • (2024)Deep semi-supervised learning for recovering traceability links between issues and commitsJournal of Systems and Software10.1016/j.jss.2024.112109216(112109)Online publication date: Oct-2024
  • (2024)Measuring software engineer's contribution in practice: An industrial experience reportJournal of Software: Evolution and Process10.1002/smr.272237:1Online publication date: 4-Sep-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ESEC/FSE 2022: Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering
November 2022
1822 pages
ISBN:9781450394130
DOI:10.1145/3540250
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 November 2022

Permissions

Request permissions for this article.

Check for updates

Badges

Author Tags

  1. Data Imbalance
  2. Data Sparsity
  3. Industry Practice
  4. Learning-based Model
  5. Semi-supervised Learning
  6. Software Traceability

Qualifiers

  • Research-article

Funding Sources

Conference

ESEC/FSE '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 112 of 543 submissions, 21%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)70
  • Downloads (Last 6 weeks)6
Reflects downloads up to 19 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)MPLinker: Multi-template Prompt-tuning with adversarial training for Issue–commit Link recoveryJournal of Systems and Software10.1016/j.jss.2025.112351223(112351)Online publication date: May-2025
  • (2024)Deep semi-supervised learning for recovering traceability links between issues and commitsJournal of Systems and Software10.1016/j.jss.2024.112109216(112109)Online publication date: Oct-2024
  • (2024)Measuring software engineer's contribution in practice: An industrial experience reportJournal of Software: Evolution and Process10.1002/smr.272237:1Online publication date: 4-Sep-2024
  • (2023)On Preparing and Assessing Data for Process Simulation Modeling: An Industrial Report2023 IEEE/ACM International Conference on Software and System Processes (ICSSP)10.1109/ICSSP59042.2023.00016(58-69)Online publication date: May-2023
  • (2023)EALink: An Efficient and Accurate Pre-Trained Framework for Issue-Commit Link Recovery2023 38th IEEE/ACM International Conference on Automated Software Engineering (ASE)10.1109/ASE56229.2023.00059(217-229)Online publication date: 11-Sep-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media