Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2642937.2642953acmconferencesArticle/Chapter ViewAbstractPublication PagesaseConference Proceedingsconference-collections
research-article

Interrogative-guided re-ranking for question-oriented software text retrieval

Published: 15 September 2014 Publication History

Abstract

In many software engineering tasks, question-oriented text retrieval is often used to help developers search for software artifacts. In this paper, we propose an interrogative-guided re-ranking approach for question-oriented software text retrieval. Since different interrogatives usually indicate users' different search focuses, we firstly label 9 kinds of question-answer pairs according to the common interrogatives. Then, we train document classifiers by using 1,826 questions along with 2,460 answers from StackOverflow, apply the classifiers to our document repository and present a re-ranking approach to improve the retrieval precision. In software document classification, our classifiers achieve the average precision, recall and F-measure of 56.2%, 90.9% and 69.4% respectively. Our re-ranking approach presents 9.6% improvement in nDCG@1 upon the baseline, and we also obtain 8.1% improvement in nDCG@10 when more candidates are included.

References

[1]
C. M. Bishop et al. Pattern recognition and machine learning, volume 1. springer New York, 2006.
[2]
D. H. Dalip, M. A. Gonçalves, M. Cristo, and P. Calado. Exploiting user feedback to learn to rank answers in q&a forums: a case study with stack overflow. In SIGIR, pages 543--552. ACM, 2013.
[3]
A. Genkin, D. D. Lewis, and D. Madigan. Large-scale bayesian logistic regression for text categorization. Technometrics, 49(3):291--304, 2007.
[4]
S. Gottipati, D. Lo, and J. Jiang. Finding relevant answers in software forums. In ASE, pages 323--332. IEEE Computer Society, 2011.
[5]
S. Haiduc, G. Bavota, A. Marcus, R. Oliveto, A. De Lucia, and T. Menzies. Automatic query reformulations for text retrieval in software engineering. In ICSE, pages 842--851. IEEE Press, 2013.
[6]
K. Järvelin and J. Kekäläinen. Ir evaluation methods for retrieving highly relevant documents. In Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, pages 41--48. ACM, 2000.
[7]
C. Liu, Y. Zou, S. Cai, B. Xie, and H. Mei. Finding the merits and drawbacks of software resources from comments. In ASE, pages 432--435. IEEE Computer Society, 2011.
[8]
C. D. Manning, P. Raghavan, and H. Schütze. Introduction to information retrieval, volume 1. 2008.
[9]
D. Roobaert, G. Karakoulas, and N. V. Chawla. Information gain, correlation and support vector machines. In Feature Extraction, pages 463--470. Springer, 2006.
[10]
Y. Tian, J. Lawall, and D. Lo. Identifying linux bug fixing patches. In ICSE, pages 386--396. IEEE, 2012.
[11]
C. Treude, O. Barzilay, and M.-A. Storey. How do programmers ask and answer questions on the web?: Nier track. In ICSE, pages 804--807. IEEE, 2011.

Cited By

View all
  • (2023)Automated Summarization of Stack Overflow Posts2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)10.1109/ICSE48619.2023.00158(1853-1865)Online publication date: May-2023
  • (2022)I Know What You Are Searching for: Code Snippet Recommendation from Stack Overflow PostsACM Transactions on Software Engineering and Methodology10.1145/355015032:3(1-42)Online publication date: 21-Jul-2022
  • (2022)API-Related Developer Information Needs in Stack OverflowIEEE Transactions on Software Engineering10.1109/TSE.2021.312020348:11(4485-4500)Online publication date: 1-Nov-2022
  • Show More Cited By

Index Terms

  1. Interrogative-guided re-ranking for question-oriented software text retrieval

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ASE '14: Proceedings of the 29th ACM/IEEE International Conference on Automated Software Engineering
      September 2014
      934 pages
      ISBN:9781450330138
      DOI:10.1145/2642937
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 15 September 2014

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. interrogative-guided
      2. ranking
      3. software text retrieval

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      ASE '14
      Sponsor:

      Acceptance Rates

      ASE '14 Paper Acceptance Rate 82 of 337 submissions, 24%;
      Overall Acceptance Rate 82 of 337 submissions, 24%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 23 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Automated Summarization of Stack Overflow Posts2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)10.1109/ICSE48619.2023.00158(1853-1865)Online publication date: May-2023
      • (2022)I Know What You Are Searching for: Code Snippet Recommendation from Stack Overflow PostsACM Transactions on Software Engineering and Methodology10.1145/355015032:3(1-42)Online publication date: 21-Jul-2022
      • (2022)API-Related Developer Information Needs in Stack OverflowIEEE Transactions on Software Engineering10.1109/TSE.2021.312020348:11(4485-4500)Online publication date: 1-Nov-2022
      • (2020)Essential Sentences for Navigating Stack Overflow Answers2020 IEEE 27th International Conference on Software Analysis, Evolution and Reengineering (SANER)10.1109/SANER48275.2020.9054828(229-239)Online publication date: Feb-2020
      • (2018)A survey on mining stack overflow: question and answering (Q&A) communityData Technologies and Applications10.1108/DTA-07-2017-005452:2(190-247)Online publication date: 3-Apr-2018
      • (2015)Learning to rank for question-oriented software text retrievalProceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering10.1109/ASE.2015.24(1-11)Online publication date: 9-Nov-2015

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media