Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Mining and searching app reviews for requirements engineering: : Evaluation and replication studies

Published: 01 March 2023 Publication History

Abstract

App reviews provide a rich source of feature-related information that can support requirement engineering activities. Analyzing them manually to find this information, however, is challenging due to their large quantity and noisy nature. To overcome the problem, automated approaches have been proposed for ‘feature-specific analysis’. Unfortunately, the effectiveness of these approaches has been evaluated using different methods and datasets. Replicating these studies to confirm their results and to provide benchmarks of different approaches is a challenging problem. We address the problem by extending previous evaluations and performing a comparison of these approaches. In this paper, we present two empirical studies. In the first study, we evaluate opinion mining approaches; the approaches extract features discussed in app reviews and identify their associated sentiments. In the second study, we evaluate approaches searching for feature-related reviews. The approaches search for users’ feedback pertinent to a particular feature. The results of both studies show these approaches achieve lower effectiveness than reported originally, and raise an important question about their practical use.

Highlights

Study the performance of mining and searching techniques applied to app reviews.
The new dataset can improve the quality of evaluation and replication studies.
Opinion mining approaches achieve lower effectiveness than reported originally.
Existing searching tools can be useful for requirements engineering use cases.
Researchers should pay more attention to the efficiency of their approaches.

References

[1]
Martin W., Sarro F., Jia Y., Zhang Y., Harman M., A survey of app store analysis for software engineering, IEEE Trans. Softw. Eng. 43 (9) (2017) 817–847. doi.ieeecomputersociety.org/10.1109/TSE.2016.2630689.
[2]
Ceci L., Number of apps available in leading app stores as of 2021, 2021, https://www.statista.com/statistics/276623/number-of-apps-available-in-leading-app-stores/ (Accessed: 20 Jan 2021).
[3]
AlSubaihin A., Sarro F., Black S., Capra L., Harman M., App store effects on software engineering practices, IEEE Trans. Softw. Eng. (2019) 1,.
[4]
Dąbrowski J., Letier E., Perini A., Susi A., Mining user feedback for software engineering: Use cases and reference architecture, in: 30th IEEE International Requirements Engineering Conference, RE 2022, Melbourne, Australia, August (2022) 15-19, IEEE, 2022, pp. 114–126,.
[5]
Dąbrowski J., Mining App Reviews to Support Software Engineering, (Ph.D. thesis) University College London, 2022, https://discovery.ucl.ac.uk/id/eprint/10149747/.
[6]
D. Pagano, W. Maalej, User feedback in the appstore: An empirical study, in: 2013 21st IEEE International Requirements Engineering Conference, RE, 2013, pp. 125–134.
[7]
Dąbrowski J., Letier E., Perini A., Susi A., Analysing app reviews for software engineering: A systematic literature review, Empir. Softw. Eng. 27 (2022) 43,.
[8]
J.O. Johanssen, A. Kleebaum, B. Bruegge, B. Paech, How do practitioners capture and utilize user feedback during continuous software engineering?, in: 2019 IEEE 27th International Requirements Engineering Conference, 2019.
[9]
A. Begel, T. Zimmermann, Analyze this! 145 questions for data scientists in software engineering, in: 36th International Conference on Software Engineering, 2014, pp. 12–13.
[10]
Buse R.P.L., Zimmermann T., Information needs for software development analytics, in: 34th International Conference on Software Engineering, 2012, pp. 987–996,.
[11]
Groen E.C., Seyff N., Ali R., Dalpiaz F., Doerr J., Guzman E., Hosseini M., Marco J., Oriol M., Perini A., Stade M., The crowd in requirements engineering: The landscape and challenges, IEEE Softw. 34 (2) (2017) 44–52,.
[12]
App Annie, 2020, https://www.appannie.com/ (Accessed: 20 Jan 2022).
[13]
Dąbrowski J., Letier E., Perini A., Susi A., Finding and analyzing app reviews related to specific features: A research preview, in: 25th International Working Conference, REFSQ 2019, Essen, Germany, March (2019) 18-21, Proceedings, 2019, pp. 183–189,.
[14]
Iacob C., Faily S., Harrison R., Maram: Tool support for mobile app review management, in: Proceedings of the 8th EAI International Conference on Mobile Computing, Applications and Services, MobiCASE ’16, ICST (Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, 2016, pp. 42–50.
[15]
Johann T., Stanik C., A.M.A.B., Maalej W., Safe: A simple approach for feature extraction from app descriptions and app reviews, in: 2017 IEEE 25th International Requirements Engineering Conference, 2017, pp. 21–30,.
[16]
Liu B., Sentiment analysis and opinion mining, Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, 2012,.
[17]
E. Guzman, W. Maalej, How do users like this feature? a fine grained sentiment analysis of app reviews, in: 2014 IEEE 22nd International Requirements Engineering Conference, RE, 2014, pp. 153–162.
[18]
Dragoni M., Federici M., Rexha A., An unsupervised aspect extraction strategy for monitoring real-time reviews stream, Inf. Process. Manage. 56 (3) (2019) 1103–1118,.
[19]
Dąbrowski J., Letier E., Perini A., Susi A., Mining user opinions to support requirement engineering: An empirical study, in: Dustdar S., Yu E., Salinesi C., Rieu D., Pant V. (Eds.), Advanced Information Systems Engineering, Springer International Publishing, Cham, 2020, pp. 401–416.
[20]
Gu X., Kim S., What parts of your apps are loved by users? (t), in: 30th International Conference on Automated Software Engineering, 2015, pp. 760–770,.
[21]
Shah F., Sirts K., Pfahl D., Simulating the impact of annotation guidelines and annotated data on extracting app features from app reviews, in: Proceedings of the 14th International Conference on Software Technologies, ICSOFT 2019, SCITEPRESS - Science and Technology Publications, Lda, Setubal, PRT, 2019, pp. 384–396,.
[22]
Shah F.A., Sirts K., Pfahl D., [Is the safe] approach too simple for app feature extraction? A replication study, in: 25th International Working Conference, REFSQ 2019, Essen, Germany, March (2019) 18-21, Proceedings, 2019, pp. 21–36,.
[23]
McCandless M., Hatcher E., Gospodnetic O., Lucene in Action, Second Edition: Covers Apache Lucene 3.0, Manning Publications Co., USA, 2010.
[24]
Dąrowski J., Replication package for a journal paper in information systems - mining and searching app reviews for requirements engineering, 2022, https://github.com/jsdabrowski/IS-22/.
[25]
Batory D.S., Feature models, grammars, and propositional formulas, in: Obbink J.H., Pohl K. (Eds.), Software Product Lines, 9th International Conference, in: 3714 of Lecture Notes in Computer Science, SPLC 2005, Rennes, France, September (2005) 26-29, Proceedings, Springer, 2005, pp. 7–20,.
[26]
Wiegers K.E., Beatty J., Software Requirements 3, Microsoft Press, USA, 2013.
[27]
Kang K., Cohen S., Hess J., Novak W., Peterson A., Feature-Oriented Domain Analysis (FODA) Feasibility Study, Software Engineering Institute, Carnegie Mellon University, Pittsburgh, PA, 1990.
[28]
Groen E.C., Kopczyńska S., Hauer M.P., Krafft T.D., Doerr J., Users – the hidden software product quality experts?: A study on how app users report quality aspects in online reviews, in: 2017 IEEE 25th International Requirements Engineering Conference, RE, 2017, pp. 80–89,.
[29]
Jha N., Mahmoud A., Mining non-functional requirements from app store reviews, Empir. Softw. Eng. 24 (6) (2019) 3659–3695.
[30]
Lim S., Henriksson A., Zdravkovic J., Data-driven requirements elicitation: A systematic literature review, SN Comput. Sci. 2 (1) (2021) 16.
[31]
Dalpiaz F., Parente M., [Re-SWOT:] From user feedback to requirements via competitor analysis, in: 25th International Working Conference, REFSQ 2019, Essen, Germany, March (2019) 18-21, Proceedings, 2019, pp. 55–70,.
[32]
Manning C.D., Schütze H., Foundations of Statistical Natural Language Processing, MIT Press, Cambridge, MA, USA, 1999.
[33]
Thelwall M., Buckley K., Paltoglou G., Cai D., Kappas A., Sentiment strength detection in short informal text, J. Am. Soc. Inf. Sci. Technol. 61 (12) (2010) 2544–2558. https://onlinelibrary.wiley.com/doi/abs/10.1002/asi.21416.
[34]
NLTK: Sample usage for collocations, 2022, https://www.nltk.org/howto/collocations.html.
[35]
Apache lucene 7.4.0 documentation, 2022, https://lucene.apache.org/core/7_4_0/index.html.
[36]
Vu P.M., Nguyen T.T., Pham H.V., Nguyen T.T., Mining user opinions in mobile app reviews: A keyword-based approach (t), in: Proceedings of the 2015 30th IEEE/ACM International Conference on Automated Software Engineering, ASE, ASE ’15, 2015, pp. 749–759,.
[37]
McAuley J., Targett C., Shi Q., van den Hengel A., Image-based recommendations on styles and substitutes, in: 38th International Conference on Research and Development in Information Retrieval, ACM, 2015, pp. 43–52,.
[38]
Pustejovsky J., Stubbs A., Natural Language Annotation for Machine Learning - a Guide to Corpus-Building for Applications, O’Reilly, 2012.
[39]
Cunningham H., Maynard D., Tablan V., Ursu C., Bontcheva K., Developing Language Processing Components with GATE Version 8, University of Sheffield Department of Computer Science, 2014.
[40]
Croft B., Metzler D., Strohman T., Search Engines: Information Retrieval in Practice, first ed., Addison-Wesley Publishing Company, USA, 2009.
[41]
L. Hoon, R. Vasa, J.-G. Schneider, K. Mouzakis, A preliminary analysis of vocabulary in mobile app user reviews, in: Proceedings of the 24th Australian Computer-Human Interaction Conference, OzCHI ’12, ACM, New York, NY, USA, 2012, pp. 245–248, https://doi.org/10.1145/2414536.2414578.
[42]
Lin B., Zampetti F., Bavota G., Di Penta M., Lanza M., Oliveto R., Sentiment analysis for software engineering: How far can we go?, in: 40th International Conference on Software Engineering, 2018, pp. 94–104,.
[43]
Calefato F., Lanubile F., Maiorano F., Novielli N., Sentiment polarity detection for software development, Empirical Softw. Engg. 23 (3) (2018) 1352–1382,.
[44]
Berry D.M., Cleland-Huang J., Ferrari A., Maalej W., Mylopoulos J., Zowghi D., Panel: Context-dependent evaluation of tools for nl re tasks: Recall vs. precision, and beyond, in: 2017 IEEE 25th International Requirements Engineering Conference, RE, 2017, pp. 570–573,.
[45]
Wohlin C., Runeson P., Hst M., Ohlsson M.C., Regnell B., Wessln A., Experimentation in Software Engineering, Springer Publishing, Company, Incorporated, 2012.
[46]
Al-Subaihin A.A., Software Engineering in the Age of App Stores: Feature-Based Analyses to Guide Mobile Software Engineers, (Ph.D. thesis,doctoral thesis) University College London, 2019.
[47]
Manning C.D., Raghavan P., Schütze H., Introduction to Information Retrieval, Cambridge University Press, USA, 2008.
[48]
Korfhage R.R., Information Storage and Retrieval, John Wiley & Sons, 1997.
[49]
Achimugu P., Selamat A., Ibrahim R., Mahrin M.N., A systematic literature review of software requirements prioritization research, Inf. Softw. Technol. 56 (6) (2014) 568–585,.
[50]
Martin W., Harman M., Jia Y., Sarro F., Zhang Y., The app sampling problem for app store mining, in: 2015 IEEE/ACM 12th Working Conference on Mining Software Repositories, 2015, pp. 123–133,.

Cited By

View all
  • (2024)Replication in Requirements Engineering: The NLP for RE CaseACM Transactions on Software Engineering and Methodology10.1145/365866933:6(1-33)Online publication date: 27-Jun-2024
  • (2024)Prioritizing user requirements for digital products using explainable artificial intelligenceFuture Generation Computer Systems10.1016/j.future.2024.04.037158:C(167-182)Online publication date: 1-Sep-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Information Systems
Information Systems  Volume 114, Issue C
Mar 2023
357 pages

Publisher

Elsevier Science Ltd.

United Kingdom

Publication History

Published: 01 March 2023

Author Tags

  1. Mining user reviews
  2. Software engineering
  3. Feature extraction
  4. Sentiment analysis
  5. Searching for feature-related reviews
  6. Empirical study

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Replication in Requirements Engineering: The NLP for RE CaseACM Transactions on Software Engineering and Methodology10.1145/365866933:6(1-33)Online publication date: 27-Jun-2024
  • (2024)Prioritizing user requirements for digital products using explainable artificial intelligenceFuture Generation Computer Systems10.1016/j.future.2024.04.037158:C(167-182)Online publication date: 1-Sep-2024

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media