Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3183440.3195081acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
poster

Duplicate finder toolkit

Published: 27 May 2018 Publication History

Abstract

Software documentation is a significant component of modern software systems. Each year it becomes more and more complicated, just as the software itself. One of the aspects that negatively impact documentation quality is the presence of textual duplicates. Textual duplicates encountered in software documentation are inherently imprecise, i.e. in a single document the same information may be presented many times with different levels of detail and in various contexts. Documentation maintenance is an acute problem, and there is a strong demand for automation tools in this domain.
In this study we present the Duplicate Finder Toolkit, a tool which assists an expert with duplicate maintenance-related tasks. Our tool can facilitate the maintenance process in a number of ways: 1) detection of both exact and near duplicates 2) duplicate visualization via heat maps 3) duplicate analysis - comparison of several duplicate instances, evaluation of their differences, exploration of duplicate context 4) duplicate manipulation and extraction.

References

[1]
H.A. Basit, S.J. Puglisi, W.F. Smyth, A. Turpin, and S. Jarzabek. 2007. Efficient Token Based Clone Detection with Flexible Tokenization. In ESEC-FSE companion '07. 513--516.
[2]
Paul G. Bassett. 1997. Framing Software Reuse: Lessons from the Real World. Prentice-Hall, Inc., Upper Saddle River, NJ, USA.
[3]
Michihiro Horie and Shigeru Chiba. 2010. Tool Support for Crosscutting Concerns of API Documentation. In Proc of AOSD '10. 97--108.
[4]
E. Juergens, F. Deissenboeck, M. Feilkas, B. Hummel, B. Schaetz, S. Wagner, C. Domann, and J. Streit. 2010. Can clone detection support quality assessments of requirements specifications?. In proc of ICSE'10, Vol. 2. 79--88.
[5]
D.V. Koznov and K.Y. Romanovsky. 2008. A method for software product lines documentation development. Programming and Computer Software 34, 4 (8 2008), 216--224.
[6]
D. V. Luciv, D. V. Koznov, G. A. Chernishev, and A. N. Terekhov. 2017. Detecting Near Duplicates in Software Documentation. ArXiv e-prints (Nov. 2017). arXiv:cs.SE/1711.04705
[7]
Milan Nosál' and Jaroslav Porubän. 2014. Reusable software documentation with phrase annotations. CEJCS 4, 4 (2014), 242--258.
[8]
M. A. Oumaziz, A. Charpentier, J. Falleri, and X. Blanc. 2017. Documentation Reuse: Hot or Not? An Empirical Study. Springer, Cham, 12--27.
[9]
David Lorge Parnas. 2011. Precise Documentation: The Key to Better Software. Springer Berlin Heidelberg, Berlin, Heidelberg, 125--148.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICSE '18: Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings
May 2018
231 pages
ISBN:9781450356633
DOI:10.1145/3183440
  • Conference Chair:
  • Michel Chaudron,
  • General Chair:
  • Ivica Crnkovic,
  • Program Chairs:
  • Marsha Chechik,
  • Mark Harman
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 May 2018

Check for updates

Author Tags

  1. copy-paste
  2. meaningful search
  3. near duplicates
  4. software clone detection
  5. software documents

Qualifiers

  • Poster

Conference

ICSE '18
Sponsor:

Acceptance Rates

Overall Acceptance Rate 276 of 1,856 submissions, 15%

Upcoming Conference

ICSE 2025

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Calculating Similarity of Javadoc CommentsProgramming and Computing Software10.1134/S036176882401004350:1(85-89)Online publication date: 22-May-2024
  • (2021)CentrisProceedings of the 43rd International Conference on Software Engineering10.1109/ICSE43902.2021.00083(860-872)Online publication date: 22-May-2021
  • (2021)Visualization of ClonesCode Clone Analysis10.1007/978-981-16-1927-4_8(107-120)Online publication date: 4-Aug-2021
  • (2019)Interactive Near Duplicate Search in Software DocumentationProgramming and Computer Software10.1134/S036176881906004545:6(346-355)Online publication date: 3-Dec-2019
  • (2019)Extraction of Archetype from Near Duplicates in Software Documentation2019 Actual Problems of Systems and Software Engineering (APSSE)10.1109/APSSE47353.2019.00023(126-130)Online publication date: Nov-2019

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media