Analyzing Code Comments to Boost Program Comprehension

Shinyama, Yusuke; Arahori, Yoshitaka; Gondow, Katsuhiko

doi:10.1109/APSEC.2018.00047

Computer Science > Software Engineering

arXiv:1905.02050 (cs)

[Submitted on 6 May 2019 (v1), last revised 17 Mar 2022 (this version, v2)]

Title:Analyzing Code Comments to Boost Program Comprehension

Authors:Yusuke Shinyama, Yoshitaka Arahori, Katsuhiko Gondow

View PDF

Abstract:We are trying to find source code comments that help programmers understand a nontrivial part of source code. One of such examples would be explaining to assign a zero as a way to "clear" a buffer. Such comments are invaluable to programmers and identifying them correctly would be of great help. Toward this goal, we developed a method to discover explanatory code comments in a source code. We first propose eleven distinct categories of code comments. We then developed a decision-tree based classifier that can identify explanatory comments with 60% precision and 80% recall. We analyzed 2,000 GitHub projects that are written in two languages: Java and Python. This task is novel in that it focuses on a microscopic comment ("local comment") within a method or function, in contrast to the prior efforts that focused on API- or method-level comments. We also investigated how different category of comments is used in different projects. Our key finding is that there are two dominant types of comments: preconditional and postconditional. Our findings also suggest that many English code comments have a certain grammatical structure that are consistent across different projects.

Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:1905.02050 [cs.SE]
	(or arXiv:1905.02050v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.1905.02050
Journal reference:	Proceedings of 2018 25th Asia-Pacific Software Engineering Conference (APSEC), pp. 325-334
Related DOI:	https://doi.org/10.1109/APSEC.2018.00047

Submission history

From: Yusuke Shinyama [view email]
[v1] Mon, 6 May 2019 13:55:41 UTC (210 KB)
[v2] Thu, 17 Mar 2022 02:59:53 UTC (209 KB)

Computer Science > Software Engineering

Title:Analyzing Code Comments to Boost Program Comprehension

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Analyzing Code Comments to Boost Program Comprehension

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators