Medical code prediction with multi-view convolution and description-regularized label-dependent attention

Sadoughi, Najmeh; Finley, Greg P.; Fone, James; Murali, Vignesh; Korenevski, Maxim; Baryshnikov, Slava; Axtmann, Nico; Miller, Mark; Suendermann-Oeft, David

Computer Science > Computation and Language

arXiv:1811.01468 (cs)

[Submitted on 5 Nov 2018]

Title:Medical code prediction with multi-view convolution and description-regularized label-dependent attention

Authors:Najmeh Sadoughi, Greg P. Finley, James Fone, Vignesh Murali, Maxim Korenevski, Slava Baryshnikov, Nico Axtmann, Mark Miller, David Suendermann-Oeft

View PDF

Abstract:A ubiquitous task in processing electronic medical data is the assignment of standardized codes representing diagnoses and/or procedures to free-text documents such as medical reports. This is a difficult natural language processing task that requires parsing long, heterogeneous documents and selecting a set of appropriate codes from tens of thousands of possibilities---many of which have very few positive training samples. We present a deep learning system that advances the state of the art for the MIMIC-III dataset, achieving a new best micro F1-measure of 55.85\%, significantly outperforming the previous best result (Mullenbach et al. 2018). We achieve this through a number of enhancements, including two major novel contributions: multi-view convolutional channels, which effectively learn to adjust kernel sizes throughout the input; and attention regularization, mediated by natural-language code descriptions, which helps overcome sparsity for thousands of uncommon codes. These and other modifications are selected to address difficulties inherent to both automated coding specifically and deep learning generally. Finally, we investigate our accuracy results in detail to individually measure the impact of these contributions and point the way towards future algorithmic improvements.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1811.01468 [cs.CL]
	(or arXiv:1811.01468v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1811.01468

Submission history

From: Najmeh Sadoughi [view email]
[v1] Mon, 5 Nov 2018 00:54:03 UTC (1,029 KB)

Computer Science > Computation and Language

Title:Medical code prediction with multi-view convolution and description-regularized label-dependent attention

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Medical code prediction with multi-view convolution and description-regularized label-dependent attention

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators