A Robust Classification Framework for Medical Patents Based on Deep Learning

Wang, Siyuan; Long, Mei; Shi, Xiaoyu; He, Xianbo; Shang, Mingsheng

doi:10.1007/978-3-030-70665-4_25

Siyuan Wang^8,9,
Mei Long¹⁰,
Xiaoyu Shi⁹,
Xianbo He⁸ &
…
Mingsheng Shang⁹

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 88))

Included in the following conference series:

The International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery

111 Accesses

Abstract

With the repaid development of bioinformatics and pharmaceutical engineering, pharmaceutical company and institutes increasingly pay attention to intellectual property protection via medical patents. As a result, how to classify the massive medical patents accurately without manual intervention is an important challenge for academia and industrials. To address it, we propose a deep learning based classification framework for medical patents, which consists of three components (i.e., text processing, feature extraction, and prototype clustering). Different from the existing classification method based on machine learning, the proposed framework enjoys the robust characteristic for the external samples, while it can guarantee high precision. In detail, for the text processors, a professional medical text thesaurus is built via the GloVe method, which can learn more specialized vocabulary in the medical field. In the feature extraction, a hybrid deep learning model is proposed to extract the features of patent texts, which integers a one-dimensional convolutional neural network (CNN) and two bidirectional long-short-term sequence network (Bi-LSTM), propose an improved distance-based center loss function (DCL). Finally, extensive experiments are conducted on the Chinese medical patents dataset supported by the company. It demonstrates that our proposed method shows the significant superiority in the classification precision and robustness, compared with other existing multi-classification methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 229.00; Price excludes VAT (USA)

Softcover Book: USD 299.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Enhancing patent text classification with Bi-LSTM technique and alpine skiing optimization for improved diagnostic accuracy

Article 08 May 2024

PatentNet: multi-label classification of patent documents using deep learning based language understanding

Article Open access 18 December 2021

Greek Patent Classification Using Deep Learning

Notes

1.
https://www.yaozh.com/

References

Wu, B., Miao, Y.N., Peng, X.Q., et al.: Patent protection strategy of technical standards in medical innovation. Chin. J. New Drugs 27(5), 494–497 (2018)
Google Scholar
Han, J.W., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, New York (2001)
MATH Google Scholar
Yong, Z., Li, Y., Xia, S.: An improved KNN text classification algorithm based on clustering. J. Comput. 4(3), 230–237 (2009)
Google Scholar
Sun, A., Lim, E.P., Liu, Y.: On strategies for imbalanced text classification using SVM: a comparative study. Decis. Support Syst. 48(1), 191–201 (2009)
Article Google Scholar
Hu, J., Li, S., et al.: A patent classification model based on convolutional neural networks and rand forest. Sci. Technol. Eng. 18(6), 268–272 (2018)
Google Scholar
Yang, H.M., Zhang, X.Y., Yin, F.C., Liu, L.: Robust classification with convolutional prototype learning. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3474–3482 (2018)
Google Scholar
Kim, Y.: Convolutional neural networks for sentence classification. Eprint Arxiv (2014)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Zhang, S., Zheng, D.Q., Hu, C., Yang, M.: Bidirectional long short-term memory networks for relation classification. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 207–212, August 2016
Google Scholar
Hodge, V.J., Austin, J.: A survey of outlier detection methodologies. Artif. Intell. Rev. 22, 85–126 (2004)
Article Google Scholar
Song, J., Huang, X., Qin, S., et al.: A bi-directional sampling based on K-means method for imbalance text classification. In: International Conference on Computer and Information Science (ICIS), pp. 1–5 (2016)
Google Scholar
Han, E.-H., Karypis, G., Kumar, V.: Text categorization using weight adjusted k-nearest neighbor classification. In: Cheung, D., Williams, G.J., Li, Q. (eds.) PAKDD. LNCS, vol. 2035, pp. 53–65. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-45357-1_9
Chapter Google Scholar
Crammer, K., Gilad-Bachrach, R., Navot, A., et al.: Margin analysis of the LVQ Algorithm. In: Advances in Neural Information Processing Systems, pp. 462–469 (2003)
Google Scholar
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space, Computer Science, January 2013
Google Scholar

Download references

Acknowledgment

This work was supported in part by the National Natural Science Foundation of China under Grant 61602434, in part by Chongqing research program of technology innovation and application under grant cstc2019jscx-zdztzxX0019, in part by Chongqing research program of key standard technologies innovation of key industries under grant cstc2017zdcy-zdyfX0076, in part by Youth Innovation Promotion Association CAS, No. 2017393, in part by Nanchong major scientific and technological achievements conversion project 18SXHZ0386, Scientific research Fund for talents of the China West Normal University, No. 17YC149 and in part by Education program of the Ministry of Education project 201702049002.

Author information

Authors and Affiliations

The Computer School of China West Normal University, Nanchong, 637002, Sichuan, China
Siyuan Wang & Xianbo He
Chongqing Key Laboratory of Big Data and Intelligent Computing, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing, 400714, China
Siyuan Wang, Xiaoyu Shi & Mingsheng Shang
Chongqing Zhubajie Network Co., Ltd., Chongqing, 401120, China
Mei Long

Authors

Siyuan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Mei Long
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyu Shi
View author publications
You can also search for this author in PubMed Google Scholar
Xianbo He
View author publications
You can also search for this author in PubMed Google Scholar
Mingsheng Shang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

College of Engineering, Design and Physical Sciences, Brunel University London, Uxbridge, UK
Hongying Meng
School of Electronical Information and Artificial Engineering, Shaanxi University of Science and Technology, Xi’an, China
Tao Lei
College of Engineering, Design and Physical Sciences, Brunel University London, Uxbridge, UK
Maozhen Li
College of Electrical and Information, Hunan University, Changsha, China
Kenli Li
Division of Intelligent Future Technologies, Mälardalen University, Västerås, Västmanlands Län, Sweden
Ning Xiong
School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore, Singapore
Lipo Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, S., Long, M., Shi, X., He, X., Shang, M. (2021). A Robust Classification Framework for Medical Patents Based on Deep Learning. In: Meng, H., Lei, T., Li, M., Li, K., Xiong, N., Wang, L. (eds) Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery. ICNC-FSKD 2020. Lecture Notes on Data Engineering and Communications Technologies, vol 88. Springer, Cham. https://doi.org/10.1007/978-3-030-70665-4_25

Download citation

DOI: https://doi.org/10.1007/978-3-030-70665-4_25
Published: 27 June 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-70664-7
Online ISBN: 978-3-030-70665-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

A Robust Classification Framework for Medical Patents Based on Deep Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Enhancing patent text classification with Bi-LSTM technique and alpine skiing optimization for improved diagnostic accuracy

PatentNet: multi-label classification of patent documents using deep learning based language understanding

Greek Patent Classification Using Deep Learning

Notes

References

Acknowledgment

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Robust Classification Framework for Medical Patents Based on Deep Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Enhancing patent text classification with Bi-LSTM technique and alpine skiing optimization for improved diagnostic accuracy

PatentNet: multi-label classification of patent documents using deep learning based language understanding

Greek Patent Classification Using Deep Learning

Notes

References

Acknowledgment

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation