Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

A Robust Classification Framework for Medical Patents Based on Deep Learning

  • Conference paper
  • First Online:
Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD 2020)

Abstract

With the repaid development of bioinformatics and pharmaceutical engineering, pharmaceutical company and institutes increasingly pay attention to intellectual property protection via medical patents. As a result, how to classify the massive medical patents accurately without manual intervention is an important challenge for academia and industrials. To address it, we propose a deep learning based classification framework for medical patents, which consists of three components (i.e., text processing, feature extraction, and prototype clustering). Different from the existing classification method based on machine learning, the proposed framework enjoys the robust characteristic for the external samples, while it can guarantee high precision. In detail, for the text processors, a professional medical text thesaurus is built via the GloVe method, which can learn more specialized vocabulary in the medical field. In the feature extraction, a hybrid deep learning model is proposed to extract the features of patent texts, which integers a one-dimensional convolutional neural network (CNN) and two bidirectional long-short-term sequence network (Bi-LSTM), propose an improved distance-based center loss function (DCL). Finally, extensive experiments are conducted on the Chinese medical patents dataset supported by the company. It demonstrates that our proposed method shows the significant superiority in the classification precision and robustness, compared with other existing multi-classification methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://www.yaozh.com/

References

  1. Wu, B., Miao, Y.N., Peng, X.Q., et al.: Patent protection strategy of technical standards in medical innovation. Chin. J. New Drugs 27(5), 494–497 (2018)

    Google Scholar 

  2. Han, J.W., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, New York (2001)

    MATH  Google Scholar 

  3. Yong, Z., Li, Y., Xia, S.: An improved KNN text classification algorithm based on clustering. J. Comput. 4(3), 230–237 (2009)

    Google Scholar 

  4. Sun, A., Lim, E.P., Liu, Y.: On strategies for imbalanced text classification using SVM: a comparative study. Decis. Support Syst. 48(1), 191–201 (2009)

    Article  Google Scholar 

  5. Hu, J., Li, S., et al.: A patent classification model based on convolutional neural networks and rand forest. Sci. Technol. Eng. 18(6), 268–272 (2018)

    Google Scholar 

  6. Yang, H.M., Zhang, X.Y., Yin, F.C., Liu, L.: Robust classification with convolutional prototype learning. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3474–3482 (2018)

    Google Scholar 

  7. Kim, Y.: Convolutional neural networks for sentence classification. Eprint Arxiv (2014)

    Google Scholar 

  8. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  9. Zhang, S., Zheng, D.Q., Hu, C., Yang, M.: Bidirectional long short-term memory networks for relation classification. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 207–212, August 2016

    Google Scholar 

  10. Hodge, V.J., Austin, J.: A survey of outlier detection methodologies. Artif. Intell. Rev. 22, 85–126 (2004)

    Article  Google Scholar 

  11. Song, J., Huang, X., Qin, S., et al.: A bi-directional sampling based on K-means method for imbalance text classification. In: International Conference on Computer and Information Science (ICIS), pp. 1–5 (2016)

    Google Scholar 

  12. Han, E.-H., Karypis, G., Kumar, V.: Text categorization using weight adjusted k-nearest neighbor classification. In: Cheung, D., Williams, G.J., Li, Q. (eds.) PAKDD. LNCS, vol. 2035, pp. 53–65. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-45357-1_9

    Chapter  Google Scholar 

  13. Crammer, K., Gilad-Bachrach, R., Navot, A., et al.: Margin analysis of the LVQ Algorithm. In: Advances in Neural Information Processing Systems, pp. 462–469 (2003)

    Google Scholar 

  14. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space, Computer Science, January 2013

    Google Scholar 

Download references

Acknowledgment

This work was supported in part by the National Natural Science Foundation of China under Grant 61602434, in part by Chongqing research program of technology innovation and application under grant cstc2019jscx-zdztzxX0019, in part by Chongqing research program of key standard technologies innovation of key industries under grant cstc2017zdcy-zdyfX0076, in part by Youth Innovation Promotion Association CAS, No. 2017393, in part by Nanchong major scientific and technological achievements conversion project 18SXHZ0386, Scientific research Fund for talents of the China West Normal University, No. 17YC149 and in part by Education program of the Ministry of Education project 201702049002.

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, S., Long, M., Shi, X., He, X., Shang, M. (2021). A Robust Classification Framework for Medical Patents Based on Deep Learning. In: Meng, H., Lei, T., Li, M., Li, K., Xiong, N., Wang, L. (eds) Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery. ICNC-FSKD 2020. Lecture Notes on Data Engineering and Communications Technologies, vol 88. Springer, Cham. https://doi.org/10.1007/978-3-030-70665-4_25

Download citation

Publish with us

Policies and ethics