Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Chinese Question Classification Based on ERNIE and Feature Fusion

  • Conference paper
  • First Online:
Natural Language Processing and Chinese Computing (NLPCC 2020)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12431))

Abstract

Question classification (QC) is a basic task of question answering (QA) system. This task effectively narrows the range of candidate answers and improves the operating efficiency of the system by providing semantic restrictions for the subsequent steps of information retrieval and answer extraction. Due to the small number of words in the question, it is difficult to extract deep semantic information for the existing QC methods. In this work, we propose a QC method based on ERNIE and feature fusion. We approach this problem by first using ERNIE to generate word vectors, which we then use to input into the feature extraction model. Next, we propose to combine the hybrid neural network (CNN-BILSTM, which extracts features independently), highway network and DCU (Dilated Composition Units) module as the feature extraction model. Experimental results on Fudan university’s question classification data set and NLPCC(QA)-2018 data set show that our method can improve the accuracy, recall rate and F1 of the QC task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    http://code.google.com/p/fudannlp/w/edit/QuestionClassification.

References

  1. Xu, J., Zhang, D., Li, S., Wang, H.: Research on question classification via bilingual information. J. Chin. Inf. Process. 31(05), 171–177 (2017). ISBN 1003-0077

    Google Scholar 

  2. Zhang, D., Li, S., Wang, J.: Semi-supervised question classification with jointly learning question and answer representations. J. Chin. Inf. Process. 31(01), 1–7 (2017). ISBN 1003-0077

    Google Scholar 

  3. Barigou, F.: Impact of instance selection on kNN-based text categorization. J. Inf. Process. Syst. 14 (2018)

    Google Scholar 

  4. Fan, Z., Su, L., Liu, X., Wang, S.: Multi-label Chinese question classification based on word2vec. pp. 546–550 (2017)

    Google Scholar 

  5. Yu, B., Xu, Q., Zhang, P.: Question classification based on MAC-LSTM, p. 75 (2018)

    Google Scholar 

  6. Yang, S., Gao, C.: Enriching basic features via multilayer bag-of-words binding for Chinese question classification. CAAI Trans. Intell. Tech. 2(3), 133–140 (2017)

    Article  Google Scholar 

  7. Wang. D., Nyberg, E.: A long short-term memory model for answer sentence selection in question answering. In: Meeting of the Association for Computational Linguistics & the International Joint Conference on Natural Language Processing (2015)

    Google Scholar 

  8. Zhou, C., Sun, C., Liu, Z., Lau, F.C.M.: A C-LSTM neural network for text classification. Comput. Sci. 1(4), 39–44 (2015)

    Google Scholar 

  9. Wen, Y., Zhang, W., Luo, R., Wang, J.: Learning text representation using recurrent convolutional neural network with highway layers (2016)

    Google Scholar 

  10. Liu, J., Yang, Y., Lv, S., Wang, J., Chen, H.: Attention-based BiGRU-CNN for Chinese question classification. J. Ambient Intell. Hum. Comput. (2) (2019). https://doi.org/10.1007/s12652-019-01344-9

  11. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. Computer Science (2013)

    Google Scholar 

  12. Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: Conference on Empirical Methods in Natural Language Processing (2014)

    Google Scholar 

  13. Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding (2018)

    Google Scholar 

  14. Sun, Y., Wang, S., Li, Y., Feng, S., Wu, H.: ERNIE: enhanced representation through knowledge integration (2019)

    Google Scholar 

  15. Srivastava, R.K., Greff, K., Schmidhuber, J.: Training very deep networks. Computer Science (2015)

    Google Scholar 

  16. Yi, T., Tuan, L.A., Hui, S.C.: Multi-granular sequence encoding via dilated compositional units for reading comprehension. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (2018)

    Google Scholar 

  17. Kim, Y.: Convolutional Neural Networks for Sentence Classification. Eprint Arxiv (2014)

    Google Scholar 

Download references

Acknowledgements

This research work has been partially supported by two NSFC grants, No. 61972003 and No. 61672040.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianyong Duan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Liu, G., Yuan, Q., Duan, J., Kou, J., Wang, H. (2020). Chinese Question Classification Based on ERNIE and Feature Fusion. In: Zhu, X., Zhang, M., Hong, Y., He, R. (eds) Natural Language Processing and Chinese Computing. NLPCC 2020. Lecture Notes in Computer Science(), vol 12431. Springer, Cham. https://doi.org/10.1007/978-3-030-60457-8_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-60457-8_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-60456-1

  • Online ISBN: 978-3-030-60457-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics