Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Knowledge Graph and Deep Learning-based Text-to-GraphQL Model for Intelligent Medical Consultation Chatbot

Published: 06 July 2022 Publication History

Abstract

Text-to-GraphQL (Text2GraphQL) is a task that converts the user's questions into Graph + QL (Query Language) when a graph database is given. That is a task of semantic parsing that transforms natural language problems into logical expressions, which will bring more efficient direct communication between humans and machines. The existing related work mainly focuses on Text-to-SQL tasks, and there is no available semantic parsing method and data set for the graph database. In order to fill the gaps in this field to serve the medical Human–Robot Interactions (HRI) better, we propose this task and a pipeline solution for the Text2GraphQL task. This solution uses the Adapter pre-trained by “the linking of GraphQL schemas and the corresponding utterances” as an external knowledge introduction plug-in. By inserting the Adapter into the language model, the mapping between logical language and natural language can be introduced faster and more directly to better realize the end-to-end human–machine language translation task. In the study, the proposed Text2GraphQL task model is mainly constructed based on an improved pipeline composed of a Language Model, Pre-trained Adapter plug-in, and Pointer Network. This enables the model to copy objects' tokens from utterances, generate corresponding GraphQL statements for graph database retrieval, and builds an adjustment mechanism to improve the final output. And the experiments have proved that our proposed method has certain competitiveness on the counterpart datasets (Spider, ATIS, GeoQuery, and 39.net) converted from the Text2SQL task, and the proposed method is also practical in medical scenarios.

References

[1]
Abd-Alrazaq AA, Alajlani M, Alalwan AA, Bewick BM, Gardner P, and Househ M An overview of the features of chatbots in mental health: A scoping review International Journal of Medical Informatics 2019 132
[2]
Amith M, Roberts K, and Tao C Conceiving an application ontology to model patient human papillomavirus vaccine counseling for dialogue management BMC Bioinformatics 2019 20 21 1-16
[3]
Ba, L. J., Kiros, J. R., & Hinton, G. E. (2016). Layer normalization. CoRR abs/1607.06450 (2016). ArXiv Preprint ArXiv:1607.06450, 178.
[4]
Bahdanau, D., Cho, K., & Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. ArXiv Preprint ArXiv:1409.0473.
[5]
Bogin, B., Berant, J., & Gardner, M. (2019). Representing Schema Structure with Graph Neural Networks for Text-to-SQL Parsing. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 4560–4565.
[6]
Chao, G.-L., & Lane, I. (2019). BERT-DST: Scalable end-to-end dialogue state tracking with bidirectional encoder representations from transformer. Proc. Interspeech 2019, 1468–1472.
[7]
Che, W., Shao, Y., Liu, T., & Ding, Y. (2016). Semeval-2016 task 9: Chinese semantic dependency parsing. Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), 1074–1080.
[8]
Chintagunta, B., Katariya, N., Amatriain, X., & Kannan, A. (2021). Medically aware GPT-3 as a data generator for medical dialogue summarization. Machine Learning for Healthcare Conference, 354–372.
[9]
Chuan C-H and Morgan S Creating and Evaluating Chatbots as eligibility assistants for clinical trials: An active deep learning approach towards user-centered classification ACM Transactions on Computing for Healthcare 2020 2 1 1-19
[10]
Conneau, A., & Lample, G. (2019). Cross-lingual language model pretraining. Proceedings of the 33rd International Conference on Neural Information Processing Systems, 7059–7069.
[11]
Cui, F., Cui, Q., & Song, Y. (2020). A Survey on Learning-Based Approaches for Modeling and Classification of Human-Machine Dialog Systems. IEEE Transactions on Neural Networks and Learning Systems.
[12]
Dahl, D. A., Bates, M., Brown, M., Fisher, W., Hunicke-Smith, K., Pallett, D., Pao, C., Rudnicky, A., & Shriberg, E. (1994). Expanding the scope of the ATIS task: The ATIS-3 corpus. Human Language Technology: Proceedings of a Workshop Held at Plainsboro, New Jersey, March 8–11, 1994. https://aclanthology.org/H94-1010
[13]
Dai, Z., Wang, X., Ni, P., Li, Y., Li, G., & Bai, X. (2019). Named entity recognition using BERT BiLSTM CRF for Chinese electronic health records. 2019 12th International Congress on Image and Signal Processing, Biomedical Engineering and Informatics (Cisp-Bmei), 1–5.
[14]
Finegan-Dollak, C., Kummerfeld, J. K., Zhang, L., Ramanathan, K., Sadasivam, S., Zhang, R., & Radev, D. (2018). Improving Text-to-SQL Evaluation Methodology. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 351–360.
[15]
Gan Y, Chen X, Xie J, Purver M, Woodward JR, Drake J, and Zhang Q Natural SQL: Making SQL easier to infer from natural language specifications Findings of the Association for Computational Linguistics: EMNLP 2021 2021 2030-2042
[16]
Golden B, Bodin L, Doyle T, and Stewart W Jr Approximate traveling salesman algorithms Operations Research 1980 28 3-part-ii 694-711
[17]
Gotab P, Béchet F, and Damnati G Active learning for rule-based and corpus-based spoken language understanding models IEEE Workshop on Automatic Speech Recognition & Understanding 2009 2009 444-449
[18]
Guo, J., Zhan, Z., Gao, Y., Xiao, Y., Lou, J.-G., Liu, T., & Zhang, D. (2019). Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 4524–4535.
[19]
He, H., & Choi, J. (2020). Establishing strong baselines for the new decade: Sequence tagging, syntactic and semantic parsing with BERT. The Thirty-Third International Flairs Conference.
[20]
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 770–778.
[21]
Hong T, Kwon O-W, and Kim Y-K End-to-end task-oriented dialog system through template slot value generation Proc. Interspeech 2020 2020 3900-3904
[22]
Huang, J., Wang, Y., Wang, Y., Dong, Y., & Xiao, Y. (2021). Relation Aware Semi-autoregressive Semantic Parsing for NL2SQL. ArXiv Preprint ArXiv:2108.00804.
[23]
Hwang, W., Yim, J., Park, S., & Seo, M. (2019). A comprehensive exploration on wikisql with table-aware word contextualization. ArXiv Preprint ArXiv:1902.01069.
[24]
Iyer, S., Konstas, I., Cheung, A., Krishnamurthy, J., & Zettlemoyer, L. (2017). Learning a Neural Semantic Parser from User Feedback. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 963–973.
[25]
Kandpal, P., Jasnani, K., Raut, R., & Bhorge, S. (2020). Contextual Chatbot for healthcare purposes (using deep learning). 2020 Fourth World Conference on Smart Trends in Systems, Security and Sustainability (WorldS4), 625–634.
[26]
Khilji, A. F. U. R., Laskar, S. R., Pakray, P., Kadir, R. A., Lydia, M. S., & Bandyopadhyay, S. (2020). Healfavor: Dataset and a prototype system for healthcare chatbot. 2020 International Conference on Data Science, Artificial Intelligence, and Business Analytics (DATABIA), 1–4.
[27]
Lacson RC, Barzilay R, and Long WJ Automatic analysis of medical dialogue in the home hemodialysis domain: Structure induction and summarization Journal of Biomedical Informatics 2006 39 5 541-555
[28]
Lai, T. M., Tran, Q. H., Bui, T., & Kihara, D. (2020). A simple but effective bert model for dialog state tracking on resource-limited systems. ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 8034–8038.
[29]
Lee J, Yoon W, Kim S, Kim D, Kim S, So CH, and Kang J BioBERT: A pre-trained biomedical language representation model for biomedical text mining Bioinformatics 2020 36 4 1234-1240
[30]
Lee, S., Zhu, Q., Takanobu, R., Zhang, Z., Zhang, Y., Li, X., Li, J., Peng, B., Li, X., Huang, M., & others. (2019). ConvLab: Multi-domain end-to-end dialog system platform. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 64–69.
[31]
Lee, D. (2019). Clause-wise and recursive decoding for complex and cross-domain text-to-SQL generation. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 6047–6053.
[32]
Li Y, Ni P, Peng J, Zhu J, Dai Z, Li G, and Bai X A joint model of clinical domain classification and slot filling based on RCNN and BiGRU-CRF IEEE International Conference on Big Data (big Data) 2019 2019 6133-6135
[33]
Li Y, Ni P, and Chang V Application of deep reinforcement learning in stock trading strategies and stock forecasting Computing 2020 102 6 1305-1322
[34]
Li Y, Qian K, Shi W, and Yu Z End-to-end trainable non-collaborative dialog system Proceedings of the AAAI Conference on Artificial Intelligence 2020 34 8293-8302
[35]
Li, Y., Ni, P., & Chang, V. (2019a). An empirical research on the investment strategy of stock market based on deep reinforcement learning model. COMPLEXIS, 52–58.
[36]
Li, Y., Ni, P., Li, G., & Chang, V. (2020b). Effective piecewise CNN with attention mechanism for distant supervision on relation extraction task. COMPLEXIS, 53–60.
[37]
Lin, X. V., Socher, R., & Xiong, C. (2020). Bridging textual and tabular data for cross-domain text-to-SQL semantic parsing. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, 4870–4888.
[38]
Maltoudoglou L, Paisios A, Lenc L, Martínek J, Král P, and Papadopoulos H Well-calibrated confidence measures for multi-label text classification with a large number of labels Pattern Recognition 2022 122
[39]
Melamud, O., & Shivade, C. (2019). Towards automatic generation of shareable synthetic clinical notes using neural language models. Proceedings of the 2nd Clinical Natural Language Processing Workshop, 35–45.
[40]
Mellah Y, Rhouati A, Ettifouri EH, Bouchentouf T, and Belkasmi MG SQL generation from natural language: A sequence-to-sequence model powered by the transformers architecture and association rules Journal of Computer Science 2021 17 5 480-489
[41]
Mrkšić, N., Séaghdha, D. Ó., Wen, T.-H., Thomson, B., & Young, S. (2017). Neural belief tracker: Data-driven dialogue state tracking. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 1777–1788.
[42]
Névéol A, Dalianis H, Velupillai S, Savova G, and Zweigenbaum P Clinical natural language processing in languages other than english: Opportunities and challenges Journal of Biomedical Semantics 2018 9 1 1-13
[43]
Ni P, Li Y, Zhu J, Peng J, Dai Z, Li G, and Bai X Disease diagnosis prediction of emr based on BiGRU-ATT-capsnetwork model IEEE International Conference on Big Data (big Data) 2019 2019 6166-6168
[44]
Ni P, Li Y, and Chang V Recommendation and sentiment analysis based on consumer review and rating International Journal of Business Intelligence Research (IJBIR) 2020 11 2 11-27
[45]
Ni P, Li Y, and Chang V Research on text classification based on automatically extracted keywords International Journal of Enterprise Information Systems (IJEIS) 2020 16 4 1-16
[46]
Ni P, Li G, Hung PC, and Chang V StaResGRU-CNN with CMedLMs: A stacked residual GRU-CNN with pre-trained biomedical language models for predictive intelligence Applied Soft Computing 2021 113
[47]
Ni P, Li Y, Li G, and Chang V A hybrid siamese neural network for natural language inference in cyber-physical systems ACM Transactions on Internet Technology (TOIT) 2021 21 2 1-25
[48]
Ni, P., Li, Y., Li, G., & Chang, V. (2020c). Natural language understanding approaches based on joint task of intent detection and slot filling for IoT voice interaction. Neural Computing and Applications, 1–18.
[49]
Nobel JM, Puts S, Weiss J, Aerts HJ, Mak RH, Robben SG, and Dekker AL T-staging pulmonary oncology from radiological reports using natural language processing: Translating into a multi-language setting Insights into Imaging 2021 12 1 1-11
[50]
Nouri, E., & Hosseini-Asl, E. (2018). Toward scalable neural dialogue state tracking model. ArXiv Preprint ArXiv:1812.00899.
[51]
Oepen, S., Kuhlmann, M., Miyao, Y., Zeman, D., Cinková, S., Flickinger, D., Hajic, J., & Uresova, Z. (2015). Semeval 2015 task 18: Broad-coverage semantic dependency parsing. Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), 915–926.
[52]
Peng J, Ni P, Zhu J, Dai Z, Li Y, Li G, and Bai X Automatic generation of electronic medical record based on GPT2 model IEEE International Conference on Big Data (big Data) 2019 2019 6180-6182
[53]
Polanyi, L., Culy, C., Van Den Berg, M., Thione, G. L., & Ahn, D. (2004). A rule based approach to discourse parsing. Proceedings of the 5th SIGdial Workshop on Discourse and Dialogue at HLT-NAACL 2004, 108–117.
[54]
Poon, H. (2013). Grounded unsupervised semantic parsing. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 933–943. https://aclanthology.org/P13-1092
[55]
Ramasamy, L., & Žabokrtskỳ, Z. (2011). Tamil dependency parsing: Results using rule based and corpus based approaches. International Conference on Intelligent Text Processing and Computational Linguistics, 82–95.
[56]
Reddy JEP, Bhuwaneshwar CN, Palakurthi S, and Chavan A AI-IoT based healthcare prognosis interactive system IEEE International Conference for Innovation in Technology (INOCON) 2020 2020 1-5
[57]
Ren, L., Xie, K., Chen, L., & Yu, K. (2018). Towards Universal Dialogue State Tracking. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2780–2786.
[58]
Rubin, O., & Berant, J. (2021). SmBoP: Semi-autoregressive bottom-up semantic parsing. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 311–324.
[59]
Scholak, T., Schucher, N., & Bahdanau, D. (2021). PICARD: Parsing incrementally for constrained auto-regressive decoding from language models. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 9895–9901.
[60]
Van Rooij I, Stege U, and Schactman A Convex hull and tour crossings in the Euclidean traveling salesperson problem: Implications for human performance studies Memory & Cognition 2003 31 2 215-220
[61]
Vilares, D., Gómez-Rodríguez, C., & Alonso, M. A. (2017). Universal, unsupervised (rule-based), uncovered sentiment analysis. Knowledge-Based Systems, 118, 45–55.
[62]
Vinyals, O., Fortunato, M., & Jaitly, N. (2015). Pointer networks. Proceedings of the 28th International Conference on Neural Information Processing Systems-Volume 2, 2692–2700.
[63]
Wang Z, Poon J, and Poon S Tcm translator: A sequence generation approach for prescribing herbal medicines IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2019 2019 2474-2480
[64]
Wang R, Tang D, Duan N, Wei Z, Huang X-J, Ji J, Cao G, Jiang D, and Zhou M K-Adapter: Infusing knowledge into pre-trained models with adapters Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 2021 1405-1418
[65]
Wang, H., Li, M., & Chen, L. (2020). PG-GSQL: Pointer-generator network with guide decoding for cross-domain context-dependent text-to-SQL generation. Proceedings of the 28th International Conference on Computational Linguistics, 370–380.
[66]
Wen, T., Vandyke, D., Mrkšíc, N., Gašíc, M., Rojas-Barahona, L., Su, P., Ultes, S., & Young, S. (2017). A network-based end-to-end trainable task-oriented dialogue system. 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017-Proceedings of Conference, 1, 438–449.
[67]
Wolf, T., Sanh, V., Chaumond, J., & Delangue, C. (2019). Transfertransfo: A transfer learning approach for neural network based conversational agents. ArXiv Preprint ArXiv:1901.08149.
[68]
Xiong Y, Tang B, Chen Q, Wang X, and Yan J A study on automatic generation of Chinese discharge summary IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2019 2019 1681-1687
[69]
Xu, X., Liu, C., & Song, D. (2017). Sqlnet: Generating structured queries from natural language without reinforcement learning. arXiv preprint arXiv:1711.04436.
[70]
Yu, T., Li, Z., Zhang, Z., Zhang, R., & Radev, D. (2018a). TypeSQL: Knowledge-based type-aware neural text-to-SQL generation. Proceedings of the 2018a Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), 588–594.
[71]
Yu, T., Yasunaga, M., Yang, K., Zhang, R., Wang, D., Li, Z., & Radev, D. (2018b). SyntaxSQLNet: Syntax tree networks for complex and cross-domain text-to-SQL task. Proceedings of the 2018b Conference on Empirical Methods in Natural Language Processing, 1653–1663.
[72]
Yu, T., Zhang, R., Yang, K., Yasunaga, M., Wang, D., Li, Z., Ma, J., Li, I., Yao, Q., Roman, S., & others. (2018c). Spider: A large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-SQL task. Proceedings of the 2018c Conference on Empirical Methods in Natural Language Processing, 3911–3921.
[73]
Yu, T., Zhang, R., Yasunaga, M., Tan, Y. C., Lin, X. V., Li, S., Er, H., Li, I., Pang, B., Chen, T., & others. (2019). SParC: Cross-domain semantic parsing in context. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 4511–4523.
[74]
Yuan, Q., Ni, P., Liu, J., Tong, X., Lu, H., Li, G., & Guan, S. (2021). An encoder-decoder architecture with graph convolutional networks for abstractive summarization. 2021 IEEE 4th International Conference on Big Data and Artificial Intelligence (BDAI), 91–97.
[75]
Zelle, J. M., & Mooney, R. J. (1996). Learning to parse database queries using inductive logic programming. Proceedings of the National Conference on Artificial Intelligence, 1050–1055.
[76]
Zhang, R., Yu, T., Er, H., Shim, S., Xue, E., Lin, X. V., Shi, T., Xiong, C., Socher, R., & Radev, D. (2019). Editing-based SQL query generation for cross-domain context-dependent questions. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 5338–5349.
[77]
Zhang, T., Zhang, L., Ye, W., Li, B., Sun, J., Zhu, X., Zhao, W., & Zhang, S. (2021). Point, disambiguate and copy: Incorporating bilingual dictionaries for neural machine translation. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 3970–3979.
[78]
Zhong, V., Xiong, C., & Socher, R. (2017). Seq2sql: Generating structured queries from natural language using reinforcement learning. ArXiv Preprint ArXiv:1709.00103.
[79]
Zhong, V., Xiong, C., & Socher, R. (2018). Global-locally self-attentive encoder for dialogue state tracking. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 1458–1467.
[80]
Zhong, V., Lewis, M., Wang, S. I., & Zettlemoyer, L. (2020). Grounded adaptation for zero-shot executable semantic parsing. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 6869–6882.
[81]
Zhu, J., Ni, P., Li, Y., Peng, J., Dai, Z., Li, G., & Bai, X. (2019). A word2vec based on Chinese medical knowledge. IEEE International Conference on Big Data (big Data),2019, 6263–6265.
[82]
Zhu, J., Xia, Y., Wu, L., He, D., Qin, T., Zhou, W., Li, H., & Liu, T.-Y. (2020). Incorporating bert into neural machine translation. ArXiv Preprint ArXiv:2002.06823.

Cited By

View all
  • (2024)Intelligent recruitment decision support combining Transformer model and knowledge graphProceedings of the 2024 International Conference on Machine Intelligence and Digital Applications10.1145/3662739.3672315(596-601)Online publication date: 30-May-2024
  • (2024)Social Robotics Business and ComputingInformation Systems Frontiers10.1007/s10796-023-10413-626:1(1-8)Online publication date: 1-Feb-2024

Index Terms

  1. Knowledge Graph and Deep Learning-based Text-to-GraphQL Model for Intelligent Medical Consultation Chatbot
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Information & Contributors

            Information

            Published In

            cover image Information Systems Frontiers
            Information Systems Frontiers  Volume 26, Issue 1
            Feb 2024
            358 pages

            Publisher

            Kluwer Academic Publishers

            United States

            Publication History

            Published: 06 July 2022
            Accepted: 10 May 2022

            Author Tags

            1. Text-to-GraphQL
            2. Semantic parsing
            3. Knowledge graph
            4. Natural language processing
            5. Deep learning
            6. Health informatics

            Qualifiers

            • Research-article

            Contributors

            Other Metrics

            Bibliometrics & Citations

            Bibliometrics

            Article Metrics

            • Downloads (Last 12 months)0
            • Downloads (Last 6 weeks)0
            Reflects downloads up to 04 Oct 2024

            Other Metrics

            Citations

            Cited By

            View all
            • (2024)Intelligent recruitment decision support combining Transformer model and knowledge graphProceedings of the 2024 International Conference on Machine Intelligence and Digital Applications10.1145/3662739.3672315(596-601)Online publication date: 30-May-2024
            • (2024)Social Robotics Business and ComputingInformation Systems Frontiers10.1007/s10796-023-10413-626:1(1-8)Online publication date: 1-Feb-2024

            View Options

            View options

            Get Access

            Login options

            Media

            Figures

            Other

            Tables

            Share

            Share

            Share this Publication link

            Share on social media