Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3660853.3660921acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaicconfConference Proceedingsconference-collections
research-article

An End-to-End Framework for Multi-Docs Chatbot using Llama2

Published: 23 June 2024 Publication History
  • Get Citation Alerts
  • Abstract

    The evolution of conversational agents, in particular the case with chatbots, has experienced huge boosts in recent years, enabling a variety of tasks and allowing users to enjoy much more interaction. This research presents a sequential model for a Chatbot of multiple documents that is based on the best of the Llama2 mod-el. The document classification framework intends to offer a user-oriented as well as a versatile conversational approach that draws on data from several fields. Through proper implementation of state-of-the-art natural language processing technology, the chatbot can understand users' inquiries, retrieve the required in-formation from the uploaded files, and respond fluently and understandably. It provides document management processes, like file handling of PDF, DOCX, etc., which enables the user to work with almost all file types and formats. And that directly uses Hugging Face Transformers in such processes as text embed-ding and conversational generation. One of the key components of the system is the FAISS tool that allows for vector storage and retrieval keeping the chatbot operating efficiently in the process of searching and retrieving information from vast document collections. In summary, the work provided here lays out the foundations of the multi-doc system which is a powerful tool that can be used to improve the deployments and information search tasks, with the effect of boost-ing user engagement and productivity.

    References

    [1]
    "Llama: Leveraging Language Models for Multi-turn Multi-modal Conversation Assistant" by Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh, Marcus Rohrbach. (Conference: ECCV 2020)
    [2]
    "Improving Conversational Question Answering Systems with Candidate Verification and Entailment Generation" by Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, Veselin Stoyanov. (Conference: ACL 2020)
    [3]
    "TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents" by Thomas Scialom, Sylvain Gelly. (Conference: ICLR 2020)
    [4]
    "BERT for Joint Intent Classification and Slot Filling" by Devendra Singh Sachan, Manish Shrivastava, Avneet Kaur, Manohar Kuse. (Conference: ACL 2019)
    [5]
    "Hierarchical Question-Image Co-Attention for Visual Question Answering" by Zhou Yu, Jun Yu, Chenchao Xiang, Jianping Fan, and Dacheng Tao. (Conference: AAAI 2019)
    [6]
    "Attention is All You Need" by Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin. (Conference: NeurIPS 2017)
    [7]
    "Multi-view Conversational Context Embedding for Response Retrieval in Retrieval-based Chatbots" by Zhen Xu, Bingquan Liu, Bolei He, Wu Yang, Shu Wu, Liang Wang. (Conference: AAAI 2019)
    [8]
    "End-to-End Task-Completion Neural Dialogue Systems" by Jason D. Williams, Kavosh Asadi, Geoffrey Zweig. (Conference: ACL 2017)
    [9]
    "Deal or No Deal? End-to-End Learning for Negotiation Dialogues" by Mike Lewis, Denis Yarats, Yann Dauphin, Devi Parikh, Dhruv Batra. (Conference: EMNLP 2017)
    [10]
    "Convolutional Sequence to Sequence Learning" by Jonas Gehring, Michael Auli, David Grangier, Denis Yarats, Yann N. Dauphin. (Conference: ICML 2017)
    [11]
    "Neural Generative Question Answering" by Bowen Li, Jianfeng Gao, Michel Galley, Chris Brockett, Xiaodong Liu, Bill Dolan. (Conference: ACL 2016)
    [12]
    "Neural Responding Machine for Short-Text Conversation" by Lifeng Shang, Zhengdong Lu, Hang Li. (Conference: ACL 2015)
    [13]
    "Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning" by Xiujun Li, Zachary C. Lipton, Bhuwan Dhingra, Lihong Li, Jianfeng Gao, Yun-Nung Chen, Faisal Ahmed, Li Deng. (Conference: ACL 2017)
    [14]
    "Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context" by Hiroaki Sugiyama, Yutaka Matsuo. (Conference: AAAI 2017)
    [15]
    "Neural Belief Tracker: Data-Driven Dialogue State Tracking" by Pei-Hao Su, Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young. (Conference: SIGDIAL 2016)
    [16]
    "End-to-End Memory Networks" by Sainbayar Sukhbaatar, Arthur Szlam, Jason Weston, Rob Fergus. (Conference: NIPS 2015)
    [17]
    "Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning" by Xiujun Li, Zachary C. Lipton, Bhuwan Dhingra, Lihong Li, Jianfeng Gao, Yun-Nung Chen, Faisal Ahmed, Li Deng. (Conference: ACL 2017)
    [18]
    "Neural Belief Tracker: Data-Driven Dialogue State Tracking" by Pei-Hao Su, Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young. (Conference: SIGDIAL 2016)
    [19]
    "End-to-End Memory Networks" by Sainbayar Sukhbaatar, Arthur Szlam, Jason Weston, Rob Fergus. (Conference: NIPS 2015)
    [20]
    "Neural Machine Translation by Jointly Learning to Align and Translate" by Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio. (Conference: ICLR 2015)

    Index Terms

    1. An End-to-End Framework for Multi-Docs Chatbot using Llama2

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      AICCONF '24: Proceedings of the Cognitive Models and Artificial Intelligence Conference
      May 2024
      367 pages
      ISBN:9798400716928
      DOI:10.1145/3660853
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 23 June 2024

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Chatbot
      2. Generative AI
      3. LLM
      4. Natural Language Processing

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Funding Sources

      • We thank to receive the funding for this research work from RSF (Research Support funding) from Symbiosis Institute of Technology, Symbiosis International (Deemed University) (SIU),Lavale Campus, Pune, Maharashtra,412115, India.

      Conference

      AICCONF '24

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 9
        Total Downloads
      • Downloads (Last 12 months)9
      • Downloads (Last 6 weeks)9
      Reflects downloads up to 26 Jul 2024

      Other Metrics

      Citations

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media