research-article

An End-to-End Framework for Multi-Docs Chatbot using Llama2

Authors:

Tanupriya Choudhury,

Turgut OzsevenAuthors Info & Claims

AICCONF '24: Proceedings of the Cognitive Models and Artificial Intelligence Conference

Pages 232 - 236

https://doi.org/10.1145/3660853.3660921

Published: 23 June 2024 Publication History

Abstract

The evolution of conversational agents, in particular the case with chatbots, has experienced huge boosts in recent years, enabling a variety of tasks and allowing users to enjoy much more interaction. This research presents a sequential model for a Chatbot of multiple documents that is based on the best of the Llama2 mod-el. The document classification framework intends to offer a user-oriented as well as a versatile conversational approach that draws on data from several fields. Through proper implementation of state-of-the-art natural language processing technology, the chatbot can understand users' inquiries, retrieve the required in-formation from the uploaded files, and respond fluently and understandably. It provides document management processes, like file handling of PDF, DOCX, etc., which enables the user to work with almost all file types and formats. And that directly uses Hugging Face Transformers in such processes as text embed-ding and conversational generation. One of the key components of the system is the FAISS tool that allows for vector storage and retrieval keeping the chatbot operating efficiently in the process of searching and retrieving information from vast document collections. In summary, the work provided here lays out the foundations of the multi-doc system which is a powerful tool that can be used to improve the deployments and information search tasks, with the effect of boost-ing user engagement and productivity.

References

[1]

"Llama: Leveraging Language Models for Multi-turn Multi-modal Conversation Assistant" by Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh, Marcus Rohrbach. (Conference: ECCV 2020)

[2]

"Improving Conversational Question Answering Systems with Candidate Verification and Entailment Generation" by Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, Veselin Stoyanov. (Conference: ACL 2020)

[3]

"TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents" by Thomas Scialom, Sylvain Gelly. (Conference: ICLR 2020)

[4]

"BERT for Joint Intent Classification and Slot Filling" by Devendra Singh Sachan, Manish Shrivastava, Avneet Kaur, Manohar Kuse. (Conference: ACL 2019)

[5]

"Hierarchical Question-Image Co-Attention for Visual Question Answering" by Zhou Yu, Jun Yu, Chenchao Xiang, Jianping Fan, and Dacheng Tao. (Conference: AAAI 2019)

[6]

"Attention is All You Need" by Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin. (Conference: NeurIPS 2017)

[7]

"Multi-view Conversational Context Embedding for Response Retrieval in Retrieval-based Chatbots" by Zhen Xu, Bingquan Liu, Bolei He, Wu Yang, Shu Wu, Liang Wang. (Conference: AAAI 2019)

[8]

"End-to-End Task-Completion Neural Dialogue Systems" by Jason D. Williams, Kavosh Asadi, Geoffrey Zweig. (Conference: ACL 2017)

[9]

"Deal or No Deal? End-to-End Learning for Negotiation Dialogues" by Mike Lewis, Denis Yarats, Yann Dauphin, Devi Parikh, Dhruv Batra. (Conference: EMNLP 2017)

[10]

"Convolutional Sequence to Sequence Learning" by Jonas Gehring, Michael Auli, David Grangier, Denis Yarats, Yann N. Dauphin. (Conference: ICML 2017)

[11]

"Neural Generative Question Answering" by Bowen Li, Jianfeng Gao, Michel Galley, Chris Brockett, Xiaodong Liu, Bill Dolan. (Conference: ACL 2016)

[12]

"Neural Responding Machine for Short-Text Conversation" by Lifeng Shang, Zhengdong Lu, Hang Li. (Conference: ACL 2015)

[13]

"Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning" by Xiujun Li, Zachary C. Lipton, Bhuwan Dhingra, Lihong Li, Jianfeng Gao, Yun-Nung Chen, Faisal Ahmed, Li Deng. (Conference: ACL 2017)

[14]

"Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context" by Hiroaki Sugiyama, Yutaka Matsuo. (Conference: AAAI 2017)

[15]

"Neural Belief Tracker: Data-Driven Dialogue State Tracking" by Pei-Hao Su, Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young. (Conference: SIGDIAL 2016)

[16]

"End-to-End Memory Networks" by Sainbayar Sukhbaatar, Arthur Szlam, Jason Weston, Rob Fergus. (Conference: NIPS 2015)

[17]

"Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning" by Xiujun Li, Zachary C. Lipton, Bhuwan Dhingra, Lihong Li, Jianfeng Gao, Yun-Nung Chen, Faisal Ahmed, Li Deng. (Conference: ACL 2017)

[18]

"Neural Belief Tracker: Data-Driven Dialogue State Tracking" by Pei-Hao Su, Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young. (Conference: SIGDIAL 2016)

[19]

"End-to-End Memory Networks" by Sainbayar Sukhbaatar, Arthur Szlam, Jason Weston, Rob Fergus. (Conference: NIPS 2015)

[20]

"Neural Machine Translation by Jointly Learning to Align and Translate" by Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio. (Conference: ICLR 2015)

Index Terms

An End-to-End Framework for Multi-Docs Chatbot using Llama2
1. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Document filtering

Recommendations

Chatbot with Touch and Graphics: An Interaction of Users for Emotional Expression and Turn-taking
CUI '20: Proceedings of the 2nd Conference on Conversational User Interfaces

Use of chatbots for emotional exchange is recently increasing in various domains. However, as existing chatbots have been considered in terms of natural language processing techniques for interaction with text-based chatting, chatbot interaction with ...
Action-a-Bot: Exploring Human-Chatbot Conversations for Actionable Instruction Giving and Following
CSCW'22 Companion: Companion Publication of the 2022 Conference on Computer Supported Cooperative Work and Social Computing

Conversation serves as one critical mechanism for knowledge-sharing and instruction-giving in collaborative work. Conversation allows people to take turns to make contributions, plan joint actions, align shared understanding of work status and resolve ...
How to Build a Chatbot: Chatbot Framework and its Capabilities
ICMLC '18: Proceedings of the 2018 10th International Conference on Machine Learning and Computing

Artificial Intelligence continues to grow in popularity on various industrial platforms, becoming especially prominent in Chatbot technology. A great deal of recent research was focused on social and assistive Chatbot, and its technology has been ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

AICCONF '24: Proceedings of the Cognitive Models and Artificial Intelligence Conference

May 2024

367 pages

ISBN:9798400716928

DOI:10.1145/3660853

Copyright © 2024 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 June 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

We thank to receive the funding for this research work from RSF (Research Support funding) from Symbiosis Institute of Technology, Symbiosis International (Deemed University) (SIU),Lavale Campus, Pune, Maharashtra,412115, India.

Conference

AICCONF '24

AICCONF '24: Cognitive Models and Artificial Intelligence Conference

May 25 - 26, 2024

İstanbul, Turkiye

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
9
Total Downloads

Downloads (Last 12 months)9
Downloads (Last 6 weeks)9

Reflects downloads up to 26 Jul 2024

Other Metrics

View Author Metrics

Citations

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents