Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
27 views

Smart_AI_Voice_Assistant_through_Generative_Text_Transformer_and_NLP_Implementation_in_Python

Its a Research Paper

Uploaded by

jales34489
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views

Smart_AI_Voice_Assistant_through_Generative_Text_Transformer_and_NLP_Implementation_in_Python

Its a Research Paper

Uploaded by

jales34489
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

2024 4th International Conference on Intelligent Technologies (CONIT)

Karnataka, India. Jun 21-23, 2024

Smart AI Voice Assistant through Generative Text


Transformer and NLP Implementation in Python
Mallela Uday Kiran Busi Hemanth Reddy
Devesh Bajpai
School of Computer Science and School of Computer Science and
School of Computer Science and
2024 4th International Conference on Intelligent Technologies (CONIT) | 979-8-3503-4990-0/24/$31.00 ©2024 IEEE | DOI: 10.1109/CONIT61985.2024.10626557

Engineering Engineering
Engineering
JAIN (Deemed-to-be University) JAIN (Deemed-to-be University)
JAIN (Deemed-to-be University)
Ramanagara, India. Ramanagara, India.
Ramanagara, India.
udaykiran9123@gmail.com hemanthreddy00605@gmail.com
bajpaidevesh604@gmail.com
Suresh Kumar Natarajan
School of Computer Science and
Engineering
JAIN (Deemed-to-be University)
Ramanagara, India.
sureshkumar0707@gmail.com

Abstract—In our project, we employed Natural Language


Processing (NLP), Artificial Intelligence (AI), Generative Text A. Background
Transformer (GTT), and Speech Recognition technologies. The proliferation of AI applications has witnessed a
Through the utilization of these advanced technologies, we paradigm shift, especially in the realm of voice assistants.
aimed to enhance the capabilities and functionalities of our Smart AI emerges as a noteworthy advancement, not only
system. NLP facilitated improved language understanding and responding to predefined commands but comprehending and
interaction, while AI contributed to the overall intelligence and generating human-like responses through the amalgamation of
decision-making processes. GTT played a crucial role in NLP and ASR. This integration allows for a more natural and
generating coherent and contextually relevant text, adding a context-aware interaction, significantly enhancing the user
layer of sophistication to our applications. Additionally, the experience. As technology progresses, the demand for
integration of speech recognition technology enabled seamless
intelligent voice assistants that seamlessly integrate into our
interaction through voice commands, enhancing the overall user
daily lives becomes more pronounced, and Smart AI
experience. By leveraging these cutting-edge technologies
synergistically, our project aimed to deliver a more intelligent, addresses this need with sophistication.
efficient, and user-friendly system, showcasing the
transformative potential of combining NLP, AI, GTT, and B. Technological Foundations:
speech recognition in an integrated framework. This potent At the core of Smart AI lies the robust combination of NLP
blend transcends mere functionality, aiming to transform user and ASR technologies. NLP empowers the assistant to
experience and pave the way for intelligent companions understand and interpret human language with a nuanced
seamlessly integrated into our lives. By leveraging the understanding of context, sentiment, and intent. This goes
transformative power of this unified framework, we envision a beyond mere keyword recognition, enabling Smart AI to
future where technology not only serves us but also empowers
engage in meaningful conversations and execute complex
and enriches our interactions with the world around us.
tasks based on user commands. Complementing this, ASR
Keywords—NLP, AI, GTT, Speech Recognition, Contextual technology equips Smart AI with the capability to convert
Relevance, Unified Framework, Empowerment spoken language into text, forming the backbone of its
auditory perception. Together, these technologies form a
cohesive foundation that distinguishes Smart AI as an
I. INTRODUCTION advanced voice assistant.
In the dynamic landscape of artificial intelligence (AI), the
integration of advanced technologies has given rise to C. Automatic Speech Recognition (ASR):
innovative solutions, and one such groundbreaking The ASR component of Smart AI represents a
development is the Smart AI Advanced Voice Assistant. This breakthrough in voice recognition systems. By employing
research paper delves into the intricate design, development, sophisticated algorithms, it accurately transcribes spoken
and functionalities of Smart AI. This intelligent voice assistant words into textual form, laying the groundwork for precise
leverages the synergies of Natural Language Processing and effective communication. This facilitates seamless
(NLP), Automatic Speech Recognition (ASR), and cutting- interaction between users and the voice assistant, overcoming
edge AI algorithms. In an era where human-computer challenges associated with variations in accent, language, and
interaction has transcended traditional boundaries, Smart AI pronunciation. The accuracy and efficiency of ASR contribute
stands as a testament to the transformative potential of significantly to the overall efficacy of Smart AI, making it a
converging technologies. versatile and user-friendly voice assistant.

979-8-3503-4990-0/24/$31.00 ©2024 IEEE 1


Authorized licensed use limited to: REVA UNIVERSITY. Downloaded on October 23,2024 at 14:11:12 UTC from IEEE Xplore. Restrictions apply.
D. Natural Language Processing (NLP): client for sending and receiving mail. The paper emphasizes
NLP plays a pivotal role in elevating Smart AI beyond Speech Text and Speech Recognition technologies alongside
conventional voice assistants. Through advanced language Artificial Intelligence in creating technical bots. Overall, it
models and semantic understanding, NLP enables Smart AI to focuses on the integration of voice control with email
not only comprehend the explicit meaning of user queries but communication, showcasing advancements in AI-driven
also infer implicit nuances. This proficiency in contextual interactions [3]. The paper discusses the integration of voice
understanding allows the assistant to generate responses that assistants into daily life, emphasizing their role in simplifying
align with the user's intent, fostering a more intuitive and tasks through speech-to-text and text-to-speech conversion.
natural interaction. The synergy between ASR and NLP Popular voice assistants like Amazon Alexa, Apple Siri, and
ensures that Smart AI transcends mere speech recognition, Google Voice Search are highlighted for their widespread
engaging in meaningful conversations that mirror human adoption and reliance on internet connectivity. The proposed
communication patterns. model aims to operate offline, offering efficient task
management without the need for internet access. It
emphasizes the importance of voice communication between
E. Research Objectives:
users and their devices, showcasing the evolution of
The primary objectives of this research paper are to dissect technology towards smarter personal assistants. Overall, the
the technological underpinnings of Smart AI and evaluate its paper explores the landscape of voice assistant technology and
performance in real-world scenarios. By conducting a introduces a novel approach to task management without
comprehensive analysis of the NLP and ASR components, we internet dependency [4].
aim to provide insights into the strengths and limitations of
Smart AI as an advanced voice assistant. Additionally, we The paper introduces a system proposal for enhancing the
seek to explore potential enhancements and applications, capabilities of the Jarvis AI voice assistant. It employs
contributing to the ongoing discourse on the evolution of AI- techniques like speech recognition, natural language
driven voice interfaces. Through empirical evaluation and processing, and machine learning algorithms to enable Jarvis
critical examination, this research endeavours to shed light on to perform diverse tasks via voice commands. These tasks
the transformative impact of Smart AI and its implications for include email sending, appointment scheduling, and phone
the future of human-computer interaction. calls. Personalized user profiles will be integrated for a
tailored user experience, alongside features like sentiment
analysis and emotion recognition to better understand user
II. LITERATURE SURVEY
needs. Implementation will be carried out using Python
The paper discusses the proliferation of Artificial programming language and various open-source libraries and
Intelligence (AI), particularly in the realm of Natural APIs. Overall, the paper outlines a comprehensive approach
Language Processing (NLP), focusing on voice assistants. It to advancing Jarvis's functionality and user interaction in the
highlights their integration with cloud computing and realm of AI voice assistants [5]. The paper discusses the rising
widespread adoption in households, schools, and universities. demand for efficient technology interaction, focusing on
Voice assistants are portrayed as a transformative innovation Voice-based Virtual Assistants powered by speech
in AI, initially popularized through smartphones and laptops, recognition and AI. It highlights the utilization of the GPT-3
now expanding to home automation systems and smart language processing model by OpenAI for intelligent
speakers. The paper aims to explore the evolving landscape of responses. Users can interact with the virtual assistant through
human-computer interaction facilitated by voice assistants. spoken commands, functioning as a chat interface. Security
However, it also promises to address the challenges and measures including encryption and authentication are
limitations inherent in current voice assistant technologies. integrated to protect personal data from unauthorized access.
Overall, it suggests that while voice assistants have immense These measures safeguard privacy and prevent exposure of
potential to positively impact people's lives, there are sensitive information like bank account details. Overall, the
significant hurdles that need to be addressed for optimal paper emphasizes the convergence of speech recognition, AI,
functionality and acceptance [1]. The paper explores the and security features in advancing Virtual Assistant
growing significance of voice control in transforming daily technology for enhanced user experience and data protection
life, particularly through voice assistants commonly found in [6].
smartphones and laptops. It delves into the mechanics of AI-
based voice assistants, which recognize human speech and The paper explores the evolving landscape of Virtual
respond through integrated voices. The process involves Assistants, highlighting advancements in voice recognition
capturing audio from a microphone, converting it to text, and and natural language processing [7]. It predicts increased
utilizing the GTTS (Google Text-to-Speech) engine to render integration of virtual assistants in commercial activities as
the text into an English audio file. This audio file is then speech recognition technology progresses. The main objective
played using Python programming language's play sound of developing Voice-Based Virtual Assistants is to enhance
package. Overall, the paper provides insight into the technical convenience and provide personalized assistance to users.
workings of voice assistants and their integration into daily Virtual assistants learn from user interactions, improving their
routines [2]. ability to anticipate user needs over time. They are utilized for
tasks such as calendar management, information retrieval, and
The paper discusses the rising prominence of voice home automation. These technologies rely on massive
assistants and aims to develop an AI-based email voice volumes of data to power AI platforms like machine learning
assistant system. It highlights the process of converting voice and speech recognition. Speech recognition, now a standard
input to text and sending it as an email message using Python feature on smartphones and wearables, has undergone
in PyCharm IDE. Authentication involves providing the user's significant advancements in dictation and voice commands.
email ID and Gmail password. The SMTP protocol facilitates The paper emphasizes the design of efficient speech
email transmission over the internet, with MTA serving as the recognition systems for mobile devices, aiming for accuracy

2
Authorized licensed use limited to: REVA UNIVERSITY. Downloaded on October 23,2024 at 14:11:12 UTC from IEEE Xplore. Restrictions apply.
and low latency [8]. The paper discusses the ease of creating x as the input mathematical expression provided by the
voice assistants in Python and customizing them to meet user.
various needs. It highlights Python's Speech Recognition API
f(x) as the function representing the computation
for converting speech to text, enabling convenient tasks like
performed on the input expression x.
email sending, PDF reading, and WhatsApp messaging
through voice commands. Additionally, it emphasizes the y as the output/result of the computation.
efficiency of voice assistants in performing everyday tasks,
suggesting a potential role in reducing workload and saving Based on the description provided, we can formulate a
time. The project outlined in the paper includes various hypothetical mathematical equation as follows:
functionalities such as opening applications, playing music, y=f(x)
searching Wikipedia, and accessing weather forecasts [9].
While acknowledging its limitations, the paper underscores This equation represents the transformation of the input
the broader goal of artificial intelligence to perform tasks as mathematical expression x into an output/result y through the
efficiently as, or better than, humans. Overall, it explores the function f, which encapsulates the computational process
development of virtual assistants as a step towards achieving facilitated by the Wolfram Alpha API. While this equation
artificial intelligence's overarching objective of automating serves as a generic representation of the mathematical
and enhancing human tasks [10]. computation process within the assistant, specific details of
the function f and the nature of x would need to be elaborated
based on the actual implementation and capabilities of the
III. METHODOLOGY assistant.
The proposed system is designed to operate through voice
commands provided by the user, offering a versatile and The extensibility of the system is highlighted by the
interactive assistant that can perform a variety of tasks. This acknowledgement that coding choices play a pivotal role.
sophisticated assistant employs cutting-edge technology to Depending on the coding approach taken, the system can be
open installed applications, conduct searches on Google, tailored to process data in specific ways or incorporate
Wikipedia, and YouTube, perform complex mathematical additional functionalities. This adaptability ensures that the
computations, and more. The flexibility of the system allows assistant can evolve and grow to meet changing user needs and
for further customization and the addition of functionalities as technological advancements. in conclusion, the proposed
needed, depending on the coding approach. One of the key system represents a sophisticated voice-controlled assistant
features of the system is its utilization of Google's Speech that seamlessly integrates advanced speech recognition,
Recognition API for voice input [11]. This API enables the information retrieval from prominent online sources,
assistant to accurately transcribe spoken words into text, mathematical computation capabilities, and audio playback
laying the foundation for seamless communication between functionalities. Through the strategic use of Google's APIs
the user and the system. The use of voice input enhances user and Wolfram Alpha API, the system delivers a comprehensive
experience, making interactions more natural and convenient. and user-friendly experience, setting the stage for future
enhancements and developments in voice-controlled
In tandem with voice input, the system leverages technology.
Google's Text-to-Speech API for generating spoken responses
or converting textual information into audible content [12]. A. Project Design
This bidirectional communication approach not only allows
The project design draws inspiration from various open-
the assistant to understand user queries but also ensures that
source user interfaces (UIs) and incorporates insights gained
the user receives information in a clear and easily digestible
through reviewing existing Virtual Personal Assistants
manner. For information retrieval, the assistant taps into the
(VPAs). Valuable lessons from predecessors contribute to
vast resources available on the internet. It can perform
creating a unique variant for the assistant, setting it apart from
searches on Google and Wikipedia, offering users real-time
the current landscape. In addition to design considerations, the
and comprehensive information based on their inquiries.
development process involved careful planning. The creators
Additionally, the integration with YouTube enables the
established a roadmap covering overall design principles to
assistant to fetch and present relevant video content,
detailed function implementation, ensuring a cohesive and
expanding the range of available resources for user queries.
well-thought-out development process. Continuous
Mathematical problem-solving is a core capability of refinement is crucial. The implementation and improvement
the assistant, facilitated by the Wolfram Alpha API. By phases are considered essential to make the project user-
incorporating this API, the system can handle a diverse array friendly. This iterative approach allows the team to address
of mathematical expressions and computations, providing challenges, incorporate user feedback, and enhance the overall
users with accurate and timely answers to their mathematical experience, showcasing a commitment to delivering a product
queries. This feature enhances the utility of the assistant for that exceeds expectations.
both educational and practical purposes. To enrich the user Extensive research using reputable sources such as Google
experience further, the system includes a play sound API
and Wikipedia provides ideas and insights. This exploration
capable of playing audio files stored in the system. The
forms a foundation for understanding existing projects,
compatibility with supported audio formats ensures that users
designs, and best practices, contributing to informed decisions
can enjoy a variety of auditory content, whether it be music,
and features aligned with user needs. Comparisons with
notifications, or other sound-related features.
existing models are critical. Analysing design structures
We can formulate an equation representing a mathematical identifies strengths, weaknesses, and areas for improvement.
problem-solving scenario. The goal is not just replication but innovation, creating a
solution that stands out in usability and efficiency.
Let's denote, Comparative analysis refines the project for better user

3
Authorized licensed use limited to: REVA UNIVERSITY. Downloaded on October 23,2024 at 14:11:12 UTC from IEEE Xplore. Restrictions apply.
satisfaction. Implementation requires additional packages for 6. Pyaudio: This voice engine in Python enhances the
functionality. Carefully chosen and integrated into the system, overall voice processing capabilities of the assistant
these packages contribute to overall robustness and versatility. [16].
Their strategic inclusion ensures adaptability and capability to
In summary, the creation of our virtual assistant is a
evolve with emerging technologies and user needs.
dynamic and evolving process. The inclusion of technologies
In summary, the project's design and implementation are such as GTTS, a diverse set of commands, and essential
guided by a thorough analysis of existing UIs and VPAs. The packages like Selenium, Wolfram API [17], Play Sound API,
team, armed with a specific roadmap, embraces continuous and Pyaudio demonstrates our commitment to building a
improvement to refine features and enhance the user sophisticated and versatile assistant [18]. As we progress, the
experience. Drawing inspiration from external sources, emphasis remains on continuous learning and improvement,
conducting meticulous research, and comparing with existing ensuring that our virtual assistant stays at the forefront of
models ensures the project meets industry standards while innovation and user satisfaction.
introducing innovative elements, making it unique and user-
friendly. The integration of additional packages fortifies IV. PROPOSED SYSTEM
capabilities, paving the way for a dynamic Virtual Personal
Assistant. This The project's architecture depicts the flow of control
in the system. It also shows the required software and
B. Innovation In Project hardware used in the project.
Embarking on the creation of a virtual assistant is not only
enjoyable but also represents an exciting venture into the
realm of learning about working principles, algorithms, and
more. The process involves understanding the intricate flow
that contributes to making a virtual assistant efficient, as
depicted in the attached image. Our journey began with
enthusiasm, and the learning experience is anticipated to be
ongoing. A pivotal element in our virtual assistant is the
integration of Google Text-to-Speech (GTTS), commonly
known as GTTS [13]. This technology plays a crucial role in
Fig. 1. Natural Language Processing
converting text information into speech, enabling our assistant
to both produce output and record user input. By incorporating We started this project by dividing it into phases. It has had
GTTS, our virtual assistant gains the ability to communicate a total of 3 Phases each one consisting of different domains
audibly, enhancing the overall user experience. and functionalities. Below is the pictorial representation of
Furthermore, we have implemented a set of diverse that A.
commands, detailed in the implementation section. The How it works: When given a voice command or a text-
flexibility of our system allows us to process information based command the assistant processes the information using
based on these commands, adapting to user needs. Depending text to speech and speech recognition system, and then it
on the coding approach, we can fine-tune the assistant's performs the task accordingly if it is programmed for that task
functionality or seamlessly introduce additional features. Have a look at how it works when we ask the assistant about
As mentioned earlier, a sound package is an integral part “Today’s Weather” (it just shows the work process): Fig1.2
of our project. This package facilitates the playback of saved
sounds on the computer. However, it is important to note that
this functionality is subject to the limitation of supported
formats.
For the successful implementation of our virtual assistant,
several additional packages are required:
1. GTTS (Google Text-to-Speech): This package
processes text information into speech, enabling the
assistant to communicate audibly. Fig. 2. Process of weather details from the internet (a) call for the weather
by assistant (b) process of weather details from the internet
2. Speech Recognition System: Utilized to recognize
commands given to the assistant and process the
information accordingly [14].
3. Selenium: Essential for web-based tasks, particularly
for efficient searching capabilities [15].
4. Wolfram API: Integrated for calculations and related
functions, enhancing the assistant's capabilities in
handling mathematical queries.
5. Play Sound API: This package is crucial for playing
saved sounds on the system, contributing to a more
dynamic and engaging user interaction.
Fig. 3. Virtual Assistant (Requirements)

4
Authorized licensed use limited to: REVA UNIVERSITY. Downloaded on October 23,2024 at 14:11:12 UTC from IEEE Xplore. Restrictions apply.
A. Comparisons executed searches, and displayed results. The implementation
The project underwent thorough research and relies on Smart AI, a key contributor to the assistant's
development, focusing initially on meeting the fundamental accuracy. As demonstrated in the figures, the assistant adeptly
requirements of a Virtual Personal Assistant (VPA). responds to commands such as requesting temperature
Progressing from the basics, our team diligently worked to information, showcasing its dynamic functionality. Simple
enhance the assistant's efficiency and capabilities. Fig1.3, the greetings, like 'Hi' or 'Hello,' elicit friendly responses from the
accompanying visual representation, succinctly illustrates the assistant, fostering a user-friendly interaction. The visual
essential requirements critical for the Virtual Assistant's representations offer insights into the seamless execution of
functionality. By systematically addressing these core needs, tasks and the assistant's adept handling of diverse commands,
we aim to ensure the assistant not only complies with industry highlighting its versatility and responsiveness.
standards but surpasses user expectations. The graph serves as
a visual guide, outlining key components vital for a robust
VPA. As the project evolves, continual efforts are directed
towards maintaining these fundamental requirements while
actively seeking opportunities for improvement, ensuring that
our Virtual Assistant excels in terms of functionality, user
experience, and overall performance. Fig. 5. Response of assistant “To Take a Screeshot”

B. Sequence Diagram
A sequence diagram visually represents the chronological
order of system operations and data processing. It delineates
the specific sequence, illustrating steps such as text
recognition followed by subsequent processing. This
graphical depiction provides a clear understanding of the
sequential flow within the system, outlining the systematic
execution of tasks. The diagram serves as a valuable tool to
comprehend the orderly progression of actions, enhancing
insights into the functioning of the system and its data
processing procedures.
Fig. 6. Asking Assistant for the Opening YouTube

Fig. 7. To Track the Location

VI. CONCLUSION
In conclusion, Smart AI represents a groundbreaking
achievement in advanced artificial intelligence, ushering in a
new era of human-computer interaction. Its incorporation of
cutting-edge technologies like machine learning and natural
language processing demonstrates unparalleled
sophistication, impacting diverse industries and reshaping
productivity landscapes. While providing significant benefits,
the rise of potent AI assistants like Smart AI necessitates a
Fig. 4. Sequence Diagram careful exploration of ethical considerations, spanning privacy
concerns to responsible deployment. Navigating Smart AI's
V. RESULTS AND DISCUSSION transformative potential requires striking a delicate balance
between technological advancement and ethical safeguards,
This section illustrates the project's implementation, urging collective efforts from researchers, developers, and
encompassing essential elements and visual aspects like policymakers. Beyond showcasing AI capabilities, Smart AI
interface and themes. Seeking to understand the assistant's catalyzes ongoing discussions on responsible AI
capabilities, inquiries were made, revealing its multifunctional development, marking a crucial step in harnessing the full
nature. The assistant can provide weather reports, read news, potential of these technologies. As we embrace this
open applications, execute internet searches, and more. The transformative journey, Smart AI sets the stage for a future
input and corresponding results are visually depicted in Fig where advanced AI harmoniously integrates with ethical
1.5(a), Fig1.5(b), Fig1.5(c), and Fig1.5(d). Notably, the principles, shaping a positive trajectory for the coexistence of
assistant autonomously launched the Chrome application, intelligent systems in our society.

5
Authorized licensed use limited to: REVA UNIVERSITY. Downloaded on October 23,2024 at 14:11:12 UTC from IEEE Xplore. Restrictions apply.
Future enhancements of Smart AI can significantly elevate Renewable Systems (ICEARS), Tuticorin, India, 2023, pp. 822-827,
its functionality and user experience. Integrating advanced doi: 10.1109/ICEARS56392.2023.10085043.
natural language processing (NLP) algorithms would improve [7] Fanni, S. C., Febi, M., Aghakhanyan, G., & Neri, E. (2023). Natural
language processing. In Introduction to Artificial Intelligence (pp. 87-
the assistant's understanding of nuanced language, resulting in 99). Cham: Springer International Publishing.
more accurate responses. Enhancing the assistant's ability to [8] S. P. Yadav, A. Gupta, C. Dos Santos Nascimento, V. Hugo C. de
manage multi-turn conversations would also facilitate Albuquerque, M. S. Naruka and S. Singh Chauhan, "Voice-Based
smoother, more natural interactions. Incorporating predictive Virtual-Controlled Intelligent Personal Assistants," 2023 International
analytics could enable Smart AI to offer personalized Conference on Computational Intelligence, Communication
recommendations and automate routine tasks, tailoring its Technology and Networking (CICTN), Ghaziabad, India, 2023, pp.
563-568, doi: 10.1109/CICTN57981.2023.10141447..
functionality to individual user needs. Additionally,
[9] He, T., Jazizadeh, F., & Arpan, L. (2022). AI-powered virtual assistants
integrating augmented reality (AR) capabilities could provide nudging occupants for energy saving: proactive smart speakers for
a more immersive experience, useful for virtual tours, remote HVAC control. Building Research & Information, 50(4), 394-409.
assistance, and educational tools. Ethical considerations [10] Dalal, P., Sharma, T., Garg, Y., Gambhir, P., & Khandelwal, Y. (2023,
remain crucial. Future versions should include robust privacy March). “JARVIS”-AI Voice Assistant. In 2023 1st International
features to ensure user data protection and transparency in data Conference on Innovations in High-Speed Communication and Signal
processing. By pursuing these enhancements, Smart AI can Processing (IHCSP) (pp. 273-280). IEEE.
evolve to provide more sophisticated, personalized, and [11] Lin, P. C., Yankson, B., Chauhan, V., & Tsukada, M. (2022). Building
ethical solutions, meeting the diverse needs of its users. a speech recognition system with privacy identification information
based on Google Voice for social robots. The Journal of
Supercomputing, 78(13), 15060-15088.
REFERENCES [12] Kumar, R., Gupta, M., Shrama, P., Soni, N., & Rawat, K. (2024,
March). NLP-Based text-to-speech and speech-to-text virtual assistant.
[1] P. Kunekar, A. Deshmukh, S. Gajalwad, A. Bichare, K. Gunjal and S.
In AIP Conference Proceedings (Vol. 3072, No. 1). AIP Publishing.
Hingade, "AI-based Desktop Voice Assistant," 2023 5th Biennial
International Conference on Nascent Technologies in Engineering [13] Le, P. N., Vu, H. M. L., & Tran, M. N. (2022). Improving EFL students'
(ICNTE), Navi Mumbai, India,2023, pp. 1-4,doi: intonation in-text using shadowing technique with the implementation
10.1109/ICNTE56631.2023.10146699B.. of Google text-to-speech. AsiaCALL Online Journal, 13(1), 93-121.
[2] S. Subhash, P. N. Srivatsa, S. Siddesh, A. Ullas and B. Santhosh, [14] Pawar, A. B., Gawali, P., Gite, M., Jawale, M. A., & William, P. (2022,
"Artificial Intelligence-based Voice Assistant," 2020 Fourth World April). Challenges for hate speech recognition system: approach based
Conference on Smart Trends in Systems, Security and Sustainability on solution. In 2022 International conference on sustainable computing
(WorldS4), London, UK, 2020, pp. 593-596, doi: and data communication systems (ICSCDS) (pp. 699-704). IEEE.
10.1109/WorldS450073.2020.9210344M [15] Singh, S., Arora, D. K., Dar, I. N., Moghni, A., Kumar, S., & Kumar,
[3] K. G. Maheshwari, R. Meenakshi, G. NaliniPriya, K. Anandasayanam, A. (2022, February). ARIA The Bot. In 2022 2nd International
B. Hariram and G. Maheswara Pandian, "Dynamic AI based Email Conference on Innovative Practices in Technology and Management
Voice Assistant for Web Services," 2022 International Conference on (ICIPTM) (Vol. 2, pp. 167-174). IEEE.
Smart Technologies and Systems for Next Generation Computing [16] Shoeb, M., Kolluru, V. R., Naga Venkat Sai, M., Mustafa Baig, M., &
(ICSTSN), Villupuram, India, 2022, pp. 1-4, doi: Razia, S. (2022, May). Implementation of Artificial Intelligence Based
10.1109/ICSTSN53084.2022.9761287 Sustainable Smart Voice Assistance. In ICCCE 2021: Proceedings of
[4] A. Jagan, M. Ghouse Pasha, D. Nandini, H. Susanna and D. N. the 4th International Conference on Communications and Cyber
Parvathi, "Personal Voice Assistant Using Computer Vision," 2023 Physical Engineering (pp. 1073-1081). Singapore: Springer Nature
International Conference on Research Methodologies in Knowledge Singapore.
Management, Artificial Intelligence and Telecommunication [17] Pandey, D., Maitrey, S., & Seth, D. (2022, December). Artificially
Engineering (RMKMATE), Chennai, India, 2023, pp. 1-8, doi: developed intelligent system using Python. In AIP Conference
10.1109/RMKMATE59243.2023.10369477. Proceedings (Vol. 2597, No. 1). AIP Publishing.
[5] M. Gupta, R. Kumar and H. Sardalia, "Voice Assistant Technology: [18] Zheng, J., & Fischer, M. (2023). Dynamic prompt-based virtual
The Case of Jarvis AI," 2023 4th International Conference for assistant framework for BIM information search. Automation in
Emerging Technology (INCET), Belgaum, India,2023, pp. 1-5, doi: Construction, 155, 105067.
10.1109/INCET57972.2023.10170362
[6] C. Simon and M. Rajeswari, "Voice-based Virtual Assistant with
Security," 2023 Second International Conference on Electronics and

6
Authorized licensed use limited to: REVA UNIVERSITY. Downloaded on October 23,2024 at 14:11:12 UTC from IEEE Xplore. Restrictions apply.

You might also like