0 \vgtccategoryResearch \vgtcinsertpkg
The Hall of Singularity: VR Experience of Prophecy by AI
Abstract
‘The Hall of Singularity’ is an immersive art that creates personalized experiences of receiving prophecies from an AI deity through an integration of Artificial Intelligence (AI) and Virtual Reality (VR). As a metaphor for the mythologizing of AI in our society, ‘The Hall of Singularity’ offers an immersive quasi-religious experience where individuals can encounter an AI that has the power to make prophecies. This journey enables users to experience and imagine a world with an omnipotent AI deity.
keywords:
Artificial intelligence, Virtual reality, Speculative fiction.Introduction
‘Technological Singularity’ is a quasi-religious hypothesis about a certain future point in time when artificial intelligence (AI) surpasses human intelligence and capabilities[5]. Mythologizing AI prevails with various media platforms especially in the form of comparing future abilities of AI to those of omnipotent and omniscient deities, or monotheistic god.
Untransparent characteristics of Large AI models also plays a significant role in reinforcing this myth. From the Delphic Oracle to Catholicism, and Korea’s Mudang (shamanistic medium), we individuals have always relied on so called “Mediums(an individual held to be a channel of communication between the earthly world and a world of spirits)[3]’ to connect with deities or divine beings, receiving prophecies and messages on their behalf[6]. Similarly, the limited accessibility of large AI models, which necessitates reliance on intermediary programs for non-AI professionals, contributes to the perpetuation of mythical misconceptions surrounding AI. Non-Ai professionals rely on medium programs like OpenAI playground that connect individuals with the data they want and deliver the results[4]. This limited accessibility of large AI models enhances the mythological misconceptions surrounding AI since it is only reachable and understood by AI professionals and medium programs.
In this work, we attempt to portray the prevailing state of AI mythologization in our society through this metaphor: ‘The medium of artificial intelligence that possesses omnipotent power like deity’. We compared utilization of large AI models via intermediary programs with the act of entrusting and relying on mediums that connect with deity. This metaphor effectively captures the essence of the situation where individuals depend on intermediary programs to gain access to the large AI models without clear understanding.
Moreover, we used Virtual Reality as a medium for accessing large artificial intelligence models. Recent large artificial intelligence models like Stable Diffusion and GPT have gained significant attention as a tool to improve user engagement[1]. However, the majority of users still utilize large artificial intelligence models within the framework of traditional computer user interfaces (ex: websites). We aimed to provide a more immersive experience for users by using VR to mediate Artificial Intelligence. Designing VR levels more appropriately, we managed the drawbacks of using artificial intelligence model such as latency in processing generation in AI.
1 User Walkthrough
1. User enters the Hall of Singularity in Virtual Reality through HMD headset. The temple, where they can encounter the ”Medium of AI deity,” engulfs users with its reminiscence of celestial space.
2. User can spacewalk in a cosmic temple via swimming motion by using hand controllers. This motion synchronizes physical movement with virtual movement and helps user to navigate inside the temple.
3. Once user encounter the ‘Medium of the AI deity’, they are guided to ask one question to the Medium using their own voices, regardless of the length of their inquiries.
4. Before the prophecy delivered, visitors can explore the temple’s ceiling and floor, where they can wander and observe prophecy videos from previous users.
5. When the prophecy is ready, user is teleported to the space where they can watch a 30-second video showcasing their prophecy.
2 AI – VR Integration
To deliver mysterious and personalized experience to user, various Large artificial intelligence models were used throughout the entire process of the work. We used Whisper model from Open AI to enable user to speak directly to the ‘Medium of deity’, and ChatGPT to generate personalized prophecy that responds user’s questions. And we used Deforum model to deliver the prophecy into video format to infuse more ambiguity and allow various interpretations from the user.
2.1 Real-time Voice Recognition and Translation through Whisper Model
In the first stage, we used Whisper for speech-to-text and its real-time translation function to convert the voice data from user into English at the same time since the AI models such as ChatGPT and Deforum provide better quality of output in English. Due to the use of Whisper’s translation capabilities, users can ask questions in various languages, including Korean, as demonstrated in the demo video. The voice data spoken by user is converted into English text and the result is sent to the input prompt of ChatGPT.
2.2 Prophecy Generation with ChatGPT
In the second stage, we used ChatGPT model to generate a prophecy that responds to user’s question. We gave predefined few shot examples to ChatGPT to generate a prompt for Deforum model that helps it to make consistent style of video. After Whisper model finishes the translation, the result is sent and added to the input of the predefined prompt in ChatGPT. Then, the ChatGPT model generates the text version of prophecy based on the input text.
2.3 Video Prophecy Generation with Deforum
We decided to present the prophecy into Video format to allow diverse interpretation in the prophecy. We transformed prophecy text from ChatGPT into Video through Deforum model that enables ‘Text to Video” generation. It takes 2 minutes to generate 30 seconds of video.
2.4 Integrating AI and VR through HTTP Communication
The connection between large artificial intelligence models and the virtual space is established through HTTP communication. Artificial intelligence models are accessed through APIs on a remote server, and the virtual space communicates with the remote server through HTTP requests, exchanging information. In this system, User’s voice data is sent from virtual space to remote server, and generated prophecy data is sent from remote server to virtual space.
3 Designing Virtual Reality
The background of ‘The Hall of Singularity’ is a cosmic temple. Spacewalk is implemented via swimming motion using hand controllers. User can freely navigate through the space by holding the controllers and doing swimming strokes. We chose swimming as the method of locomotion in order to enhance presence. Swimming as spacewalk is well-suited for the cosmic background. Also, user’s movements in the virtual space are synchronized with the real-world movements while swimming. Synchronizing user’s real-world movements with their movements in the virtual space enhances presence for user[2]. The 3D modeling objects in the temple are designed to have larger scale than objects in our daily life. It makes user feel overwhelmed and gives a sense of awe. Since swimming is used for locomotion, user is free to move both horizontally and vertically as well to touch the ceiling.
However, when user pose questions, they are guided to ask one question from the floor. This guidance makes user to look up ‘The Medium of the AI deity’ while they ask for prophecy. To enhance presence in Virtual Reality, we enabled interaction through voice by using Whisper. This function allows user to ask questions without the need for a separate virtual keyboard.
The prophecy generation begins after user asked their question. The Medium of the deity is concealed behind the veil. It takes approximately 2 minutes for the AI models to generate prophecy video. During this generation time, user is allowed to explore and watch various prophecy videos from previous user. The videos are placed all over the temple from the ceiling to the bottom. These videos are strategically placed in various locations. With their swimming motion, this design provides a multidirectional viewing experience.
Once the generation is complete, the color of the veil changes. As user approaches the veil, they are transported to a separate enclosed space where they can view the prophecy.
The prophecy videos are projected onto all surfaces of the square space. Post-processing effects like motion blur infuse a unique visual atmosphere and creates a sense of detachment from previous experiences. Once the prophecy video ends, user is transported back to the starting point where they can start another journey to ask additional questions. This effect allows a user to continue their exploration and ask further inquiries within the virtual space. Here, variation in background music and ambience are also implemented to make the transition between each stage more noticeable.
4 Conclusion
”The Hall of Singularity” invites user to explore the realm of contemporary artificial intelligence technology, offering a quasi-religious experience as metaphor of mythologization around AI. In this immersive art, user can experience a virtual world where artificial intelligence possesses omnipotent power as it is portrayed through mass media.
Generating prophecy, user’s voice data converts into English text with Whisper model’s real-time voice recognition and translation. Then, ChatGPT generates a prophecy based on user queries, with the input text incorporating the translated voice data and predefined prompts. This mechanism ensures producing coherent results regardless of the content of input(question) from user. Lastly, Deforum model transforms the prophecy text into a video format. This abstract form of prophecy allows user for richer and diverse interpretations.
”The Hall of Singularity” is an experimental attempt that use VR as a new medium utilizing large AI models. Designing VR levels for AI models addressed drawbacks of large AI models, such as latency of generation time. Incorporating large AI models in a virtual space offered distinctive advantages of infusing dynamics into VR environment. This exploration paves the way for a more interactive and engaging user experience at the intersection of VR and AI.
Acknowledgements.
The authors would like to thank Prof. Dasaem Jeong at Sogang University, for instructing AI & Creativity course (ANT6040, Fall 2022) where the subject idea of this artwork were begun. We would also like to thank to Heeyeon Yeon, a master’s student at department of artificial intelligence of Sogang University, for providing technical assistance.References
- [1] Y. Cao, S. Li, Y. Liu, Z. Yan, Y. Dai, P. S. Yu, and L. Sun. A comprehensive survey of ai-generated content (aigc): A history of generative ai from gan to chatgpt. arXiv preprint arXiv:2303.04226, 2023.
- [2] J. Lee, M. Kim, and J. Kim. A study on immersion and vr sickness in walking interaction for immersive virtual reality applications. Symmetry, 9(5):78, 2017.
- [3] Merriam-Webster. “Medium” definition & meaning.
- [4] OpenAI. OpenAI Playground.
- [5] M. Shanahan. The technological singularity. MIT press, 2015.
- [6] J. Yang. Korean shamanism: The training process of charismatic’mudang’. 1988.