CoHRT: A Collaboration System for Human-Robot Teamwork
Abstract
Collaborative robots are increasingly deployed alongside humans in factories, hospitals, schools, and other domains to enhance teamwork and efficiency. Systems that seamlessly integrate humans and robots into cohesive teams for coordinated and efficient task execution are needed, enabling studies on how robot collaboration policies affect team performance and teammates’ perceived fairness, trust, and safety. Such a system can also be utilized to study the impact of a robot’s normative behavior on team collaboration. Additionally, it allows for investigation into how the legibility and predictability of robot actions affect human-robot teamwork and perceived safety and trust. Existing systems are limited, typically involving one human and one robot, and thus require more insight into broader team dynamics. Many rely on games or virtual simulations, neglecting the impact of a robot’s physical presence. Most tasks are turn-based, hindering simultaneous execution and affecting efficiency. This paper introduces CoHRT (Collaboration System for Human-Robot Teamwork), which facilitates multi-human-robot teamwork through seamless collaboration, coordination, and communication. CoHRT utilizes a server-client-based architecture, a vision-based system to track task environments, and a simple interface for team action coordination. It allows for the design of tasks considering the human teammates’ physical and mental workload and varied skill labels across the team members. We used CoHRT to design a collaborative block manipulation and jigsaw puzzle-solving task in a team of one Franka Emika Panda robot and two humans. The system enables recording multi-modal collaboration data to develop adaptive collaboration policies for robots. To further utilize CoHRT, we outline potential research directions in diverse human-robot collaborative tasks.
I Introduction
Collaborative robots (cobots) are increasingly being deployed to work alongside humans in various domains, including manufacturing, healthcare, and education [51, 41, 26, 53]. These cobots have the potential to revolutionize efficiency and productivity by seamlessly integrating into human-robot teams and facilitating coordinated task execution [29, 33, 43, 48, 52, 4]. By sharing physical workspaces and collaborating on complex tasks, cobots can augment human capabilities, alleviate physical and cognitive burdens, and complete intricate objectives that would be challenging for either humans or robots alone to accomplish effectively [6, 7, 19, 24, 42, 30]. However, realizing the full potential of human-robot collaboration hinges on developing systems that enable efficient teamwork, coordination, and communication between humans and robots [18, 13, 47]. Such a system allows the robot to perceive the human teammates’ behavior, understand their instruction, and leverage its collaborative efforts accordingly [36, 35, 34, 37, 50, 60, 61].
The development of collaboration systems must not only focus on efficiency and productivity but also prioritize the physical, cognitive, and social safety of human team members [63, 5, 49, 57]. This includes designing robots with predictable and legible behaviors, implementing robust safety mechanisms, and considering the influence of social and cultural norms on perceived safety in human-robot interactions [21, 59, 55]. Furthermore, as these collaborative systems evolve, it is crucial to explore how they can be designed to enhance trust, fairness, and overall acceptance of robots in shared workspaces [20, 39, 40, 11, 12, 38].
Recent research has explored various aspects of human-robot teamwork, such as communication, synchronization, and task allocation [28, 31, 32, 56, 27]. These studies highlight the importance of designing effective collaboration strategies considering human-robot teams’ unique dynamics and challenges [54, 25, 47]. However, existing human-robot collaboration systems have limitations that hinder the robot’s ability to facilitate seamless and efficient teamwork. Many of these systems are confined to dyadic interactions involving only one human and one robot [11, 9], failing to capture the dynamics and complexities of larger team settings, where multiple humans and robots must coordinate their actions and adapt to diverse individual strengths, preferences, and constraints.
Additionally, several systems rely on game-based or virtual simulation environments, neglecting the potential impact of a robot’s physical embodiment on collaboration dynamics, such as spatial awareness, non-verbal communication, and shared situational understanding [15, 14, 17, 16, 44]. Moreover, some works utilize complex systems, such as motion capture systems, to perceive and comprehend human behavior and determine the collaboration effort required by the robot [62]. In such a system, participants must be equipped with sophisticated sensors or apparatus, which may increase participants’ cognitive load and discomfort, thus affecting their collaboration experiences. Furthermore, many tasks in these systems are turn-based, restricting simultaneous execution and potentially hindering team efficiency by preventing concurrent action and fluid task handoffs [11, 39].
To address these limitations, we introduce CoHRT (Collaboration System for Human-Robot Teamwork), a system designed to facilitate multi-human-robot teamwork through seamless collaboration, coordination, and communication. CoHRT leverages a vision-based system for tracking the environment state and team members’ actions, enabling real-time monitoring. It also incorporates a simple interface for action coordination among team members, allowing for efficient communication and synchronization of efforts. Importantly, CoHRT allows for the design of tasks that accommodate varying skill levels and constraints across team members, enabling studies on team performance, trust, fairness, and robot acceptance in diverse settings that more accurately reflect real-world scenarios. We demonstrate the capability of the CoHRT system through a collaborative task involving one Franka Emika Panda robot and two human participants. The task requires the team to solve a jigsaw puzzle and stack blocks, creating a scenario that demands a mental and physical workload.
The CoHRT system is designed to be extensible to larger teams and diverse task domains, allowing for the exploration of various team compositions, task complexities, and environmental constraints. By enabling the collection of multi-modal collaboration datasets, CoHRT can facilitate the development of adaptive collaborative policies that optimize team performance, enhance trust, and promote acceptance of robotic teammates. We expect CoHRT to be a valuable resource for the broader human-robot collaboration research community.
II The CoHRT System
II-A The Robot
The CoHRT system utilizes the Franka Emika Panda robot, a state-of-the-art manipulator robot designed for safe and efficient interaction with humans in various industrial and research settings. With its seven joints and lightweight design, the Panda robot offers remarkable versatility and agility, enabling it to execute intricate manipulations and adapt to diverse tasks. The robot’s sophisticated control software and torque-controlled joints ensure fluid, accurate, and responsive movements. Moreover, the Panda robot prioritizes safety through integrated features like collision detection and force sensing, allowing direct human collaboration without protective barriers. These features, along with its user-friendly interface and programming capabilities, make the Franka Emika Panda robot an ideal choice for studying human-robot collaborative teamwork. The robotic manipulator employs pick-and-place operations, the fundamentals for most mobile manipulators [39]. These basic pick-and-place capabilities can be leveraged to develop various collaborative task scenarios. By maintaining simplicity, the CoHRT system facilitates a clearer understanding of the effect of robot collaborative policy on human teammates.
II-B The Collaborative Task
The CoHRT system enables the design of two types of tasks: tasks involving physical workloads, like block manipulation, and tasks requiring mental workload, such as solving puzzles and math problems. By offering this flexibility, the CoHRT system can simulate real-world scenarios, such as assembly tasks frequently employed in human-robot collaboration research [39, 23]. In this context, the mentally demanding task is comparable to a planning task, while the physically demanding task parallels the plan execution.
II-C CoHRT Architecture
The CoHRT system leverages a server-client architecture, where a control server acts as the central coordinator, facilitating collaboration and coordination among the human-robot team (Fig. 2). This section unfolds different modules of CoHRT architecture. While describing different modules, we consider a scenario where the CoHRT system is utilized to develop a block stacking and a picture jigsaw puzzle-solving task for human teammates. In contrast, the robot’s task is limited to block stacking tasks.
II-C1 Server Program
The server program is a central controller and facilitates communication and coordination between the robot and participants. The server program also executes robot collaboration strategies and controls the robot by planning the trajectory. We provide a detailed description of each of the modules below.
Collaboration Strategy Executor runs a particular collaboration procedure relying on the system state detected by the state observer module. It also communicates with the trajectory planner to plan the trajectory of the current robot task and send it to the robot controller module.
Trajectory Planner determines the set of robot waypoints based on the current state by determining the block fetch location and stack height.
Robot Controller implements low-level planning, including inverse kinematics (ik) solvers and joint angle calculators. We rely on the Python frankx [3] library for low-level trajectory planning to move the robot to a target waypoint.
State Observer provides the current task state. To detect the puzzle state, we rely on the client program that sends the current puzzle state when the participant takes an action (e.g., moving a puzzle piece to a grid). The stack state changes when the participant or the robot stacks a block. In our system, a block can be in one of three manipulation states: unstacked, working, and stacked. In contrast, each stack can be in one of two states: complete and incomplete. We implement a vision-based approach that detects the current stack state by reading the Apriltags [58] attached to each block.
We utilize an Azure Kinect (RGB-D) [45] camera to capture high-resolution RGB images of the stacks, which are then processed using the Python pupil-apriltags library [1] to detect and decode the tags. The unique identifier of each Apriltag allows us to distinguish individual blocks and their positions within the stack. Their detected positions enable us to determine relative block locations and accurately detect stack configurations, even when blocks are partially occluded. Based on these detections, the system updates the individual block’s state (unstacked or stacked) and the stack state (complete or incomplete). When a robot or participant selects a block for manipulation, the system updates its manipulator property (Human or Robot) and sets it to a “working” state. The system maintains a history of recent states and enforces that only the topmost unstacked block can be chosen for manipulation. Upon stacking, the system detects the block’s Apriltag and leverages its position to determine the correct placement. Each Apriltag is unique so that the system can extract block properties such as color. If the system confirms correct stacking, it updates the block’s state to “stacked.” Finally, the block’s states are updated in the client interface, providing a comprehensive and real-time tracking of the task’s progress.
Client Hander facilitates communication with the participants by sending task configuration at the beginning of the task and exchanging commands between server and client during the tasks.
II-C2 Client Program
provides a client interface for facilitating communication between client and server. The client program also implements the picture jigsaw puzzle-solving task. It has the following modules.
Client Interface is the graphical user interface (GUI) implemented using the PyQt5 [2] library that renders the puzzle-solving task and the current stack state. Participants can interact with this interface using a standard keyboard. The jigsaw puzzle requires participants to move picture pieces between puzzle grids. To stack a block, the participant first selects the block by clicking. The GUI visualizes the current stack state by showing a human or robot icon on each block and text with ”working” or ”done.” Note that when a block is stacked either by the robot or participant, it is automatically detected by the state observer module of the server, and the state is updated in the client interface. This interface provides a way to communicate actions between the robot and participants. The participant can also observe robot actions and infer their actions, but it can be stressful while solving the puzzle; thus, our system also provides a visualization in the client interface.
Client Interface Renderer renders the client interface whenever the state of the system changes.
Server Handler is responsible for exchanging messages between the server and client program by implementing a read-and-write procedure to read from and write to the server.
II-C3 Interaction and Synchronization
CoHRT implements a synchronized coordination mechanism to prevent conflicts between team members. During the initial setup phase, the client program receives task configurations, including puzzle size, picture, stack size, and color pattern. The task is initiated by the participant using the client interface. The robot collaborates based on the selected strategy when participants initiate the task. The CoHRT server keeps listening to client requests. An allocation request is sent to the server whenever the robot or participant intends to fetch and place a block. If the block is available, the server reserves it, updates the state accordingly, and communicates the update to the client GUI for visualization.
To handle potential conflicts where the robot and a participant request the same block simultaneously, CoHRT implements a locking mechanism that ensures only one request is entertained. Conflicts can arise when the robot and a participant select the same block for manipulation. It is important to note that each participant has a different stack to manipulate, so there is no conflict between the two participants. To deal with the conflict, we identify the critical sections in the program, such as the current stack state, and implement locks to access these critical sections unambiguously and synchronously. Any selection request first comes to the server, and the server’s locking mechanism only confirms one request while the other is rejected. This mechanism ensures synchronization and coordination among team members, while the client interface ensures that the coordination is legible to the participants. By implementing this approach, CoHRT effectively manages resource allocation and prevents conflicts in block selection between the robot and human participants.
III CoHRT Implementation
III-A An example collaborative task
To evaluate the CoHRT system, we design a collaborative task involving one robot and two humans (P1 and P2) in a team setting (Fig. 1). The team’s objective is to construct two stacks, each consisting of seven blocks arranged in a specific color pattern, and to solve two jigsaw puzzles of varying complexity ( or grids). Each block in a stack has a unique color. The human participants are assigned two tasks: first, to solve a picture jigsaw puzzle, and second, to stack blocks in a designated area. The robot’s role is to collaborate with the participants solely in the stacking task. Note that the participants do not collaborate but can work simultaneously without interrupting one another, which differs from turn-based approaches [10, 15, 39]. The puzzle-solving task makes it more mentally demanding for the participants. Each participant is randomly assigned to a or a . To make the task physically demanding, we introduce the manipulation task, where a participant fetches a block of a specific color from another station (Station 2) and places it in their designated stacking location (S1 and S2) at Station 1. The robot is attached to the stacking station (Station 1) and performs the task exclusively by fetching a block from the same station and placing it in either of the participants’ stacks. The manipulator is not mobile, so we place two inventories of blocks, one for each participant, at Station 1. However, participants must fetch blocks from Station 2, making the task more challenging. The designed task is analogous to assembly or building tasks commonly used in many human-robot collaborative studies [39, 23, 22, 46]. The puzzle-solving task can be considered a planning task. This team scenario is common in many real-life applications, such as factory environments, where each worker has a dedicated task that requires planning and execution. Each participant’s actions include moving puzzle blocks between grids during the puzzle-solving phase. During the manipulation phase, they pick and place blocks from one station to another or remain idle. The robot’s actions involve picking a block and placing it in either the participants’ stacks or remaining idle. The robot equally collaborates with both of the participants by alternatively stacking blocks for each of the participants.
III-B Collaborative Task Execution
Fig. 3 shows an illustrative example of the task execution in the human-robot team. The client interface displays the current state of the puzzle and the stack. The participant first solves the jigsaw puzzle by moving the pieces in the interfaces. When the participant finishes solving the puzzle, block-stacking starts. To stack a block, the participant first clicks on the available block within the GUI and then retrieves the physical block from a separate inventory station and places it in the designated location at the stacking station. The robot is attached to the stacking station. It aids the participant by fetching blocks from one of two inventories located at the same station, with each inventory assigned to a specific participant. The robot alternates its collaboration efforts between the participants. The robot continues collaboration till it can provide equal collaboration effort, where equal collaboration means the robot stacks an equal number of blocks for both participants. The teamwork finished whenever the jigsaw puzzles were solved, and two of the stacks were built.
IV CoHRT Evaluation Plan
To assess the effectiveness of the CoHRT system in improving team performance and communication efficiency, we propose a comprehensive evaluation plan incorporating both quantitative and qualitative measures. Our evaluation will focus on team fluency, task performance, and user experience. We will employ several critical metrics from Hoffman’s work evaluating fluency in human-robot collaboration [25]. The plan includes measuring the team’s total time to complete the collaborative task, including puzzle-solving and block-stacking components (task completion time). We will track the duration for which the robot remains inactive while waiting for human actions or decisions (robot idle time) and the periods when human participants are not actively engaged in task-related activities (human idle time). The evaluation will also assess the percentage of time during which both human participants and the robot are simultaneously active (concurrent activity), the time lag between the end of one agent’s action and the beginning of another’s (functional delay), and the regularity and predictability of action transitions between humans and the robot (human-robot rhythm). By analyzing these metrics, we aim to provide insights into the effectiveness and efficiency of each collaboration strategy in facilitating smooth and seamless human-robot interaction. To evaluate the overall usability of the CoHRT system, we plan to utilize the System Usability Scale (SUS) [8]. Additionally, we will conduct post-task interviews to gather in-depth insights into participants’ experiences, challenges, and suggestions for improvement, providing a rich qualitative complement to our quantitative data.
V Extendability and Future Work
The CoHRT shows potential for generalizability and extendability to various task scenarios and team compositions. The server-client-based architecture can be utilized to integrate multiple human participants and robots, enabling the system to scale to larger teams and more complex collaborative tasks. We utilized the CoHRT to design a collaborative task involving two humans and one robot. The system can potentially support collaboration scenarios involving more than two humans by instantiating multiple client interfaces. The CoHRT facilitates seamless collaboration and coordination among the team members and system entities, enabling its extendability to design real-world tasks in manufacturing, healthcare, and education. However, further research and testing would be needed to fully assess the system’s capabilities and limitations in the aforementioned scenarios. In addition to supporting multiple human participants, the system also has the potential to incorporate heterogeneous robots. The CoHRT server can integrate a new robot by instantiating a new robot controller that utilizes robot-specific APIs. This flexibility will enable the system to leverage the capabilities of multiple robots, each with its own set of skills and functionalities, to address more complex and diverse collaborative tasks. For example, a mobile manipulator can be added to the CoHRT to complement the skills of the existing manipulator, providing increased mobility and adaptability to different task environments. A mobile manipulator combines a robotic arm’s dexterity and precision with a mobile base’s mobility, enabling the robot to navigate and interact with its surroundings more effectively. This adaptability is particularly valuable when the collaborative task requires the robot to move between different workstations or adapt to changing task demands.
The system’s adaptability also extends to the design of collaborative tasks. While we evaluate CoHRT with a collaborative block stacking task, we can utilize the system to create new tasks such as item sorting and table decluttering tasks. By adjusting the task parameters, perception algorithms, and robot control strategies, the system can be tailored to address specific task requirements and objectives. This allows researchers to explore a variety of collaborative scenarios and investigate the effectiveness of different robot collaboration strategies in diverse task contexts. In future work, we will explore the extensibility of the CoHRT in larger teams and diverse tasks. We plan to leverage the CoHRT system and the designed tasks to investigate how humans perceive fairness, trust, and safety in collaborative human-robot teams. An intriguing avenue of exploration involves examining the impact of various robot strategies for allocating collaborative efforts on human teammates’ perceptions of fairness and trust. Additionally, we intend to collect interaction data to gain insights into teammates’ capabilities, which can then be used to develop robot strategies that consider the unique needs of the teammates. By pursuing this line of research, we aim to create more personalized and adaptive robot collaboration strategies that enhance the overall user experience and promote trust and acceptance of human-robot collaboration systems. Furthermore, we will open-source the system for the research community, providing detailed documentation and support to facilitate its adoption in human-robot collaboration research.
VI Conclusion
In this work, we present CoHRT, a system for seamless human-robot collaboration that addresses the critical limitations of existing systems. CoHRT facilitates multi-human-robot teamwork with synchronized coordination and communication through its server-client architecture and integrated modules. Its flexibility allows for diverse collaborative tasks. By enabling efficient collaboration in mentally and physically demanding tasks, CoHRT opens avenues for research into human-robot team dynamics. The system can be leveraged to study the perception of fairness, trust, and safety in human-robot collaborative tasks, ultimately promoting the development of more user-centric collaboration strategies and the widespread adoption of human-robot collaboration systems.
References
- [1] Pupil apriltags documentation. https://pupil-apriltags.readthedocs.io/en/latest/index.html. Accessed on: June 26, 2024.
- [2] PyQt5. https://pypi.org/project/PyQt5/. Accessed on June 26, 2024.
- AB [Accessed 2024] Pantor Engineering AB. Franka Emika FrankaX Documentation. https://pantor.github.io/frankx/, Accessed 2024. Accessed on June 26, 2024.
- Admoni et al. [2014] Henny Admoni, Anca Dragan, Siddhartha S Srinivasa, and Brian Scassellati. Deliberate delays during robot-to-human handovers improve compliance with gaze communication. In Proceedings of the 2014 ACM/IEEE international conference on Human-robot interaction, pages 49–56, 2014.
- Akalin et al. [2022] Neziha Akalin, Annica Kristoffersson, and Amy Loutfi. Do you feel safe with your robot? factors influencing perceived safety in human-robot interaction based on subjective and objective measures. International Journal of Human-Computer Studies, 158:102744, 2022. ISSN 1071-5819. doi: https://doi.org/10.1016/j.ijhcs.2021.102744. URL https://www.sciencedirect.com/science/article/pii/S1071581921001622.
- Bestmann et al. [2024] Benjamin O. Bestmann, Alex Chow, Alyssa Kubota, and Laurel D. Riek. Garry: The gait rehabilitation robotic system. HRI ’24, page 875–879, New York, NY, USA, 2024. Association for Computing Machinery. ISBN 9798400703225. doi: 10.1145/3610977.3637475. URL https://doi.org/10.1145/3610977.3637475.
- Bouzida et al. [2024] Anya Bouzida, Alyssa Kubota, Dagoberto Cruz-Sandoval, Elizabeth W. Twamley, and Laurel D. Riek. Carmen: A cognitively assistive robot for personalized neurorehabilitation at home. In Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, HRI ’24, page 55–64, New York, NY, USA, 2024. Association for Computing Machinery. ISBN 9798400703225. doi: 10.1145/3610977.3634971. URL https://doi.org/10.1145/3610977.3634971.
- Brooke et al. [1996] John Brooke et al. Sus-a quick and dirty usability scale. Usability evaluation in industry, 189(194):4–7, 1996.
- Chang et al. [2020a] Mai Lee Chang, Taylor Kessler Faulkner, Thomas Benjamin Wei, Elaine Schaertl Short, Gokul Anandaraman, and Andrea Lockerd Thomaz. Tasc: Teammate algorithm for shared cooperation. In 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 11229–11236. IEEE, 2020a.
- Chang et al. [2020b] Mai Lee Chang, Zachary Pope, Elaine Schaertl Short, and Andrea Lockerd Thomaz. Defining fairness in human-robot teams. In 2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), pages 1251–1258, 2020b. doi: 10.1109/RO-MAN47096.2020.9223594.
- Chang et al. [2020c] Mai Lee Chang, Zachary Pope, Elaine Schaertl Short, and Andrea Lockerd Thomaz. Defining fairness in human-robot teams. In 2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), pages 1251–1258. IEEE, 2020c.
- Chang et al. [2021] Mai Lee Chang, Greg Trafton, J Malcolm McCurry, and Andrea Lockerd Thomaz. Unfair! perceptions of fairness in human-robot teams. In 2021 30th IEEE International Conference on Robot & Human Interactive Communication (RO-MAN), pages 905–912. IEEE, 2021.
- Chen et al. [2018] Min Chen, Stefanos Nikolaidis, Harold Soh, David Hsu, and Siddhartha Srinivasa. Planning with trust for human-robot collaboration. In Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction, pages 307–315, 2018.
- Claure et al. [2020a] Hector Claure, Victoria Ning, Malte F. Jung, and Stefanos Nikolaidis. Ai-mediated team interaction: How an ai agent influences human teammates’ perception and behavior in a cooperative game. In Proceedings of the 2020 ACM/IEEE International Conference on Human-Robot Interaction, pages 401–410, 2020a.
- Claure et al. [2020b] Houston Claure, Yifang Chen, Jignesh Modi, Malte Jung, and Stefanos Nikolaidis. Multi-armed bandits with fairness constraints for distributing resources to human teammates. In Proceedings of the 2020 ACM/IEEE International Conference on Human-Robot Interaction, pages 299–308, 2020b.
- Claure et al. [2023] Houston Claure, Seyun Kim, René F. Kizilcec, and Malte Jung. The social consequences of machine allocation behavior: Fairness, interpersonal perceptions and performance. Computers in Human Behavior, 146:107628, 2023. ISSN 0747-5632. doi: https://doi.org/10.1016/j.chb.2022.107628. URL https://www.sciencedirect.com/science/article/pii/S0747563222004484.
- Claure et al. [2024] Houston Claure, Kate Candon, Olivia Clark, and Marynel Vázquez. Multiplayer space invaders: A platform for studying evolving fairness perceptions in human-robot interaction. In Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, pages 347–350, 2024.
- Gombolay et al. [2017] Matthew Gombolay, Anna Bair, Cindy Huang, and Julie Shah. Computational design of mixed-initiative human–robot teaming that considers human factors: situational awareness, workload, and workflow preferences. The International journal of robotics research, 36(5-7):597–617, 2017.
- Gombolay et al. [2015] Matthew C Gombolay, Reymundo A Gutierrez, Shanelle G Clarke, Giancarlo F Sturla, and Julie A Shah. Coordination of human-robot teaming with human task preferences. In 2015 AAAI Fall Symposium Series, 2015.
- Green et al. [2022] Haley N. Green, Md Mofijul Islam, Shahira Ali, and Tariq Iqbal. Who’s laughing nao? examining perceptions of failure in a humorous robot partner. In 2022 17th ACM/IEEE International Conference on Human-Robot Interaction (HRI), pages 313–322, 2022. doi: 10.1109/HRI53351.2022.9889353.
- Habibian and Losey [2022] Soheil Habibian and Dylan P Losey. Encouraging human interaction with robot teams: Legible and fair subtask allocations. IEEE Robotics and Automation Letters, 7(3):6685–6692, 2022.
- Hayes and Scassellati [2015] Bradley Hayes and Brian Scassellati. Effective robot teammate behaviors for supporting sequential manipulation tasks. In 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 6374–6380. IEEE, 2015.
- Hayes and Scassellati [2016] Bradley Hayes and Brian Scassellati. Autonomously constructing hierarchical task networks for planning and human-robot collaboration. In 2016 IEEE International Conference on Robotics and Automation (ICRA), pages 5469–5476. IEEE, 2016.
- Hinds et al. [2004] Pamela J Hinds, Teresa L Roberts, and Hank Jones. Whose job is it anyway? a study of human-robot interaction in a collaborative task. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pages 183–190, 2004.
- Hoffman [2019a] Guy Hoffman. Evaluating fluency in human–robot collaboration. IEEE Transactions on Human-Machine Systems, 49(3):209–218, 2019a. doi: 10.1109/THMS.2019.2904558.
- Hoffman [2019b] Guy Hoffman. Evaluating fluency in human–robot collaboration. IEEE Transactions on Human-Machine Systems, 49(3):209–218, 2019b.
- Hoffman and Weinberg [2010] Guy Hoffman and Gil Weinberg. Synchronization in human-robot musicianship. In 19th International Symposium in Robot and Human Interactive Communication, pages 718–724. IEEE, 2010.
- Iqbal and Riek [2015] Tariq Iqbal and Laurel D Riek. A method for automatic detection of psychomotor entrainment. IEEE Transactions on affective computing, 7(1):3–16, 2015.
- Iqbal and Riek [2017] Tariq Iqbal and Laurel D Riek. Coordination dynamics in multihuman multirobot teams. IEEE Robotics and Automation Letters, 2(3):1712–1717, 2017.
- Iqbal and Riek [2021] Tariq Iqbal and Laurel D. Riek. Temporal anticipation and adaptation methods for fluent human-robot teaming. In 2021 IEEE International Conference on Robotics and Automation (ICRA), pages 3736–3743, 2021. doi: 10.1109/ICRA48506.2021.9561763.
- Iqbal et al. [2015] Tariq Iqbal, Michael J Gonzales, and Laurel D Riek. Joint action perception to enable fluent human-robot teamwork. In 2015 24th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), pages 400–406. IEEE, 2015.
- Iqbal et al. [2016] Tariq Iqbal, Samantha Rack, and Laurel D Riek. Movement coordination in human–robot teams: a dynamical systems approach. IEEE Transactions on Robotics, 32(4):909–919, 2016.
- Iqbal et al. [2019] Tariq Iqbal, Shen Li, Christopher Fourie, Bradley Hayes, and Julie A Shah. Fast online segmentation of activities from partial trajectories. In 2019 International Conference on Robotics and Automation (ICRA), pages 5019–5025. IEEE, 2019.
- Islam et al. [2022] Md Mofijul Islam, Reza Mirzaiee, Alexi Gladstone, Haley Green, and Tariq Iqbal. Caesar: An embodied simulator for generating multimodal referring expression datasets. Advances in Neural Information Processing Systems, 35:21001–21015, 2022.
- Islam et al. [2023a] Md Mofijul Islam, Alexi Gladstone, and Tariq Iqbal. Patron: perspective-aware multitask model for referring expression grounding using embodied multimodal cues. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, pages 971–979, 2023a.
- Islam et al. [2023b] Md Mofijul Islam, Alexi Gladstone, Riashat Islam, and Tariq Iqbal. Eqa-mx: Embodied question answering using multimodal expression. In The Twelfth International Conference on Learning Representations, 2023b.
- Islam et al. [2023c] Md Mofijul Islam, Mohammad Samin Yasar, and Tariq Iqbal. Maven: A memory augmented recurrent approach for multimodal fusion. IEEE Transactions on Multimedia, 25:3694–3708, 2023c. doi: 10.1109/TMM.2022.3164261.
- Joosse et al. [2017] M Joosse, S van Waveren, C Zaga, and V Evers. Groups in conflict at the airport: How people think a robot should act. In CSCW’17 workshop on robots in groups and teams, volume 26, 2017.
- Jung et al. [2020a] Malte F. Jung, Dominic Difranzo, Solace Shen, Brett Stoll, Houston Claure, and Austin Lawrence. Robot-assisted tower construction—a method to study the impact of a robot’s allocation behavior on interpersonal dynamics and collaboration in groups. J. Hum.-Robot Interact., 10(1), oct 2020a. doi: 10.1145/3394287. URL https://doi.org/10.1145/3394287.
- Jung et al. [2020b] Malte F Jung, Friederike Eyssel, Selma Šabanović, and Marlena Fraune. Robots in groups and teams: A literature review. In Proceedings of the 2020 ACM/IEEE International Conference on Human-Robot Interaction, pages 11–21, 2020b.
- Kubota et al. [2023] Alyssa Kubota, Rainee Pei, Ethan Sun, Dagoberto Cruz-Sandoval, Soyon Kim, and Laurel D. Riek. Get smart: Collaborative goal setting with cognitively assistive robots. In Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, HRI ’23, page 44–53, New York, NY, USA, 2023. Association for Computing Machinery. ISBN 9781450399647. URL https://doi.org/10.1145/3568162.3576993.
- Kwon and Jung [2018] Minae Kwon and Malte F Jung. Emotional labor in human-robot interaction. Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction, pages 161–162, 2018.
- Lee et al. [2023] Hee Rin Lee, Fei Sun, Tariq Iqbal, and Brenda Roberts. Reimagining robots for dementia: From robots for care-receivers/giver to robots for carepartners. In Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, pages 475–484, 2023.
- Mailapalli et al. [2022] Tejus Mailapalli, Xi Wang, and Min Kyung Lee. Modeling fairness in human-robot teams. In Proceedings of the 2022 ACM/IEEE International Conference on Human-Robot Interaction, pages 454–459, 2022.
- Microsoft [Accessed 2024] Microsoft. Azure Kinect DK. https://azure.microsoft.com/en-us/products/kinect-dk, Accessed 2024. Accessed on June 26, 2024.
- Morioka and Sakakibara [2010] Masashiro Morioka and Shinsuke Sakakibara. A new cell production assembly system with human–robot cooperation. CIRP annals, 59(1):9–12, 2010.
- Nikolaidis et al. [2017] Stefanos Nikolaidis, Yu Xiang Zhu, David Hsu, and Siddhartha Srinivasa. Human-robot mutual adaptation in shared autonomy. In Proceedings of the 2017 ACM/IEEE International Conference on Human-Robot Interaction, pages 294–302, 2017.
- Rahman et al. [2022] Sejuti Rahman, Sujan Sarker, AKM Nadimul Haque, Monisha Mushtary Uttsha, Md Fokhrul Islam, and Swakshar Deb. Ai-driven stroke rehabilitation systems and assessment: a systematic review. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 31:192–207, 2022.
- Rubagotti et al. [2022] Matteo Rubagotti, Inara Tusseyeva, Sara Baltabayeva, Danna Summers, and Anara Sandygulova. Perceived safety in physical human–robot interaction—a survey. Robotics and Autonomous Systems, 151:104047, 2022. ISSN 0921-8890. doi: https://doi.org/10.1016/j.robot.2022.104047. URL https://www.sciencedirect.com/science/article/pii/S0921889022000173.
- Samyoun et al. [2022] Sirat Samyoun, Md Mofijul Islam, Tariq Iqbal, and John Stankovic. M3sense: Affect-agnostic multitask representation learning using multimodal wearable sensors. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 6(2), jul 2022. doi: 10.1145/3534600. URL https://doi.org/10.1145/3534600.
- Sanneman et al. [2021] Lindsay Sanneman, Christopher Fourie, Julie A Shah, et al. The state of industrial robotics: Emerging technologies, challenges, and key research directions. Foundations and Trends® in Robotics, 8(3):225–306, 2021.
- Sarker et al. [2021] Sujan Sarker, Lafifa Jamal, Syeda Faiza Ahmed, and Niloy Irtisam. Robotics and artificial intelligence in healthcare during covid-19 pandemic: A systematic review. Robotics and autonomous systems, 146:103902, 2021.
- Sarker et al. [2024] Sujan Sarker, Md. Tanvir Arafat, Aiman Lameesa, Mahbuba Afrin, Redowan Mahmud, Md. Abdur Razzaque, and Tariq Iqbal. Fold: Fog-dew infrastructure-aided optimal workload distribution for cloud robotic operations. Internet of Things, 26:101185, 2024. ISSN 2542-6605. doi: https://doi.org/10.1016/j.iot.2024.101185. URL https://www.sciencedirect.com/science/article/pii/S2542660524001264.
- Sebo et al. [2020] Sarah Sebo, Brett Stoll, Brian Scassellati, and Malte F Jung. Robots in groups and teams: a literature review. Proceedings of the ACM on Human-Computer Interaction, 4(CSCW2):1–36, 2020.
- Seraj et al. [2024] Esameil Seraj, Kin Man Lee, Zulfiqar Zaidi, Qingyu Xiao, Zhaoxin Li, Arthur Nascimento, Sanne van Waveren, Pradyumna Tambwekar, Rohan Paleja, Devleena Das, et al. Interactive and explainable robot learning: A comprehensive review. volume 12, pages 75–349. Now Publishers, Inc., 2024.
- Shah et al. [2011] Julie Shah, James Wiken, Brian Williams, and Cynthia Breazeal. Improved human-robot team performance using chaski, a human-inspired plan execution system. In Proceedings of the 6th international conference on Human-robot interaction, pages 29–36, 2011.
- Van Waveren et al. [2023] Sanne Van Waveren, Rasmus Rudling, Iolanda Leite, Patric Jensfelt, and Christian Pek. Increasing perceived safety in motion planning for human-drone interaction. In Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, pages 446–455, 2023.
- Wang and Olson [2016] John Wang and Edwin Olson. AprilTag 2: Efficient and robust fiducial detection. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), October 2016.
- Yang et al. [2024] Yue* Yang, Letian* Chen, Zulfiqar* Zaidi, Sanne van Waveren, Arjun Krishna, and Matthew Gombolay. Enhancing safety in learning from demonstration algorithms via control barrier function shielding. In Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, HRI ’24, page 820–829, New York, NY, USA, 2024. Association for Computing Machinery. ISBN 9798400703225. doi: 10.1145/3610977.3635002. URL https://doi.org/10.1145/3610977.3635002.
- Yasar and Iqbal [2023] Mohammad Samin Yasar and Tariq Iqbal. VADER: Vector-quantized generative adversarial network for motion prediction. 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023.
- Yasar et al. [2023] Mohammad Samin Yasar, Md Mofijul Islam, and Tariq Iqbal. Imprint: Interactional dynamics-aware motion prediction in teams using multimodal context. J. Hum.-Robot Interact., oct 2023. doi: 10.1145/3626954. URL https://doi.org/10.1145/3626954. Just Accepted.
- Yasar et al. [2024] Mohammad Samin Yasar, Md Mofijul Islam, and Tariq Iqbal. Posetron: Enabling close-proximity human-robot collaboration through multi-human motion prediction. In Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, HRI ’24, page 830–839, New York, NY, USA, 2024. Association for Computing Machinery. ISBN 9798400703225. doi: 10.1145/3610977.3635006. URL https://doi.org/10.1145/3610977.3635006.
- Zacharaki et al. [2020] Angeliki Zacharaki, Ioannis Kostavelis, Antonios Gasteratos, and Ioannis Dokas. Safety bounds in human robot interaction: A survey. Safety Science, 127:104667, 2020. ISSN 0925-7535. doi: https://doi.org/10.1016/j.ssci.2020.104667. URL https://www.sciencedirect.com/science/article/pii/S0925753520300643.