Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–20 of 20 results for author: Novoseller, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07946  [pdf, ps, other

    cs.HC cs.AI cs.CL cs.LG

    Re-Envisioning Command and Control

    Authors: Kaleb McDowell, Ellen Novoseller, Anna Madison, Vinicius G. Goecks, Christopher Kelshaw

    Abstract: Future warfare will require Command and Control (C2) decision-making to occur in more complex, fast-paced, ill-structured, and demanding conditions. C2 will be further complicated by operational challenges such as Denied, Degraded, Intermittent, and Limited (DDIL) communications and the need to account for many data streams, potentially across multiple domains of operation. Yet, current C2 practic… ▽ More

    Submitted 28 March, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted at the NATO Science and Technology Organization Symposium (ICMCIS) organized by the Information Systems Technology (IST) Panel, IST-205-RSY - the ICMCIS, held in Koblenz, Germany, 23-24 April 2024

    ACM Class: I.2.6; I.2.7; J.7

  2. arXiv:2402.06501  [pdf, other

    cs.LG cs.AI cs.CL cs.HC

    Scalable Interactive Machine Learning for Future Command and Control

    Authors: Anna Madison, Ellen Novoseller, Vinicius G. Goecks, Benjamin T. Files, Nicholas Waytowich, Alfred Yu, Vernon J. Lawhern, Steven Thurman, Christopher Kelshaw, Kaleb McDowell

    Abstract: Future warfare will require Command and Control (C2) personnel to make decisions at shrinking timescales in complex and potentially ill-defined situations. Given the need for robust decision-making processes and decision-support tools, integration of artificial and human intelligence holds the potential to revolutionize the C2 operations process to ensure adaptability and efficiency in rapidly cha… ▽ More

    Submitted 28 March, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted at the NATO Science and Technology Organization Symposium (ICMCIS) organized by the Information Systems Technology (IST) Panel, IST-205-RSY - the ICMCIS, held in Koblenz, Germany, 23-24 April 2024

    ACM Class: I.2.6; I.2.7; J.7

  3. arXiv:2401.10941  [pdf, other

    cs.HC cs.LG cs.SI

    Crowd-PrefRL: Preference-Based Reward Learning from Crowds

    Authors: David Chhan, Ellen Novoseller, Vernon J. Lawhern

    Abstract: Preference-based reinforcement learning (RL) provides a framework to train agents using human feedback through pairwise preferences over pairs of behaviors, enabling agents to learn desired behaviors when it is difficult to specify a numerical reward function. While this paradigm leverages human feedback, it currently treats the feedback as given by a single human user. Meanwhile, incorporating pr… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  4. arXiv:2307.16348  [pdf, other

    cs.LG cs.AI cs.RO

    Rating-based Reinforcement Learning

    Authors: Devin White, Mingkang Wu, Ellen Novoseller, Vernon J. Lawhern, Nicholas Waytowich, Yongcan Cao

    Abstract: This paper develops a novel rating-based reinforcement learning approach that uses human ratings to obtain human guidance in reinforcement learning. Different from the existing preference-based and ranking-based reinforcement learning paradigms, based on human relative preferences over sample pairs, the proposed rating-based reinforcement learning approach is based on human evaluation of individua… ▽ More

    Submitted 29 January, 2024; v1 submitted 30 July, 2023; originally announced July 2023.

    Comments: This is an extended version of the paper "Rating-based Reinforcement Learning" accepted to the 38th Annual AAAI Conference on Artificial Intelligence

  5. arXiv:2307.12158  [pdf, other

    cs.LG cs.AI cs.HC

    DIP-RL: Demonstration-Inferred Preference Learning in Minecraft

    Authors: Ellen Novoseller, Vinicius G. Goecks, David Watkins, Josh Miller, Nicholas Waytowich

    Abstract: In machine learning for sequential decision-making, an algorithmic agent learns to interact with an environment while receiving feedback in the form of a reward signal. However, in many unstructured real-world settings, such a reward signal is unknown and humans cannot reliably craft a reward signal that correctly captures desired behavior. To solve tasks in such unstructured and open-ended enviro… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

    Comments: Paper accepted at The Many Facets of Preference Learning Workshop at the International Conference on Machine Learning (ICML), Honolulu, Hawaii, USA, 2023

    ACM Class: I.2.6; G.3

  6. arXiv:2303.13512  [pdf, other

    cs.AI

    Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

    Authors: Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Sharada Mohanty, Byron Galbraith, Ke Chen, Yan Song, Tianze Zhou, Bingquan Yu, He Liu, Kai Guan, Yujing Hu, Tangjie Lv, Federico Malato, Florian Leopold, Amogh Raut, Ville Hautamäki, Andrew Melnik, Shu Ishida, João F. Henriques, Robert Klassert, Walter Laurito, Ellen Novoseller , et al. (5 additional authors not shown)

    Abstract: To facilitate research in the direction of fine-tuning foundation models from human feedback, we held the MineRL BASALT Competition on Fine-Tuning from Human Feedback at NeurIPS 2022. The BASALT challenge asks teams to compete to develop algorithms to solve tasks with hard-to-specify reward functions in Minecraft. Through this competition, we aimed to promote the development of algorithms that use… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  7. arXiv:2301.04741  [pdf, other

    cs.LG

    Efficient Preference-Based Reinforcement Learning Using Learned Dynamics Models

    Authors: Yi Liu, Gaurav Datta, Ellen Novoseller, Daniel S. Brown

    Abstract: Preference-based reinforcement learning (PbRL) can enable robots to learn to perform tasks based on an individual's preferences without requiring a hand-crafted reward function. However, existing approaches either assume access to a high-fidelity simulator or analytic model or take a model-free approach that requires extensive, possibly unsafe online environment interactions. In this paper, we stu… ▽ More

    Submitted 9 February, 2024; v1 submitted 11 January, 2023; originally announced January 2023.

    Comments: In proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA 2023)

  8. arXiv:2207.07813  [pdf, other

    cs.RO cs.AI

    Autonomously Untangling Long Cables

    Authors: Vainavi Viswanath, Kaushik Shivakumar, Justin Kerr, Brijen Thananjeyan, Ellen Novoseller, Jeffrey Ichnowski, Alejandro Escontrela, Michael Laskey, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Cables are ubiquitous in many settings and it is often useful to untangle them. However, cables are prone to self-occlusions and knots, making them difficult to perceive and manipulate. The challenge increases with cable length: long cables require more complex slack management to facilitate observability and reachability. In this paper, we focus on autonomously untangling cables up to 3 meters in… ▽ More

    Submitted 31 July, 2022; v1 submitted 15 July, 2022; originally announced July 2022.

  9. arXiv:2207.00911  [pdf, other

    cs.RO

    Learning Switching Criteria for Sim2Real Transfer of Robotic Fabric Manipulation Policies

    Authors: Satvik Sharma, Ellen Novoseller, Vainavi Viswanath, Zaynah Javed, Rishi Parikh, Ryan Hoque, Ashwin Balakrishna, Daniel S. Brown, Ken Goldberg

    Abstract: Simulation-to-reality transfer has emerged as a popular and highly successful method to train robotic control policies for a wide variety of tasks. However, it is often challenging to determine when policies trained in simulation are ready to be transferred to the physical world. Deploying policies that have been trained with very little simulation data can result in unreliable and dangerous behav… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

    Comments: CASE 2022. The first two authors contributed equally. 9 pages; 5 figures; 1 table

  10. arXiv:2206.08921  [pdf, other

    cs.RO

    Efficiently Learning Single-Arm Fling Motions to Smooth Garments

    Authors: Lawrence Yunliang Chen, Huang Huang, Ellen Novoseller, Daniel Seita, Jeffrey Ichnowski, Michael Laskey, Richard Cheng, Thomas Kollar, Ken Goldberg

    Abstract: Recent work has shown that 2-arm "fling" motions can be effective for garment smoothing. We consider single-arm fling motions. Unlike 2-arm fling motions, which require little robot trajectory parameter tuning, single-arm fling motions are very sensitive to trajectory parameters. We consider a single 6-DOF robot arm that learns fling trajectories to achieve high garment coverage. Given a garment g… ▽ More

    Submitted 24 September, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted to 2022 International Symposium on Robotics Research (ISRR)

  11. arXiv:2203.04272  [pdf, other

    cs.LG cs.AI stat.ME

    Policy-Based Bayesian Experimental Design for Non-Differentiable Implicit Models

    Authors: Vincent Lim, Ellen Novoseller, Jeffrey Ichnowski, Huang Huang, Ken Goldberg

    Abstract: For applications in healthcare, physics, energy, robotics, and many other fields, designing maximally informative experiments is valuable, particularly when experiments are expensive, time-consuming, or pose safety hazards. While existing approaches can sequentially design experiments based on prior observation history, many of these methods do not extend to implicit models, where simulation is po… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: 15 pages, 3 figures

  12. arXiv:2109.08273  [pdf, other

    cs.RO cs.AI

    ThriftyDAgger: Budget-Aware Novelty and Risk Gating for Interactive Imitation Learning

    Authors: Ryan Hoque, Ashwin Balakrishna, Ellen Novoseller, Albert Wilcox, Daniel S. Brown, Ken Goldberg

    Abstract: Effective robot learning often requires online human feedback and interventions that can cost significant human time, giving rise to the central challenge in interactive imitation learning: is it possible to control the timing and length of interventions to both facilitate learning and limit burden on the human supervisor? This paper presents ThriftyDAgger, an algorithm for actively querying a hum… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: CoRL 2021 Oral

  13. arXiv:2107.08942  [pdf, other

    cs.RO cs.AI cs.LG

    Untangling Dense Non-Planar Knots by Learning Manipulation Features and Recovery Policies

    Authors: Priya Sundaresan, Jennifer Grannen, Brijen Thananjeyan, Ashwin Balakrishna, Jeffrey Ichnowski, Ellen Novoseller, Minho Hwang, Michael Laskey, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Robot manipulation for untangling 1D deformable structures such as ropes, cables, and wires is challenging due to their infinite dimensional configuration space, complex dynamics, and tendency to self-occlude. Analytical controllers often fail in the presence of dense configurations, due to the difficulty of grasping between adjacent cable segments. We present two algorithms that enhance robust ca… ▽ More

    Submitted 29 June, 2021; originally announced July 2021.

  14. arXiv:2106.02252  [pdf, other

    cs.RO cs.AI cs.LG

    Disentangling Dense Multi-Cable Knots

    Authors: Vainavi Viswanath, Jennifer Grannen, Priya Sundaresan, Brijen Thananjeyan, Ashwin Balakrishna, Ellen Novoseller, Jeffrey Ichnowski, Michael Laskey, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Disentangling two or more cables requires many steps to remove crossings between and within cables. We formalize the problem of disentangling multiple cables and present an algorithm, Iterative Reduction Of Non-planar Multiple cAble kNots (IRON-MAN), that outputs robot actions to remove crossings from multi-cable knotted structures. We instantiate this algorithm with a learned perception system, i… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: First three authors contributed equally

  15. arXiv:2104.00053  [pdf, other

    cs.RO cs.AI

    LazyDAgger: Reducing Context Switching in Interactive Imitation Learning

    Authors: Ryan Hoque, Ashwin Balakrishna, Carl Putterman, Michael Luo, Daniel S. Brown, Daniel Seita, Brijen Thananjeyan, Ellen Novoseller, Ken Goldberg

    Abstract: Corrective interventions while a robot is learning to automate a task provide an intuitive method for a human supervisor to assist the robot and convey information about desired behavior. However, these interventions can impose significant burden on a human supervisor, as each intervention interrupts other work the human is doing, incurs latency with each context switch between supervisor and auto… ▽ More

    Submitted 20 July, 2021; v1 submitted 31 March, 2021; originally announced April 2021.

    Comments: IEEE CASE 2021

  16. arXiv:2102.13008  [pdf, other

    cs.LG cs.HC cs.RO

    Imitation Learning with Human Eye Gaze via Multi-Objective Prediction

    Authors: Ravi Kumar Thakur, MD-Nazmus Samin Sunbeam, Vinicius G. Goecks, Ellen Novoseller, Ritwik Bera, Vernon J. Lawhern, Gregory M. Gremillion, John Valasek, Nicholas R. Waytowich

    Abstract: Approaches for teaching learning agents via human demonstrations have been widely studied and successfully applied to multiple domains. However, the majority of imitation learning work utilizes only behavioral information from the demonstrator, i.e. which actions were taken, and ignores other useful information. In particular, eye gaze information can give valuable insight towards where the demons… ▽ More

    Submitted 22 July, 2023; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: Paper accepted and selected as an oral presentation at Interactive Learning with Implicit Human Feedback Workshop at ICML 2023

    ACM Class: I.2.6; I.2.9; I.2.10

  17. ROIAL: Region of Interest Active Learning for Characterizing Exoskeleton Gait Preference Landscapes

    Authors: Kejun Li, Maegan Tucker, Erdem Bıyık, Ellen Novoseller, Joel W. Burdick, Yanan Sui, Dorsa Sadigh, Yisong Yue, Aaron D. Ames

    Abstract: Characterizing what types of exoskeleton gaits are comfortable for users, and understanding the science of walking more generally, require recovering a user's utility landscape. Learning these landscapes is challenging, as walking trajectories are defined by numerous gait parameters, data collection from human trials is expensive, and user safety and comfort must be ensured. This work proposes the… ▽ More

    Submitted 30 March, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: 6 pages + 1 page of references; 7 figures; To Appear at ICRA 2021

  18. arXiv:2003.06495  [pdf, other

    cs.RO cs.HC cs.LG

    Human Preference-Based Learning for High-dimensional Optimization of Exoskeleton Walking Gaits

    Authors: Maegan Tucker, Myra Cheng, Ellen Novoseller, Richard Cheng, Yisong Yue, Joel W. Burdick, Aaron D. Ames

    Abstract: Optimizing lower-body exoskeleton walking gaits for user comfort requires understanding users' preferences over a high-dimensional gait parameter space. However, existing preference-based learning methods have only explored low-dimensional domains due to computational limitations. To learn user preferences in high dimensions, this work presents LineCoSpar, a human-in-the-loop preference-based fram… ▽ More

    Submitted 8 August, 2020; v1 submitted 13 March, 2020; originally announced March 2020.

    Comments: 8 pages, 9 figures, 2 tables, to appear at IROS 2020

  19. arXiv:1909.12316  [pdf, other

    cs.RO

    Preference-Based Learning for Exoskeleton Gait Optimization

    Authors: Maegan Tucker, Ellen Novoseller, Claudia Kann, Yanan Sui, Yisong Yue, Joel Burdick, Aaron D. Ames

    Abstract: This paper presents a personalized gait optimization framework for lower-body exoskeletons. Rather than optimizing numerical objectives such as the mechanical cost of transport, our approach directly learns from user preferences, e.g., for comfort. Building upon work in preference-based interactive learning, we present the CoSpar algorithm. CoSpar prompts the user to give pairwise preferences betw… ▽ More

    Submitted 25 May, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: To appear at ICRA 2020

  20. arXiv:1908.01289  [pdf, other

    cs.LG cs.AI stat.ML

    Dueling Posterior Sampling for Preference-Based Reinforcement Learning

    Authors: Ellen R. Novoseller, Yibing Wei, Yanan Sui, Yisong Yue, Joel W. Burdick

    Abstract: In preference-based reinforcement learning (RL), an agent interacts with the environment while receiving preferences instead of absolute feedback. While there is increasing research activity in preference-based RL, the design of formal frameworks that admit tractable theoretical analysis remains an open challenge. Building upon ideas from preference-based bandit learning and posterior sampling in… ▽ More

    Submitted 29 June, 2020; v1 submitted 4 August, 2019; originally announced August 2019.

    Comments: To appear in Conference on Uncertainty in Artificial Intelligence (UAI), 2020. 9 pages before references and appendix; 51 pages total; 7 figures; 4 tables. This replacement incorporates reviewer comments, and in comparison to version 1, extends the theoretical and empirical analyses and adds mathematical detail. Code: https://github.com/ernovoseller/DuelingPosteriorSampling