Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–15 of 15 results for author: Lawhern, V J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.06501  [pdf, other

    cs.LG cs.AI cs.CL cs.HC

    Scalable Interactive Machine Learning for Future Command and Control

    Authors: Anna Madison, Ellen Novoseller, Vinicius G. Goecks, Benjamin T. Files, Nicholas Waytowich, Alfred Yu, Vernon J. Lawhern, Steven Thurman, Christopher Kelshaw, Kaleb McDowell

    Abstract: Future warfare will require Command and Control (C2) personnel to make decisions at shrinking timescales in complex and potentially ill-defined situations. Given the need for robust decision-making processes and decision-support tools, integration of artificial and human intelligence holds the potential to revolutionize the C2 operations process to ensure adaptability and efficiency in rapidly cha… ▽ More

    Submitted 28 March, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted at the NATO Science and Technology Organization Symposium (ICMCIS) organized by the Information Systems Technology (IST) Panel, IST-205-RSY - the ICMCIS, held in Koblenz, Germany, 23-24 April 2024

    ACM Class: I.2.6; I.2.7; J.7

  2. arXiv:2401.10941  [pdf, other

    cs.HC cs.LG cs.SI

    Crowd-PrefRL: Preference-Based Reward Learning from Crowds

    Authors: David Chhan, Ellen Novoseller, Vernon J. Lawhern

    Abstract: Preference-based reinforcement learning (RL) provides a framework to train agents using human feedback through pairwise preferences over pairs of behaviors, enabling agents to learn desired behaviors when it is difficult to specify a numerical reward function. While this paradigm leverages human feedback, it currently treats the feedback as given by a single human user. Meanwhile, incorporating pr… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  3. arXiv:2307.16348  [pdf, other

    cs.LG cs.AI cs.RO

    Rating-based Reinforcement Learning

    Authors: Devin White, Mingkang Wu, Ellen Novoseller, Vernon J. Lawhern, Nicholas Waytowich, Yongcan Cao

    Abstract: This paper develops a novel rating-based reinforcement learning approach that uses human ratings to obtain human guidance in reinforcement learning. Different from the existing preference-based and ranking-based reinforcement learning paradigms, based on human relative preferences over sample pairs, the proposed rating-based reinforcement learning approach is based on human evaluation of individua… ▽ More

    Submitted 29 January, 2024; v1 submitted 30 July, 2023; originally announced July 2023.

    Comments: This is an extended version of the paper "Rating-based Reinforcement Learning" accepted to the 38th Annual AAAI Conference on Artificial Intelligence

  4. arXiv:2304.09050  [pdf

    q-bio.NC cs.LG stat.ML

    Decoding Neural Activity to Assess Individual Latent State in Ecologically Valid Contexts

    Authors: Stephen M. Gordon, Jonathan R. McDaniel, Kevin W. King, Vernon J. Lawhern, Jonathan Touryan

    Abstract: There exist very few ways to isolate cognitive processes, historically defined via highly controlled laboratory studies, in more ecologically valid contexts. Specifically, it remains unclear as to what extent patterns of neural activity observed under such constraints actually manifest outside the laboratory in a manner that can be used to make an accurate inference about the latent state, associa… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Journal ref: Journal of Neural Engineering, vol. 20(4), 2023

  5. arXiv:2202.12950  [pdf, other

    eess.SP cs.AI cs.LG

    2021 BEETL Competition: Advancing Transfer Learning for Subject Independence & Heterogenous EEG Data Sets

    Authors: Xiaoxi Wei, A. Aldo Faisal, Moritz Grosse-Wentrup, Alexandre Gramfort, Sylvain Chevallier, Vinay Jayaram, Camille Jeunet, Stylianos Bakas, Siegfried Ludwig, Konstantinos Barmpas, Mehdi Bahri, Yannis Panagakis, Nikolaos Laskaris, Dimitrios A. Adamos, Stefanos Zafeiriou, William C. Duong, Stephen M. Gordon, Vernon J. Lawhern, Maciej ƚliwowski, Vincent Rouanne, Piotr Tempczyk

    Abstract: Transfer learning and meta-learning offer some of the most promising avenues to unlock the scalability of healthcare and consumer technologies driven by biosignal data. This is because current methods cannot generalise well across human subjects' data and handle learning from different heterogeneously collected data sets, thus limiting the scale of training data. On the other side, developments in… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: PrePrint of the NeurIPS2021 BEETL Competition Submitted to Proceedings of Machine Learning Research (PMLR)

  6. arXiv:2102.13008  [pdf, other

    cs.LG cs.HC cs.RO

    Imitation Learning with Human Eye Gaze via Multi-Objective Prediction

    Authors: Ravi Kumar Thakur, MD-Nazmus Samin Sunbeam, Vinicius G. Goecks, Ellen Novoseller, Ritwik Bera, Vernon J. Lawhern, Gregory M. Gremillion, John Valasek, Nicholas R. Waytowich

    Abstract: Approaches for teaching learning agents via human demonstrations have been widely studied and successfully applied to multiple domains. However, the majority of imitation learning work utilizes only behavioral information from the demonstrator, i.e. which actions were taken, and ignores other useful information. In particular, eye gaze information can give valuable insight towards where the demons… ▽ More

    Submitted 22 July, 2023; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: Paper accepted and selected as an oral presentation at Interactive Learning with Implicit Human Feedback Workshop at ICML 2023

    ACM Class: I.2.6; I.2.9; I.2.10

  7. arXiv:1910.04281  [pdf, other

    cs.LG cs.AI stat.ML

    Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments

    Authors: Vinicius G. Goecks, Gregory M. Gremillion, Vernon J. Lawhern, John Valasek, Nicholas R. Waytowich

    Abstract: This paper investigates how to efficiently transition and update policies, trained initially with demonstrations, using off-policy actor-critic reinforcement learning. It is well-known that techniques based on Learning from Demonstrations, for example behavior cloning, can lead to proficient policies given limited data. However, it is currently unclear how to efficiently update that policy using r… ▽ More

    Submitted 3 April, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: 9 pages, 5 Figures. AAMAS 2020

  8. arXiv:1810.11545  [pdf, other

    cs.AI cs.HC cs.RO

    Efficiently Combining Human Demonstrations and Interventions for Safe Training of Autonomous Systems in Real-Time

    Authors: Vinicius G. Goecks, Gregory M. Gremillion, Vernon J. Lawhern, John Valasek, Nicholas R. Waytowich

    Abstract: This paper investigates how to utilize different forms of human interaction to safely train autonomous systems in real-time by learning from both human demonstrations and interventions. We implement two components of the Cycle-of-Learning for Autonomous Systems, which is our framework for combining multiple modalities of human interaction. The current effort employs human demonstrations to teach a… ▽ More

    Submitted 28 November, 2018; v1 submitted 26 October, 2018; originally announced October 2018.

    Comments: 9 pages, 6 figures

  9. arXiv:1808.09572  [pdf, other

    cs.AI cs.HC cs.RO

    Cycle-of-Learning for Autonomous Systems from Human Interaction

    Authors: Nicholas R. Waytowich, Vinicius G. Goecks, Vernon J. Lawhern

    Abstract: We discuss different types of human-robot interaction paradigms in the context of training end-to-end reinforcement learning algorithms. We provide a taxonomy to categorize the types of human interaction and present our Cycle-of-Learning framework for autonomous systems that combines different human-interaction modalities with reinforcement learning. Two key concepts provided by our Cycle-of-Learn… ▽ More

    Submitted 9 October, 2018; v1 submitted 28 August, 2018; originally announced August 2018.

    Comments: Presented at AI-HRI AAAI-FSS, 2018 (arXiv:1809.06606)

    Report number: AI-HRI/2018/05

  10. arXiv:1805.04740  [pdf, ps, other

    cs.LG cs.HC stat.ML

    Agreement Rate Initialized Maximum Likelihood Estimator for Ensemble Classifier Aggregation and Its Application in Brain-Computer Interface

    Authors: Dongrui Wu, Vernon J. Lawhern, Stephen Gordon, Brent J. Lance, Chin-Teng Lin

    Abstract: Ensemble learning is a powerful approach to construct a strong learner from multiple base learners. The most popular way to aggregate an ensemble of classifiers is majority voting, which assigns a sample to the class that most base classifiers vote for. However, improved performance can be obtained by assigning weights to the base classifiers according to their accuracy. This paper proposes an agr… ▽ More

    Submitted 12 May, 2018; originally announced May 2018.

    Journal ref: IEEE Int'l. Conf. on Systems, Man and Cybernetics, pp. 724-729, Budapest, Hungary, 2016

  11. arXiv:1805.04737  [pdf, ps, other

    cs.LG cs.HC stat.ML

    Offline EEG-Based Driver Drowsiness Estimation Using Enhanced Batch-Mode Active Learning (EBMAL) for Regression

    Authors: Dongrui Wu, Vernon J. Lawhern, Stephen Gordon, Brent J. Lance, Chin-Teng Lin

    Abstract: There are many important regression problems in real-world brain-computer interface (BCI) applications, e.g., driver drowsiness estimation from EEG signals. This paper considers offline analysis: given a pool of unlabeled EEG epochs recorded during driving, how do we optimally select a small number of them to label so that an accurate regression model can be built from them to label the rest? Acti… ▽ More

    Submitted 12 May, 2018; originally announced May 2018.

    Journal ref: IEEE Int'l. Conf. on Systems, Man and Cybernetics, pp. 730-736, Budapest, Hungary, 2016

  12. arXiv:1704.08533  [pdf, ps, other

    cs.HC cs.LG

    EEG-Based User Reaction Time Estimation Using Riemannian Geometry Features

    Authors: Dongrui Wu, Brent J. Lance, Vernon J. Lawhern, Stephen Gordon, Tzyy-Ping Jung, Chin-Teng Lin

    Abstract: Riemannian geometry has been successfully used in many brain-computer interface (BCI) classification problems and demonstrated superior performance. In this paper, for the first time, it is applied to BCI regression problems, an important category of BCI applications. More specifically, we propose a new feature extraction approach for Electroencephalogram (EEG) based BCI regression problems: a spa… ▽ More

    Submitted 27 April, 2017; originally announced April 2017.

    Comments: arXiv admin note: text overlap with arXiv:1702.02914

    Journal ref: IEEE Trans. on Neural Systems and Rehabilitation Engineering, 25(11), pp. 2157-2168, 2017

  13. Switching EEG Headsets Made Easy: Reducing Offline Calibration Effort Using Active Weighted Adaptation Regularization

    Authors: Dongrui Wu, Vernon J. Lawhern, W. David Hairston, Brent J. Lance

    Abstract: Electroencephalography (EEG) headsets are the most commonly used sensing devices for Brain-Computer Interface. In real-world applications, there are advantages to extrapolating data from one user session to another. However, these advantages are limited if the data arise from different hardware systems, which often vary between application spaces. Currently, this creates a need to recalibrate clas… ▽ More

    Submitted 9 February, 2017; originally announced February 2017.

    Journal ref: IEEE Trans. on Neural Systems and Rehabilitation Engineering, 24(11), pp. 1125-1137 (2016)

  14. Driver Drowsiness Estimation from EEG Signals Using Online Weighted Adaptation Regularization for Regression (OwARR)

    Authors: Dongrui Wu, Vernon J. Lawhern, Stephen Gordon, Brent J. Lance, Chin-Teng Lin

    Abstract: One big challenge that hinders the transition of brain-computer interfaces (BCIs) from laboratory settings to real-life applications is the availability of high-performance and robust learning algorithms that can effectively handle individual differences, i.e., algorithms that can be applied to a new subject with zero or very little subject-specific calibration data. Transfer learning and domain a… ▽ More

    Submitted 9 February, 2017; originally announced February 2017.

    Comments: in press

    Journal ref: IEEE Trans.on Fuzzy Systems, 25(6), pp. 1522-1535, 2017

  15. arXiv:1611.08024  [pdf, other

    cs.LG q-bio.NC stat.ML

    EEGNet: A Compact Convolutional Network for EEG-based Brain-Computer Interfaces

    Authors: Vernon J. Lawhern, Amelia J. Solon, Nicholas R. Waytowich, Stephen M. Gordon, Chou P. Hung, Brent J. Lance

    Abstract: Brain computer interfaces (BCI) enable direct communication with a computer, using neural activity as the control signal. This neural signal is generally chosen from a variety of well-studied electroencephalogram (EEG) signals. For a given BCI paradigm, feature extractors and classifiers are tailored to the distinct characteristics of its expected EEG control signal, limiting its application to th… ▽ More

    Submitted 15 May, 2018; v1 submitted 23 November, 2016; originally announced November 2016.

    Comments: 30 pages, 10 figures. Added additional feature relevance analyses. Minor change to EEGNet architecture. Source code can be found at https://github.com/vlawhern/arl-eegmodels