Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 174 results for author: Patel, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06474  [pdf, other

    cs.AI cs.CL

    Towards a Personal Health Large Language Model

    Authors: Justin Cosentino, Anastasiya Belyaeva, Xin Liu, Nicholas A. Furlotte, Zhun Yang, Chace Lee, Erik Schenck, Yojan Patel, Jian Cui, Logan Douglas Schneider, Robby Bryant, Ryan G. Gomes, Allen Jiang, Roy Lee, Yun Liu, Javier Perez, Jameson K. Rogers, Cathy Speed, Shyam Tailor, Megan Walker, Jeffrey Yu, Tim Althoff, Conor Heneghan, John Hernandez, Mark Malhotra , et al. (9 additional authors not shown)

    Abstract: In health, most large language model (LLM) research has focused on clinical tasks. However, mobile and wearable devices, which are rarely integrated into such tasks, provide rich, longitudinal data for personal health monitoring. Here we present Personal Health Large Language Model (PH-LLM), fine-tuned from Gemini for understanding and reasoning over numerical time-series personal health data. We… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 72 pages

  2. arXiv:2406.06464  [pdf, other

    cs.AI cs.CL

    Transforming Wearable Data into Health Insights using Large Language Model Agents

    Authors: Mike A. Merrill, Akshay Paruchuri, Naghmeh Rezaei, Geza Kovacs, Javier Perez, Yun Liu, Erik Schenck, Nova Hammerquist, Jake Sunshine, Shyam Tailor, Kumar Ayush, Hao-Wei Su, Qian He, Cory Y. McLean, Mark Malhotra, Shwetak Patel, Jiening Zhan, Tim Althoff, Daniel McDuff, Xin Liu

    Abstract: Despite the proliferation of wearable health trackers and the importance of sleep and exercise to health, deriving actionable personalized insights from wearable data remains a challenge because doing so requires non-trivial open-ended analysis of these data. The recent rise of large language model (LLM) agents, which can use tools to reason about and interact with the world, presents a promising… ▽ More

    Submitted 11 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: 38 pages

  3. arXiv:2406.02554  [pdf, other

    eess.AS cs.AI cs.CL cs.CV cs.LG cs.MM

    Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition

    Authors: Shijian Deng, Erin E. Kosloski, Siddhi Patel, Zeke A. Barnett, Yiyang Nan, Alexander Kaplan, Sisira Aarukapalli, William T. Doan, Matthew Wang, Harsh Singh, Pamela R. Rollins, Yapeng Tian

    Abstract: In this article, we introduce a novel problem of audio-visual autism behavior recognition, which includes social behavior recognition, an essential aspect previously omitted in AI-assisted autism screening research. We define the task at hand as one that is audio-visual autism behavior recognition, which uses audio and visual cues, including any speech present in the audio, to recognize autism-rel… ▽ More

    Submitted 22 March, 2024; originally announced June 2024.

  4. arXiv:2406.01915  [pdf, other

    cs.RO cs.HC

    Enhancing Human-Robot Collaborative Assembly in Manufacturing Systems Using Large Language Models

    Authors: Jonghan Lim, Sujani Patel, Alex Evans, John Pimley, Yifei Li, Ilya Kovalenko

    Abstract: The development of human-robot collaboration has the ability to improve manufacturing system performance by leveraging the unique strengths of both humans and robots. On the shop floor, human operators contribute with their adaptability and flexibility in dynamic situations, while robots provide precision and the ability to perform repetitive tasks. However, the communication gap between human ope… ▽ More

    Submitted 21 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  5. arXiv:2406.00010  [pdf, other

    cs.IR cs.CL

    EnterpriseEM: Fine-tuned Embeddings for Enterprise Semantic Search

    Authors: Kamalkumar Rathinasamy, Jayarama Nettar, Amit Kumar, Vishal Manchanda, Arun Vijayakumar, Ayush Kataria, Venkateshprasanna Manjunath, Chidambaram GS, Jaskirat Singh Sodhi, Shoeb Shaikh, Wasim Akhtar Khan, Prashant Singh, Tanishq Dattatray Ige, Vipin Tiwari, Rajab Ali Mondal, Harshini K, S Reka, Chetana Amancharla, Faiz ur Rahman, Harikrishnan P A, Indraneel Saha, Bhavya Tiwary, Navin Shankar Patel, Pradeep T S, Balaji A J , et al. (2 additional authors not shown)

    Abstract: Enterprises grapple with the significant challenge of managing proprietary unstructured data, hindering efficient information retrieval. This has led to the emergence of AI-driven information retrieval solutions, designed to adeptly extract relevant insights to address employee inquiries. These solutions often leverage pre-trained embedding models and generative models as foundational components.… ▽ More

    Submitted 18 May, 2024; originally announced June 2024.

    ACM Class: I.2.7

  6. arXiv:2405.19338  [pdf, other

    eess.SP cs.AI cs.CV

    Accurate Patient Alignment without Unnecessary Imaging Dose via Synthesizing Patient-specific 3D CT Images from 2D kV Images

    Authors: Yuzhen Ding, Jason M. Holmes, Hongying Feng, Baoxin Li, Lisa A. McGee, Jean-Claude M. Rwigema, Sujay A. Vora, Daniel J. Ma, Robert L. Foote, Samir H. Patel, Wei Liu

    Abstract: In radiotherapy, 2D orthogonally projected kV images are used for patient alignment when 3D-on-board imaging(OBI) unavailable. But tumor visibility is constrained due to the projection of patient's anatomy onto a 2D plane, potentially leading to substantial setup errors. In treatment room with 3D-OBI such as cone beam CT(CBCT), the field of view(FOV) of CBCT is limited with unnecessarily high imag… ▽ More

    Submitted 1 April, 2024; originally announced May 2024.

    Comments: 17 pages, 8 figures and tables

  7. arXiv:2405.04657  [pdf, other

    cs.LG cs.AI q-bio.BM

    ACEGEN: Reinforcement learning of generative chemical agents for drug discovery

    Authors: Albert Bou, Morgan Thomas, Sebastian Dittert, Carles Navarro Ramírez, Maciej Majewski, Ye Wang, Shivam Patel, Gary Tresadern, Mazen Ahmad, Vincent Moens, Woody Sherman, Simone Sciabola, Gianni De Fabritiis

    Abstract: In recent years, reinforcement learning (RL) has emerged as a valuable tool in drug design, offering the potential to propose and optimize molecules with desired properties. However, striking a balance between capabilities, flexibility, reliability, and efficiency remains challenging due to the complexity of advanced RL algorithms and the significant reliance on specialized code. In this work, we… ▽ More

    Submitted 3 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  8. arXiv:2404.18867  [pdf, other

    cs.HC

    Feminist Interaction Techniques: Deterring Non-Consensual Screenshots with Interaction Techniques

    Authors: Li Qiwei, Francesca Lameiro, Shefali Patel, Cristi-Isaula-Reyes, Eytan Adar, Eric Gilbert, Sarita Schoenebeck

    Abstract: Non-consensual Intimate Media (NCIM) refers to the distribution of sexual or intimate content without consent. NCIM is common and causes significant emotional, financial, and reputational harm. We developed Hands-Off, an interaction technique for messaging applications that deters non-consensual screenshots. Hands-Off requires recipients to perform a hand gesture in the air, above the device, to u… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  9. arXiv:2404.17347  [pdf, other

    cs.SE cs.HC

    InspectorRAGet: An Introspection Platform for RAG Evaluation

    Authors: Kshitij Fadnis, Siva Sankalp Patel, Odellia Boni, Yannis Katsis, Sara Rosenthal, Benjamin Sznajder, Marina Danilevsky

    Abstract: Large Language Models (LLM) have become a popular approach for implementing Retrieval Augmented Generation (RAG) systems, and a significant amount of effort has been spent on building good models and metrics. In spite of increased recognition of the need for rigorous evaluation of RAG systems, few tools exist that go beyond the creation of model output and automatic calculation. We present Inspect… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  10. arXiv:2404.13502  [pdf, other

    cs.DS

    Optimal Non-Adaptive Tolerant Junta Testing via Local Estimators

    Authors: Shivam Nadimpalli, Shyamal Patel

    Abstract: We give a non-adaptive algorithm that makes $2^{\tilde{O}(\sqrt{k\log(1/\varepsilon_2 - \varepsilon_1)})}$ queries to a Boolean function $f:\{\pm 1\}^n \rightarrow \{\pm 1\}$ and distinguishes between $f$ being $\varepsilon_1$-close to some $k$-junta versus $\varepsilon_2$-far from every $k$-junta. At the heart of our algorithm is a local mean estimation procedure for Boolean functions that may be… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: To appear in STOC 2024

  11. arXiv:2404.11103  [pdf, ps, other

    cs.DS

    Distribution-Free Testing of Decision Lists with a Sublinear Number of Queries

    Authors: Xi Chen, Yumou Fei, Shyamal Patel

    Abstract: We give a distribution-free testing algorithm for decision lists with $\tilde{O}(n^{11/12}/\varepsilon^3)$ queries. This is the first sublinear algorithm for this problem, which shows that, unlike halfspaces, testing is strictly easier than learning for decision lists. Complementing the algorithm, we show that any distribution-free tester for decision lists must make $\tildeΩ(\sqrt{n})$ queries, o… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: To appear in STOC 2024

  12. arXiv:2404.04729  [pdf, other

    cs.CR

    Towards a low carbon proof-of-work blockchain

    Authors: Agron Gemajli, Shivam Patel, Phillip G. Bradford

    Abstract: Proof of Work (PoW) blockchains burn a lot of energy. Proof-of-work algorithms are expensive by design and often only serve to compute blockchains. In some sense, carbon-based and non-carbon based regional electric power is fungible. So the total carbon and non-carbon electric power mix plays a role. Thus, generally PoW algorithms have large CO$_2$ footprints solely for computing blockchains. A pr… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  13. arXiv:2404.04525  [pdf, other

    cs.CL cs.AI cs.LG

    IITK at SemEval-2024 Task 10: Who is the speaker? Improving Emotion Recognition and Flip Reasoning in Conversations via Speaker Embeddings

    Authors: Shubham Patel, Divyaksh Shukla, Ashutosh Modi

    Abstract: This paper presents our approach for the SemEval-2024 Task 10: Emotion Discovery and Reasoning its Flip in Conversations. For the Emotion Recognition in Conversations (ERC) task, we utilize a masked-memory network along with speaker participation. We propose a transformer-based speaker-centric model for the Emotion Flip Reasoning (EFR) task. We also introduce Probable Trigger Zone, a region of the… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Accepted at SemEval 2024, NAACL 2024; 10 Pages

  14. arXiv:2404.03130  [pdf, other

    cs.HC

    Biodegradable Interactive Materials

    Authors: Zhihan Zhang, Mallory Parker, Kuotian Liao, Jerry Cao, Anandghan Waghmare, Joseph Breda, Chris Matsumura, Serena Eley, Eleftheria Roumeli, Shwetak Patel, Vikram Iyer

    Abstract: The sense of touch is fundamental to how we interact with the physical and digital world. Conventional interactive surfaces and tactile interfaces use electronic sensors embedded into objects, however this approach poses serious challenges both for environmental sustainability and a future of truly ubiquitous interaction systems where information is encoded into everyday objects. In this work, we… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  15. arXiv:2404.01904  [pdf, ps, other

    cs.IT

    Construction of quantum codes from $(γ,Δ)$-cyclic codes

    Authors: Om Prakash, Shikha Patel, Habibul Islam

    Abstract: Let $\mathbb{F}_q$ be the finite field of $q=p^m$ elements where $p$ is a prime and $m$ is a positive integer. This paper considers $(γ,Δ)$-cyclic codes over a class of finite commutative non-chain rings $\mathscr{R}_{q,s}=\mathbb{F}_q[v_1,v_2,\dots,v_s]/\langle v_i-v_i^2,v_iv_j=v_jv_i=0\rangle$ where $γ$ is an automorphism of $\mathscr{R}_{q,s}$, $Δ$ is a $γ$-derivation of $\mathscr{R}_{q,s}$ and… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    MSC Class: 12Y05; 16Z05; 94B05; 94B35; 94B15

  16. arXiv:2403.09810  [pdf, other

    cs.HC cs.AI cs.LG

    LabelAId: Just-in-time AI Interventions for Improving Human Labeling Quality and Domain Knowledge in Crowdsourcing Systems

    Authors: Chu Li, Zhihan Zhang, Michael Saugstad, Esteban Safranchik, Minchu Kulkarni, Xiaoyu Huang, Shwetak Patel, Vikram Iyer, Tim Althoff, Jon E. Froehlich

    Abstract: Crowdsourcing platforms have transformed distributed problem-solving, yet quality control remains a persistent challenge. Traditional quality control measures, such as prescreening workers and refining instructions, often focus solely on optimizing economic output. This paper explores just-in-time AI interventions to enhance both labeling quality and domain-specific knowledge among crowdworkers. W… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  17. arXiv:2403.07410  [pdf, ps, other

    cs.DC

    Polylog-Competitive Deterministic Local Routing and Scheduling

    Authors: Bernhard Haeupler, Shyamal Patel, Antti Roeyskoe, Cliff Stein, Goran Zuzic

    Abstract: This paper addresses point-to-point packet routing in undirected networks, which is the most important communication primitive in most networks. The main result proves the existence of routing tables that guarantee a polylog-competitive completion-time $\textbf{deterministically}$: in any undirected network, it is possible to give each node simple stateless deterministic local forwarding rules, su… ▽ More

    Submitted 13 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: To appear at STOC 2024

  18. arXiv:2403.02522  [pdf, other

    cs.LG cs.AI

    HeAR -- Health Acoustic Representations

    Authors: Sebastien Baur, Zaid Nabulsi, Wei-Hung Weng, Jake Garrison, Louis Blankemeier, Sam Fishman, Christina Chen, Sujay Kakarmath, Minyoi Maimbolwa, Nsala Sanjase, Brian Shuma, Yossi Matias, Greg S. Corrado, Shwetak Patel, Shravya Shetty, Shruthi Prabhakara, Monde Muyoyeta, Diego Ardila

    Abstract: Health acoustic sounds such as coughs and breaths are known to contain useful health signals with significant potential for monitoring health and disease, yet are underexplored in the medical machine learning community. The existing deep learning systems for health acoustics are often narrowly trained and evaluated on a single task, which is limited by data and may hinder generalization to other t… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 4 tables, 4 figures, 6 supplementary tables, 3 supplementary figures

  19. arXiv:2402.14815  [pdf

    cs.CY cs.AI cs.CV cs.LG

    Demographic Bias of Expert-Level Vision-Language Foundation Models in Medical Imaging

    Authors: Yuzhe Yang, Yujia Liu, Xin Liu, Avanti Gulhane, Domenico Mastrodicasa, Wei Wu, Edward J Wang, Dushyant W Sahani, Shwetak Patel

    Abstract: Advances in artificial intelligence (AI) have achieved expert-level performance in medical imaging applications. Notably, self-supervised vision-language foundation models can detect a broad spectrum of pathologies without relying on explicit training annotations. However, it is crucial to ensure that these AI models do not mirror or amplify human biases, thereby disadvantaging historically margin… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Code and data are available at https://github.com/YyzHarry/vlm-fairness

  20. arXiv:2402.08832  [pdf, other

    cs.LG cs.AI cs.CY

    Intelligent Agricultural Management Considering N$_2$O Emission and Climate Variability with Uncertainties

    Authors: Zhaoan Wang, Shaoping Xiao, Jun Wang, Ashwin Parab, Shivam Patel

    Abstract: This study examines how artificial intelligence (AI), especially Reinforcement Learning (RL), can be used in farming to boost crop yields, fine-tune nitrogen use and watering, and reduce nitrate runoff and greenhouse gases, focusing on Nitrous Oxide (N$_2$O) emissions from soil. Facing climate change and limited agricultural knowledge, we use Partially Observable Markov Decision Processes (POMDPs)… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  21. arXiv:2402.00076  [pdf, ps, other

    cs.AI

    Exploitation Strategies in Conditional Markov Chain Search: A case study on the three-index assignment problem

    Authors: Sahil Patel, Daniel Karapetyan

    Abstract: The Conditional Markov Chain Search (CMCS) is a framework for automated design of metaheuristics for discrete combinatorial optimisation problems. Given a set of algorithmic components such as hill climbers and mutations, CMCS decides in which order to apply those components. The decisions are dictated by the CMCS configuration that can be learnt offline. CMCS does not have an acceptance criterion… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

    Comments: 14 pages

  22. arXiv:2401.07889  [pdf

    cs.LG cs.AI eess.SP

    Machine Learning Techniques to Identify Hand Gestures amidst Forearm Muscle Signals

    Authors: Ryan Cho, Sunil Patel, Kyu Taek Cho, Jaejin Hwang

    Abstract: This study investigated the use of forearm EMG data for distinguishing eight hand gestures, employing the Neural Network and Random Forest algorithms on data from ten participants. The Neural Network achieved 97 percent accuracy with 1000-millisecond windows, while the Random Forest achieved 85 percent accuracy with 200-millisecond windows. Larger window sizes improved gesture classification due t… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 21 pages, 7 figures

  23. Scaling Computational Fluid Dynamics: In Situ Visualization of NekRS using SENSEI

    Authors: Victor A. Mateevitsi, Mathis Bode, Nicola Ferrier, Paul Fischer, Jens Henrik Göbbert, Joseph A. Insley, Yu-Hsiang Lan, Misun Min, Michael E. Papka, Saumil Patel, Silvio Rizzi, Jonathan Windgassen

    Abstract: In the realm of Computational Fluid Dynamics (CFD), the demand for memory and computation resources is extreme, necessitating the use of leadership-scale computing platforms for practical domain sizes. This intensive requirement renders traditional checkpointing methods ineffective due to the significant slowdown in simulations while saving state data to disk. As we progress towards exascale and G… ▽ More

    Submitted 18 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

  24. arXiv:2312.07627  [pdf

    cs.CV cs.LG cs.SI

    Multimodal Sentiment Analysis: Perceived vs Induced Sentiments

    Authors: Aditi Aggarwal, Deepika Varshney, Saurabh Patel

    Abstract: Social media has created a global network where people can easily access and exchange vast information. This information gives rise to a variety of opinions, reflecting both positive and negative viewpoints. GIFs stand out as a multimedia format offering a visually engaging way for users to communicate. In this research, we propose a multimodal framework that integrates visual and textual features… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  25. arXiv:2312.06652  [pdf, other

    cs.AI cs.CL

    Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?

    Authors: Shabaz Patel, Hassan Kane, Rayhan Patel

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across numerous natural language understanding use cases. However, this impressive performance comes with inherent limitations, such as the tendency to perpetuate stereotypical biases or fabricate non-existent facts. In the context of Islam and its representation, accurate and factual representation of its beliefs and teachings… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted for Muslims in ML workshop at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  26. arXiv:2312.03259  [pdf, other

    cs.LG

    f-FERM: A Scalable Framework for Robust Fair Empirical Risk Minimization

    Authors: Sina Baharlouei, Shivam Patel, Meisam Razaviyayn

    Abstract: Training and deploying machine learning models that meet fairness criteria for protected groups are fundamental in modern artificial intelligence. While numerous constraints and regularization terms have been proposed in the literature to promote fairness in machine learning tasks, most of these methods are not amenable to stochastic optimization due to the complex and nonlinear structure of const… ▽ More

    Submitted 7 April, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: 24 Pages,5 figures

    Journal ref: ICLR 2024

  27. arXiv:2312.00164  [pdf, other

    cs.CY cs.AI

    Towards Accurate Differential Diagnosis with Large Language Models

    Authors: Daniel McDuff, Mike Schaekermann, Tao Tu, Anil Palepu, Amy Wang, Jake Garrison, Karan Singhal, Yash Sharma, Shekoofeh Azizi, Kavita Kulkarni, Le Hou, Yong Cheng, Yun Liu, S Sara Mahdavi, Sushant Prakash, Anupam Pathak, Christopher Semturs, Shwetak Patel, Dale R Webster, Ewa Dominowska, Juraj Gottweis, Joelle Barral, Katherine Chou, Greg S Corrado, Yossi Matias , et al. (3 additional authors not shown)

    Abstract: An accurate differential diagnosis (DDx) is a cornerstone of medical care, often reached through an iterative process of interpretation that combines clinical history, physical examination, investigations and procedures. Interactive interfaces powered by Large Language Models (LLMs) present new opportunities to both assist and automate aspects of this process. In this study, we introduce an LLM op… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  28. arXiv:2311.17133  [pdf, other

    cs.LG cs.AI

    Deployment of a Robust and Explainable Mortality Prediction Model: The COVID-19 Pandemic and Beyond

    Authors: Jacob R. Epifano, Stephen Glass, Ravi P. Ramachandran, Sharad Patel, Aaron J. Masino, Ghulam Rasool

    Abstract: This study investigated the performance, explainability, and robustness of deployed artificial intelligence (AI) models in predicting mortality during the COVID-19 pandemic and beyond. The first study of its kind, we found that Bayesian Neural Networks (BNNs) and intelligent training techniques allowed our models to maintain performance amidst significant data shifts. Our results emphasize the imp… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  29. arXiv:2311.16213  [pdf, other

    eess.IV cs.CV cs.LG

    Seeing Beyond Cancer: Multi-Institutional Validation of Object Localization and 3D Semantic Segmentation using Deep Learning for Breast MRI

    Authors: Arda Pekis, Vignesh Kannan, Evandros Kaklamanos, Anu Antony, Snehal Patel, Tyler Earnest

    Abstract: The clinical management of breast cancer depends on an accurate understanding of the tumor and its anatomical context to adjacent tissues and landmark structures. This context may be provided by semantic segmentation methods; however, previous works have been largely limited to a singular focus on the tumor alone and rarely other tissue types. In contrast, we present a method that exploits tissue-… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 9 pages, 2 figures, to appear in SPIE: Medical Imaging 2024

    ACM Class: I.4.6; J.3

  30. From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models

    Authors: Zachary Englhardt, Chengqian Ma, Margaret E. Morris, Xuhai "Orson" Xu, Chun-Cheng Chang, Lianhui Qin, Daniel McDuff, Xin Liu, Shwetak Patel, Vikram Iyer

    Abstract: Passively collected behavioral health data from ubiquitous sensors holds significant promise to provide mental health professionals insights from patient's daily lives; however, developing analysis tools to use this data in clinical practice requires addressing challenges of generalization across devices and weak or ambiguous correlations between the measured signals and an individual's mental hea… ▽ More

    Submitted 25 November, 2023; v1 submitted 21 November, 2023; originally announced November 2023.

  31. arXiv:2311.09611  [pdf, other

    cs.HC

    DeltaLCA: Comparative Life-Cycle Assessment for Electronics Design

    Authors: Zhihan Zhang, Felix Hähnlein, Yuxuan Mei, Zachary Englhardt, Shwetak Patel, Adriana Schulz, Vikram Iyer

    Abstract: Reducing the environmental footprint of electronics and computing devices requires new tools that empower designers to make informed decisions about sustainability during the design process itself. This is not possible with current tools for life cycle assessment (LCA) which require substantial domain expertise and time to evaluate the numerous chips and other components that make up a device. We… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  32. arXiv:2311.05371  [pdf, other

    cs.CV cs.AI

    Training Robust Deep Physiological Measurement Models with Synthetic Video-based Data

    Authors: Yuxuan Ou, Yuzhe Zhang, Yuntang Wang, Shwetak Patel, Daniel McDuf, Yuzhe Yang, Xin Liu

    Abstract: Recent advances in supervised deep learning techniques have demonstrated the possibility to remotely measure human physiological vital signs (e.g., photoplethysmograph, heart rate) just from facial videos. However, the performance of these methods heavily relies on the availability and diversity of real labeled data. Yet, collecting large-scale real-world data with high-quality labels is typically… ▽ More

    Submitted 15 November, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

  33. Indoor Localization for an Autonomous Model Car: A Marker-Based Multi-Sensor Fusion Framework

    Authors: Xibo Li, Shruti Patel, David Stronzek-Pfeifer, Christof Büskens

    Abstract: Global navigation satellite systems readily provide accurate position information when localizing a robot outdoors. However, an analogous standard solution does not exist yet for mobile robots operating indoors. This paper presents an integrated framework for indoor localization and experimental validation of an autonomous driving system based on an advanced driver-assistance system (ADAS) model c… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  34. arXiv:2309.10160  [pdf, other

    physics.med-ph cs.AI

    RadOnc-GPT: A Large Language Model for Radiation Oncology

    Authors: Zhengliang Liu, Peilong Wang, Yiwei Li, Jason Holmes, Peng Shu, Lian Zhang, Chenbin Liu, Ninghao Liu, Dajiang Zhu, Xiang Li, Quanzheng Li, Samir H. Patel, Terence T. Sio, Tianming Liu, Wei Liu

    Abstract: This paper presents RadOnc-GPT, a large language model specialized for radiation oncology through advanced tuning methods. RadOnc-GPT was finetuned on a large dataset of radiation oncology patient records from the Mayo Clinic in Arizona. The model employs instruction tuning on three key tasks - generating radiotherapy treatment regimens, determining optimal radiation modalities, and providing diag… ▽ More

    Submitted 5 November, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

  35. arXiv:2308.14583  [pdf, other

    cs.CV

    Learning to Read Analog Gauges from Synthetic Data

    Authors: Juan Leon-Alcazar, Yazeed Alnumay, Cheng Zheng, Hassane Trigui, Sahejad Patel, Bernard Ghanem

    Abstract: Manually reading and logging gauge data is time inefficient, and the effort increases according to the number of gauges available. We present a computer vision pipeline that automates the reading of analog gauges. We propose a two-stage CNN pipeline that identifies the key structural components of an analog gauge and outputs an angular reading. To facilitate the training of our approach, a synthet… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Journal ref: Winter Conference on Applications of Computer Vision 2024

  36. arXiv:2308.14089  [pdf, other

    cs.CL cs.AI cs.LG

    MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records

    Authors: Scott L. Fleming, Alejandro Lozano, William J. Haberkorn, Jenelle A. Jindal, Eduardo P. Reis, Rahul Thapa, Louis Blankemeier, Julian Z. Genkins, Ethan Steinberg, Ashwin Nayak, Birju S. Patel, Chia-Chun Chiang, Alison Callahan, Zepeng Huo, Sergios Gatidis, Scott J. Adams, Oluseyi Fayanju, Shreya J. Shah, Thomas Savage, Ethan Goh, Akshay S. Chaudhari, Nima Aghaeepour, Christopher Sharp, Michael A. Pfeffer, Percy Liang , et al. (5 additional authors not shown)

    Abstract: The ability of large language models (LLMs) to follow natural language instructions with human-level fluency suggests many opportunities in healthcare to reduce administrative burden and improve quality of care. However, evaluating LLMs on realistic text generation tasks for healthcare remains challenging. Existing question answering datasets for electronic health record (EHR) data fail to capture… ▽ More

    Submitted 24 December, 2023; v1 submitted 27 August, 2023; originally announced August 2023.

  37. arXiv:2308.05950  [pdf, other

    cs.DC cs.CR

    Blockchain-Based Transferable Digital Rights of Land

    Authors: Ras Dwivedi, Sumit Patel, Prof. Sandeep Shukla

    Abstract: Land, being a scarce and valuable resource, is in high demand, especially in densely populated areas of older cities. Development authorities require land for infrastructure projects and other amenities, while landowners hold onto their land for both its usage and its financial value. Transferable Development Rights (TDRs) serve as a mechanism to separate the development rights associated with the… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 5 pages, Paper presented in https://easychair.org/cfp/ICSF2023

  38. arXiv:2307.07529  [pdf, other

    cs.LG cs.AI cs.MA

    Learning Multiple Coordinated Agents under Directed Acyclic Graph Constraints

    Authors: Jaeyeon Jang, Diego Klabjan, Han Liu, Nital S. Patel, Xiuqi Li, Balakrishnan Ananthanarayanan, Husam Dauod, Tzung-Han Juang

    Abstract: This paper proposes a novel multi-agent reinforcement learning (MARL) method to learn multiple coordinated agents under directed acyclic graph (DAG) constraints. Unlike existing MARL approaches, our method explicitly exploits the DAG structure between agents to achieve more effective learning performance. Theoretically, we propose a novel surrogate value function based on a MARL model with synthet… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  39. arXiv:2307.03817  [pdf, other

    cs.SE cs.AI

    Exploring and Characterizing Large Language Models For Embedded System Development and Debugging

    Authors: Zachary Englhardt, Richard Li, Dilini Nissanka, Zhihan Zhang, Girish Narayanswamy, Joseph Breda, Xin Liu, Shwetak Patel, Vikram Iyer

    Abstract: Large language models (LLMs) have shown remarkable abilities to generate code, however their ability to develop software for embedded systems, which requires cross-domain knowledge of hardware and software has not been studied. In this paper we develop an extensible, open source hardware-in-the-loop framework to systematically evaluate leading LLMs (GPT-3.5, GPT-4, PaLM 2) to assess their capabili… ▽ More

    Submitted 21 November, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

  40. arXiv:2306.03331  [pdf, ps, other

    cs.CV cs.LG

    A Robust Likelihood Model for Novelty Detection

    Authors: Ranya Almohsen, Shivang Patel, Donald A. Adjeroh, Gianfranco Doretto

    Abstract: Current approaches to novelty or anomaly detection are based on deep neural networks. Despite their effectiveness, neural networks are also vulnerable to imperceptible deformations of the input data. This is a serious issue in critical applications, or when data alterations are generated by an adversarial attack. While this is a known problem that has been studied in recent years for the case of s… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: CVPR Workshop on Computer Vision in the Wild, 2023

  41. arXiv:2306.01938  [pdf, other

    cs.CV cs.RO

    Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images

    Authors: Marcela Mera-Trujillo, Shivang Patel, Yu Gu, Gianfranco Doretto

    Abstract: Keypoint detection and matching is a fundamental task in many computer vision problems, from shape reconstruction, to structure from motion, to AR/VR applications and robotics. It is a well-studied problem with remarkable successes such as SIFT, and more recent deep learning approaches. While great robustness is exhibited by these techniques with respect to noise, illumination variation, and rigid… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: CVPR Workshop on Omnidirectional Computer Vision, 2023

  42. arXiv:2305.15525  [pdf, other

    cs.CL cs.LG

    Large Language Models are Few-Shot Health Learners

    Authors: Xin Liu, Daniel McDuff, Geza Kovacs, Isaac Galatzer-Levy, Jacob Sunshine, Jiening Zhan, Ming-Zher Poh, Shun Liao, Paolo Di Achille, Shwetak Patel

    Abstract: Large language models (LLMs) can capture rich representations of concepts that are useful for real-world tasks. However, language alone is limited. While existing LLMs excel at text-based inferences, health applications require that models be grounded in numerical data (e.g., vital signs, laboratory values in clinical domains; steps, movement in the wellness domain) that is not easily or readily e… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  43. arXiv:2305.10404  [pdf, ps, other

    cs.IT

    $\mathbb{F}_q\mathcal{R}$-skew cyclic codes and their application to quantum codes

    Authors: Om Prakash, Shikha Patel, Habibul Islam

    Abstract: Let $p$ be a prime and $\mathbb{F}_q$ be the finite field of order $q=p^m$. In this paper, we study $\mathbb{F}_q\mathcal{R}$-skew cyclic codes where $\mathcal{R}=\mathbb{F}_q+u\mathbb{F}_q$ with $u^2=u$. To characterize $\mathbb{F}_q\mathcal{R}$-skew cyclic codes, we first establish their algebraic structure and then discuss the dual-containing properties by considering a non-degenerate inner pro… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: 17 pages. This paper is under modification in the Quantum Information Processing

    MSC Class: 11T71; 11T06; 94B05; 94B15

  44. arXiv:2305.06161  [pdf, other

    cs.CL cs.AI cs.PL cs.SE

    StarCoder: may the source be with you!

    Authors: Raymond Li, Loubna Ben Allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, Qian Liu, Evgenii Zheltonozhskii, Terry Yue Zhuo, Thomas Wang, Olivier Dehaene, Mishig Davaadorj, Joel Lamy-Poirier, João Monteiro, Oleh Shliazhko, Nicolas Gontier, Nicholas Meade, Armel Zebaze, Ming-Ho Yee, Logesh Kumar Umapathi, Jian Zhu , et al. (42 additional authors not shown)

    Abstract: The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large colle… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  45. arXiv:2304.10647  [pdf, ps, other

    cs.DS

    New Lower Bounds for Adaptive Tolerant Junta Testing

    Authors: Xi Chen, Shyamal Patel

    Abstract: We prove a $k^{-Ω(\log(\varepsilon_2 - \varepsilon_1))}$ lower bound for adaptively testing whether a Boolean function is $\varepsilon_1$-close to or $\varepsilon_2$-far from $k$-juntas. Our results provide the first superpolynomial separation between tolerant and non-tolerant testing for a natural property of boolean functions under the adaptive setting. Furthermore, our techniques generalize to… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 22 pages

  46. arXiv:2304.07610  [pdf, other

    stat.ME cs.LG stat.AP

    A tutorial on the Bayesian statistical approach to inverse problems

    Authors: Faaiq G. Waqar, Swati Patel, Cory M. Simon

    Abstract: Inverse problems are ubiquitous in the sciences and engineering. Two categories of inverse problems concerning a physical system are (1) estimate parameters in a model of the system from observed input-output pairs and (2) given a model of the system, reconstruct the input to it that caused some observed output. Applied inverse problems are challenging because a solution may (i) not exist, (ii) no… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Comments: v0.0

    MSC Class: 60-01 ACM Class: G.3

    Journal ref: APL Machine Learning. 1, 041101 (2023)

  47. arXiv:2303.12059  [pdf, other

    cs.CV

    Motion Matters: Neural Motion Transfer for Better Camera Physiological Measurement

    Authors: Akshay Paruchuri, Xin Liu, Yulu Pan, Shwetak Patel, Daniel McDuff, Soumyadip Sengupta

    Abstract: Machine learning models for camera-based physiological measurement can have weak generalization due to a lack of representative training data. Body motion is one of the most significant sources of noise when attempting to recover the subtle cardiac pulse from a video. We explore motion transfer as a form of data augmentation to introduce motion variation while preserving physiological changes of i… ▽ More

    Submitted 6 November, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepted to WACV 2024, 17 pages, 6 figures, 15 tables

  48. arXiv:2303.11573  [pdf, other

    cs.CV

    BigSmall: Efficient Multi-Task Learning for Disparate Spatial and Temporal Physiological Measurements

    Authors: Girish Narayanswamy, Yujia Liu, Yuzhe Yang, Chengqian Ma, Xin Liu, Daniel McDuff, Shwetak Patel

    Abstract: Understanding of human visual perception has historically inspired the design of computer vision architectures. As an example, perception occurs at different scales both spatially and temporally, suggesting that the extraction of salient visual information may be made more effective by paying attention to specific features at varying scales. Visual changes in the body due to physiological processe… ▽ More

    Submitted 17 November, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

  49. arXiv:2303.10445  [pdf, other

    cs.SD cs.HC cs.LG eess.AS

    EarCough: Enabling Continuous Subject Cough Event Detection on Hearables

    Authors: Xiyuxing Zhang, Yuntao Wang, Jingru Zhang, Yaqing Yang, Shwetak Patel, Yuanchun Shi

    Abstract: Cough monitoring can enable new individual pulmonary health applications. Subject cough event detection is the foundation for continuous cough monitoring. Recently, the rapid growth in smart hearables has opened new opportunities for such needs. This paper proposes EarCough, which enables continuous subject cough event detection on edge computing hearables by leveraging the always-on active noise… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: This paper has been accepted by ACM CHI 2023

  50. arXiv:2303.10435  [pdf, other

    cs.HC cs.CV cs.LG

    Modeling the Trade-off of Privacy Preservation and Activity Recognition on Low-Resolution Images

    Authors: Yuntao Wang, Zirui Cheng, Xin Yi, Yan Kong, Xueyang Wang, Xuhai Xu, Yukang Yan, Chun Yu, Shwetak Patel, Yuanchun Shi

    Abstract: A computer vision system using low-resolution image sensors can provide intelligent services (e.g., activity recognition) but preserve unnecessary visual privacy information from the hardware level. However, preserving visual privacy and enabling accurate machine recognition have adversarial needs on image resolution. Modeling the trade-off of privacy preservation and machine recognition performan… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: This paper has been accepted by the ACM CHI 2023