-
A General-Purpose Self-Supervised Model for Computational Pathology
Authors:
Richard J. Chen,
Tong Ding,
Ming Y. Lu,
Drew F. K. Williamson,
Guillaume Jaume,
Bowen Chen,
Andrew Zhang,
Daniel Shao,
Andrew H. Song,
Muhammad Shaban,
Mane Williams,
Anurag Vaidya,
Sharifa Sahai,
Lukas Oldenburg,
Luca L. Weishaupt,
Judy J. Wang,
Walt Williams,
Long Phi Le,
Georg Gerber,
Faisal Mahmood
Abstract:
Tissue phenotyping is a fundamental computational pathology (CPath) task in learning objective characterizations of histopathologic biomarkers in anatomic pathology. However, whole-slide imaging (WSI) poses a complex computer vision problem in which the large-scale image resolutions of WSIs and the enormous diversity of morphological phenotypes preclude large-scale data annotation. Current efforts…
▽ More
Tissue phenotyping is a fundamental computational pathology (CPath) task in learning objective characterizations of histopathologic biomarkers in anatomic pathology. However, whole-slide imaging (WSI) poses a complex computer vision problem in which the large-scale image resolutions of WSIs and the enormous diversity of morphological phenotypes preclude large-scale data annotation. Current efforts have proposed using pretrained image encoders with either transfer learning from natural image datasets or self-supervised pretraining on publicly-available histopathology datasets, but have not been extensively developed and evaluated across diverse tissue types at scale. We introduce UNI, a general-purpose self-supervised model for pathology, pretrained using over 100 million tissue patches from over 100,000 diagnostic haematoxylin and eosin-stained WSIs across 20 major tissue types, and evaluated on 33 representative CPath clinical tasks in CPath of varying diagnostic difficulties. In addition to outperforming previous state-of-the-art models, we demonstrate new modeling capabilities in CPath such as resolution-agnostic tissue classification, slide classification using few-shot class prototypes, and disease subtyping generalization in classifying up to 108 cancer types in the OncoTree code classification system. UNI advances unsupervised representation learning at scale in CPath in terms of both pretraining data and downstream evaluation, enabling data-efficient AI models that can generalize and transfer to a gamut of diagnostically-challenging tasks and clinical workflows in anatomic pathology.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Alexa, play with robot: Introducing the First Alexa Prize SimBot Challenge on Embodied AI
Authors:
Hangjie Shi,
Leslie Ball,
Govind Thattai,
Desheng Zhang,
Lucy Hu,
Qiaozi Gao,
Suhaila Shakiah,
Xiaofeng Gao,
Aishwarya Padmakumar,
Bofei Yang,
Cadence Chung,
Dinakar Guthy,
Gaurav Sukhatme,
Karthika Arumugam,
Matthew Wen,
Osman Ipek,
Patrick Lange,
Rohan Khanna,
Shreyas Pansare,
Vasu Sharma,
Chao Zhang,
Cris Flagg,
Daniel Pressel,
Lavina Vaz,
Luke Dai
, et al. (17 additional authors not shown)
Abstract:
The Alexa Prize program has empowered numerous university students to explore, experiment, and showcase their talents in building conversational agents through challenges like the SocialBot Grand Challenge and the TaskBot Challenge. As conversational agents increasingly appear in multimodal and embodied contexts, it is important to explore the affordances of conversational interaction augmented wi…
▽ More
The Alexa Prize program has empowered numerous university students to explore, experiment, and showcase their talents in building conversational agents through challenges like the SocialBot Grand Challenge and the TaskBot Challenge. As conversational agents increasingly appear in multimodal and embodied contexts, it is important to explore the affordances of conversational interaction augmented with computer vision and physical embodiment. This paper describes the SimBot Challenge, a new challenge in which university teams compete to build robot assistants that complete tasks in a simulated physical environment. This paper provides an overview of the SimBot Challenge, which included both online and offline challenge phases. We describe the infrastructure and support provided to the teams including Alexa Arena, the simulated environment, and the ML toolkit provided to teams to accelerate their building of vision and language models. We summarize the approaches the participating teams took to overcome research challenges and extract key lessons learned. Finally, we provide analysis of the performance of the competing SimBots during the competition.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition
Authors:
Saumya Y. Sahai,
Jing Liu,
Thejaswi Muniyappa,
Kanthashree M. Sathyendra,
Anastasios Alexandridis,
Grant P. Strimel,
Ross McGowan,
Ariya Rastrow,
Feng-Ju Chang,
Athanasios Mouchtaris,
Siegfried Kunzmann
Abstract:
We present dual-attention neural biasing, an architecture designed to boost Wake Words (WW) recognition and improve inference time latency on speech recognition tasks. This architecture enables a dynamic switch for its runtime compute paths by exploiting WW spotting to select which branch of its attention networks to execute for an input audio frame. With this approach, we effectively improve WW s…
▽ More
We present dual-attention neural biasing, an architecture designed to boost Wake Words (WW) recognition and improve inference time latency on speech recognition tasks. This architecture enables a dynamic switch for its runtime compute paths by exploiting WW spotting to select which branch of its attention networks to execute for an input audio frame. With this approach, we effectively improve WW spotting accuracy while saving runtime compute cost as defined by floating point operations (FLOPs). Using an in-house de-identified dataset, we demonstrate that the proposed dual-attention network can reduce the compute cost by $90\%$ for WW audio frames, with only $1\%$ increase in the number of parameters. This architecture improves WW F1 score by $16\%$ relative and improves generic rare word error rate by $3\%$ relative compared to the baselines.
△ Less
Submitted 4 April, 2023; v1 submitted 2 April, 2023;
originally announced April 2023.
-
Observation on the bias current variation of a single mask triple GEM chamber
Authors:
S. Chatterjee,
A. Sen,
R. Paul,
S. Sahai,
S. Das,
S. Biswas
Abstract:
Gas Electron Multiplier (GEM) detector, one of the advanced members of the Micro Pattern Gas Detector (MPGD) group, is widely used in High Energy Physics (HEP) experiments. The high rate handling capability and spatial resolution make it a desired tracking detector for high rate HEP experiments. Investigation of the long-term stability is an essential criterion for any tracking device used in HEP…
▽ More
Gas Electron Multiplier (GEM) detector, one of the advanced members of the Micro Pattern Gas Detector (MPGD) group, is widely used in High Energy Physics (HEP) experiments. The high rate handling capability and spatial resolution make it a desired tracking detector for high rate HEP experiments. Investigation of the long-term stability is an essential criterion for any tracking device used in HEP experiments. To investigate the long-term stability of a Single Mask~(SM) triple GEM detector prototype, it is irradiated continuously using a $^{55}$Fe X-ray source of energy 5.9 keV. The chamber is operated with Ar/CO$_2$ gas mixture in continuous flow mode. The gain and energy resolution of the chamber are calculated from the 5.9 keV X-ray peak and studied as a function of time. The applied voltage, divider current and also the environmental parameters (ambient temperature, pressure and relative humidity) are recorded continuously. It is observed that at a fixed applied voltage, the divider current of the detector is changing with time and as a result, the gain of the detector also changes. A systematic investigation is carried out to understand the probable reasons behind the observed variation in divider current and also to find its possible remedies. The details of the experimental setup, methodology and results are discussed in this article.
△ Less
Submitted 4 May, 2023; v1 submitted 29 March, 2023;
originally announced March 2023.
-
Alexa, Let's Work Together: Introducing the First Alexa Prize TaskBot Challenge on Conversational Task Assistance
Authors:
Anna Gottardi,
Osman Ipek,
Giuseppe Castellucci,
Shui Hu,
Lavina Vaz,
Yao Lu,
Anju Khatri,
Anjali Chadha,
Desheng Zhang,
Sattvik Sahai,
Prerna Dwivedi,
Hangjie Shi,
Lucy Hu,
Andy Huang,
Luke Dai,
Bofei Yang,
Varun Somani,
Pankaj Rajan,
Ron Rezac,
Michael Johnston,
Savanna Stiff,
Leslie Ball,
David Carmel,
Yang Liu,
Dilek Hakkani-Tur
, et al. (5 additional authors not shown)
Abstract:
Since its inception in 2016, the Alexa Prize program has enabled hundreds of university students to explore and compete to develop conversational agents through the SocialBot Grand Challenge. The goal of the challenge is to build agents capable of conversing coherently and engagingly with humans on popular topics for 20 minutes, while achieving an average rating of at least 4.0/5.0. However, as co…
▽ More
Since its inception in 2016, the Alexa Prize program has enabled hundreds of university students to explore and compete to develop conversational agents through the SocialBot Grand Challenge. The goal of the challenge is to build agents capable of conversing coherently and engagingly with humans on popular topics for 20 minutes, while achieving an average rating of at least 4.0/5.0. However, as conversational agents attempt to assist users with increasingly complex tasks, new conversational AI techniques and evaluation platforms are needed. The Alexa Prize TaskBot challenge, established in 2021, builds on the success of the SocialBot challenge by introducing the requirements of interactively assisting humans with real-world Cooking and Do-It-Yourself tasks, while making use of both voice and visual modalities. This challenge requires the TaskBots to identify and understand the user's need, identify and integrate task and domain knowledge into the interaction, and develop new ways of engaging the user without distracting them from the task at hand, among other challenges. This paper provides an overview of the TaskBot challenge, describes the infrastructure support provided to the teams with the CoBot Toolkit, and summarizes the approaches the participating teams took to overcome the research challenges. Finally, it analyzes the performance of the competing TaskBots during the first year of the competition.
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
Algorithm Fairness in AI for Medicine and Healthcare
Authors:
Richard J. Chen,
Tiffany Y. Chen,
Jana Lipkova,
Judy J. Wang,
Drew F. K. Williamson,
Ming Y. Lu,
Sharifa Sahai,
Faisal Mahmood
Abstract:
In the current development and deployment of many artificial intelligence (AI) systems in healthcare, algorithm fairness is a challenging problem in delivering equitable care. Recent evaluation of AI models stratified across race sub-populations have revealed inequalities in how patients are diagnosed, given treatments, and billed for healthcare costs. In this perspective article, we summarize the…
▽ More
In the current development and deployment of many artificial intelligence (AI) systems in healthcare, algorithm fairness is a challenging problem in delivering equitable care. Recent evaluation of AI models stratified across race sub-populations have revealed inequalities in how patients are diagnosed, given treatments, and billed for healthcare costs. In this perspective article, we summarize the intersectional field of fairness in machine learning through the context of current issues in healthcare, outline how algorithmic biases (e.g. - image acquisition, genetic variation, intra-observer labeling variability) arise in current clinical workflows and their resulting healthcare disparities. Lastly, we also review emerging technology for mitigating bias via federated learning, disentanglement, and model explainability, and their role in AI-SaMD development.
△ Less
Submitted 23 March, 2022; v1 submitted 1 October, 2021;
originally announced October 2021.
-
A Machine Learning Model for Nowcasting Epidemic Incidence
Authors:
Saumya Yashmohini Sahai,
Saket Gurukar,
Wasiur R. KhudaBukhsh,
Srinivasan Parthasarathy,
Grzegorz A. Rempala
Abstract:
Due to delay in reporting, the daily national and statewide COVID-19 incidence counts are often unreliable and need to be estimated from recent data. This process is known in economics as nowcasting. We describe in this paper a simple random forest statistical model for nowcasting the COVID - 19 daily new infection counts based on historic data along with a set of simple covariates, such as the cu…
▽ More
Due to delay in reporting, the daily national and statewide COVID-19 incidence counts are often unreliable and need to be estimated from recent data. This process is known in economics as nowcasting. We describe in this paper a simple random forest statistical model for nowcasting the COVID - 19 daily new infection counts based on historic data along with a set of simple covariates, such as the currently reported infection counts, day of the week, and time since first reporting. We apply the model to adjust the daily infection counts in Ohio, and show that the predictions from this simple data-driven method compare favorably both in quality and computational burden to those obtained from the state-of-the-art hierarchical Bayesian model employing a complex statistical algorithm.
△ Less
Submitted 5 April, 2021;
originally announced April 2021.
-
DrugDBEmbed : Semantic Queries on Relational Database using Supervised Column Encodings
Authors:
Bortik Bandyopadhyay,
Pranav Maneriker,
Vedang Patel,
Saumya Yashmohini Sahai,
Ping Zhang,
Srinivasan Parthasarathy
Abstract:
Traditional relational databases contain a lot of latent semantic information that have largely remained untapped due to the difficulty involved in automatically extracting such information. Recent works have proposed unsupervised machine learning approaches to extract such hidden information by textifying the database columns and then projecting the text tokens onto a fixed dimensional semantic v…
▽ More
Traditional relational databases contain a lot of latent semantic information that have largely remained untapped due to the difficulty involved in automatically extracting such information. Recent works have proposed unsupervised machine learning approaches to extract such hidden information by textifying the database columns and then projecting the text tokens onto a fixed dimensional semantic vector space. However, in certain databases, task-specific class labels may be available, which unsupervised approaches are unable to lever in a principled manner. Also, when embeddings are generated at individual token level, then column encoding of multi-token text column has to be computed by taking the average of the vectors of the tokens present in that column for any given row. Such averaging approach may not produce the best semantic vector representation of the multi-token text column, as observed while encoding paragraphs or documents in natural language processing domain. With these shortcomings in mind, we propose a supervised machine learning approach using a Bi-LSTM based sequence encoder to directly generate column encodings for multi-token text columns of the DrugBank database, which contains gold standard drug-drug interaction (DDI) labels. Our text data driven encoding approach achieves very high Accuracy on the supervised DDI prediction task for some columns and we use those supervised column encodings to simulate and evaluate the Analogy SQL queries on relational data to demonstrate the efficacy of our technique.
△ Less
Submitted 5 July, 2020;
originally announced July 2020.
-
Verification of Quantitative Hyperproperties Using Trace Enumeration Relations
Authors:
Shubham Sahai,
Rohit Sinha,
Pramod Subramanyan
Abstract:
Many important cryptographic primitives offer probabilistic guarantees of security that can be specified as quantitative hyperproperties; these are specifications that stipulate the existence of a certain number of traces in the system satisfying certain constraints. Verification of such hyperproperties is extremely challenging because they involve simultaneous reasoning about an unbounded number…
▽ More
Many important cryptographic primitives offer probabilistic guarantees of security that can be specified as quantitative hyperproperties; these are specifications that stipulate the existence of a certain number of traces in the system satisfying certain constraints. Verification of such hyperproperties is extremely challenging because they involve simultaneous reasoning about an unbounded number of different traces. In this paper, we introduce a technique for verification of quantitative hyperproperties based on the notion of trace enumeration relations. These relations allow us to reduce the problem of trace-counting into one of model-counting of formulas in first-order logic. We also introduce a set of inference rules for machine-checked reasoning about the number of satisfying solutions to first-order formulas (aka model counting). Putting these two components together enables semi-automated verification of quantitative hyperproperties on infinite state systems. We use our methodology to prove confidentiality of access patterns in Path ORAMs of unbounded size, soundness of a simple interactive zero-knowledge proof protocol as well as other applications of quantitative hyperproperties studied in past work.
△ Less
Submitted 14 May, 2020; v1 submitted 10 May, 2020;
originally announced May 2020.
-
Composition Tableaux basis for Schur functors and the Plücker algebra
Authors:
Shubhankar Sahai
Abstract:
We show that combinatorial objects called row-strict composition tableaux, introduced by Mason and Remmel in 2014 and closely related to the quasi-symmetric Schur functions of Haglund-Luoto-Mason-van Willigenburg, form a basis for Schur functors of finite free modules over arbitrary commutative rings. When the ring is the complex numbers, this produces a new basis for the irreducible polynomial re…
▽ More
We show that combinatorial objects called row-strict composition tableaux, introduced by Mason and Remmel in 2014 and closely related to the quasi-symmetric Schur functions of Haglund-Luoto-Mason-van Willigenburg, form a basis for Schur functors of finite free modules over arbitrary commutative rings. When the ring is the complex numbers, this produces a new basis for the irreducible polynomial representations of $\operatorname{GL}_n(\mathbb{C})$. Moreover, in this case it also produces new basis for the Plücker algebra, a subalgebra of the polynomial ring over $\mathbb{C}$ in $n^2$ variables, which is of independent combinatorial and geometric interests. As an aside we also show that these results hold for other combinatorial objects called reverse row strict tableau.
△ Less
Submitted 12 January, 2021; v1 submitted 26 November, 2018;
originally announced November 2018.
-
Expectation propagation as a way of life: A framework for Bayesian inference on partitioned data
Authors:
Aki Vehtari,
Andrew Gelman,
Tuomas Sivula,
Pasi Jylänki,
Dustin Tran,
Swupnil Sahai,
Paul Blomstedt,
John P. Cunningham,
David Schiminovich,
Christian Robert
Abstract:
A common divide-and-conquer approach for Bayesian computation with big data is to partition the data, perform local inference for each piece separately, and combine the results to obtain a global posterior approximation. While being conceptually and computationally appealing, this method involves the problematic need to also split the prior for the local inferences; these weakened priors may not p…
▽ More
A common divide-and-conquer approach for Bayesian computation with big data is to partition the data, perform local inference for each piece separately, and combine the results to obtain a global posterior approximation. While being conceptually and computationally appealing, this method involves the problematic need to also split the prior for the local inferences; these weakened priors may not provide enough regularization for each separate computation, thus eliminating one of the key advantages of Bayesian methods. To resolve this dilemma while still retaining the generalizability of the underlying local inference method, we apply the idea of expectation propagation (EP) as a framework for distributed Bayesian inference. The central idea is to iteratively update approximations to the local likelihoods given the state of the other approximations and the prior. The present paper has two roles: we review the steps that are needed to keep EP algorithms numerically stable, and we suggest a general approach, inspired by EP, for approaching data partitioning problems in a way that achieves the computational benefits of parallelism while allowing each local update to make use of relevant information from the other sites. In addition, we demonstrate how the method can be applied in a hierarchical context to make use of partitioning of both data and parameters. The paper describes a general algorithmic framework, rather than a specific algorithm, and presents an example implementation for it.
△ Less
Submitted 30 November, 2019; v1 submitted 15 December, 2014;
originally announced December 2014.
-
Facile synthesis and step by step enhancement of blue photoluminescence from Ag-doped ZnS quantum dots
Authors:
Sonal Sahai,
Mushahid Husain,
Virendra Shanker,
Nahar Singh,
D. Haranath
Abstract:
Our results pertaining to the step by step enhancement of photoluminescence (PL) intensity from ZnS:Ag,Al quantum dots (QDs) are presented. Initially, these QDs were synthesized using a simple co-precipitation technique involving a surfactant, polyvinylpyrrolidone (PVP), in de-ionised water. It was observed that the blue PL originated from ZnS:Ag,Al QDs was considerably weak and not suitable for a…
▽ More
Our results pertaining to the step by step enhancement of photoluminescence (PL) intensity from ZnS:Ag,Al quantum dots (QDs) are presented. Initially, these QDs were synthesized using a simple co-precipitation technique involving a surfactant, polyvinylpyrrolidone (PVP), in de-ionised water. It was observed that the blue PL originated from ZnS:Ag,Al QDs was considerably weak and not suitable for any practical display application. Upon UV (365 nm) photolysis, the PL intensity augmented to ~170% and attained a saturation value after ~100 minutes of exposure. This is attributed to the photo-corrosion mechanism exerted by high-flux UV light on ZnS:Ag,Al QDs. Auxiliary enhancement of PL intensity to 250% has been evidenced by subjecting the QDs to high temperatures (200oC) and pressures (~120 bars) in a sulphur-rich atmosphere, which is due to the improvement in crystallanity of ZnS QDs. The origin of the bright blue PL has been discussed. The results were supported by x-ray phase analysis, high-resolution electron microscopy and compositional evaluation.
△ Less
Submitted 16 November, 2012;
originally announced November 2012.
-
Fabrication and Electro-optic Properties of MWCNT Driven Novel Electroluminescent Lamp
Authors:
D. Haranath,
Sonal Sahai,
Mushahid Husain,
Savvi Mishra,
Virendra Shanker
Abstract:
We present a novel, cost-effective and facile technique, wherein multi-walled carbon nano-tubes (CNTs) were used to transform a photoluminescent material to exhibit stable and efficient electroluminescence (EL) at low-voltages. As a case study, a commercially available ZnS:Cu phosphor (P-22G) was combined with a very low concentration of CNTs dispersed in ethanol and its alternating current driven…
▽ More
We present a novel, cost-effective and facile technique, wherein multi-walled carbon nano-tubes (CNTs) were used to transform a photoluminescent material to exhibit stable and efficient electroluminescence (EL) at low-voltages. As a case study, a commercially available ZnS:Cu phosphor (P-22G) was combined with a very low concentration of CNTs dispersed in ethanol and its alternating current driven electroluminescence (AC-EL) is demonstrated. The role of CNTs has been understood as a local electric field enhancer and facilitator in the hot carrier injection inside the ZnS crystal to produce EL in the hybrid material. The mechanism of EL is discussed using an internal field emission model, intra-CNT impact excitation and the recombination of electrons and holes through the impurity states.
△ Less
Submitted 16 November, 2012; v1 submitted 31 July, 2012;
originally announced July 2012.
-
Valence band and core-level analysis of highly luminescent ZnO nanocrystals for designing ultrafast optical sensors
Authors:
Amish G. Joshi,
Sonal Sahai,
Namita Gandhi,
Y. G. Radha Krishna,
D. Haranath
Abstract:
Highly luminescent ZnO:Na nanocrystals of size ~2 nm were synthesized using a improved sol-lyophilization process. The surface analysis such as survey scan, core-level and valence band spectra of ZnO:Na nanocrystals were studied using x-ray photoelectron spectroscopy (XPS) to establish the presence of Na+ ions. The observed increase in band gap from 3.30 (bulk) to 4.16 eV (nano), is attributed to…
▽ More
Highly luminescent ZnO:Na nanocrystals of size ~2 nm were synthesized using a improved sol-lyophilization process. The surface analysis such as survey scan, core-level and valence band spectra of ZnO:Na nanocrystals were studied using x-ray photoelectron spectroscopy (XPS) to establish the presence of Na+ ions. The observed increase in band gap from 3.30 (bulk) to 4.16 eV (nano), is attributed to the quantum confinement of the motion of electron and holes in all three directions. The photoluminescence and decay measurements have complemented and supported our study to design an efficient and ultrafast responsive optical sensing device.
△ Less
Submitted 14 May, 2010;
originally announced May 2010.