-
DiCTI: Diffusion-based Clothing Designer via Text-guided Input
Authors:
Ajda Lampe,
Julija Stopar,
Deepak Kumar Jain,
Shinichiro Omachi,
Peter Peer,
Vitomir Štruc
Abstract:
Recent developments in deep generative models have opened up a wide range of opportunities for image synthesis, leading to significant changes in various creative fields, including the fashion industry. While numerous methods have been proposed to benefit buyers, particularly in virtual try-on applications, there has been relatively less focus on facilitating fast prototyping for designers and cus…
▽ More
Recent developments in deep generative models have opened up a wide range of opportunities for image synthesis, leading to significant changes in various creative fields, including the fashion industry. While numerous methods have been proposed to benefit buyers, particularly in virtual try-on applications, there has been relatively less focus on facilitating fast prototyping for designers and customers seeking to order new designs. To address this gap, we introduce DiCTI (Diffusion-based Clothing Designer via Text-guided Input), a straightforward yet highly effective approach that allows designers to quickly visualize fashion-related ideas using text inputs only. Given an image of a person and a description of the desired garments as input, DiCTI automatically generates multiple high-resolution, photorealistic images that capture the expressed semantics. By leveraging a powerful diffusion-based inpainting model conditioned on text inputs, DiCTI is able to synthesize convincing, high-quality images with varied clothing designs that viably follow the provided text descriptions, while being able to process very diverse and challenging inputs, captured in completely unconstrained settings. We evaluate DiCTI in comprehensive experiments on two different datasets (VITON-HD and Fashionpedia) and in comparison to the state-of-the-art (SoTa). The results of our experiments show that DiCTI convincingly outperforms the SoTA competitor in generating higher quality images with more elaborate garments and superior text prompt adherence, both according to standard quantitative evaluation measures and human ratings, generated as part of a user study.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Modeling the Real World with High-Density Visual Particle Dynamics
Authors:
William F. Whitney,
Jacob Varley,
Deepali Jain,
Krzysztof Choromanski,
Sumeet Singh,
Vikas Sindhwani
Abstract:
We present High-Density Visual Particle Dynamics (HD-VPD), a learned world model that can emulate the physical dynamics of real scenes by processing massive latent point clouds containing 100K+ particles. To enable efficiency at this scale, we introduce a novel family of Point Cloud Transformers (PCTs) called Interlacers leveraging intertwined linear-attention Performer layers and graph-based neig…
▽ More
We present High-Density Visual Particle Dynamics (HD-VPD), a learned world model that can emulate the physical dynamics of real scenes by processing massive latent point clouds containing 100K+ particles. To enable efficiency at this scale, we introduce a novel family of Point Cloud Transformers (PCTs) called Interlacers leveraging intertwined linear-attention Performer layers and graph-based neighbour attention layers. We demonstrate the capabilities of HD-VPD by modeling the dynamics of high degree-of-freedom bi-manual robots with two RGB-D cameras. Compared to the previous graph neural network approach, our Interlacer dynamics is twice as fast with the same prediction quality, and can achieve higher quality using 4x as many particles. We illustrate how HD-VPD can evaluate motion plan quality with robotic box pushing and can grasping tasks. See videos and particle dynamics rendered by HD-VPD at https://sites.google.com/view/hd-vpd.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning
Authors:
Arijit Sehanobish,
Avinava Dubey,
Krzysztof Choromanski,
Somnath Basu Roy Chowdhury,
Deepali Jain,
Vikas Sindhwani,
Snigdha Chaturvedi
Abstract:
Recent efforts to scale Transformer models have demonstrated rapid progress across a wide range of tasks (Wei et al., 2022). However, fine-tuning these models for downstream tasks is expensive due to their large parameter counts. Parameter-efficient fine-tuning (PEFT) approaches have emerged as a viable alternative by allowing us to fine-tune models by updating only a small number of parameters. I…
▽ More
Recent efforts to scale Transformer models have demonstrated rapid progress across a wide range of tasks (Wei et al., 2022). However, fine-tuning these models for downstream tasks is expensive due to their large parameter counts. Parameter-efficient fine-tuning (PEFT) approaches have emerged as a viable alternative by allowing us to fine-tune models by updating only a small number of parameters. In this work, we propose a general framework for parameter efficient fine-tuning (PEFT), based on structured unrestricted-rank matrices (SURM) which can serve as a drop-in replacement for popular approaches such as Adapters and LoRA. Unlike other methods like LoRA, SURMs provides more flexibility in finding the right balance between compactness and expressiveness. This is achieved by using low displacement rank matrices (LDRMs), which hasn't been used in this context before. SURMs remain competitive with baselines, often providing significant quality improvements while using a smaller parameter budget. SURMs achieve 5-7% accuracy gains on various image classification tasks while replacing low-rank matrices in LoRA. It also results in up to 12x reduction of the number of parameters in adapters (with virtually no loss in quality) on the GLUE benchmark.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Improving and Evaluating Machine Learning Methods for Forensic Shoeprint Matching
Authors:
Divij Jain,
Saatvik Kher,
Lena Liang,
Yufeng Wu,
Ashley Zheng,
Xizhen Cai,
Anna Plantinga,
Elizabeth Upton
Abstract:
We propose a machine learning pipeline for forensic shoeprint pattern matching that improves on the accuracy and generalisability of existing methods. We extract 2D coordinates from shoeprint scans using edge detection and align the two shoeprints with iterative closest point (ICP). We then extract similarity metrics to quantify how well the two prints match and use these metrics to train a random…
▽ More
We propose a machine learning pipeline for forensic shoeprint pattern matching that improves on the accuracy and generalisability of existing methods. We extract 2D coordinates from shoeprint scans using edge detection and align the two shoeprints with iterative closest point (ICP). We then extract similarity metrics to quantify how well the two prints match and use these metrics to train a random forest that generates a probabilistic measurement of how likely two prints are to have originated from the same outsole. We assess the generalisability of machine learning methods trained on lab shoeprint scans to more realistic crime scene shoeprint data by evaluating the accuracy of our methods on several shoeprint scenarios: partial prints, prints with varying levels of blurriness, prints with different amounts of wear, and prints from different shoe models. We find that models trained on one type of shoeprint yield extremely high levels of accuracy when tested on shoeprint pairs of the same scenario but fail to generalise to other scenarios. We also discover that models trained on a variety of scenarios predict almost as accurately as models trained on specific scenarios.
△ Less
Submitted 2 April, 2024;
originally announced May 2024.
-
PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models
Authors:
Devansh Jain,
Priyanshu Kumar,
Samuel Gehman,
Xuhui Zhou,
Thomas Hartvigsen,
Maarten Sap
Abstract:
Recent advances in large language models (LLMs) have led to their extensive global deployment, and ensuring their safety calls for comprehensive and multilingual toxicity evaluations. However, existing toxicity benchmarks are overwhelmingly focused on English, posing serious risks to deploying LLMs in other languages. We address this by introducing PolygloToxicityPrompts (PTP), the first large-sca…
▽ More
Recent advances in large language models (LLMs) have led to their extensive global deployment, and ensuring their safety calls for comprehensive and multilingual toxicity evaluations. However, existing toxicity benchmarks are overwhelmingly focused on English, posing serious risks to deploying LLMs in other languages. We address this by introducing PolygloToxicityPrompts (PTP), the first large-scale multilingual toxicity evaluation benchmark of 425K naturally occurring prompts spanning 17 languages. We overcome the scarcity of naturally occurring toxicity in web-text and ensure coverage across languages with varying resources by automatically scraping over 100M web-text documents. Using PTP, we investigate research questions to study the impact of model size, prompt language, and instruction and preference-tuning methods on toxicity by benchmarking over 60 LLMs. Notably, we find that toxicity increases as language resources decrease or model size increases. Although instruction- and preference-tuning reduce toxicity, the choice of preference-tuning method does not have any significant impact. Our findings shed light on crucial shortcomings of LLM safeguarding and highlight areas for future research.
△ Less
Submitted 20 May, 2024; v1 submitted 15 May, 2024;
originally announced May 2024.
-
Embodied AI with Two Arms: Zero-shot Learning, Safety and Modularity
Authors:
Jake Varley,
Sumeet Singh,
Deepali Jain,
Krzysztof Choromanski,
Andy Zeng,
Somnath Basu Roy Chowdhury,
Avinava Dubey,
Vikas Sindhwani
Abstract:
We present an embodied AI system which receives open-ended natural language instructions from a human, and controls two arms to collaboratively accomplish potentially long-horizon tasks over a large workspace. Our system is modular: it deploys state of the art Large Language Models for task planning,Vision-Language models for semantic perception, and Point Cloud transformers for grasping. With sem…
▽ More
We present an embodied AI system which receives open-ended natural language instructions from a human, and controls two arms to collaboratively accomplish potentially long-horizon tasks over a large workspace. Our system is modular: it deploys state of the art Large Language Models for task planning,Vision-Language models for semantic perception, and Point Cloud transformers for grasping. With semantic and physical safety in mind, these modules are interfaced with a real-time trajectory optimizer and a compliant tracking controller to enable human-robot proximity. We demonstrate performance for the following tasks: bi-arm sorting, bottle opening, and trash disposal tasks. These are done zero-shot where the models used have not been trained with any real world data from this bi-arm robot, scenes or workspace.Composing both learning- and non-learning-based components in a modular fashion with interpretable inputs and outputs allows the user to easily debug points of failures and fragilities. One may also in-place swap modules to improve the robustness of the overall platform, for instance with imitation-learned policies.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Studying Differential Mental Health Expressions in India
Authors:
Khushi Shelat,
Sunny Rai,
Devansh R Jain,
Kishen Sivabalan,
Young Min Cho,
Maitreyi Redkar,
Samindara Sawant,
Sharath Chandra Guntuku
Abstract:
Psychosocial stressors and the symptomatology of mental disorders vary across cultures. However, current understandings of mental health expressions on social media are predominantly derived from studies in WEIRD (Western, Educated, Industrialized, Rich, and Democratic) contexts. In this paper, we analyze mental health posts on Reddit made by individuals in India, to identify variations in online…
▽ More
Psychosocial stressors and the symptomatology of mental disorders vary across cultures. However, current understandings of mental health expressions on social media are predominantly derived from studies in WEIRD (Western, Educated, Industrialized, Rich, and Democratic) contexts. In this paper, we analyze mental health posts on Reddit made by individuals in India, to identify variations in online depression language specific to the Indian context compared to users from the Rest of the World (ROW). Unlike in Western samples, we observe that mental health discussions in India additionally express sadness, use negation, are present-focused, and are related to work and achievement. Illness is uniquely correlated to India, indicating the association between depression and physical health in Indian patients. Two clinical psychologists validated the findings from social media posts and found 95% of the top 20 topics associated with mental health discussions as prevalent in Indians. Significant linguistic variations in online mental health-related language in India compared to ROW, emphasize the importance of developing precision-targeted interventions that are culturally appropriate.
△ Less
Submitted 16 June, 2024; v1 submitted 18 February, 2024;
originally announced February 2024.
-
SwissNYF: Tool Grounded LLM Agents for Black Box Setting
Authors:
Somnath Sendhil Kumar,
Dhruv Jain,
Eshaan Agarwal,
Raunak Pandey
Abstract:
While Large Language Models (LLMs) have demonstrated enhanced capabilities in function-calling, these advancements primarily rely on accessing the functions' responses. This methodology is practical for simpler APIs but faces scalability issues with irreversible APIs that significantly impact the system, such as a database deletion API. Similarly, processes requiring extensive time for each API ca…
▽ More
While Large Language Models (LLMs) have demonstrated enhanced capabilities in function-calling, these advancements primarily rely on accessing the functions' responses. This methodology is practical for simpler APIs but faces scalability issues with irreversible APIs that significantly impact the system, such as a database deletion API. Similarly, processes requiring extensive time for each API call and those necessitating forward planning, like automated action pipelines, present complex challenges. Furthermore, scenarios often arise where a generalized approach is needed because algorithms lack direct access to the specific implementations of these functions or secrets to use them. Traditional tool planning methods are inadequate in these cases, compelling the need to operate within black-box environments. Unlike their performance in tool manipulation, LLMs excel in black-box tasks, such as program synthesis. Therefore, we harness the program synthesis capabilities of LLMs to strategize tool usage in black-box settings, ensuring solutions are verified prior to implementation. We introduce TOPGUN, an ingeniously crafted approach leveraging program synthesis for black box tool planning. Accompanied by SwissNYF, a comprehensive suite that integrates black-box algorithms for planning and verification tasks, addressing the aforementioned challenges and enhancing the versatility and effectiveness of LLMs in complex API interactions. The public code for SwissNYF is available at https://github.com/iclr-dummy-user/SwissNYF.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Buffer-layer-controlled Nickeline vs Zinc-Blende/Wurtzite-type MnTe growths on c-plane Al2O3 substrates
Authors:
Deepti Jain,
Hee Taek Yi,
Alessandro R. Mazza,
Kim Kisslinger,
Myung-Geun Han,
Matthew Brahlek,
Seongshik Oh
Abstract:
In the recent past, MnTe has proven to be a crucial component of the intrinsic magnetic topological insulator (IMTI) family [MnTe]m[Bi2Te3]n, which hosts a wide range of magneto-topological properties depending on the choice of m and n. However, bulk crystal growth allows only a few combinations of m and n for these IMTIs due to the strict limitations of the thermodynamic growth conditions. One wa…
▽ More
In the recent past, MnTe has proven to be a crucial component of the intrinsic magnetic topological insulator (IMTI) family [MnTe]m[Bi2Te3]n, which hosts a wide range of magneto-topological properties depending on the choice of m and n. However, bulk crystal growth allows only a few combinations of m and n for these IMTIs due to the strict limitations of the thermodynamic growth conditions. One way to overcome this challenge is to utilize atomic layer-by-layer molecular beam epitaxy (MBE) technique, which allows arbitrary sequences of [MnTe]m and [Bi2Te3]n to be formed beyond the thermodynamic limit. For such MBE growth, finding optimal growth templates and conditions for the parent building block, MnTe, is a key requirement. Here, we report that two different hexagonal phases of MnTe-nickeline (NC) and zinc-blende/wurtzite (ZB-WZ) structures, with distinct in-plane lattice constants of 4.20 +/- 0.04 A and 4.39 +/- 0.04 A, respectively-can be selectively grown on c-plane Al2O3 substrates using different buffer layers and growth temperatures. Moreover, we provide the first comparative studies of different MnTe phases using atomic-resolution scanning transmission electron microscopy and show that ZB and WZ-like stacking sequences can easily alternate between the two. Surprisingly, In2Se3 buffer layer, despite its lattice constant (4.02 A) being closer to that of the NC phase, fosters the ZB-WZ instead, whereas Bi2Te3, sharing the same lattice constant (4.39 A) with the ZB-WZ phase, fosters the NC phase. These discoveries suggest that lattice matching is not always the most critical factor determining the preferred phase during epitaxial growth. Overall, this will deepen our understanding of epitaxial growth modes for chalcogenide materials and accelerate progress toward new IMTI phases as well as other magneto-topological applications.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
SoundShift: Exploring Sound Manipulations for Accessible Mixed-Reality Awareness
Authors:
Ruei-Che Chang,
Chia-Sheng Hung,
Bing-Yu Chen,
Dhruv Jain,
Anhong Guo
Abstract:
Mixed-reality (MR) soundscapes blend real-world sound with virtual audio from hearing devices, presenting intricate auditory information that is hard to discern and differentiate. This is particularly challenging for blind or visually impaired individuals, who rely on sounds and descriptions in their everyday lives. To understand how complex audio information is consumed, we analyzed online forum…
▽ More
Mixed-reality (MR) soundscapes blend real-world sound with virtual audio from hearing devices, presenting intricate auditory information that is hard to discern and differentiate. This is particularly challenging for blind or visually impaired individuals, who rely on sounds and descriptions in their everyday lives. To understand how complex audio information is consumed, we analyzed online forum posts within the blind community, identifying prevailing challenges, needs, and desired solutions. We synthesized the results and propose SoundShift for increasing MR sound awareness, which includes six sound manipulations: Transparency Shift, Envelope Shift, Position Shift, Style Shift, Time Shift, and Sound Append. To evaluate the effectiveness of SoundShift, we conducted a user study with 18 blind participants across three simulated MR scenarios, where participants identified specific sounds within intricate soundscapes. We found that SoundShift increased MR sound awareness and minimized cognitive load. Finally, we developed three real-world example applications to demonstrate the practicality of SoundShift.
△ Less
Submitted 26 May, 2024; v1 submitted 19 January, 2024;
originally announced January 2024.
-
Electrodynamics of the quantum anomalous Hall state in a magnetically doped topological insulator
Authors:
Zhenisbek Tagay,
Hee Taek Yi,
Deepti Jain,
Seongshik Oh,
N. P. Armitage
Abstract:
Magnetically doped topological insulators have been extensively studied over the past decade as a material platform to exhibit quantum anomalous Hall effect. Most material realizations are magnetically doped and despite material advances suffer from large disorder effects. In such systems, it is believed that magnetic disorder leads to a spatially varying Dirac mass gap and chemical potential fluc…
▽ More
Magnetically doped topological insulators have been extensively studied over the past decade as a material platform to exhibit quantum anomalous Hall effect. Most material realizations are magnetically doped and despite material advances suffer from large disorder effects. In such systems, it is believed that magnetic disorder leads to a spatially varying Dirac mass gap and chemical potential fluctuations, and hence quantized conductance is only observed at very low temperatures. Here, we use a recently developed high-precision time-domain terahertz (THz) polarimeter to study the low-energy electrodynamic response of Cr-doped (Bi,Sb)$_2$Te$_3$ thin films. These films have been recently shown to exhibit a dc quantized anomalous Hall response up to T = 2 K at zero gate voltage. We show that the real part of the THz range Hall conductance $σ_{xy}(ω)$ is slightly smaller than $e^2/h$ down to T = 2 K with an unconventional decreasing dependence on frequency. The imaginary (dissipative) part of $σ_{xy}(ω)$ is small, but increasing as a function of omega. We connect both aspects of our data to a simple model for effective magnetic gap disorder. Our work highlights the different effect that disorder can have on the dc vs. ac quantum anomalous Hall effect.
△ Less
Submitted 13 January, 2024;
originally announced January 2024.
-
Hunting imaging biomarkers in pulmonary fibrosis: Benchmarks of the AIIB23 challenge
Authors:
Yang Nan,
Xiaodan Xing,
Shiyi Wang,
Zeyu Tang,
Federico N Felder,
Sheng Zhang,
Roberta Eufrasia Ledda,
Xiaoliu Ding,
Ruiqi Yu,
Weiping Liu,
Feng Shi,
Tianyang Sun,
Zehong Cao,
Minghui Zhang,
Yun Gu,
Hanxiao Zhang,
Jian Gao,
Pingyu Wang,
Wen Tang,
Pengxin Yu,
Han Kang,
Junqiang Chen,
Xing Lu,
Boyu Zhang,
Michail Mamalakis
, et al. (16 additional authors not shown)
Abstract:
Airway-related quantitative imaging biomarkers are crucial for examination, diagnosis, and prognosis in pulmonary diseases. However, the manual delineation of airway trees remains prohibitively time-consuming. While significant efforts have been made towards enhancing airway modelling, current public-available datasets concentrate on lung diseases with moderate morphological variations. The intric…
▽ More
Airway-related quantitative imaging biomarkers are crucial for examination, diagnosis, and prognosis in pulmonary diseases. However, the manual delineation of airway trees remains prohibitively time-consuming. While significant efforts have been made towards enhancing airway modelling, current public-available datasets concentrate on lung diseases with moderate morphological variations. The intricate honeycombing patterns present in the lung tissues of fibrotic lung disease patients exacerbate the challenges, often leading to various prediction errors. To address this issue, the 'Airway-Informed Quantitative CT Imaging Biomarker for Fibrotic Lung Disease 2023' (AIIB23) competition was organized in conjunction with the official 2023 International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI). The airway structures were meticulously annotated by three experienced radiologists. Competitors were encouraged to develop automatic airway segmentation models with high robustness and generalization abilities, followed by exploring the most correlated QIB of mortality prediction. A training set of 120 high-resolution computerised tomography (HRCT) scans were publicly released with expert annotations and mortality status. The online validation set incorporated 52 HRCT scans from patients with fibrotic lung disease and the offline test set included 140 cases from fibrosis and COVID-19 patients. The results have shown that the capacity of extracting airway trees from patients with fibrotic lung disease could be enhanced by introducing voxel-wise weighted general union loss and continuity loss. In addition to the competitive image biomarkers for prognosis, a strong airway-derived biomarker (Hazard ratio>1.5, p<0.0001) was revealed for survival prognostication compared with existing clinical measurements, clinician assessment and AI-based biomarkers.
△ Less
Submitted 16 April, 2024; v1 submitted 21 December, 2023;
originally announced December 2023.
-
Reply to the "Comment on `Effect of density and nucleon-nucleon potential on the fusion cross section within the relativistic mean field formalism'"
Authors:
M. Bhuyan,
Raj Kumar,
Shilpa Rana,
D. Jain,
S. K. Patra,
B. V. Carlson
Abstract:
In reply to the Comment made by M. V. Chushnyakova et al. on our paper [Phys. Rev. C 101, 044603 (2020)], we argue that the calculations, results and conclusions of our paper remain valid. We have shown here the calculations for one reaction using the deformed densities and the R3Y nucleon-nucleon potential obtained within the relativistic mean-field (RMF) formalism. Suitable clarications and just…
▽ More
In reply to the Comment made by M. V. Chushnyakova et al. on our paper [Phys. Rev. C 101, 044603 (2020)], we argue that the calculations, results and conclusions of our paper remain valid. We have shown here the calculations for one reaction using the deformed densities and the R3Y nucleon-nucleon potential obtained within the relativistic mean-field (RMF) formalism. Suitable clarications and justifications are given to address all the points raised in the Comment.
△ Less
Submitted 16 December, 2023;
originally announced December 2023.
-
Solar flare catalog from 3 years of Chandrayaan-2 XSM observations
Authors:
Aravind Bharathi Valluvan,
Ashwin Goyal,
Devansh Jain,
Abhinna Sundar Samantaray,
Abhilash Sarwade,
Kasiviswanathan Sankarasubramanian
Abstract:
We present a catalog of 6266 solar flares detected by the X-Ray Solar Monitor onboard the Chandrayaan-2 lunar orbiter between 1.55 and 12.4 keV (1 and 8 Å) from 2019 September 12 to 2022 November 4, including 1469 type A flares. The catalog represents the first large sample, including both type A, hot thermal flares, and type B, impulsive flares, with a sub-A class sensitive instrument. We also de…
▽ More
We present a catalog of 6266 solar flares detected by the X-Ray Solar Monitor onboard the Chandrayaan-2 lunar orbiter between 1.55 and 12.4 keV (1 and 8 Å) from 2019 September 12 to 2022 November 4, including 1469 type A flares. The catalog represents the first large sample, including both type A, hot thermal flares, and type B, impulsive flares, with a sub-A class sensitive instrument. We also detect 213 sub-A and 1330 A class flares. Individual flares are fit with an exponentially-modified Gaussian function and multi-flare groups are decomposed into individual flares. We validate our findings with flare catalogs made using visual inspection as well as automatic pipelines on Geostationary Operational Environmental Satellite and Solar Dynamics Observatory data. We find a clear bimodality in the ratio of the width to decay time between type A and B flares. We infer a power-law index of $α_F = 1.92 \pm 0.09$ for the background-subtracted peak flux distribution of XSM flares, which is consistent with the value $\sim 2$ reported in the literature. We also infer $α_F = 1.90 \pm 0.09$ for type B, and $α_F = 1.94 \pm 0.08$ for type A flares, which has previously not been reported in the literature. These comparable values hint at a similarity in their generative processes.
△ Less
Submitted 8 January, 2024; v1 submitted 14 December, 2023;
originally announced December 2023.
-
SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention
Authors:
Isabel Leal,
Krzysztof Choromanski,
Deepali Jain,
Avinava Dubey,
Jake Varley,
Michael Ryoo,
Yao Lu,
Frederick Liu,
Vikas Sindhwani,
Quan Vuong,
Tamas Sarlos,
Ken Oslund,
Karol Hausman,
Kanishka Rao
Abstract:
We present Self-Adaptive Robust Attention for Robotics Transformers (SARA-RT): a new paradigm for addressing the emerging challenge of scaling up Robotics Transformers (RT) for on-robot deployment. SARA-RT relies on the new method of fine-tuning proposed by us, called up-training. It converts pre-trained or already fine-tuned Transformer-based robotic policies of quadratic time complexity (includi…
▽ More
We present Self-Adaptive Robust Attention for Robotics Transformers (SARA-RT): a new paradigm for addressing the emerging challenge of scaling up Robotics Transformers (RT) for on-robot deployment. SARA-RT relies on the new method of fine-tuning proposed by us, called up-training. It converts pre-trained or already fine-tuned Transformer-based robotic policies of quadratic time complexity (including massive billion-parameter vision-language-action models or VLAs), into their efficient linear-attention counterparts maintaining high quality. We demonstrate the effectiveness of SARA-RT by speeding up: (a) the class of recently introduced RT-2 models, the first VLA robotic policies pre-trained on internet-scale data, as well as (b) Point Cloud Transformer (PCT) robotic policies operating on large point clouds. We complement our results with the rigorous mathematical analysis providing deeper insight into the phenomenon of SARA.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
Signatures of Majorana Bound States in the Diffraction Patterns of Extended Superconductor-Topological Insulator-Superconductor Josephson Junctions
Authors:
Guang Yue,
Can Zhang,
Erik D. Huemiller,
Jessica H. Montone,
Gilbert R. Arias,
Drew G. Wild,
Jered Y. Zhang,
David R. Hamilton,
Xiaoyu Yuan,
Xiong Yao,
Deepti Jain,
Jisoo Moon,
Maryam Salehi,
Nikesh Koirala,
Seongshik Oh,
Dale J. Van Harlingen
Abstract:
In an extended superconductor-topological insulator-superconductor (S-TI-S) Josephson junction in a magnetic field, localized Majorana bound states (MBS) are predicted to exist at the cores of Josephson vortices where the local phase difference across the junction is an odd-multiple of $π$. These states contribute a supercurrent with a $4π$-periodic current-phase relation (CPR) that adds to the co…
▽ More
In an extended superconductor-topological insulator-superconductor (S-TI-S) Josephson junction in a magnetic field, localized Majorana bound states (MBS) are predicted to exist at the cores of Josephson vortices where the local phase difference across the junction is an odd-multiple of $π$. These states contribute a supercurrent with a $4π$-periodic current-phase relation (CPR) that adds to the conventional $2π$-periodic sinusoidal CPR. In this work, we present a comprehensive experimental study of the critical current vs. applied magnetic field diffraction patterns of lateral Nb-Bi$_2$Se$_3$-Nb Josephson junctions. We compare our observations to a model of the Josephson dynamics in the S-TI-S junction system to explore what feature of MBS are, or are not, exhibited in these junctions. Consistent with the model, we find several distinct deviations from a Fraunhofer diffraction pattern that is expected for a uniform sin$(φ)$ CPR. In particular, we observe abrupt changes in the diffraction pattern at applied magnetic fields in which the current-carrying localized MBS are expected to enter the junction, and a lifting of the odd-numbered nodes consistent with a $4π$-periodic sin$(φ/2)$-component in the CPR. We also see that although the even-numbered nodes often remain fully-formed, we sometimes see deviations that are consistent with quasiparticle-induced fluctuations in the parity of the MBS pairs that encodes quantum information.
△ Less
Submitted 21 February, 2024; v1 submitted 27 November, 2023;
originally announced November 2023.
-
Can Language Model Moderators Improve the Health of Online Discourse?
Authors:
Hyundong Cho,
Shuai Liu,
Taiwei Shi,
Darpan Jain,
Basem Rizk,
Yuyang Huang,
Zixun Lu,
Nuan Wen,
Jonathan Gratch,
Emilio Ferrara,
Jonathan May
Abstract:
Conversational moderation of online communities is crucial to maintaining civility for a constructive environment, but it is challenging to scale and harmful to moderators. The inclusion of sophisticated natural language generation modules as a force multiplier to aid human moderators is a tantalizing prospect, but adequate evaluation approaches have so far been elusive. In this paper, we establis…
▽ More
Conversational moderation of online communities is crucial to maintaining civility for a constructive environment, but it is challenging to scale and harmful to moderators. The inclusion of sophisticated natural language generation modules as a force multiplier to aid human moderators is a tantalizing prospect, but adequate evaluation approaches have so far been elusive. In this paper, we establish a systematic definition of conversational moderation effectiveness grounded on moderation literature and establish design criteria for conducting realistic yet safe evaluation. We then propose a comprehensive evaluation framework to assess models' moderation capabilities independently of human intervention. With our framework, we conduct the first known study of language models as conversational moderators, finding that appropriately prompted models that incorporate insights from social science can provide specific and fair feedback on toxic behavior but struggle to influence users to increase their levels of respect and cooperation.
△ Less
Submitted 6 May, 2024; v1 submitted 16 November, 2023;
originally announced November 2023.
-
The S-matrix and boundary correlators in flat space
Authors:
Diksha Jain,
Suman Kundu,
Shiraz Minwalla,
Onkar Parrikar,
Siddharth G. Prabhu,
Pushkal Shrivastava
Abstract:
We consider the path integral of a quantum field theory in Minkowski spacetime with fixed boundary values (for the elementary fields) on asymptotic boundaries. We define and study the corresponding boundary correlation functions obtained by taking derivatives of this path integral with respect to the boundary values. The S-matrix of the QFT can be extracted directly from these boundary correlation…
▽ More
We consider the path integral of a quantum field theory in Minkowski spacetime with fixed boundary values (for the elementary fields) on asymptotic boundaries. We define and study the corresponding boundary correlation functions obtained by taking derivatives of this path integral with respect to the boundary values. The S-matrix of the QFT can be extracted directly from these boundary correlation functions after smearing. We interpret this relation in terms of coherent state quantization and derive the constraints on the path-integral as a function of boundary values that follow from the unitarity of the S-matrix. We then study the locality structure of boundary correlation functions. In the massive case, we find that the boundary correlation functions for generic locations of boundary points are dominated by a saddle point which has the interpretation of particles scattering in a small elevator in the bulk, where the location of the elevator is determined dynamically, and the S-matrix can be recovered after stripping off some dynamically determined but non-local ``renormalization'' factors. In the massless case, we find that while the boundary correlation functions are generically analytic as a function on the whole manifold of locations of boundary points, they have special singularities on a sub-manifold, points on which correspond to light-like scattering in the bulk. We completely characterize this singular scattering sub-manifold, and find that the corresponding residues of the boundary correlations at these singularities are precisely given by S-matrices. This analysis parallels the analysis of bulk-point singularities in AdS/CFT and generalizes it to the case of multi-bulk point singularities.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Probing the Physics of Reionization Using kSZ Power Spectrum from Current and Upcoming CMB Surveys
Authors:
Divesh Jain,
Tirthankar Roy Choudhury,
Srinivasan Raghunathan,
Suvodip Mukherjee
Abstract:
The patchiness in the reionization process alters the statistics of Cosmic Microwave Background (CMB), with the kinematic Sunyaev-Zeldovich (kSZ) effect in the CMB temperature power spectrum being a notable consequence. In this work, we aim to explore the potential of future kSZ power spectrum measurements in inferring the details of the reionization process. In this pursuit, we capitalize on the…
▽ More
The patchiness in the reionization process alters the statistics of Cosmic Microwave Background (CMB), with the kinematic Sunyaev-Zeldovich (kSZ) effect in the CMB temperature power spectrum being a notable consequence. In this work, we aim to explore the potential of future kSZ power spectrum measurements in inferring the details of the reionization process. In this pursuit, we capitalize on the recent developments in foreground mitigation techniques using the Cross-Internal Linear Combination (Cross-ILC) technique, which enables robust detection of the kSZ power spectrum with signal-to-noise ($S/N$) roughly $20-30σ$ in this decade by SPT-3G and Simons Observatory (SO); and $\geq 80σ$ by CMB-S4, substantially improving on the recent evidence for kSZ binned at $\ell=3000$ using SPT-SZ+SPTpol surveys. We use a fiducial kSZ power spectrum along with realistic error bars expected from the above technique for SPT-3G, SO, and CMB-S4 to constrain the parameter space for a physical model of reionization. We find that with the improved error bars it will be possible to place stringent constraints on reionization using solely the Cross-ILC recovered SPT-3G kSZ without imposing any prior on $τ$ in the Bayesian inference. Notably, high-fidelity kSZ measurements from CMB-S4 coupled with $τ$ measurements through LiteBIRD will enable unprecedented constraint on the midpoint of reionization with an error bar of $\sim 0.25$ and the duration of reionization with an error bar at $\sim 0.21$ exclusively using CMB data. This study highlights the need to capture kSZ power spectrum on a broad range of multipoles to gain insights into the inhomogeneous reionization era.
△ Less
Submitted 12 March, 2024; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Coupled metamaterial-phonon terahertz range polaritons in a topological insulator
Authors:
Sirak M. Mekonen,
Deepti Jain,
Seongshik Oh,
N. P. Armitage
Abstract:
We report terahertz time-domain spectroscopy (TDTS) experiments demonstrating strong light-matter coupling in a terahertz (THz) LC-metamaterial in which the phonon resonance of a topological insulator (TI) thin film is coupled to the photonic modes of an array of electronic split-ring resonators. As we tune the metamaterial resonance frequency through the frequency of the low frequency $α$ mode of…
▽ More
We report terahertz time-domain spectroscopy (TDTS) experiments demonstrating strong light-matter coupling in a terahertz (THz) LC-metamaterial in which the phonon resonance of a topological insulator (TI) thin film is coupled to the photonic modes of an array of electronic split-ring resonators. As we tune the metamaterial resonance frequency through the frequency of the low frequency $α$ mode of (Bi$_x$Sb$_{1-x}$)$_2$Te$_3$ (BST), we observe strong mixing and level repulsion between phonon and metamaterial resonance. This hybrid resonance is a phonon polariton. We observe a normalized coupling strength, $η$ = $Ω_R$/$ω_c$ $\approx$ 0.09, using the measured vacuum Rabi frequency and cavity resonance. Our results demonstrate that one can tune the mechanical properties of materials by changing their electromagnetic environment and therefore modify their magnetic and topological degrees of freedom via coupling to the lattice in this fashion.
△ Less
Submitted 19 May, 2024; v1 submitted 13 October, 2023;
originally announced October 2023.
-
Robotic Table Tennis: A Case Study into a High Speed Learning System
Authors:
David B. D'Ambrosio,
Jonathan Abelian,
Saminda Abeyruwan,
Michael Ahn,
Alex Bewley,
Justin Boyd,
Krzysztof Choromanski,
Omar Cortes,
Erwin Coumans,
Tianli Ding,
Wenbo Gao,
Laura Graesser,
Atil Iscen,
Navdeep Jaitly,
Deepali Jain,
Juhana Kangaspunta,
Satoshi Kataoka,
Gus Kouretas,
Yuheng Kuang,
Nevena Lazic,
Corey Lynch,
Reza Mahjourian,
Sherry Q. Moore,
Thinh Nguyen,
Ken Oslund
, et al. (10 additional authors not shown)
Abstract:
We present a deep-dive into a real-world robotic learning system that, in previous work, was shown to be capable of hundreds of table tennis rallies with a human and has the ability to precisely return the ball to desired targets. This system puts together a highly optimized perception subsystem, a high-speed low-latency robot controller, a simulation paradigm that can prevent damage in the real w…
▽ More
We present a deep-dive into a real-world robotic learning system that, in previous work, was shown to be capable of hundreds of table tennis rallies with a human and has the ability to precisely return the ball to desired targets. This system puts together a highly optimized perception subsystem, a high-speed low-latency robot controller, a simulation paradigm that can prevent damage in the real world and also train policies for zero-shot transfer, and automated real world environment resets that enable autonomous training and evaluation on physical robots. We complement a complete system description, including numerous design decisions that are typically not widely disseminated, with a collection of studies that clarify the importance of mitigating various sources of latency, accounting for training and deployment distribution shifts, robustness of the perception system, sensitivity to policy hyper-parameters, and choice of action space. A video demonstrating the components of the system and details of experimental results can be found at https://youtu.be/uFcnWjB42I0.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Disentangling patchy reionization signatures from primordial gravitational waves using CMB $E$-mode and $B$-mode polarization
Authors:
Divesh Jain,
Suvodip Mukherjee,
Tirthankar Roy Choudhury
Abstract:
The detection of large angular scale $B$-mode in the Cosmic Microwave Background (CMB) polarization signal will open a direct window into not only the primary CMB anisotropies caused by the primordial gravitational waves (PGW) originating in the epoch of inflation, but also the secondary anisotropies imprinted during the epoch of cosmic reionization. The existence of patchiness in the electron den…
▽ More
The detection of large angular scale $B$-mode in the Cosmic Microwave Background (CMB) polarization signal will open a direct window into not only the primary CMB anisotropies caused by the primordial gravitational waves (PGW) originating in the epoch of inflation, but also the secondary anisotropies imprinted during the epoch of cosmic reionization. The existence of patchiness in the electron density during reionization produces a unique distortion in the CMB $B$-mode polarization, which can be distinguished from the PGW signal with the aid of spatial frequency modes. In this work, we employ an $EB$ estimator by combining $E$-mode and $B$-mode polarization for the $τ$ power spectrum signal generated in a photon-conserving semi-numerical reionization model called SCRIPT. We developed a Bayesian framework for the joint detection of the PGW and reionization signal from CMB observations and show the efficacy of this technique for upcoming CMB experiments. We find that, for our model, the $τ$ power spectrum signal effectively tracks the inhomogeneous electron density field, allowing for robust constraints on the patchy $B$-mode signal. Further, our results indicate that employing the $EB$ estimator for the $τ$ signal will facilitate ground-based CMB-S4 to detect the patchy $B$-mode signal at approximately $\geq 2σ$ confidence level while observations with space-based PICO will improve this detection to $\geq 3σ$ going as high as $\geq 7σ$ for extreme reionization models. These findings not only highlight the future potential of these experiments to provide an improved picture of the reionization process but also have important implications towards an unbiased measurement of $r$.
△ Less
Submitted 25 October, 2023; v1 submitted 18 August, 2023;
originally announced August 2023.
-
Coherent States in M-Theory: A Brane Scan using the Taub-NUT
Authors:
Joydeep Chakravarty,
Keshav Dasgupta,
Diksha Jain,
Dileep P. Jatkar,
Archana Maji,
Radu Tatar
Abstract:
The Taub-NUT geometry corresponds to the Kaluza-Klein monopole solution of M-theory and on dimension reduction along the Taub-NUT circle direction it becomes the D6 brane of type IIA string theory. We show that the Taub-NUT geometry can be realised as a coherent state, or more appropriately as a Glauber-Sudarshan state in M-theory, once we take the underlying resurgence structure carefully. Using…
▽ More
The Taub-NUT geometry corresponds to the Kaluza-Klein monopole solution of M-theory and on dimension reduction along the Taub-NUT circle direction it becomes the D6 brane of type IIA string theory. We show that the Taub-NUT geometry can be realised as a coherent state, or more appropriately as a Glauber-Sudarshan state in M-theory, once we take the underlying resurgence structure carefully. Using the duality chain it in turn implies that all D-branes as well as NS5-branes can be realised as Glauber-Sudarshan states in string theory. Our analysis also leads to an intriguing possibility of realizing the gravity duals of certain non-conformal minimally-supersymmetric gauge theories by deforming a class of Glauber-Sudarshan states.
△ Less
Submitted 25 October, 2023; v1 submitted 16 August, 2023;
originally announced August 2023.
-
Search for Eccentric Black Hole Coalescences during the Third Observing Run of LIGO and Virgo
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
C. Adamcewicz,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi
, et al. (1750 additional authors not shown)
Abstract:
Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effect…
▽ More
Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effects of eccentricity. Here, we present observational results for a waveform-independent search sensitive to eccentric black hole coalescences, covering the third observing run (O3) of the LIGO and Virgo detectors. We identified no new high-significance candidates beyond those that were already identified with searches focusing on quasi-circular binaries. We determine the sensitivity of our search to high-mass (total mass $M>70$ $M_\odot$) binaries covering eccentricities up to 0.3 at 15 Hz orbital frequency, and use this to compare model predictions to search results. Assuming all detections are indeed quasi-circular, for our fiducial population model, we place an upper limit for the merger rate density of high-mass binaries with eccentricities $0 < e \leq 0.3$ at $0.33$ Gpc$^{-3}$ yr$^{-1}$ at 90\% confidence level.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
Construction of Linear Codes from the Unit Graph $G(\mathbb{Z}_{n})$
Authors:
Dr. Rupali S. Jain,
Dr. B. Surendranath Reddy,
Mr. Wajid M. Shaikh
Abstract:
In this paper, we consider the unit graph $G(\mathbb{Z}_{n})$, where $n=p_{1}^{n_{1}} \text{ or } p_{1}^{n_{1}}p_{2}^{n_{2}} \text{ or } p_{1}^{n_{1}}p_{2}^{n_{2}}p_{3}^{n_{3}}$ and $p_{1}, p_{2}, p_{3}$ are distinct primes. For any prime $q$, we construct $q$-ary linear codes from the incidence matrix of the unit graph $G(\mathbb{Z}_{n})$ with their parameters. We also prove that the dual of the…
▽ More
In this paper, we consider the unit graph $G(\mathbb{Z}_{n})$, where $n=p_{1}^{n_{1}} \text{ or } p_{1}^{n_{1}}p_{2}^{n_{2}} \text{ or } p_{1}^{n_{1}}p_{2}^{n_{2}}p_{3}^{n_{3}}$ and $p_{1}, p_{2}, p_{3}$ are distinct primes. For any prime $q$, we construct $q$-ary linear codes from the incidence matrix of the unit graph $G(\mathbb{Z}_{n})$ with their parameters. We also prove that the dual of the constructed codes have minimum distance either 3 or 4. Lastly, we stated two conjectures on diameter of unit graph $G(\mathbb{Z}_{n})$ and linear codes constructed from the incidence matrix of the unit graph $G(\mathbb{Z}_{n})$ for any integer $n$.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Agile Catching with Whole-Body MPC and Blackbox Policy Learning
Authors:
Saminda Abeyruwan,
Alex Bewley,
Nicholas M. Boffi,
Krzysztof Choromanski,
David D'Ambrosio,
Deepali Jain,
Pannag Sanketi,
Anish Shankar,
Vikas Sindhwani,
Sumeet Singh,
Jean-Jacques Slotine,
Stephen Tu
Abstract:
We address a benchmark task in agile robotics: catching objects thrown at high-speed. This is a challenging task that involves tracking, intercepting, and cradling a thrown object with access only to visual observations of the object and the proprioceptive state of the robot, all within a fraction of a second. We present the relative merits of two fundamentally different solution strategies: (i) M…
▽ More
We address a benchmark task in agile robotics: catching objects thrown at high-speed. This is a challenging task that involves tracking, intercepting, and cradling a thrown object with access only to visual observations of the object and the proprioceptive state of the robot, all within a fraction of a second. We present the relative merits of two fundamentally different solution strategies: (i) Model Predictive Control using accelerated constrained trajectory optimization, and (ii) Reinforcement Learning using zeroth-order optimization. We provide insights into various performance trade-offs including sample efficiency, sim-to-real transfer, robustness to distribution shifts, and whole-body multimodality via extensive on-hardware experiments. We conclude with proposals on fusing "classical" and "learning-based" techniques for agile robot control. Videos of our experiments may be found at https://sites.google.com/view/agile-catching
△ Less
Submitted 19 October, 2023; v1 submitted 13 June, 2023;
originally announced June 2023.
-
High temperature, gate-free quantum anomalous Hall effect with an active capping layer
Authors:
Hee Taek Yi,
Deepti Jain,
Xiong Yao,
Seongshik Oh
Abstract:
Quantum anomalous Hall effect (QAHE) was discovered a decade ago, but is still not utilized beyond a handful of research groups, due to numerous limitations such as extremely low temperature, electric field-effect gating requirement, small sample sizes and environmental aging effect. Here, we present a robust platform that provides effective solutions to these problems. Specifically, on this platf…
▽ More
Quantum anomalous Hall effect (QAHE) was discovered a decade ago, but is still not utilized beyond a handful of research groups, due to numerous limitations such as extremely low temperature, electric field-effect gating requirement, small sample sizes and environmental aging effect. Here, we present a robust platform that provides effective solutions to these problems. Specifically, on this platform, we observe QAH signatures at record high temperatures, with the Hall conductance of 1.00 e2/h at 2.0 K, 0.98 e2/h at 4.2 K, and 0.92 e2/h at 10 K, on centimeter-scale substrates, without electric-field-effect gating. The key ingredient is an active CrOx capping layer, which substantially boosts the ferromagnetism while suppressing environmental degradation. With this development, QAHE will now be accessible to much broader applications than before.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Barkour: Benchmarking Animal-level Agility with Quadruped Robots
Authors:
Ken Caluwaerts,
Atil Iscen,
J. Chase Kew,
Wenhao Yu,
Tingnan Zhang,
Daniel Freeman,
Kuang-Huei Lee,
Lisa Lee,
Stefano Saliceti,
Vincent Zhuang,
Nathan Batchelor,
Steven Bohez,
Federico Casarini,
Jose Enrique Chen,
Omar Cortes,
Erwin Coumans,
Adil Dostmohamed,
Gabriel Dulac-Arnold,
Alejandro Escontrela,
Erik Frey,
Roland Hafner,
Deepali Jain,
Bauyrjan Jyenis,
Yuheng Kuang,
Edward Lee
, et al. (19 additional authors not shown)
Abstract:
Animals have evolved various agile locomotion strategies, such as sprinting, leaping, and jumping. There is a growing interest in developing legged robots that move like their biological counterparts and show various agile skills to navigate complex environments quickly. Despite the interest, the field lacks systematic benchmarks to measure the performance of control policies and hardware in agili…
▽ More
Animals have evolved various agile locomotion strategies, such as sprinting, leaping, and jumping. There is a growing interest in developing legged robots that move like their biological counterparts and show various agile skills to navigate complex environments quickly. Despite the interest, the field lacks systematic benchmarks to measure the performance of control policies and hardware in agility. We introduce the Barkour benchmark, an obstacle course to quantify agility for legged robots. Inspired by dog agility competitions, it consists of diverse obstacles and a time based scoring mechanism. This encourages researchers to develop controllers that not only move fast, but do so in a controllable and versatile way. To set strong baselines, we present two methods for tackling the benchmark. In the first approach, we train specialist locomotion skills using on-policy reinforcement learning methods and combine them with a high-level navigation controller. In the second approach, we distill the specialist skills into a Transformer-based generalist locomotion policy, named Locomotion-Transformer, that can handle various terrains and adjust the robot's gait based on the perceived environment and robot states. Using a custom-built quadruped robot, we demonstrate that our method can complete the course at half the speed of a dog. We hope that our work represents a step towards creating controllers that enable robots to reach animal-level agility.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
Double roton-minima in bosonic fractional quantum Hall states
Authors:
Moumita Indra,
Deepak Jain,
Sandip Mondal
Abstract:
We have studied the collective spin-conserving collective excitation spectra in rotating diluted ultra-cold Bose atoms. Double roton-minima have been observed in the fractional quantum Hall (FQH) states for the two filling fractions ($ν$) of the first series of Jain's composite fermion sequences. The obtained roton-minima for $ν$ = 1/4 are at the wave-vectors 1.26 and 2.38 and the roton-minima for…
▽ More
We have studied the collective spin-conserving collective excitation spectra in rotating diluted ultra-cold Bose atoms. Double roton-minima have been observed in the fractional quantum Hall (FQH) states for the two filling fractions ($ν$) of the first series of Jain's composite fermion sequences. The obtained roton-minima for $ν$ = 1/4 are at the wave-vectors 1.26 and 2.38 and the roton-minima for $ν$ = 1/6 have been shifted to 1.08 and 2.06. Such shift of roton-minima is attributed due to strong correlation between the particles in bosonic FQH-system. Moreover, the number of roton minima observed depends upon number of attached fluxes as well as the ranges of interaction between the particles.
△ Less
Submitted 12 May, 2023;
originally announced May 2023.
-
Open data from the third observing run of LIGO, Virgo, KAGRA and GEO
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah,
C. Alléné,
A. Allocca
, et al. (1719 additional authors not shown)
Abstract:
The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasti…
▽ More
The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasting 2 weeks. In this paper we describe these data and various other science products that can be freely accessed through the Gravitational Wave Open Science Center at https://gwosc.org. The main dataset, consisting of the gravitational-wave strain time series that contains the astrophysical signals, is released together with supporting data useful for their analysis and documentation, tutorials, as well as analysis software packages.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
-
Mnemosyne: Learning to Train Transformers with Transformers
Authors:
Deepali Jain,
Krzysztof Marcin Choromanski,
Avinava Dubey,
Sumeet Singh,
Vikas Sindhwani,
Tingnan Zhang,
Jie Tan
Abstract:
In this work, we propose a new class of learnable optimizers, called \textit{Mnemosyne}. It is based on the novel spatio-temporal low-rank implicit attention Transformers that can learn to train entire neural network architectures, including other Transformers, without any task-specific optimizer tuning. We show that Mnemosyne: (a) outperforms popular LSTM optimizers (also with new feature enginee…
▽ More
In this work, we propose a new class of learnable optimizers, called \textit{Mnemosyne}. It is based on the novel spatio-temporal low-rank implicit attention Transformers that can learn to train entire neural network architectures, including other Transformers, without any task-specific optimizer tuning. We show that Mnemosyne: (a) outperforms popular LSTM optimizers (also with new feature engineering to mitigate catastrophic forgetting of LSTMs), (b) can successfully train Transformers while using simple meta-training strategies that require minimal computational resources, (c) matches accuracy-wise SOTA hand-designed optimizers with carefully tuned hyper-parameters (often producing top performing models). Furthermore, Mnemosyne provides space complexity comparable to that of its hand-designed first-order counterparts, which allows it to scale to training larger sets of parameters. We conduct an extensive empirical evaluation of Mnemosyne on: (a) fine-tuning a wide range of Vision Transformers (ViTs) from medium-size architectures to massive ViT-Hs (36 layers, 16 heads), (b) pre-training BERT models and (c) soft prompt-tuning large 11B+ T5XXL models. We complement our results with a comprehensive theoretical analysis of the compact associative memory used by Mnemosyne which we believe was never done before.
△ Less
Submitted 16 June, 2023; v1 submitted 2 February, 2023;
originally announced February 2023.
-
Efficient Graph Field Integrators Meet Point Clouds
Authors:
Krzysztof Choromanski,
Arijit Sehanobish,
Han Lin,
Yunfan Zhao,
Eli Berger,
Tetiana Parshakova,
Alvin Pan,
David Watkins,
Tianyi Zhang,
Valerii Likhosherstov,
Somnath Basu Roy Chowdhury,
Avinava Dubey,
Deepali Jain,
Tamas Sarlos,
Snigdha Chaturvedi,
Adrian Weller
Abstract:
We present two new classes of algorithms for efficient field integration on graphs encoding point clouds. The first class, SeparatorFactorization(SF), leverages the bounded genus of point cloud mesh graphs, while the second class, RFDiffusion(RFD), uses popular epsilon-nearest-neighbor graph representations for point clouds. Both can be viewed as providing the functionality of Fast Multipole Metho…
▽ More
We present two new classes of algorithms for efficient field integration on graphs encoding point clouds. The first class, SeparatorFactorization(SF), leverages the bounded genus of point cloud mesh graphs, while the second class, RFDiffusion(RFD), uses popular epsilon-nearest-neighbor graph representations for point clouds. Both can be viewed as providing the functionality of Fast Multipole Methods (FMMs), which have had a tremendous impact on efficient integration, but for non-Euclidean spaces. We focus on geometries induced by distributions of walk lengths between points (e.g., shortest-path distance). We provide an extensive theoretical analysis of our algorithms, obtaining new results in structural graph theory as a byproduct. We also perform exhaustive empirical evaluation, including on-surface interpolation for rigid and deformable objects (particularly for mesh-dynamics modeling), Wasserstein distance computations for point clouds, and the Gromov-Wasserstein variant.
△ Less
Submitted 4 October, 2023; v1 submitted 2 February, 2023;
originally announced February 2023.
-
CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement Learning
Authors:
Darshita Jain,
Anima Majumder,
Samrat Dutta,
Swagat Kumar
Abstract:
This paper addresses the problem of visual feature representation learning with an aim to improve the performance of end-to-end reinforcement learning (RL) models. Specifically, a novel architecture is proposed that uses a heterogeneous loss function, called CRC loss, to learn improved visual features which can then be used for policy learning in RL. The CRC-loss function is a combination of three…
▽ More
This paper addresses the problem of visual feature representation learning with an aim to improve the performance of end-to-end reinforcement learning (RL) models. Specifically, a novel architecture is proposed that uses a heterogeneous loss function, called CRC loss, to learn improved visual features which can then be used for policy learning in RL. The CRC-loss function is a combination of three individual loss functions, namely, contrastive, reconstruction and consistency loss. The feature representation is learned in parallel to the policy learning while sharing the weight updates through a Siamese Twin encoder model. This encoder model is augmented with a decoder network and a feature projection network to facilitate computation of the above loss components. Through empirical analysis involving latent feature visualization, an attempt is made to provide an insight into the role played by this loss function in learning new action-dependent features and how they are linked to the complexity of the problems being solved. The proposed architecture, called CRC-RL, is shown to outperform the existing state-of-the-art methods on the challenging Deep mind control suite environments by a significant margin thereby creating a new benchmark in this field.
△ Less
Submitted 28 February, 2023; v1 submitted 31 January, 2023;
originally announced January 2023.
-
Beyond the exome: what's next in diagnostic testing for Mendelian conditions
Authors:
Monica H. Wojcik,
Chloe M. Reuter,
Shruti Marwaha,
Medhat Mahmoud,
Michael H. Duyzend,
Hayk Barseghyan,
Bo Yuan,
Philip M. Boone,
Emily E. Groopman,
Emmanuèle C. Délot,
Deepti Jain,
Alba Sanchis-Juan,
Genomics Research to Elucidate the Genetics of Rare Diseases,
Consortium,
Lea M. Starita,
Michael Talkowski,
Stephen B. Montgomery,
Michael J. Bamshad,
Jessica X. Chong,
Matthew T. Wheeler,
Seth I. Berger,
Anne O'Donnell-Luria,
Fritz J. Sedlazeck,
Danny E. Miller
Abstract:
Despite advances in clinical genetic testing, including the introduction of exome sequencing (ES), more than 50% of individuals with a suspected Mendelian condition lack a precise molecular diagnosis. Clinical evaluation is increasingly undertaken by specialists outside of clinical genetics, often occurring in a tiered fashion and typically ending after ES. The current diagnostic rate reflects mul…
▽ More
Despite advances in clinical genetic testing, including the introduction of exome sequencing (ES), more than 50% of individuals with a suspected Mendelian condition lack a precise molecular diagnosis. Clinical evaluation is increasingly undertaken by specialists outside of clinical genetics, often occurring in a tiered fashion and typically ending after ES. The current diagnostic rate reflects multiple factors, including technical limitations, incomplete understanding of variant pathogenicity, missing genotype-phenotype associations, complex gene-environment interactions, and reporting differences between clinical labs. Maintaining a clear understanding of the rapidly evolving landscape of diagnostic tests beyond ES, and their limitations, presents a challenge for non-genetics professionals. Newer tests, such as short-read genome or RNA sequencing, can be challenging to order and emerging technologies, such as optical genome mapping and long-read DNA or RNA sequencing, are not available clinically. Furthermore, there is no clear guidance on the next best steps after inconclusive evaluation. Here, we review why a clinical genetic evaluation may be negative, discuss questions to be asked in this setting, and provide a framework for further investigation, including the advantages and disadvantages of new approaches that are nascent in the clinical sphere. We present a guide for the next best steps after inconclusive molecular testing based upon phenotype and prior evaluation, including when to consider referral to a consortium such as GREGoR, which is focused on elucidating the underlying cause of rare unsolved genetic disorders.
△ Less
Submitted 18 January, 2023;
originally announced January 2023.
-
Holography of information in massive gravity using Dirac brackets
Authors:
Joydeep Chakravarty,
Diksha Jain,
Akhil Sivakumar
Abstract:
The principle of holography of information states that in massless gravity, it is possible to extract bulk information using asymptotic boundary operators. In our work, we study this principle in a linearized setting about empty flat space and formulate it using Dirac brackets between boundary Hamiltonian and bulk operators. We then address whether the storage of bulk information in flat space lin…
▽ More
The principle of holography of information states that in massless gravity, it is possible to extract bulk information using asymptotic boundary operators. In our work, we study this principle in a linearized setting about empty flat space and formulate it using Dirac brackets between boundary Hamiltonian and bulk operators. We then address whether the storage of bulk information in flat space linearized massive gravity resembles that of massless gravity. For linearized massless gravity, using Dirac brackets, we recover the necessary criteria for the holography of information. In contrast, we show that the Dirac bracket of the relevant boundary observable with bulk operators vanishes for massive gravity. We use this important distinction to outline the canonical Hilbert space. This leads to split states, and consequently, one cannot use asymptotic boundary observables to extract bulk information in massive gravity. We also argue the split property directly without an explicit reference to the Hilbert space. The result reflects that we can construct local bulk operators in massive gravity, which are obscured from boundary observables due to the lack of diffeomorphism invariance. Our analysis sheds some light on evaporating black holes in the context of the islands proposal.
△ Less
Submitted 29 June, 2023; v1 submitted 3 January, 2023;
originally announced January 2023.
-
Perturbative soft photon theorems in de Sitter spacetime
Authors:
Sayali Bhatkar,
Diksha Jain
Abstract:
We define a perturbative S-matrix in a local patch of de Sitter background in the limit when the curvature length scale ($\ell$) is large and study the 'soft' behavior of the scalar QED amplitudes in de Sitter spacetime in generic dimensions. We obtain the leading and subleading perturbative corrections to flat space soft photon theorems in the large $\ell$ limit, and comment on the universality o…
▽ More
We define a perturbative S-matrix in a local patch of de Sitter background in the limit when the curvature length scale ($\ell$) is large and study the 'soft' behavior of the scalar QED amplitudes in de Sitter spacetime in generic dimensions. We obtain the leading and subleading perturbative corrections to flat space soft photon theorems in the large $\ell$ limit, and comment on the universality of these corrections. We compare our results with the electromagnetic memory tails obtained earlier in $d=4$ using classical radiation analysis.
△ Less
Submitted 30 December, 2022;
originally announced December 2022.
-
Gamma Rays Bursts: A Viable Cosmological Probe?
Authors:
Darshan Kumar,
Nisha Rani,
Deepak Jain,
Shobhit Mahajan,
Amitabha Mukherjee
Abstract:
In this work, our focus is on exploring the potential of current GRB measurements to provide reliable constraints on cosmological model parameters at high redshift. This work is divided into two parts. First, we calibrate the Amati relation in a model-independent way by using Hubble parameter measurements obtained from the differential ages of the galaxies. We further check if the Amati relation p…
▽ More
In this work, our focus is on exploring the potential of current GRB measurements to provide reliable constraints on cosmological model parameters at high redshift. This work is divided into two parts. First, we calibrate the Amati relation in a model-independent way by using Hubble parameter measurements obtained from the differential ages of the galaxies. We further check if the Amati relation parameters evolve with the GRBs' redshift or not, using the data of Old Astrophysical Objects. The results indicate that GRBs do seem to evolve with redshift. In the second part, we test different cosmological models with the calibrated GRB data obtained by using constant and dynamical Amati relation. Our results indicate that the present quality of GRB data is not good enough to put tight constraints on the cosmological parameters. Hence we perform a joint analysis with the combined data of GRBs and Type Ia Supernovae (SNe) and find that this can considerably enhance cosmological constraints in contrast to solely relying on GRBs.
△ Less
Submitted 27 June, 2023; v1 submitted 12 December, 2022;
originally announced December 2022.
-
Revisiting the epoch of cosmic acceleration
Authors:
David Dahiya,
Deepak Jain
Abstract:
We revisit the epoch of cosmic speed-up characterized by the redshift of transition from a decelerated to an accelerated phase. This redshift is termed the transition redshift ($z_t$). We use the spatially Flat and Non-Flat variants of the most common $Λ$CDM and XCDM models to put constraints on the transition redshift along with the other model parameters. The data for this analysis comes from th…
▽ More
We revisit the epoch of cosmic speed-up characterized by the redshift of transition from a decelerated to an accelerated phase. This redshift is termed the transition redshift ($z_t$). We use the spatially Flat and Non-Flat variants of the most common $Λ$CDM and XCDM models to put constraints on the transition redshift along with the other model parameters. The data for this analysis comes from the recent and updated Pantheon+ Supernova dataset and the Hubble parameter measurements obtained from Cosmic Chronometers. We consider both datasets with their respective covariance matrices incorporating all kinds of statistical and systematic uncertainties. We observe that using the combined datasets of H(z) and SNe, the best fit value of transition redshift lies in the range $0.61 < z_t < 0.82$ for all four dark energy models. Incidentally, we observe a positive curvature for the Non-Flat models and correlations between several model parameters.
△ Less
Submitted 25 June, 2023; v1 submitted 9 December, 2022;
originally announced December 2022.
-
Rational distance sets on a parabola using Pythagorean triplets
Authors:
Sayak Bhattacharjee,
Divyam Jain
Abstract:
We study $N$-point rational distance sets ($\textrm{RDS}(N)$) on the parabola $y=x^2$. Previous approaches to the problem include efforts made using elliptic curves and diophantine chains, with successful analysis for $N\leq 4$. We extend the analysis for arbitrary $N$ by establishing a correspondence between $\textrm{RDS}(N)$s and Pythagorean triplets. Our main result gives sufficient and necessa…
▽ More
We study $N$-point rational distance sets ($\textrm{RDS}(N)$) on the parabola $y=x^2$. Previous approaches to the problem include efforts made using elliptic curves and diophantine chains, with successful analysis for $N\leq 4$. We extend the analysis for arbitrary $N$ by establishing a correspondence between $\textrm{RDS}(N)$s and Pythagorean triplets. Our main result gives sufficient and necessary conditions for the existence and nature of the $\textrm{RDS}(N)$s for arbitrary $N$. Our approach also leads to an efficient computational algorithm to construct new $\textrm{RDS}(N)$s, and we provide multiple new examples of $\textrm{RDS}(N)$s for four and five points. The correspondence with Pythagorean triplets also helps to study the density of the solutions and we reproduce density results for $N=2$ and $3$.
△ Less
Submitted 8 December, 2022;
originally announced December 2022.
-
Some positive thoughts about Negative Absolute Temperature
Authors:
Anuradha Gupta,
Deepak Jain
Abstract:
It is now widely accepted that the concept of negative absolute temperature is real one and not just theoretical curiosity. In this brief report, by combining the formalism used in the statistical mechanics and thermodynamics, we have explained some aspects of negative temperature ( both mathematically and graphically ) in the two level system. We believe that these simple calculations may give us…
▽ More
It is now widely accepted that the concept of negative absolute temperature is real one and not just theoretical curiosity. In this brief report, by combining the formalism used in the statistical mechanics and thermodynamics, we have explained some aspects of negative temperature ( both mathematically and graphically ) in the two level system. We believe that these simple calculations may give useful and concrete insights about the negative absolute temperature to the undergraduate students.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
GlobalFlowNet: Video Stabilization using Deep Distilled Global Motion Estimates
Authors:
Jerin Geo James,
Devansh Jain,
Ajit Rajwade
Abstract:
Videos shot by laymen using hand-held cameras contain undesirable shaky motion. Estimating the global motion between successive frames, in a manner not influenced by moving objects, is central to many video stabilization techniques, but poses significant challenges. A large body of work uses 2D affine transformations or homography for the global motion. However, in this work, we introduce a more g…
▽ More
Videos shot by laymen using hand-held cameras contain undesirable shaky motion. Estimating the global motion between successive frames, in a manner not influenced by moving objects, is central to many video stabilization techniques, but poses significant challenges. A large body of work uses 2D affine transformations or homography for the global motion. However, in this work, we introduce a more general representation scheme, which adapts any existing optical flow network to ignore the moving objects and obtain a spatially smooth approximation of the global motion between video frames. We achieve this by a knowledge distillation approach, where we first introduce a low pass filter module into the optical flow network to constrain the predicted optical flow to be spatially smooth. This becomes our student network, named as \textsc{GlobalFlowNet}. Then, using the original optical flow network as the teacher network, we train the student network using a robust loss function. Given a trained \textsc{GlobalFlowNet}, we stabilize videos using a two stage process. In the first stage, we correct the instability in affine parameters using a quadratic programming approach constrained by a user-specified cropping limit to control loss of field of view. In the second stage, we stabilize the video further by smoothing global motion parameters, expressed using a small number of discrete cosine transform coefficients. In extensive experiments on a variety of different videos, our technique outperforms state of the art techniques in terms of subjective quality and different quantitative measures of video stability. The source code is publicly available at \href{https://github.com/GlobalFlowNet/GlobalFlowNet}{https://github.com/GlobalFlowNet/GlobalFlowNet}
△ Less
Submitted 4 November, 2022; v1 submitted 25 October, 2022;
originally announced October 2022.
-
A framework to mitigate patchy reionization contamination on the primordial gravitational wave signal
Authors:
Divesh Jain,
Tirthankar Roy Choudhury,
Suvodip Mukherjee,
Sourabh Paul
Abstract:
One of the major goals of future cosmic microwave background (CMB) $B$-mode polarization experiments is the detection of primordial gravitational waves through an unbiased measurement of the tensor-to-scalar ratio $r$. Robust detection of this signal will require mitigating all possible contamination to the $B$-mode polarization from astrophysical origins. One such extragalactic contamination aris…
▽ More
One of the major goals of future cosmic microwave background (CMB) $B$-mode polarization experiments is the detection of primordial gravitational waves through an unbiased measurement of the tensor-to-scalar ratio $r$. Robust detection of this signal will require mitigating all possible contamination to the $B$-mode polarization from astrophysical origins. One such extragalactic contamination arises from the patchiness in the electron density during the reionization epoch. Along with the signature on CMB polarization, the patchy reionization can source secondary anisotropies on the CMB temperature through the kinetic Sunyaev-Zeldovich (kSZ) effect. In order to study the impact of this foreground for the upcoming CMB missions, we present a self-consistent framework to compute the CMB anisotropies based on a physically motivated model of reionization. We show that the value of $r$ can bias towards a higher value if the secondary contribution from reionization is neglected. However, combining small-scale kSZ signal, large-scale $E$-mode polarization, and $B$-mode polarization measurements, we can put constraints on the patchiness in electron density during reionization and can mitigate its impact on the value of $r$. CMB missions such as CMB-S4 and PICO may experience a bias of $>0.17σ$ which can go as high as $\sim 0.73σ$ for extreme reionization models allowed by the Planck and SPT CMB measurements. As future experiments target to measure $r$ at $5σ$, this is likely to affect the measurement significance and hence possibly affect the claim of detection of $r$, if not mitigated properly by using joint estimations of different reionization observables.
△ Less
Submitted 15 April, 2023; v1 submitted 26 September, 2022;
originally announced September 2022.
-
Superconducting four-fold Fe(Te,Se) film on six-fold magnetic MnTe via hybrid symmetry epitaxy
Authors:
Xiong Yao,
Alessandro R. Mazza,
Myung-Geun Han,
Hee Taek Yi,
Deepti Jain,
Matthew Brahlek,
Seongshik Oh
Abstract:
Epitaxial Fe(Te,Se) thin films have been grown on various substrates but never been realized on magnetic layers. Here we report the epitaxial growth of four-fold Fe(Te,Se) film on a six-fold antiferromagnetic insulator, MnTe. The Fe(Te,Se)/MnTe heterostructure shows a clear superconducting transition at around 11 K and the critical magnetic field measurement suggests the origin of the superconduct…
▽ More
Epitaxial Fe(Te,Se) thin films have been grown on various substrates but never been realized on magnetic layers. Here we report the epitaxial growth of four-fold Fe(Te,Se) film on a six-fold antiferromagnetic insulator, MnTe. The Fe(Te,Se)/MnTe heterostructure shows a clear superconducting transition at around 11 K and the critical magnetic field measurement suggests the origin of the superconductivity to be bulk-like. Structural characterizations suggest that the uniaxial lattice match between Fe(Te,Se) and MnTe allows a hybrid symmetry epitaxy mode, which was recently discovered between Fe(Te,Se) and Bi2Te3. Furthermore, Te/Fe flux ratio during deposition of the Fe(Te,Se) layer is found to be critical for its superconductivity. Now that superconducting Fe(Te,Se) can be grown on two related hexagonal platforms, Bi2Te3 and MnTe, this result opens a new possibility of combining topological superconductivity of Fe(Te,Se) with the rich physics in the intrinsic magnetic topological materials (MnTe)n(Bi2Te3)m family.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
Constraints on the transition redshift from the calibrated Gamma-ray Burst $E_{\rm p}$-$E_{\rm iso}$ correlation
Authors:
Marco Muccino,
Orlando Luongo,
Deepak Jain
Abstract:
We constrain the deceleration-acceleration epoch, namely the transition redshift $z_{tr}$, adopting model-independent techniques that utilize a calibrated $E_{\rm p}$-$E_{\rm iso}$ correlation for gamma-ray bursts (GRBs). To do so, in addition to real data points, we employ up to $1000$ simulated observational Hubble data (OHD) points. We then calibrate the $E_{\rm p}$-$E_{\rm iso}$ correlation by…
▽ More
We constrain the deceleration-acceleration epoch, namely the transition redshift $z_{tr}$, adopting model-independent techniques that utilize a calibrated $E_{\rm p}$-$E_{\rm iso}$ correlation for gamma-ray bursts (GRBs). To do so, in addition to real data points, we employ up to $1000$ simulated observational Hubble data (OHD) points. We then calibrate the $E_{\rm p}$-$E_{\rm iso}$ correlation by means of the well-consolidate Bézier polynomial technique, interpolating OHD up to the second order. Once GRB data have been calibrated, we consider two strategies of cosmographic expansions, i.e., first we take a direct Hubble rate expansion around $z_{tr}$, and second the expansion of the deceleration parameter around the same redshift, but with a different order. Employing type Ia supernovae, baryonic acoustic oscillations and GRB data sets, from Monte Carlo analyses we infer tight constraints on $z_{tr}$ and the jerk parameters at $z=z_{tr}$, namely $j_{tr}$. Our results are extremely compatible with previous outcomes and confirm the $Λ$CDM predictions, being slightly different in terms of the jerk parameter. In this respect, we conjecture which extensions of the concordance paradigm are possible and we compare our findings with expectations provided by generic dark energy models.
△ Less
Submitted 28 June, 2023; v1 submitted 29 August, 2022;
originally announced August 2022.
-
Implicit Two-Tower Policies
Authors:
Yunfan Zhao,
Qingkai Pan,
Krzysztof Choromanski,
Deepali Jain,
Vikas Sindhwani
Abstract:
We present a new class of structured reinforcement learning policy-architectures, Implicit Two-Tower (ITT) policies, where the actions are chosen based on the attention scores of their learnable latent representations with those of the input states. By explicitly disentangling action from state processing in the policy stack, we achieve two main goals: substantial computational gains and better pe…
▽ More
We present a new class of structured reinforcement learning policy-architectures, Implicit Two-Tower (ITT) policies, where the actions are chosen based on the attention scores of their learnable latent representations with those of the input states. By explicitly disentangling action from state processing in the policy stack, we achieve two main goals: substantial computational gains and better performance. Our architectures are compatible with both: discrete and continuous action spaces. By conducting tests on 15 environments from OpenAI Gym and DeepMind Control Suite, we show that ITT-architectures are particularly suited for blackbox/evolutionary optimization and the corresponding policy training algorithms outperform their vanilla unstructured implicit counterparts as well as commonly used explicit policies. We complement our analysis by showing how techniques such as hashing and lazy tower updates, critically relying on the two-tower structure of ITTs, can be applied to obtain additional computational improvements.
△ Less
Submitted 25 October, 2023; v1 submitted 1 August, 2022;
originally announced August 2022.
-
i-Sim2Real: Reinforcement Learning of Robotic Policies in Tight Human-Robot Interaction Loops
Authors:
Saminda Abeyruwan,
Laura Graesser,
David B. D'Ambrosio,
Avi Singh,
Anish Shankar,
Alex Bewley,
Deepali Jain,
Krzysztof Choromanski,
Pannag R. Sanketi
Abstract:
Sim-to-real transfer is a powerful paradigm for robotic reinforcement learning. The ability to train policies in simulation enables safe exploration and large-scale data collection quickly at low cost. However, prior works in sim-to-real transfer of robotic policies typically do not involve any human-robot interaction because accurately simulating human behavior is an open problem. In this work, o…
▽ More
Sim-to-real transfer is a powerful paradigm for robotic reinforcement learning. The ability to train policies in simulation enables safe exploration and large-scale data collection quickly at low cost. However, prior works in sim-to-real transfer of robotic policies typically do not involve any human-robot interaction because accurately simulating human behavior is an open problem. In this work, our goal is to leverage the power of simulation to train robotic policies that are proficient at interacting with humans upon deployment. But there is a chicken and egg problem -- how to gather examples of a human interacting with a physical robot so as to model human behavior in simulation without already having a robot that is able to interact with a human? Our proposed method, Iterative-Sim-to-Real (i-S2R), attempts to address this. i-S2R bootstraps from a simple model of human behavior and alternates between training in simulation and deploying in the real world. In each iteration, both the human behavior model and the policy are refined. For all training we apply a new evolutionary search algorithm called Blackbox Gradient Sensing (BGS). We evaluate our method on a real world robotic table tennis setting, where the objective for the robot is to play cooperatively with a human player for as long as possible. Table tennis is a high-speed, dynamic task that requires the two players to react quickly to each other's moves, making for a challenging test bed for research on human-robot interaction. We present results on an industrial robotic arm that is able to cooperatively play table tennis with human players, achieving rallies of 22 successive hits on average and 150 at best. Further, for 80% of players, rally lengths are 70% to 175% longer compared to the sim-to-real plus fine-tuning (S2R+FT) baseline. For videos of our system in action, please see https://sites.google.com/view/is2r.
△ Less
Submitted 21 November, 2022; v1 submitted 13 July, 2022;
originally announced July 2022.
-
Constraints on the Transition Redshift using Hubble Phase Space Portrait
Authors:
Darshan Kumar,
Deepak Jain,
Shobhit Mahajan,
Amitabha Mukherjee,
Akshay Rana
Abstract:
One of the most significant discoveries in modern cosmology is that the universe is currently in a phase of accelerated expansion after a switch from a decelerated expansion. The redshift corresponding to this epoch is referred to as the transition redshift $z_t$. In this work we put constraints on the $z_t$ with both model-independent and model-dependent approaches. We consider 32 Hubble paramete…
▽ More
One of the most significant discoveries in modern cosmology is that the universe is currently in a phase of accelerated expansion after a switch from a decelerated expansion. The redshift corresponding to this epoch is referred to as the transition redshift $z_t$. In this work we put constraints on the $z_t$ with both model-independent and model-dependent approaches. We consider 32 Hubble parameter measurements and the Pantheon sample of Type Ia Supernovae (SNe). In order to include the possible systematic effects in this analysis, we use the full covariance matrix of systematic uncertainties for the Hubble parameter measurements. We plot a Hubble Phase Space Portrait (HPSP) between $\dot{H}(z)$ and $H(z)$ in a model-independent way. From this HPSP diagram, we estimate the transition redshift as well as the current value of the equation of state parameter $ω_0$ in a model-independent way. By considering H(z) measurements, we find the best fit value of $z_t=0.591^{+0.332}_{-0.332}$ and $ω_0=-0.677^{+0.238}_{-0.238}$. We obtain the best fit value of $z_t=0.849^{+0.117}_{-0.117}$ and $ω_0=-0.870^{+0.013}_{-0.013}$ using the Pantheon database. Further, we also use a model dependent approach to determine $z_t$. Here, we consider a non-flat $Λ$CDM model as a background cosmological model. We reconstruct the cosmic triangle plot among $\log(Ω_{m0})$, $-\log(2Ω_{\Lambda0})$ and $3\log(1+z_t)$ where the constraints of each parameter are determined by the location in this triangle plot. Using $Ω_{m0}$ and $Ω_{\Lambda0}$ values, we find the best value of the transition redshift $z_t=0.619^{+0.580}_{-0.758}$, which is in good agreement with the Planck 2018 results at $1σ$ confidence level. We also simulate the observed Hubble parameter measurements in the redshift range $0<z<2$ and perform the same analysis to estimate the transition redshift.
△ Less
Submitted 27 March, 2023; v1 submitted 26 May, 2022;
originally announced May 2022.
-
Self-Labeling Refinement for Robust Representation Learning with Bootstrap Your Own Latent
Authors:
Siddhant Garg,
Dhruval Jain
Abstract:
In this work, we have worked towards two major goals. Firstly, we have investigated the importance of Batch Normalisation (BN) layers in a non-contrastive representation learning framework called Bootstrap Your Own Latent (BYOL). We conducted several experiments to conclude that BN layers are not necessary for representation learning in BYOL. Moreover, BYOL only learns from the positive pairs of i…
▽ More
In this work, we have worked towards two major goals. Firstly, we have investigated the importance of Batch Normalisation (BN) layers in a non-contrastive representation learning framework called Bootstrap Your Own Latent (BYOL). We conducted several experiments to conclude that BN layers are not necessary for representation learning in BYOL. Moreover, BYOL only learns from the positive pairs of images but ignores other semantically similar images in the same input batch. For the second goal, we have introduced two new loss functions to determine the semantically similar pairs in the same input batch of images and reduce the distance between their representations. These loss functions are Cross-Cosine Similarity Loss (CCSL) and Cross-Sigmoid Similarity Loss (CSSL). Using the proposed loss functions, we are able to surpass the performance of Vanilla BYOL (71.04%) by training the BYOL framework using CCSL loss (76.87%) on the STL10 dataset. BYOL trained using CSSL loss performs comparably with Vanilla BYOL.
△ Less
Submitted 9 April, 2022;
originally announced April 2022.
-
Notes on 5d Partition Functions -- II
Authors:
Dharmesh Jain
Abstract:
We study the large $N$ limit of partition functions for 5d supersymmetric gauge theories with fundamental matter. Depending on the matter content, we find that the scaling behaviour at the leading order can be either $N^2$ or $N^{\frac{3}{2}}$. The latter scaling reminds one of the 3d theories with M-theory duals and we discuss how to extract this behaviour from the recently proposed 3d theories a…
▽ More
We study the large $N$ limit of partition functions for 5d supersymmetric gauge theories with fundamental matter. Depending on the matter content, we find that the scaling behaviour at the leading order can be either $N^2$ or $N^{\frac{3}{2}}$. The latter scaling reminds one of the 3d theories with M-theory duals and we discuss how to extract this behaviour from the recently proposed 3d theories associated with the compactification of 5d SCFTs on 2d surfaces.
△ Less
Submitted 29 September, 2022; v1 submitted 31 March, 2022;
originally announced March 2022.
-
ProtoSound: A Personalized and Scalable Sound Recognition System for Deaf and Hard-of-Hearing Users
Authors:
Dhruv Jain,
Khoa Huynh Anh Nguyen,
Steven Goodman,
Rachel Grossman-Kahn,
Hung Ngo,
Aditya Kusupati,
Ruofei Du,
Alex Olwal,
Leah Findlater,
Jon E. Froehlich
Abstract:
Recent advances have enabled automatic sound recognition systems for deaf and hard of hearing (DHH) users on mobile devices. However, these tools use pre-trained, generic sound recognition models, which do not meet the diverse needs of DHH users. We introduce ProtoSound, an interactive system for customizing sound recognition models by recording a few examples, thereby enabling personalized and fi…
▽ More
Recent advances have enabled automatic sound recognition systems for deaf and hard of hearing (DHH) users on mobile devices. However, these tools use pre-trained, generic sound recognition models, which do not meet the diverse needs of DHH users. We introduce ProtoSound, an interactive system for customizing sound recognition models by recording a few examples, thereby enabling personalized and fine-grained categories. ProtoSound is motivated by prior work examining sound awareness needs of DHH people and by a survey we conducted with 472 DHH participants. To evaluate ProtoSound, we characterized performance on two real-world sound datasets, showing significant improvement over state-of-the-art (e.g., +9.7% accuracy on the first dataset). We then deployed ProtoSound's end-user training and real-time recognition through a mobile application and recruited 19 hearing participants who listened to the real-world sounds and rated the accuracy across 56 locations (e.g., homes, restaurants, parks). Results show that ProtoSound personalized the model on-device in real-time and accurately learned sounds across diverse acoustic contexts. We close by discussing open challenges in personalizable sound recognition, including the need for better recording interfaces and algorithmic improvements.
△ Less
Submitted 22 February, 2022;
originally announced February 2022.