-
Equivalence of flexible stripline and coaxial cables for superconducting qubit control and readout pulses
Authors:
V. Y. Monarkha,
S. Simbierowicz,
M. Borrelli,
R. van Gulik,
N. Drobotun,
D. Kuitenbrouwer,
D. Bouman,
D. Datta,
P. Eskelinen,
E. Mannila,
J. Kaikkonen,
V. Vesterinen,
J. Govenius,
R. E. Lake
Abstract:
We report a comparative study on microwave control lines for a transmon qubit using: (i) flexible stripline transmission lines, and (ii) semi-rigid coaxial cables. During each experiment we performed repeated measurements of the energy relaxation and coherence times of a transmon qubit using one of the wiring configurations. Each measurement run spanned 70 h to 250 h of measurement time, and four…
▽ More
We report a comparative study on microwave control lines for a transmon qubit using: (i) flexible stripline transmission lines, and (ii) semi-rigid coaxial cables. During each experiment we performed repeated measurements of the energy relaxation and coherence times of a transmon qubit using one of the wiring configurations. Each measurement run spanned 70 h to 250 h of measurement time, and four separate cooldowns were performed so that each configuration could be tested twice. From these datasets we observe that changing the microwave control lines from coaxial cables to flexible stripline transmission lines does not have a measurable effect on coherence compared to thermal cycling the system, or random coherence fluctuations. Our results open up the possibility of large scale integration of qubit control lines with integrated component with planar layouts on flexible substrate.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Polaris: A Safety-focused LLM Constellation Architecture for Healthcare
Authors:
Subhabrata Mukherjee,
Paul Gamble,
Markel Sanz Ausin,
Neel Kant,
Kriti Aggarwal,
Neha Manjunath,
Debajyoti Datta,
Zhengliang Liu,
Jiayuan Ding,
Sophia Busacca,
Cezanne Bianco,
Swapnil Sharma,
Rae Lasko,
Michelle Voisard,
Sanchay Harneja,
Darya Filippova,
Gerry Meixiong,
Kevin Cha,
Amir Youssefi,
Meyhaa Buvanesh,
Howard Weingram,
Sebastian Bierman-Lytle,
Harpreet Singh Mangat,
Kim Parikh,
Saad Godil
, et al. (1 additional authors not shown)
Abstract:
We develop Polaris, the first safety-focused LLM constellation for real-time patient-AI healthcare conversations. Unlike prior LLM works in healthcare focusing on tasks like question answering, our work specifically focuses on long multi-turn voice conversations. Our one-trillion parameter constellation system is composed of several multibillion parameter LLMs as co-operative agents: a stateful pr…
▽ More
We develop Polaris, the first safety-focused LLM constellation for real-time patient-AI healthcare conversations. Unlike prior LLM works in healthcare focusing on tasks like question answering, our work specifically focuses on long multi-turn voice conversations. Our one-trillion parameter constellation system is composed of several multibillion parameter LLMs as co-operative agents: a stateful primary agent that focuses on driving an engaging conversation and several specialist support agents focused on healthcare tasks performed by nurses to increase safety and reduce hallucinations. We develop a sophisticated training protocol for iterative co-training of the agents that optimize for diverse objectives. We train our models on proprietary data, clinical care plans, healthcare regulatory documents, medical manuals, and other medical reasoning documents. We align our models to speak like medical professionals, using organic healthcare conversations and simulated ones between patient actors and experienced nurses. This allows our system to express unique capabilities such as rapport building, trust building, empathy and bedside manner. Finally, we present the first comprehensive clinician evaluation of an LLM system for healthcare. We recruited over 1100 U.S. licensed nurses and over 130 U.S. licensed physicians to perform end-to-end conversational evaluations of our system by posing as patients and rating the system on several measures. We demonstrate Polaris performs on par with human nurses on aggregate across dimensions such as medical safety, clinical readiness, conversational quality, and bedside manner. Additionally, we conduct a challenging task-based evaluation of the individual specialist support agents, where we demonstrate our LLM agents significantly outperform a much larger general-purpose LLM (GPT-4) as well as from its own medium-size class (LLaMA-2 70B).
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Near-ground state cooling in electromechanics using measurement-based feedback and Josephson parametric amplifier
Authors:
Ewa Rej,
Richa Cutting,
Debopam Datta,
Nils Tiencken,
Joonas Govenius,
Visa Vesterinen,
Yulong Liu,
Mika A. Sillanpää
Abstract:
Feedback-based control of nano- and micromechanical resonators can enable the study of macroscopic quantum phenomena and also sensitive force measurements. Here, we demonstrate the feedback cooling of a low-loss and high-stress macroscopic SiN membrane resonator close to its quantum ground state. We use the microwave optomechanical platform, where the resonator is coupled to a microwave cavity. Th…
▽ More
Feedback-based control of nano- and micromechanical resonators can enable the study of macroscopic quantum phenomena and also sensitive force measurements. Here, we demonstrate the feedback cooling of a low-loss and high-stress macroscopic SiN membrane resonator close to its quantum ground state. We use the microwave optomechanical platform, where the resonator is coupled to a microwave cavity. The experiment utilizes a Josephson travelling wave parametric amplifier, which is nearly quantum-limited in added noise, and is important to mitigate resonator heating due to system noise in the feedback loop. We reach a thermal phonon number as low as 1.6, which is limited primarily by microwave-induced heating. We also discuss the sideband asymmetry observed when a weak microwave tone for independent readout is applied in addition to other tones used for the cooling. The asymmetry can be qualitatively attributed to the quantum-mechanical imbalance between emission and absorption. However, we find that the observed asymmetry is only partially due to this quantum effect. In specific situations, the asymmetry is fully dominated by a cavity Kerr effect under multitone irradiation.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Exploring the Mechanical Behaviors of 2D Materials in Electrochemical Energy Storage Systems: Present Insights and Future Prospects
Authors:
Dibakar Datta
Abstract:
2D materials (2DM) and their heterostructures (2D + nD, n = 0,1,2,3) hold significant promise for applications in Electrochemical Energy Storage Systems (EESS), such as batteries. 2DM can serve as van der Waals (vdW) slick interface between conventional active materials (e.g., Silicon) and current collectors, modifying interfacial adhesion and preventing stress-induced fractures. Additionally, 2DM…
▽ More
2D materials (2DM) and their heterostructures (2D + nD, n = 0,1,2,3) hold significant promise for applications in Electrochemical Energy Storage Systems (EESS), such as batteries. 2DM can serve as van der Waals (vdW) slick interface between conventional active materials (e.g., Silicon) and current collectors, modifying interfacial adhesion and preventing stress-induced fractures. Additionally, 2DM can replace traditional polymer binders (e.g., MXenes). This arrangement also underscores the critical role of interfacial mechanics between 2DM and active materials. Furthermore, 2DM can be designed to function as an electrode itself. For instance, a porous graphene network has been reported to possesses approximately five times the capacity of a traditional graphite anode. Consequently, gaining a comprehensive understanding of the mechanical properties of 2DM in EESS is paramount. However, modeling 2DM in EESS poses significant challenges due to the intricate coupling of mechanics and electrochemistry. For instance, defective graphene tends to favor adatom adsorption (e.g., Li+) during charging. In cases of strong adsorption, adatoms may not readily detach from electrodes during discharging. As a result, in such scenarios, adsorption-desorption (charge-discharge) processes govern the mechanical properties of 2DM when used as binders and current collectors. Regrettably, most existing studies on the mechanical properties of 2DM in EESS have failed to adequately address these critical issues. This perspective paper aims to provide a comprehensive overview of recent progress in the chemo-mechanics of 2DM's mechanical properties. A wide spectrum of multiscale modeling approaches, including atomistic/molecular simulations, continuum modeling, and machine learning, are discussed.
△ Less
Submitted 10 November, 2023;
originally announced November 2023.
-
MILDSum: A Novel Benchmark Dataset for Multilingual Summarization of Indian Legal Case Judgments
Authors:
Debtanu Datta,
Shubham Soni,
Rajdeep Mukherjee,
Saptarshi Ghosh
Abstract:
Automatic summarization of legal case judgments is a practically important problem that has attracted substantial research efforts in many countries. In the context of the Indian judiciary, there is an additional complexity -- Indian legal case judgments are mostly written in complex English, but a significant portion of India's population lacks command of the English language. Hence, it is crucia…
▽ More
Automatic summarization of legal case judgments is a practically important problem that has attracted substantial research efforts in many countries. In the context of the Indian judiciary, there is an additional complexity -- Indian legal case judgments are mostly written in complex English, but a significant portion of India's population lacks command of the English language. Hence, it is crucial to summarize the legal documents in Indian languages to ensure equitable access to justice. While prior research primarily focuses on summarizing legal case judgments in their source languages, this study presents a pioneering effort toward cross-lingual summarization of English legal documents into Hindi, the most frequently spoken Indian language. We construct the first high-quality legal corpus comprising of 3,122 case judgments from prominent Indian courts in English, along with their summaries in both English and Hindi, drafted by legal practitioners. We benchmark the performance of several diverse summarization approaches on our corpus and demonstrate the need for further research in cross-lingual summarization in the legal domain.
△ Less
Submitted 28 October, 2023;
originally announced October 2023.
-
Improving Access to Justice for the Indian Population: A Benchmark for Evaluating Translation of Legal Text to Indian Languages
Authors:
Sayan Mahapatra,
Debtanu Datta,
Shubham Soni,
Adrijit Goswami,
Saptarshi Ghosh
Abstract:
Most legal text in the Indian judiciary is written in complex English due to historical reasons. However, only about 10% of the Indian population is comfortable in reading English. Hence legal text needs to be made available in various Indian languages, possibly by translating the available legal text from English. Though there has been a lot of research on translation to and between Indian langua…
▽ More
Most legal text in the Indian judiciary is written in complex English due to historical reasons. However, only about 10% of the Indian population is comfortable in reading English. Hence legal text needs to be made available in various Indian languages, possibly by translating the available legal text from English. Though there has been a lot of research on translation to and between Indian languages, to our knowledge, there has not been much prior work on such translation in the legal domain. In this work, we construct the first high-quality legal parallel corpus containing aligned text units in English and nine Indian languages, that includes several low-resource languages. We also benchmark the performance of a wide variety of Machine Translation (MT) systems over this corpus, including commercial MT systems, open-source MT systems and Large Language Models. Through a comprehensive survey by Law practitioners, we check how satisfied they are with the translations by some of these MT systems, and how well automatic MT evaluation metrics agree with the opinions of Law practitioners.
△ Less
Submitted 15 October, 2023;
originally announced October 2023.
-
Exploring Thermal Transport in Electrochemical Energy Storage Systems Utilizing Two-Dimensional Materials: Prospects and Hurdles
Authors:
Dibakar Datta,
Eon Soo Lee
Abstract:
Two-dimensional materials and their heterostructures have enormous applications in Electrochemical Energy Storage Systems (EESS) such as batteries. A comprehensive and solid understanding of these materials' thermal transport and mechanism is essential for the practical design of EESS. Experiments have challenges in providing improved control and characterization of complex structures, especially…
▽ More
Two-dimensional materials and their heterostructures have enormous applications in Electrochemical Energy Storage Systems (EESS) such as batteries. A comprehensive and solid understanding of these materials' thermal transport and mechanism is essential for the practical design of EESS. Experiments have challenges in providing improved control and characterization of complex structures, especially for low dimensional materials. Theoretical and simulation tools such as first-principles calculations, boltzmann transport equations, molecular dynamics simulations, lattice dynamics simulation, and non-equilibrium Green's function provide reliable predictions of thermal conductivity and physical insights to understand the underlying thermal transport mechanism in materials. However, doing these calculations require high computational resources. The development of new materials synthesis technology and fast-growing demand for rapid and accurate prediction of physical properties require novel computational approaches. The machine learning (ML) method provides a promising solution to address such needs. This review details the recent development in atomistic/molecular studies and ML of thermal transport in EESS. The paper also addresses the latest significant experimental advances. However, designing the best low-dimensional materials-based heterostructures is like a multivariate optimization problem. For example, a particular heterostructure may be suitable for thermal transport but can have lower mechanical strength/stability. For bi/multilayer structures, the interlayer distance may influence the thermal transport properties and interlayer strength. Therefore, the last part addresses the future research direction in low-dimensional materials-based heterostructure design for thermal transport in EESS.
△ Less
Submitted 5 September, 2023;
originally announced October 2023.
-
Electro-Chemo-Mechanical Modeling of Multiscale Active Materials for Next-Generation Energy Storage: Opportunities and Challenges
Authors:
Dibakar Datta
Abstract:
The recent geopolitical crisis resulted in a gas price surge. Although lithium-ion batteries represent the best available rechargeable battery technology, a significant energy and power density gap exists between LIBs and petrol/gasoline. The battery electrodes comprise a mixture of active materials particles, conductive carbon, and binder additives deposited onto a current collector. Although thi…
▽ More
The recent geopolitical crisis resulted in a gas price surge. Although lithium-ion batteries represent the best available rechargeable battery technology, a significant energy and power density gap exists between LIBs and petrol/gasoline. The battery electrodes comprise a mixture of active materials particles, conductive carbon, and binder additives deposited onto a current collector. Although this basic design has persisted for decades, the active material particle's desired size scale is debated. Traditionally, microparticles have been used in batteries. Advances in nanotechnology have spurred interest in deploying nanoparticles as active materials. However, despite many efforts in nano, industries still primarily use 'old' microparticles. Most importantly, the battery industry is unlikely to replace microstructures with nanometer-sized analogs. This poses an important question: Is there a place for nanostructure in battery design due to irreplaceable microstructure? The way forward lies in multiscale active materials, microscale structures with built-in nanoscale features, such as microparticles assembled from nanoscale building blocks or patterned with engineered or natural nanopores. Although experimental strides have been made in developing such materials, computational progress in this domain remains limited and, in some cases, negligible. However, the fields hold immense computational potential, presenting a multitude of opportunities. This perspective highlights the existing gaps in modeling multiscale active materials and delineates various open challenges in the realm of electro-chemo-mechanical modeling. By doing so, it aims to inspire computational research within this field and promote synergistic collaborative efforts between computational and experimental researchers.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
Transferable and Robust Machine Learning Model for Predicting Stability of Si Anodes for Multivalent Cation Batteries
Authors:
Joy Datta,
Dibakar Datta,
Vidushi Sharma
Abstract:
Data-driven methodology has become a key tool in computationally predicting material properties. Currently, these techniques are priced high due to computational requirements for generating sufficient training data for high-precision machine learning models. In this study, we present a Support Vector Regression (SVR)-based machine learning model to predict the stability of silicon (Si) - alkaline…
▽ More
Data-driven methodology has become a key tool in computationally predicting material properties. Currently, these techniques are priced high due to computational requirements for generating sufficient training data for high-precision machine learning models. In this study, we present a Support Vector Regression (SVR)-based machine learning model to predict the stability of silicon (Si) - alkaline metal alloys, with a strong emphasis on the transferability of the model to new silicon alloys with different electronic configurations and structures. We elaborate on the role of the structural descriptor in imparting transferability to the model that is trained on limited data (~750 Si alloys) derived from the Material Project database. Three popular descriptors, namely X-Ray Diffraction (XRD), Sine Coulomb Matrix (SCM), and Orbital Field Matrix (OFM), are evaluated for representing Si alloys. The material structures are represented by descriptors in the SVR model, coupled with hyperparameter tuning techniques like Grid Search CV and Bayesian Optimization (BO), to find the best performing model for predicting total energy, formation energy and packing fraction of the Si alloy systems. The models are trained on Si alloys with lithium (Li), sodium (Na), potassium (K), magnesium (Mg), calcium (Ca), and aluminum (Al) metals, where Si-Na and Si-Al systems are used as test structures. Our results show that XRD, an experimentally derived characterization of structures, performs most reliably as a descriptor for total energy prediction of new Si alloys. The study demonstrates that by qualitatively selection of training data, using hyperparameter tuning methods, and employing appropriate structural descriptors, the data requirements for robust and accurate ML models can be reduced.
△ Less
Submitted 25 June, 2023;
originally announced June 2023.
-
Effect of Graphene Interface on Potassiation in a Graphene- Selenium Heterostructure Cathode for Potassium-ion Batteries
Authors:
Vidushi Sharma,
Dibakar Datta
Abstract:
Selenium (Se) cathodes are an exciting emerging high energy density storage system for Potassium ion batteries(KIB), where potassiation reactions are less understood. Here, we present an atomic-level investigation of KxSe cathode enclosed in hexagonal lattices of carbon(C) characteristic of multilayered graphene matrix and multiwalled carbon nanotubes (MW-CNTs). Microstructural changes directed by…
▽ More
Selenium (Se) cathodes are an exciting emerging high energy density storage system for Potassium ion batteries(KIB), where potassiation reactions are less understood. Here, we present an atomic-level investigation of KxSe cathode enclosed in hexagonal lattices of carbon(C) characteristic of multilayered graphene matrix and multiwalled carbon nanotubes (MW-CNTs). Microstructural changes directed by graphene substrate in KxSe cathode are contrasted with graphene-free cathode. Graphene's binding affinity for long-chain polyselenides (Se-Se-Se = -2.82 eV and Se-Se = -2.646 eV) and ability to induce reactivity between Se and K are investigated. Furthermore, intercalation voltage for graphene enclosed KxSe cathode reaction intermediates are calculated with K2Se as the final discharged product. Our results indicate a single-step reaction near a voltage of 1.55 V between K and Se cathode. Our findings suggest that operating at higher voltages (~2V) could result in the formation of reaction intermediates where intercalation/deintercalation of K could be a challenge, and therefore cause irreversible capacity losses in the battery. Primary issues are the high binding energy of long-chain polyselenides with graphene that discourage K storage and Se-Se bond dissociation at low K concentrations. A comparison with graphene-free cathode highlights the substantial changes a van der Waals (vdW) graphene interface can bring in atomic-structure and electrochemistry of the KxSe cathode.
△ Less
Submitted 31 July, 2023; v1 submitted 13 April, 2023;
originally announced April 2023.
-
Wearable Sensor-based Multimodal Physiological Responses of Socially Anxious Individuals across Social Contexts
Authors:
Emma R. Toner,
Mark Rucker,
Zhiyuan Wang,
Maria A. Larrazabal,
Lihua Cai,
Debajyoti Datta,
Elizabeth Thompson,
Haroon Lone,
Mehdi Boukhechba,
Bethany A. Teachman,
Laura E. Barnes
Abstract:
Correctly identifying an individual's social context from passively worn sensors holds promise for delivering just-in-time adaptive interventions (JITAIs) to treat social anxiety disorder. In this study, we present results using passively collected data from a within-subject experiment that assessed physiological response across different social contexts (i.e, alone vs. with others), social phases…
▽ More
Correctly identifying an individual's social context from passively worn sensors holds promise for delivering just-in-time adaptive interventions (JITAIs) to treat social anxiety disorder. In this study, we present results using passively collected data from a within-subject experiment that assessed physiological response across different social contexts (i.e, alone vs. with others), social phases (i.e., pre- and post-interaction vs. during an interaction), social interaction sizes (i.e., dyadic vs. group interactions), and levels of social threat (i.e., implicit vs. explicit social evaluation). Participants in the study ($N=46$) reported moderate to severe social anxiety symptoms as assessed by the Social Interaction Anxiety Scale ($\geq$34 out of 80). Univariate paired difference tests, multivariate random forest models, and follow-up cluster analyses were used to explore physiological response patterns across different social and non-social contexts. Our results suggest that social context is more reliably distinguishable than social phase, group size, or level of social threat, but that there is considerable variability in physiological response patterns even among these distinguishable contexts. Implications for real-world context detection and deployment of JITAIs are discussed.
△ Less
Submitted 3 April, 2023;
originally announced April 2023.
-
Open-tunneled oxides as intercalation host for multivalent ion (Ca and Al) batteries: A DFT study
Authors:
Joy Datta,
Nikhil Koratkar,
Dibakar Datta
Abstract:
Lithium-ion batteries (LIBs) are ubiquitous in everyday applications. However, Lithium (Li) is a limited resource on the planet and is therefore not sustainable. As an alternative to lithium, earth-abundant and cheaper multivalent metals such as aluminum (Al) and calcium (Ca) have been actively researched in battery systems. However, finding suitable intercalation hosts for multivalent-ion batteri…
▽ More
Lithium-ion batteries (LIBs) are ubiquitous in everyday applications. However, Lithium (Li) is a limited resource on the planet and is therefore not sustainable. As an alternative to lithium, earth-abundant and cheaper multivalent metals such as aluminum (Al) and calcium (Ca) have been actively researched in battery systems. However, finding suitable intercalation hosts for multivalent-ion batteries is urgently needed. Open-tunneled oxides are a particular category of microparticles distinguished by the presence of integrated one-dimensional channels or nanopores. This work focuses on two promising open-tunnel oxides, viz: Niobium Tungsten Oxide (NTO) and Molybdenum Vanadium Oxide (MoVO). We find that the MoVO structure can adsorb greater numbers of multivalent ions than NTO due to its larger surface area and different shapes. The MoVO structure can adsorb Ca, Li, and Al ions with adsorption potential at around 4 to 5 eV. However, the adsorption potential for hexagonal channels of Al ion drops to 1.73 eV because of less channel area. NTO structure has an insertion/adsorption potential of 4.4 eV, 3.4 eV, and 0.9 eV for one Li, Ca, and Al, respectively. In general, Ca ion is more adsorbable than Al ion in both MoVO and NTO structures. Bader charge analysis and charge density plot reveals the role of charge transfer and ion size on the insertion of multivalent ions such as Ca and Al into MoVO and NTO systems. Our results provide general guidelines to explore other multivalent ions for battery applications.
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Authors:
BigScience Workshop,
:,
Teven Le Scao,
Angela Fan,
Christopher Akiki,
Ellie Pavlick,
Suzana Ilić,
Daniel Hesslow,
Roman Castagné,
Alexandra Sasha Luccioni,
François Yvon,
Matthias Gallé,
Jonathan Tow,
Alexander M. Rush,
Stella Biderman,
Albert Webson,
Pawan Sasanka Ammanamanchi,
Thomas Wang,
Benoît Sagot,
Niklas Muennighoff,
Albert Villanova del Moral,
Olatunji Ruwase,
Rachel Bawden,
Stas Bekman,
Angelina McMillan-Major
, et al. (369 additional authors not shown)
Abstract:
Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access…
▽ More
Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.
△ Less
Submitted 27 June, 2023; v1 submitted 9 November, 2022;
originally announced November 2022.
-
Shape Analysis for Pediatric Upper Body Motor Function Assessment
Authors:
Shashwat Kumar,
Robert Gutierez,
Debajyoti Datta,
Sarah Tolman,
Allison McCrady,
Silvia Blemker,
Rebecca J. Scharf,
Laura Barnes
Abstract:
Neuromuscular disorders, such as Spinal Muscular Atrophy (SMA) and Duchenne Muscular Dystrophy (DMD), cause progressive muscular degeneration and loss of motor function for 1 in 6,000 children. Traditional upper limb motor function assessments do not quantitatively measure patient-performed motions, which makes it difficult to track progress for incremental changes. Assessing motor function in chi…
▽ More
Neuromuscular disorders, such as Spinal Muscular Atrophy (SMA) and Duchenne Muscular Dystrophy (DMD), cause progressive muscular degeneration and loss of motor function for 1 in 6,000 children. Traditional upper limb motor function assessments do not quantitatively measure patient-performed motions, which makes it difficult to track progress for incremental changes. Assessing motor function in children with neuromuscular disorders is particularly challenging because they can be nervous or excited during experiments, or simply be too young to follow precise instructions. These challenges translate to confounding factors such as performing different parts of the arm curl slower or faster (phase variability) which affects the assessed motion quality. This paper uses curve registration and shape analysis to temporally align trajectories while simultaneously extracting a mean reference shape. Distances from this mean shape are used to assess the quality of motion. The proposed metric is invariant to confounding factors, such as phase variability, while suggesting several clinically relevant insights. First, there are statistically significant differences between functional scores for the control and patient populations (p$=$0.0213$\le$0.05). Next, several patients in the patient cohort are able to perform motion on par with the healthy cohort and vice versa. Our metric, which is computed based on wearables, is related to the Brooke's score ((p$=$0.00063$\le$0.05)), as well as motor function assessments based on dynamometry ((p$=$0.0006$\le$0.05)). These results show promise towards ubiquitous motion quality assessment in daily life.
△ Less
Submitted 10 September, 2022;
originally announced September 2022.
-
Scrutinizing Shipment Records To Thwart Illegal Timber Trade
Authors:
Debanjan Datta,
Sathappan Muthiah,
John Simeone,
Amelia Meadows,
Naren Ramakrishnan
Abstract:
Timber and forest products made from wood, like furniture, are valuable commodities, and like the global trade of many highly-valued natural resources, face challenges of corruption, fraud, and illegal harvesting. These grey and black market activities in the wood and forest products sector are not limited to the countries where the wood was harvested, but extend throughout the global supply chain…
▽ More
Timber and forest products made from wood, like furniture, are valuable commodities, and like the global trade of many highly-valued natural resources, face challenges of corruption, fraud, and illegal harvesting. These grey and black market activities in the wood and forest products sector are not limited to the countries where the wood was harvested, but extend throughout the global supply chain and have been tied to illicit financial flows, like trade-based money laundering, document fraud, species mislabeling, and other illegal activities. The task of finding such fraudulent activities using trade data, in the absence of ground truth, can be modelled as an unsupervised anomaly detection problem. However existing approaches suffer from certain shortcomings in their applicability towards large scale trade data. Trade data is heterogeneous, with both categorical and numerical attributes in a tabular format. The overall challenge lies in the complexity, volume and velocity of data, with large number of entities and lack of ground truth labels. To mitigate these, we propose a novel unsupervised anomaly detection -- Contrastive Learning based Heterogeneous Anomaly Detection (CHAD) that is generally applicable for large-scale heterogeneous tabular data. We demonstrate our model CHAD performs favorably against multiple comparable baselines for public benchmark datasets, and outperforms them in the case of trade data. More importantly we demonstrate our approach reduces assumptions and efforts required hyperparameter tuning, which is a key challenging aspect in an unsupervised training paradigm. Specifically, our overarching objective pertains to detecting suspicious timber shipments and patterns using Bill of Lading trade record data. Detecting anomalous transactions in shipment records can enable further investigation by government agencies and supply chain constituents.
△ Less
Submitted 31 July, 2022;
originally announced August 2022.
-
BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing
Authors:
Jason Alan Fries,
Leon Weber,
Natasha Seelam,
Gabriel Altay,
Debajyoti Datta,
Samuele Garda,
Myungsun Kang,
Ruisi Su,
Wojciech Kusa,
Samuel Cahyawijaya,
Fabio Barth,
Simon Ott,
Matthias Samwald,
Stephen Bach,
Stella Biderman,
Mario Sänger,
Bo Wang,
Alison Callahan,
Daniel León Periñán,
Théo Gigant,
Patrick Haller,
Jenny Chim,
Jose David Posada,
John Michael Giorgi,
Karthik Rangasai Sivaraman
, et al. (18 additional authors not shown)
Abstract:
Training and evaluating language models increasingly requires the construction of meta-datasets --diverse collections of curated data with clear provenance. Natural language prompting has recently lead to improved zero-shot generalization by transforming existing, supervised datasets into a diversity of novel pretraining tasks, highlighting the benefits of meta-dataset curation. While successful i…
▽ More
Training and evaluating language models increasingly requires the construction of meta-datasets --diverse collections of curated data with clear provenance. Natural language prompting has recently lead to improved zero-shot generalization by transforming existing, supervised datasets into a diversity of novel pretraining tasks, highlighting the benefits of meta-dataset curation. While successful in general-domain text, translating these data-centric approaches to biomedical language modeling remains challenging, as labeled biomedical datasets are significantly underrepresented in popular data hubs. To address this challenge, we introduce BigBIO a community library of 126+ biomedical NLP datasets, currently covering 12 task categories and 10+ languages. BigBIO facilitates reproducible meta-dataset curation via programmatic access to datasets and their metadata, and is compatible with current platforms for prompt engineering and end-to-end few/zero shot language model evaluation. We discuss our process for task schema harmonization, data auditing, contribution guidelines, and outline two illustrative use cases: zero-shot evaluation of biomedical prompts and large-scale, multi-task learning. BigBIO is an ongoing community effort and is available at https://github.com/bigscience-workshop/biomedical
△ Less
Submitted 30 June, 2022;
originally announced June 2022.
-
Framing Algorithmic Recourse for Anomaly Detection
Authors:
Debanjan Datta,
Feng Chen,
Naren Ramakrishnan
Abstract:
The problem of algorithmic recourse has been explored for supervised machine learning models, to provide more interpretable, transparent and robust outcomes from decision support systems. An unexplored area is that of algorithmic recourse for anomaly detection, specifically for tabular data with only discrete feature values. Here the problem is to present a set of counterfactuals that are deemed n…
▽ More
The problem of algorithmic recourse has been explored for supervised machine learning models, to provide more interpretable, transparent and robust outcomes from decision support systems. An unexplored area is that of algorithmic recourse for anomaly detection, specifically for tabular data with only discrete feature values. Here the problem is to present a set of counterfactuals that are deemed normal by the underlying anomaly detection model so that applications can utilize this information for explanation purposes or to recommend countermeasures. We present an approach -- Context preserving Algorithmic Recourse for Anomalies in Tabular data (CARAT), that is effective, scalable, and agnostic to the underlying anomaly detection model. CARAT uses a transformer based encoder-decoder model to explain an anomaly by finding features with low likelihood. Subsequently semantically coherent counterfactuals are generated by modifying the highlighted features, using the overall context of features in the anomalous instance(s). Extensive experiments help demonstrate the efficacy of CARAT.
△ Less
Submitted 28 June, 2022;
originally announced June 2022.
-
Engineering frictional characteristics of MoS2 structure by tuning thickness and morphology- An atomic, electronic structure, and exciton analysis
Authors:
Jatin Kashyap,
Joseph Torsiello,
Yoshiki Kakehi,
Dibakar Datta
Abstract:
We performed atomic and electron dynamics analysis to study the impact of morphological and thickness changes of a MoS2 system on its tribological properties through a diamond tip. We had considered 4 cases: variable layers (1-4 layers) and number (2-8 indents), radius (12Å, 16Å, 20Å, 24Å), and pattern of indents (0°, 25°, 30°, 35°, 45°, 60°) resulting into 18 subcases. MD results showed changing…
▽ More
We performed atomic and electron dynamics analysis to study the impact of morphological and thickness changes of a MoS2 system on its tribological properties through a diamond tip. We had considered 4 cases: variable layers (1-4 layers) and number (2-8 indents), radius (12Å, 16Å, 20Å, 24Å), and pattern of indents (0°, 25°, 30°, 35°, 45°, 60°) resulting into 18 subcases. MD results showed changing the radius and number of indents were the most, and number of layers and indents' pattern were the least effective way to tune the frictional characteristics. Ground state ab-initio study demonstrated an increase in the number and radius of indents, raising the number of stretched bonds. Consequently, the volume covered by the HOMO iso-surface increases, and that of LUMO decreases. That makes higher area/volume available to lose/share electrons, resulting in stronger interlocking between layers and tip. And TD-DFT calculation proves the existence of interfacial excitons, resulting in stronger interlocking between the layer's surface and tip despite a contraction in the LUMO iso-surfaces' area/volume. We believe these interlayer excitons result in higher average Z-axis(hence frictional force) reaction forces for the indents number subcases and lower for indents radius subcases as the number and radius of indents increase.
△ Less
Submitted 18 June, 2022;
originally announced June 2022.
-
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
Authors:
Aarohi Srivastava,
Abhinav Rastogi,
Abhishek Rao,
Abu Awal Md Shoeb,
Abubakar Abid,
Adam Fisch,
Adam R. Brown,
Adam Santoro,
Aditya Gupta,
Adrià Garriga-Alonso,
Agnieszka Kluska,
Aitor Lewkowycz,
Akshat Agarwal,
Alethea Power,
Alex Ray,
Alex Warstadt,
Alexander W. Kocurek,
Ali Safaya,
Ali Tazarv,
Alice Xiang,
Alicia Parrish,
Allen Nie,
Aman Hussain,
Amanda Askell,
Amanda Dsouza
, et al. (426 additional authors not shown)
Abstract:
Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur…
▽ More
Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-future capabilities and limitations of language models. To address this challenge, we introduce the Beyond the Imitation Game benchmark (BIG-bench). BIG-bench currently consists of 204 tasks, contributed by 450 authors across 132 institutions. Task topics are diverse, drawing problems from linguistics, childhood development, math, common-sense reasoning, biology, physics, social bias, software development, and beyond. BIG-bench focuses on tasks that are believed to be beyond the capabilities of current language models. We evaluate the behavior of OpenAI's GPT models, Google-internal dense transformer architectures, and Switch-style sparse transformers on BIG-bench, across model sizes spanning millions to hundreds of billions of parameters. In addition, a team of human expert raters performed all tasks in order to provide a strong baseline. Findings include: model performance and calibration both improve with scale, but are poor in absolute terms (and when compared with rater performance); performance is remarkably similar across model classes, though with benefits from sparsity; tasks that improve gradually and predictably commonly involve a large knowledge or memorization component, whereas tasks that exhibit "breakthrough" behavior at a critical scale often involve multiple steps or components, or brittle metrics; social bias typically increases with scale in settings with ambiguous context, but this can be improved with prompting.
△ Less
Submitted 12 June, 2023; v1 submitted 9 June, 2022;
originally announced June 2022.
-
Propagating Quantum Microwaves: Towards Applications in Communication and Sensing
Authors:
Mateo Casariego,
Emmanuel Zambrini Cruzeiro,
Stefano Gherardini,
Tasio Gonzalez-Raya,
Rui André,
Gonçalo Frazão,
Giacomo Catto,
Mikko Möttönen,
Debopam Datta,
Klaara Viisanen,
Joonas Govenius,
Mika Prunnila,
Kimmo Tuominen,
Maximilian Reichert,
Michael Renger,
Kirill G. Fedorov,
Frank Deppe,
Harriet van der Vliet,
A. J. Matthews,
Yolanda Fernández,
R. Assouly,
R. Dassonneville,
B. Huard,
Mikel Sanz,
Yasser Omar
Abstract:
The field of propagating quantum microwaves has started to receive considerable attention in the past few years. Motivated at first by the lack of an efficient microwave-to-optical platform that could solve the issue of secure communication between remote superconducting chips, current efforts are starting to reach other areas, from quantum communications to sensing. Here, we attempt at giving a s…
▽ More
The field of propagating quantum microwaves has started to receive considerable attention in the past few years. Motivated at first by the lack of an efficient microwave-to-optical platform that could solve the issue of secure communication between remote superconducting chips, current efforts are starting to reach other areas, from quantum communications to sensing. Here, we attempt at giving a state-of-the-art view of the two, pointing at some of the technical and theoretical challenges we need to address, and while providing some novel ideas and directions for future research. Hence, the goal of this paper is to provide a bigger picture, and -- we hope -- to inspire new ideas in quantum communications and sensing: from open-air microwave quantum key distribution to direct detection of dark matter, we expect that the recent efforts and results in quantum microwaves will soon attract a wider audience, not only in the academic community, but also in an industrial environment.
△ Less
Submitted 23 May, 2022;
originally announced May 2022.
-
Developing Potential Energy Surfaces for Graphene-based 2D-3D Interfaces from Modified High Dimensional Neural Networks for Applications in Energy Storage
Authors:
Vidushi Sharma,
Dibakar Datta
Abstract:
Mixed-dimensional heterostructures composed of two-dimensional (2D) and three-dimensional (3D) materials are undisputed next-generation materials for engineered devices due to their changeable properties. The present work computationally investigates the interface between 2D graphene and 3D tin (Sn) systems with density functional theory (DFT) method. It uses computationally demanding simulation d…
▽ More
Mixed-dimensional heterostructures composed of two-dimensional (2D) and three-dimensional (3D) materials are undisputed next-generation materials for engineered devices due to their changeable properties. The present work computationally investigates the interface between 2D graphene and 3D tin (Sn) systems with density functional theory (DFT) method. It uses computationally demanding simulation data to develop machine learning (ML) based potential energy surfaces (PES). The approach to developing PES for complex interface systems in the light of limited data and transferability of such models has been discussed. To develop PES for graphene-tin interface systems, high dimensional neural networks (HDNN) are used that rely on atom-centered symmetry function to represent structural information. HDNN are modified to train on the total energies of the interface system rather than atomic energies. The performance of modified HDNN trained on 5789 interface structures of graphene|Sn is tested on new interfaces of the same material pair with varying levels of structural deviations from the training dataset. Root mean squared error (RMSE) for test interfaces fall in the range of 0.01-0.45 eV/atom, depending on the structural deviations from the reference training dataset. By avoiding incorrect decomposition of total energy into atomic energies, modified HDNN model is shown to obtain higher accuracy and transferability despite limited dataset. Improved accuracy in ML-based modeling approach promises cost-effective means of designing interfaces in heterostructure energy storage systems with higher cycle life and stability.
△ Less
Submitted 5 June, 2022; v1 submitted 12 March, 2022;
originally announced March 2022.
-
Qubit-compatible substrates with superconducting through-silicon vias
Authors:
K. Grigoras,
N. Yurttagül,
J. -P. Kaikkonen,
E. T. Mannila,
P. Eskelinen,
D. P. Lozano,
H. -X. Li,
M. Rommel,
D. Shiri,
N. Tiencken,
S. Simbierowicz,
A. Ronzani,
J. Hätinen,
D. Datta,
V. Vesterinen,
L. Grönberg,
J. Biznárová,
A. Fadavi Roudsari,
S. Kosen,
A. Osman,
M. Prunnila,
J. Hassel,
J. Bylander,
J. Govenius
Abstract:
We fabricate and characterize superconducting through-silicon vias and electrodes suitable for superconducting quantum processors. We measure internal quality factors of a million for test resonators excited at single-photon levels, on chips with superconducting vias used to stitch ground planes on the front and back sides of the chips. This resonator performance is on par with the state of the ar…
▽ More
We fabricate and characterize superconducting through-silicon vias and electrodes suitable for superconducting quantum processors. We measure internal quality factors of a million for test resonators excited at single-photon levels, on chips with superconducting vias used to stitch ground planes on the front and back sides of the chips. This resonator performance is on par with the state of the art for silicon-based planar solutions, despite the presence of vias. Via stitching of ground planes is an important enabling technology for increasing the physical size of quantum processor chips, and is a first step toward more complex quantum devices with three-dimensional integration.
△ Less
Submitted 8 November, 2022; v1 submitted 25 January, 2022;
originally announced January 2022.
-
Drug repurposing for SARS-COV-2: A high-throughput molecular docking, molecular dynamics, machine learning, & ab-initio study
Authors:
Jatin Kashyap,
Dibakar Datta
Abstract:
A molecule of dimension 125nm has caused around 479 Million human infections (80M for the USA) & 6.1 Million human deaths (977,000 for the USA) worldwide and slashed the global economy by US$ 8.5 Trillion over two years. The only other events in recent history that caused comparative human life loss through direct usage (either by (wo)man or nature, respectively) of structure-property relations of…
▽ More
A molecule of dimension 125nm has caused around 479 Million human infections (80M for the USA) & 6.1 Million human deaths (977,000 for the USA) worldwide and slashed the global economy by US$ 8.5 Trillion over two years. The only other events in recent history that caused comparative human life loss through direct usage (either by (wo)man or nature, respectively) of structure-property relations of 'nano-structures' (either (wo)man-made or nature, respectively) were nuclear bomb attacks of Japanese cities by the USA during World War II and 1918 Flu Pandemic. This molecule is SARS-CoV-2, which causes a disease known as COVID-19. The high liability cost of the pandemic had incentivized various private, government, and academic entities to work towards finding a cure for these & emerging diseases. As result, multiple vaccine candidates are discovered to avoid the infection in first place. But so far, there has been no success in finding fully effective therapeutics candidates. In this paper, we attempted to provide multiple therapy candidates based upon a sophisticated multi-scale in-silico framework. We have used the following robust framework to screen the ligands; Step-I: high throughput docking, Step-II: molecular dynamics, Step-III: density functional theory analysis. In total, we have analyzed 2.2 Million unique protein binding site/ligand combinations. The proteins were selected based on recent experimental studies. Step-I had filtered that number down to 10 ligands/protein based on molecular docking binding energy, further screening down to 2 ligands/protein based on drug-likeness analysis. Additionally, these two ligands/proteins were investigated in Step-II with a molecular dynamic based RMSD analysis. It finally suggested three ligands (ZINC1176619532, ZINC517580540, ZINC952855827) attacking different binding sites of the protein(7BV2), which were further analyzed in Step III.
△ Less
Submitted 6 April, 2022; v1 submitted 1 January, 2022;
originally announced January 2022.
-
Improving mathematical questioning in teacher training
Authors:
Debajyoti Datta,
Maria Phillips,
James P Bywater,
Jennifer Chiu,
Ginger S. Watson,
Laura E. Barnes,
Donald E Brown
Abstract:
High-fidelity, AI-based simulated classroom systems enable teachers to rehearse effective teaching strategies. However, dialogue-oriented open-ended conversations such as teaching a student about scale factors can be difficult to model. This paper builds a text-based interactive conversational agent to help teachers practice mathematical questioning skills based on the well-known Instructional Qua…
▽ More
High-fidelity, AI-based simulated classroom systems enable teachers to rehearse effective teaching strategies. However, dialogue-oriented open-ended conversations such as teaching a student about scale factors can be difficult to model. This paper builds a text-based interactive conversational agent to help teachers practice mathematical questioning skills based on the well-known Instructional Quality Assessment. We take a human-centered approach to designing our system, relying on advances in deep learning, uncertainty quantification, and natural language processing while acknowledging the limitations of conversational agents for specific pedagogical needs. Using experts' input directly during the simulation, we demonstrate how conversation success rate and high user satisfaction can be achieved.
△ Less
Submitted 6 December, 2021; v1 submitted 2 December, 2021;
originally announced December 2021.
-
Evaluation of mathematical questioning strategies using data collected through weak supervision
Authors:
Debajyoti Datta,
Maria Phillips,
James P Bywater,
Jennifer Chiu,
Ginger S. Watson,
Laura E. Barnes,
Donald E Brown
Abstract:
A large body of research demonstrates how teachers' questioning strategies can improve student learning outcomes. However, developing new scenarios is challenging because of the lack of training data for a specific scenario and the costs associated with labeling. This paper presents a high-fidelity, AI-based classroom simulator to help teachers rehearse research-based mathematical questioning skil…
▽ More
A large body of research demonstrates how teachers' questioning strategies can improve student learning outcomes. However, developing new scenarios is challenging because of the lack of training data for a specific scenario and the costs associated with labeling. This paper presents a high-fidelity, AI-based classroom simulator to help teachers rehearse research-based mathematical questioning skills. Using a human-in-the-loop approach, we collected a high-quality training dataset for a mathematical questioning scenario. Using recent advances in uncertainty quantification, we evaluated our conversational agent for usability and analyzed the practicality of incorporating a human-in-the-loop approach for data collection and system evaluation for a mathematical questioning scenario.
△ Less
Submitted 2 December, 2021;
originally announced December 2021.
-
Multitask Prompted Training Enables Zero-Shot Task Generalization
Authors:
Victor Sanh,
Albert Webson,
Colin Raffel,
Stephen H. Bach,
Lintang Sutawika,
Zaid Alyafeai,
Antoine Chaffin,
Arnaud Stiegler,
Teven Le Scao,
Arun Raja,
Manan Dey,
M Saiful Bari,
Canwen Xu,
Urmish Thakker,
Shanya Sharma Sharma,
Eliza Szczechla,
Taewoon Kim,
Gunjan Chhablani,
Nihal Nayak,
Debajyoti Datta,
Jonathan Chang,
Mike Tian-Jian Jiang,
Han Wang,
Matteo Manica,
Sheng Shen
, et al. (16 additional authors not shown)
Abstract:
Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a consequence of implicit multitask learning in language models' pretraining (Radford et al., 2019). Can zero-shot generalization instead be directly induced by explicit multitask learning? To test this question at scale,…
▽ More
Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a consequence of implicit multitask learning in language models' pretraining (Radford et al., 2019). Can zero-shot generalization instead be directly induced by explicit multitask learning? To test this question at scale, we develop a system for easily mapping any natural language tasks into a human-readable prompted form. We convert a large set of supervised datasets, each with multiple prompts with diverse wording. These prompted datasets allow for benchmarking the ability of a model to perform completely held-out tasks. We fine-tune a pretrained encoder-decoder model (Raffel et al., 2020; Lester et al., 2021) on this multitask mixture covering a wide variety of tasks. The model attains strong zero-shot performance on several standard datasets, often outperforming models up to 16x its size. Further, our approach attains strong performance on a subset of tasks from the BIG-bench benchmark, outperforming models up to 6x its size. All trained models are available at https://github.com/bigscience-workshop/t-zero and all prompts are available at https://github.com/bigscience-workshop/promptsource.
△ Less
Submitted 17 March, 2022; v1 submitted 15 October, 2021;
originally announced October 2021.
-
Efficient Computation of Periodic Orbits of Forced Rayleigh Equation in the Framework of Novel Asymptotic Structures
Authors:
Aniruddha Palit,
Dhurjati Prasad Datta,
Santanu Raut
Abstract:
Higher precision efficient computation of period 1 relaxation oscillations of strongly nonlinear and singularly perturbed Rayleigh equations with external periodic forcing is presented. The computations are performed in the context of conventional renormalization group method (RGM). We demonstrate that although a slight homotopically modified RGM could generate approximate periodic orbits that agr…
▽ More
Higher precision efficient computation of period 1 relaxation oscillations of strongly nonlinear and singularly perturbed Rayleigh equations with external periodic forcing is presented. The computations are performed in the context of conventional renormalization group method (RGM). We demonstrate that although a slight homotopically modified RGM could generate approximate periodic orbits that agree qualitatively with the exact orbits, the method, nevertheless, fails miserably to reduce the large quantitative disagreement between the theoretically computed results with that of exact numerical orbits. In the second part of the work we present a novel asymptotic analysis incorporating SL(2,R) invariant nonlinear deformation of slower time scales, $t_{n} =\varepsilon^{n}t, \ n\rightarrow\infty, \ \varepsilon<1$, for asymptotic late time $t$, to a nonlinear time $T_{n}=t_{n}σ(t_{n})$, where the deformation factor $σ(t_{n})>0$ respects some well defined SL(2,R) constraints. Motivations and detailed applications of such nonlinear asymptotic structures are explained in performing very high accuracy ($> 98\%$) computations of relaxation orbits. Existence of an interesting condensation and rarefaction phenomenon in connection with dynamically adjustable scales in the context of a slow-fast dynamical system is explained and verified numerically.
△ Less
Submitted 21 September, 2021;
originally announced September 2021.
-
Detecting Anomalies Through Contrast in Heterogeneous Data
Authors:
Debanjan Datta,
Sathappan Muthiah,
Naren Ramakrishnan
Abstract:
Detecting anomalies has been a fundamental approach in detecting potentially fraudulent activities. Tasked with detection of illegal timber trade that threatens ecosystems and economies and association with other illegal activities, we formulate our problem as one of anomaly detection. Among other challenges annotations are unavailable for our large-scale trade data with heterogeneous features (ca…
▽ More
Detecting anomalies has been a fundamental approach in detecting potentially fraudulent activities. Tasked with detection of illegal timber trade that threatens ecosystems and economies and association with other illegal activities, we formulate our problem as one of anomaly detection. Among other challenges annotations are unavailable for our large-scale trade data with heterogeneous features (categorical and continuous), that can assist in building automated systems to detect fraudulent transactions. Modelling the task as unsupervised anomaly detection, we propose a novel model Contrastive Learning based Heterogeneous Anomaly Detector to address shortcomings of prior models. Our model uses an asymmetric autoencoder that can effectively handle large arity categorical variables, but avoids assumptions about structure of data in low-dimensional latent space and is robust to changes to hyper-parameters. The likelihood of data is approximated through an estimator network, which is jointly trained with the autoencoder,using negative sampling. Further the details and intuition for an effective negative sample generation approach for heterogeneous data are outlined. We provide a qualitative study to showcase the effectiveness of our model in detecting anomalies in timber trade.
△ Less
Submitted 2 April, 2021;
originally announced April 2021.
-
A Small Survey On Event Detection Using Twitter
Authors:
Debanjan Datta
Abstract:
A small survey on event detection using Twitter. This work first defines the problem statement, and then summarizes and collates the different research works towards solving the problem.
A small survey on event detection using Twitter. This work first defines the problem statement, and then summarizes and collates the different research works towards solving the problem.
△ Less
Submitted 30 July, 2022; v1 submitted 8 November, 2020;
originally announced November 2020.
-
Improving Classification through Weak Supervision in Context-specific Conversational Agent Development for Teacher Education
Authors:
Debajyoti Datta,
Maria Phillips,
Jennifer Chiu,
Ginger S. Watson,
James P. Bywater,
Laura Barnes,
Donald Brown
Abstract:
Machine learning techniques applied to the Natural Language Processing (NLP) component of conversational agent development show promising results for improved accuracy and quality of feedback that a conversational agent can provide. The effort required to develop an educational scenario specific conversational agent is time consuming as it requires domain experts to label and annotate noisy data s…
▽ More
Machine learning techniques applied to the Natural Language Processing (NLP) component of conversational agent development show promising results for improved accuracy and quality of feedback that a conversational agent can provide. The effort required to develop an educational scenario specific conversational agent is time consuming as it requires domain experts to label and annotate noisy data sources such as classroom videos. Previous approaches to modeling annotations have relied on labeling thousands of examples and calculating inter-annotator agreement and majority votes in order to model the necessary scenarios. This method, while proven successful, ignores individual annotator strengths in labeling a data point and under-utilizes examples that do not have a majority vote for labeling. We propose using a multi-task weak supervision method combined with active learning to address these concerns. This approach requires less labeling than traditional methods and shows significant improvements in precision, efficiency, and time-requirements than the majority vote method (Ratner 2019). We demonstrate the validity of this method on the Google Jigsaw data set and then propose a scenario to apply this method using the Instructional Quality Assessment(IQA) to define the categories for labeling. We propose using probabilistic modeling of annotator labeling to generate active learning examples to further label the data. Active learning is able to iteratively improve the training performance and accuracy of the original classification model. This approach combines state-of-the art labeling techniques of weak supervision and active learning to optimize results in the educational domain and could be further used to lessen the data requirements for expanded scenarios within the education domain through transfer learning.
△ Less
Submitted 23 October, 2020;
originally announced October 2020.
-
Geometry matters: Exploring language examples at the decision boundary
Authors:
Debajyoti Datta,
Shashwat Kumar,
Laura Barnes,
Tom Fletcher
Abstract:
A growing body of recent evidence has highlighted the limitations of natural language processing (NLP) datasets and classifiers. These include the presence of annotation artifacts in datasets, classifiers relying on shallow features like a single word (e.g., if a movie review has the word "romantic", the review tends to be positive), or unnecessary words (e.g., learning a proper noun to classify a…
▽ More
A growing body of recent evidence has highlighted the limitations of natural language processing (NLP) datasets and classifiers. These include the presence of annotation artifacts in datasets, classifiers relying on shallow features like a single word (e.g., if a movie review has the word "romantic", the review tends to be positive), or unnecessary words (e.g., learning a proper noun to classify a movie as positive or negative). The presence of such artifacts has subsequently led to the development of challenging datasets to force the model to generalize better. While a variety of heuristic strategies, such as counterfactual examples and contrast sets, have been proposed, the theoretical justification about what makes these examples difficult for the classifier is often lacking or unclear. In this paper, using tools from information geometry, we propose a theoretical way to quantify the difficulty of an example in NLP. Using our approach, we explore difficult examples for several deep learning architectures. We discover that both BERT, CNN and fasttext are susceptible to word substitutions in high difficulty examples. These classifiers tend to perform poorly on the FIM test set. (generated by sampling and perturbing difficult examples, with accuracy dropping below 50%). We replicate our experiments on 5 NLP datasets (YelpReviewPolarity, AGNEWS, SogouNews, YelpReviewFull and Yahoo Answers). On YelpReviewPolarity we observe a correlation coefficient of -0.4 between resilience to perturbations and the difficulty score. Similarly we observe a correlation of 0.35 between the difficulty score and the empirical success probability of random substitutions. Our approach is simple, architecture agnostic and can be used to study the fragilities of text classification models. All the code used will be made publicly available, including a tool to explore the difficult examples for other datasets.
△ Less
Submitted 28 October, 2021; v1 submitted 14 October, 2020;
originally announced October 2020.
-
Variation in interface strength of Silicon with surface engineered Ti3C2 MXenes
Authors:
Vidushi Sharma,
Dibakar Datta
Abstract:
Current advancements in battery technologies require electrodes to combine high-performance active material such as Silicon (Si) with two-dimensional materials such as transition metal carbides (MXenes) for prolonged cycle stability and enhanced electrochemical performance. More so, it is the interface between these materials, which is the nexus for their applicatory success. Herein, the interface…
▽ More
Current advancements in battery technologies require electrodes to combine high-performance active material such as Silicon (Si) with two-dimensional materials such as transition metal carbides (MXenes) for prolonged cycle stability and enhanced electrochemical performance. More so, it is the interface between these materials, which is the nexus for their applicatory success. Herein, the interface strength variations between amorphous Si and Ti3C2Tx MXene are determined as the MXene surface functional groups (Tx) are changed using first principle calculations. Si is interfaced with three Ti3C2 MXene substrates having surface -OH, -OH and -O mixed, and -F functional groups. Density functional theory (DFT) results reveal that completely hydroxylated Ti3C2 has the highest interface strength of 0.6 J/m2 with amorphous Si. This interface strength value drops as the proportion of surface -O and -F groups increases. Additional analysis of electron redistribution and charge separation across the interface is provided for a complete understanding of underlying physico-chemical factors affecting the surface chemistry and resultant interface strength values. The presented comprehensive analysis of the interface aims to develop sophisticated MXene based electrodes by their targeted surface engineering.
△ Less
Submitted 29 November, 2020; v1 submitted 26 September, 2020;
originally announced September 2020.
-
Understanding the Strength of the Selenium -- Graphene Interfaces
Authors:
Vidushi Sharma,
David Mitlin,
Dibakar Datta
Abstract:
We present a comprehensive first-principles Density Functional Theory (DFT) analyses of the interfacial strength and bonding mechanisms between crystalline and amorphous selenium(Se) with graphene(Gr), a promising duo for energy storage applications. Comparative interface analyses are presented on amorphous silicon(Si) with graphene and crystalline Se with aluminum(Al) substrate. The interface str…
▽ More
We present a comprehensive first-principles Density Functional Theory (DFT) analyses of the interfacial strength and bonding mechanisms between crystalline and amorphous selenium(Se) with graphene(Gr), a promising duo for energy storage applications. Comparative interface analyses are presented on amorphous silicon(Si) with graphene and crystalline Se with aluminum(Al) substrate. The interface strength of monoclinic Se (0.43 J/m2) and amorphous Si with graphene (0.41 J/m2) is similar in magnitude. While both materials (c-Se, a-Si) are bonded loosely by van der Waals (vdW) forces over graphene, interfacial electron exchange is higher for a-Si/Gr. This is further elaborated by comparing potential energy step and charge transfer (delta q) across the graphene interfaces. The delta q for c-Se/Gr and a-Si/Gr are 0.3119 e-1 and 0.4266 e-1, respectively. However, the interface strength of c-Se on the 3D Al substrate is higher (0.99 J/m2), suggesting stronger adhesion. The amorphous Se with graphene has comparable interface strength (0.34 J/m2), but electron exchange in this system is slightly distinct from monoclinic Se. The electronic characteristics (density of states analysis) and bonding mechanisms are different for monoclinic and amorphous Se with graphene and they activate graphene via surface charge doping divergently. Our findings highlight the complex electrochemical phenomena in Se interfaced with graphene, which may profoundly differ from their 'free' counterparts.
△ Less
Submitted 18 September, 2020;
originally announced September 2020.
-
learn2learn: A Library for Meta-Learning Research
Authors:
Sébastien M. R. Arnold,
Praateek Mahajan,
Debajyoti Datta,
Ian Bunner,
Konstantinos Saitas Zarkias
Abstract:
Meta-learning researchers face two fundamental issues in their empirical work: prototyping and reproducibility. Researchers are prone to make mistakes when prototyping new algorithms and tasks because modern meta-learning methods rely on unconventional functionalities of machine learning frameworks. In turn, reproducing existing results becomes a tedious endeavour -- a situation exacerbated by the…
▽ More
Meta-learning researchers face two fundamental issues in their empirical work: prototyping and reproducibility. Researchers are prone to make mistakes when prototyping new algorithms and tasks because modern meta-learning methods rely on unconventional functionalities of machine learning frameworks. In turn, reproducing existing results becomes a tedious endeavour -- a situation exacerbated by the lack of standardized implementations and benchmarks. As a result, researchers spend inordinate amounts of time on implementing software rather than understanding and developing new ideas.
This manuscript introduces learn2learn, a library for meta-learning research focused on solving those prototyping and reproducibility issues. learn2learn provides low-level routines common across a wide-range of meta-learning techniques (e.g. meta-descent, meta-reinforcement learning, few-shot learning), and builds standardized interfaces to algorithms and benchmarks on top of them. In releasing learn2learn under a free and open source license, we hope to foster a community around standardized software for meta-learning research.
△ Less
Submitted 27 August, 2020; v1 submitted 27 August, 2020;
originally announced August 2020.
-
NIT-Agartala-NLP-Team at SemEval-2020 Task 8: Building Multimodal Classifiers to tackle Internet Humor
Authors:
Steve Durairaj Swamy,
Shubham Laddha,
Basil Abdussalam,
Debayan Datta,
Anupam Jamatia
Abstract:
The paper describes the systems submitted to SemEval-2020 Task 8: Memotion by the `NIT-Agartala-NLP-Team'. A dataset of 8879 memes was made available by the task organizers to train and test our models. Our systems include a Logistic Regression baseline, a BiLSTM + Attention-based learner and a transfer learning approach with BERT. For the three sub-tasks A, B and C, we attained ranks 24/33, 11/29…
▽ More
The paper describes the systems submitted to SemEval-2020 Task 8: Memotion by the `NIT-Agartala-NLP-Team'. A dataset of 8879 memes was made available by the task organizers to train and test our models. Our systems include a Logistic Regression baseline, a BiLSTM + Attention-based learner and a transfer learning approach with BERT. For the three sub-tasks A, B and C, we attained ranks 24/33, 11/29 and 15/26, respectively. We highlight our difficulties in harnessing image information as well as some techniques and handcrafted features we employ to overcome these issues. We also discuss various modelling issues and theorize possible solutions and reasons as to why these problems persist.
△ Less
Submitted 16 May, 2020; v1 submitted 14 May, 2020;
originally announced May 2020.
-
Machine Learning in Materials Modeling -- Fundamentals and the Opportunities in 2D Materials
Authors:
Shreeja Das,
Hansraj Pegu,
Kisor Sahu,
Ameeya Kumar Nayak,
Seeram Ramakrishna,
Dibakar Datta,
Soumya Swayamjyoti
Abstract:
The application of machine learning in materials presents a unique challenge of dealing with scarce and varied materials data - both experimental and theoretical. Nevertheless, several state-of-the-art machine learning models for materials have been successfully developed to predict material properties for various applications such as materials for photovoltaic cells, thermoelectric materials, die…
▽ More
The application of machine learning in materials presents a unique challenge of dealing with scarce and varied materials data - both experimental and theoretical. Nevertheless, several state-of-the-art machine learning models for materials have been successfully developed to predict material properties for various applications such as materials for photovoltaic cells, thermoelectric materials, dielectrics, materials for batteries, fuel cells, etc. The setup of comprehensive materials databases, and openly accessible algorithm frameworks have also spurred the usage of machine learning for solving some of the most pressing problems in materials science. Some such recent implementations are discussed in this book chapter. A multitude of two-dimensional (2D) materials exist with the potential to replace the conventional materials for energy storage and nanodevices. The challenges faced in designing batteries and how machine learning tools can help in screening and narrowing down on the best composition, as well as the synthesis of air-stable 2D materials, are also discussed.
△ Less
Submitted 13 January, 2020;
originally announced January 2020.
-
Comprehensive understanding of water-driven graphene wrinkle life-cycle towards applications in flexible electronics: A computational study
Authors:
Jatin Kashyap,
Eui-Hyeok Yang,
Dibakar Datta
Abstract:
The presence of wrinkles in Graphene Nanoribbons (GNR) and other two-dimensional (2D) materials significantly alter their mechanical, electronic, optical properties, which can be either beneficial or detrimental. Experimentally, it has been observed that during the commonly used growth process of GNR, water molecules, sourced from ambient humidity, can be diffused in between GNR and the substrate.…
▽ More
The presence of wrinkles in Graphene Nanoribbons (GNR) and other two-dimensional (2D) materials significantly alter their mechanical, electronic, optical properties, which can be either beneficial or detrimental. Experimentally, it has been observed that during the commonly used growth process of GNR, water molecules, sourced from ambient humidity, can be diffused in between GNR and the substrate. The water diffusion causes wrinkle formation in GNR, which influences its properties. Furthermore, the diffused water eventually dries, creating the alteration not only in the geometry of Wrinkled Graphene Nanoribbons (WGNR) but also its features. Computational analysis of these phenomena can provide an atomistic-level understanding of the phenomena. Therefore, in this work, Molecular Dynamics (MD) simulations are performed to model the water diffusion and evaporation in between GNR and its substrate, and their effect on wrinkle formation and dynamics. Additionally, Density Functional Theory (DFT)-based analysis is used to characterize the difference in the electronic structure of WGNR caused by the change in wrinkle geometry. Our study reveals that the initially distributed wrinkles tend to coalesce to form a localized wrinkle whose configuration depends on the initial wrinkle geometry and the amount of diffused water. The wrinkle configuration changes upon drying, while it remains static until the complete drying. The movement of the localized wrinkle is the combination of three fundamental modes - bending, buckling, and sliding. The stress analysis reveals that the maximum stress is at the base of the wrinkle, and its magnitude is always below the plasticity limit. The DFT results provide insight into the potential of using the wrinkles to control the direction of electron flow for the applications in flexible electronics.
△ Less
Submitted 2 January, 2020;
originally announced January 2020.
-
Controlled edge dependent stacking of WS2-WS2 Homo- and WS2-WSe2 Hetero-structures: A Computational Study
Authors:
Kamalika Ghatak,
Kyung Nam Kang,
Eui-Hyeok Yang,
Dibakar Datta
Abstract:
Transition Metal Dichalcogenides (TMDs) are one of the most studied two-dimensional materials in the last 5-10 years due to their extremely interesting layer dependent properties. Despite the presence of vast research work on TMDs, the complex relationship between the electrochemical and physical properties make them the subject of further research. Our main objective is to provide a better insigh…
▽ More
Transition Metal Dichalcogenides (TMDs) are one of the most studied two-dimensional materials in the last 5-10 years due to their extremely interesting layer dependent properties. Despite the presence of vast research work on TMDs, the complex relationship between the electrochemical and physical properties make them the subject of further research. Our main objective is to provide a better insight into the electronic structure of TMDs. This will help us better understand the stability of the bilayer post-growth homo/hetero products based on the various edge-termination, and different stacking of the two layers. In this regard, two Tungsten (W) based non-periodic chalcogenide flakes (sulfides and selenides) were considered. An in-depth analysis of their different edge termination and stacking arrangement was performed via Density Functional Theory method using VASP software. Our finding indicates the preference of chalcogenide (c-) terminated structures over the metal (m-) terminated structures for both homo and hetero layers, and thus strongly suggests the nonexistence of the m-terminated TMDs bilayer products.
△ Less
Submitted 25 November, 2019;
originally announced November 2019.
-
Accelerating Least Squares Imaging Using Deep Learning Techniques
Authors:
Janaki Vamaraju,
Jeremy Vila,
Mauricio Araya-Polo,
Debanjan Datta,
Mohamed Sidahmed,
Mrinal Sen
Abstract:
Wave equation techniques have been an integral part of geophysical imaging workflows to investigate the Earth's subsurface. Least-squares reverse time migration (LSRTM) is a linearized inversion problem that iteratively minimizes a misfit functional as a function of the model perturbation. The success of the inversion largely depends on our ability to handle large systems of equations given the ma…
▽ More
Wave equation techniques have been an integral part of geophysical imaging workflows to investigate the Earth's subsurface. Least-squares reverse time migration (LSRTM) is a linearized inversion problem that iteratively minimizes a misfit functional as a function of the model perturbation. The success of the inversion largely depends on our ability to handle large systems of equations given the massive computation costs. The size of the system almost exponentially increases with the demand for higher resolution images in complicated subsurface media. We propose an unsupervised deep learning approach that leverages the existing physics-based models and machine learning optimizers to achieve more accurate and cheaper solutions. We compare different optimizers and demonstrate their efficacy in mitigating imaging artifacts. Further, minimizing the Huber loss with mini-batch gradients and Adam optimizer is not only less memory-intensive but is also more robust. Our empirical results on synthetic, densely sampled datasets suggest faster convergence to an accurate LSRTM result than a traditional approach.
△ Less
Submitted 9 December, 2019; v1 submitted 14 November, 2019;
originally announced November 2019.
-
Turbostratic Orientations, Water Confinement and Ductile-Brittle Fracture in Bi-layer Graphene
Authors:
Nil Dhankecha,
Vidushi Sharma,
Dibakar Datta
Abstract:
Bi-layer graphene (BLG) can be a cheaper and more stable alternative to graphene in several applications. With its mechanical strength being almost equivalent to graphene, BLG also brings advanced electronic and optical properties to the table. Furthermore, entrapment of water in graphene-based nano-channels and devices has been a recent point of interest for several applications ranging from ener…
▽ More
Bi-layer graphene (BLG) can be a cheaper and more stable alternative to graphene in several applications. With its mechanical strength being almost equivalent to graphene, BLG also brings advanced electronic and optical properties to the table. Furthermore, entrapment of water in graphene-based nano-channels and devices has been a recent point of interest for several applications ranging from energy to bio-physics. Therefore, it is crucial to study the over-all mechanical strength of such structures in order to prevent system failures in future applications. In the present work, Molecular Dynamics simulations have been used to study crack propagation in BLG with different orientations between the layers. There is a major thrust in analyzing how the angular orientation between the layers affect the horizontal and vertical crack propagation in individual layers of graphene. The study has been extended to BLG with confined water in interfaces. Over-all strength of graphene sheets when in contact with water content has been determined, and prominent regional conditions for crack initiation are pointed out. It was seen that in the presence of water content, graphene deviated from its characteristic brittle failure and exhibited the ductile fracture mechanism. Origin of cracks in graphenes was located at the region where the density of water dropped near the graphene surface, suggesting that the presence of hydroxyl groups decelerate the crack formation and propagation in straining graphenes.
△ Less
Submitted 6 July, 2022; v1 submitted 30 October, 2019;
originally announced October 2019.
-
Indoor Information Retrieval using Lifelog Data
Authors:
Deepanwita Datta
Abstract:
Studying human behaviour through lifelogging has seen an increase in attention from researchers over the past decade. The opportunities that lifelogging offers are based on the fact that a lifelog, as a "black box" of our lives, offers rich contextual information, which has been an Achilles heel of information discovery. While lifelog data has been put to use in various contexts, its application t…
▽ More
Studying human behaviour through lifelogging has seen an increase in attention from researchers over the past decade. The opportunities that lifelogging offers are based on the fact that a lifelog, as a "black box" of our lives, offers rich contextual information, which has been an Achilles heel of information discovery. While lifelog data has been put to use in various contexts, its application to indoor environment scenario remains unexplored. In this proposal, I plan to design a method that enables us to capture and record indoor lifelog data of a person's life in order to facilitate healthcare systems, emergency response, item tracking etc. To this end, we aim to build an Indoor Information Retrieval system that can be queried with natural language queries over lifelog data. Judicious use of the lifelog data for the indoor application may enable us to solve very fundamental but non-avoidable problems of our daily life. Analysis of lifelog data coupled with Information Retrieval is not only a promising research topic, but the possibility of its indoor application especially for healthcare, lost-item tracking would be an innovative research idea to the best of our knowledge.
△ Less
Submitted 17 October, 2019;
originally announced October 2019.
-
Novel Excitation of local fractional dynamics
Authors:
Dhurjati Prasad Datta,
Soma Sarkar,
Santanu Raut
Abstract:
The question of a possible excitation and emergence of fractional type dynamics, as a more realistic framework for understanding emergence of complex systems, directly from a conventional integral order dynamics, in the form a continuous transition or deformation, is of significant interest. Although there have been a lot of activities in nonlinear, fractional or not, dynamical systems, the above…
▽ More
The question of a possible excitation and emergence of fractional type dynamics, as a more realistic framework for understanding emergence of complex systems, directly from a conventional integral order dynamics, in the form a continuous transition or deformation, is of significant interest. Although there have been a lot of activities in nonlinear, fractional or not, dynamical systems, the above question appears yet to be addressed systematically in the current literature. The present work may be considered to be a step forward in this direction. Based on a novel concept of asymptotic duality structure, we present here an extended analytical framework that would provide a scenario for realizing the above stated continuous deformation of integral order dynamics to a local fractional order dynamics on a fractal and fractional space. The related concepts of self dual and strictly dual asymptotics are introduced and there relevance in connection with smooth and nonsmooth deformation of the real line are pointed out. The relationship of the duality structure and renormalization group is examined. The ordinary derivation operator is shown to be invariant under this duality enabled renormalization group transformation, leading thereby to a {\em natural} realization of local fractional type derivative in a fractal space. As an application we discuss linear wave equation in one and two dimensions and show how the underlying integral order wave equation could be deformed and renormalized suitably to yield meaningful results for vibration of a fractal string or wave propagation in a region with fractal boundary.
△ Less
Submitted 21 September, 2021; v1 submitted 26 March, 2019;
originally announced April 2019.
-
The Inherent Behavior of Graphene Flakes in Water: A Molecular Dynamics Study
Authors:
Priyanka Solanky,
Vidushi Sharma,
Kamalika Ghatak,
Jatin Kashyap,
Dibakar Datta
Abstract:
Graphene-water interaction has been under scrutiny ever since graphene discovery and realization of its exceptional properties. Several computational and experimental reports exist that have tried to look into the interactions involved, however, none of them addresses the issue in its entirety. We have tested the inherent hydrophobic behavior of a small graphene in water droplet by the means of MD…
▽ More
Graphene-water interaction has been under scrutiny ever since graphene discovery and realization of its exceptional properties. Several computational and experimental reports exist that have tried to look into the interactions involved, however, none of them addresses the issue in its entirety. We have tested the inherent hydrophobic behavior of a small graphene in water droplet by the means of MD simulations. The analysis has been extended to multiple graphene flakes in water and their respective size dependent responses to water droplet. Graphene retreats from water droplet to encapsulate it from the surface. This response was highly dependent upon graphene size with respect to water content. Additionally, we also report self-assembly of multilayered graphene in water by means of MD simulations, an observation which can be utilized to synthesize such structures in a cost-effective way by experimentalists. To fully comprehend graphene behavior in water, graphene deformation was analyzed in the presence of water molecules. It was noticed that graphene wrinkled to wrap around water molecules and resisted complete failure, one that is seen in case of a sole graphene sheet. Our work will not only address the question about whether graphene is hydrophobic or hydrophilic but also provide insight into the behavior of graphene surface and mobility when exposed to water which can be exploited in numerous applications.
△ Less
Submitted 18 October, 2018;
originally announced November 2018.
-
Effect of Cobalt Content on the Electrochemical Properties and Structural Stability of NCA Type Cathode Materials
Authors:
Kamalika Ghatak,
Swastik Basu,
Tridip Das,
Hemant Kumar,
Dibakar Datta
Abstract:
At present, the most common type of cathode materials, NCA [Li_(1-x)Ni_(0.80)Co_(0.15)Al_(0.05)O_(2), x = 0 to 1], have a very high concentration of cobalt. Since cobalt is toxic and expensive, the existing design of cathode materials is neither cost-effective nor environmentally benign. We have performed density functional theory (DFT) calculations to investigate electrochemical, electronic, and…
▽ More
At present, the most common type of cathode materials, NCA [Li_(1-x)Ni_(0.80)Co_(0.15)Al_(0.05)O_(2), x = 0 to 1], have a very high concentration of cobalt. Since cobalt is toxic and expensive, the existing design of cathode materials is neither cost-effective nor environmentally benign. We have performed density functional theory (DFT) calculations to investigate electrochemical, electronic, and structural properties of four types of NCA cathode materials with the simultaneous decrease in Co content along with the increase in Ni content. Our results show that even if the cobalt concentration is significantly decreased from 16.70 % (NCA_I) to 4.20 % (NCA_IV), variation in intercalation potential and specific capacity is not significant. For example, in case of 50% Li concentration, the voltage drop is only ~17% while the change in specific capacity is negligible. Moreover, we have also explored the influence of sodium doping in the intercalation site on the electrochemical, electronic, and structural properties. By considering two extreme cases of NCAs (i.e., with highest and lowest Co content: NCA_I and NCA_IV respectively), we have demonstrated the importance of Na doping from the structural and electronic point of view. Our results provide insight into the design of environmentally benign, low-cost cathode materials with reduced cobalt concentration.
△ Less
Submitted 20 April, 2018;
originally announced April 2018.
-
Amorphous Germanium as a Promising Anode Material for Sodium Ion Batteries: A First Principle Study
Authors:
Vidushi Sharma,
Kamalika Ghatak,
Dibakar Datta
Abstract:
The abundance of Sodium (Na), its low-cost, and low reduction potential provide a lucrative inexpensive, safe, and environmentally benign alternative to Lithium Ion Batteries (LIBs). The significant challenges in advancing Sodium Ion Battery (NIB) technologies lies in finding the better electrode materials. Experimental investigations revealed the real potency of Germanium (Ge) as suitable anode m…
▽ More
The abundance of Sodium (Na), its low-cost, and low reduction potential provide a lucrative inexpensive, safe, and environmentally benign alternative to Lithium Ion Batteries (LIBs). The significant challenges in advancing Sodium Ion Battery (NIB) technologies lies in finding the better electrode materials. Experimental investigations revealed the real potency of Germanium (Ge) as suitable anode materials for NIBs. However, a systematic atomistic study is necessary to understand the fundamental aspects of capacity-voltage correlation, microstructural changes of Ge, as well as diffusion kinetics. We, therefore, performed the Density Functional Theory (DFT) and Ab Initio Molecular Dynamics (AIMD) simulation to investigate the sodiation-desodiation kinetics in Germanium-Sodium system (Na64Ge64). We analyzed the intercalation potential and capacity correlation for intermediate equilibrium structures and compared our data with the experimental results. Effect of sodiation on inter-atomic distances within Na-Ge system is analyzed by means of Pair Correlation Function (PCF). This provides insight into possible microstructural changes taking place during sodiation of amorphous Ge (a-Ge). We further investigated the diffusivity of sodium in a-Ge electrode material and analyzed the volume expansion trend for Na64Ge64 electrode system. Our computational results provide the fundamental insight into the atomic scale and help experimentalists design Ge based NIBs for real-life applications.
△ Less
Submitted 1 April, 2018;
originally announced April 2018.
-
Atomistic study of hardening mechanism in Al-Cu nanostructure
Authors:
Satyajit Mojumder,
Tawfiqur Rakib,
Mohammad Motalab,
Dibakar Datta
Abstract:
Nanostructures have the immense potential to supplant the traditional metallic structure as they show enhanced mechanical properties through strain hardening. In this paper, the effect of grain size on the hardening mechanism of Al-Cu nanostructure is elucidated by molecular dynamics simulation. Al-Cu (50-54% Cu by weight) nanostructure having an average grain size of 4.57 to 7.26 nm are investiga…
▽ More
Nanostructures have the immense potential to supplant the traditional metallic structure as they show enhanced mechanical properties through strain hardening. In this paper, the effect of grain size on the hardening mechanism of Al-Cu nanostructure is elucidated by molecular dynamics simulation. Al-Cu (50-54% Cu by weight) nanostructure having an average grain size of 4.57 to 7.26 nm are investigated for tensile simulation at different strain rate using embedded atom method (EAM) potential at a temperature of 50~500K. It is found that the failure mechanism of the nanostructure is governed by the temperature, grain size as well as strain rate effect. At the high temperature of 300-500K, the failure strength of Al-Cu nanostructure increases with the decrease of average grain size following Hall-Petch relation. Dislocation motions are hindered significantly when the grain size is decreased which play a vital role on the hardening of the nanostructure. The failure is always found to initiate at a particular Al grain due to its weak link and propagates through grain boundary (GB) sliding, diffusion, dislocation nucleation and propagation. We also visualize the dislocation density at different grain size to show how the dislocation affects the material properties at the nanoscale. These results will further aid investigation on the deformation mechanism of nanostructure.
△ Less
Submitted 23 December, 2017; v1 submitted 26 July, 2017;
originally announced July 2017.
-
Effect of Decreasing Cobalt Content on the Electrochemical Properties and Structural Stability of Li_(1-x)Ni_(y)Co_(z)Al_(0.05)O_(2) Type Cathode Materials
Authors:
Kamalika Ghatak,
Hemant Kumar,
Siva Nadimpalli,
Dibakar Datta
Abstract:
In Lithium ion batteries (LIBs), proper design of cathode materials influences its intercalation behavior, overall cost, structural stability, and its impact on environment. At present, the most common type of cathode materials, NCA , has very high cobalt concentration. Since cobalt is toxic and expensive, the existing design of cathode materials is not cost-effective, and environmentally benign.…
▽ More
In Lithium ion batteries (LIBs), proper design of cathode materials influences its intercalation behavior, overall cost, structural stability, and its impact on environment. At present, the most common type of cathode materials, NCA , has very high cobalt concentration. Since cobalt is toxic and expensive, the existing design of cathode materials is not cost-effective, and environmentally benign. However, these immensely important issues have not yet been properly addressed. Therefore, we have performed density functional theory (DFT) calculations to investigate three types of NCA cathode materials NCA_(Co=0.15), NCA_(Co=0.10), NCA_(Co=0.05). Our results show that even if the cobalt concentration is significantly decreased from NCA_(Co=0.15) to NCA_(Co=0.05), variation in intercalation potential and specific capacity is negligible. For example, in case of 50% Li concentration, voltage drop is ~0.12V while change in specific capacity is negligible. Moreover, decrease in cobalt concentration doesn't influence the structural stability. We have also explored the influence of sodium doping on the electrochemical and structural properties of these three structures. Our results provide insight into the design of cathode materials with reduced cobalt concentration, environmentally benign, low-cost cathode materials.
△ Less
Submitted 1 April, 2018; v1 submitted 26 April, 2017;
originally announced April 2017.
-
Multimedia Channel Allocation in Cognitive Radio Networks using FDM-FDMA and OFDM-FDMA
Authors:
Ansuman Bhattacharya,
Rabindranath Ghosh,
Koushik Sinha,
Debasish Datta,
Bhabani P. Sinha
Abstract:
In conventional wireless systems, unless a contiguous frequency band with width at least equal to the required bandwidth is obtained, multimedia communication can not be effected with the desired Quality of Service. We propose here a novel channel allocation technique to overcome this limitation in a Cognitive Radio Network which is based on utilizing several non-contiguous channels, each of width…
▽ More
In conventional wireless systems, unless a contiguous frequency band with width at least equal to the required bandwidth is obtained, multimedia communication can not be effected with the desired Quality of Service. We propose here a novel channel allocation technique to overcome this limitation in a Cognitive Radio Network which is based on utilizing several non-contiguous channels, each of width smaller than the required bandwidth, but whose sum equals at least the required bandwidth. We present algorithms for channel sensing, channel reservation and channel deallocation along with transmission and reception protocols with two different implementations based on $FDM-FDMA$ and $OFDM-FDMA$ techniques. Simulation results for both these implementations show that the proposed technique outperforms the existing first-fit and best-fit~\cite{b109, b110} allocation techniques in terms of the average number of attempts needed for acquiring the necessary number of channels for all traffic situations ranging from light to extremely heavy traffic. Further, the proposed technique can allocate the required numbers of channels in less than one second with $FDM-FDMA$ ($4.5$ second with $OFDM-FDMA$) even for $96\%$ traffic load, while the first-fit and best-fit techniques fail to allocate any channel in such situations.
△ Less
Submitted 12 March, 2016;
originally announced March 2016.
-
Duality Structure, Asymptotic analysis and Emergent Fractal sets
Authors:
Dhurjati Prasad Datta,
Soma Sarkar
Abstract:
A new, extended nonlinear framework of the ordinary real analysis incorporating a novel concept of {\em duality structure} and its applications into various nonlinear dynamical problems is presented. The duality structure is an asymptotic property that should affect the late time asymptotic behaviour of a nonlinear dynamical system in a nontrivial way leading naturally to signatures generic to a c…
▽ More
A new, extended nonlinear framework of the ordinary real analysis incorporating a novel concept of {\em duality structure} and its applications into various nonlinear dynamical problems is presented. The duality structure is an asymptotic property that should affect the late time asymptotic behaviour of a nonlinear dynamical system in a nontrivial way leading naturally to signatures generic to a complex system. We argue that the present formalism would offer a natural framework to understand the abundance of complex systems in natural, biological, financial and related problems. We show that the power law attenuation of a dispersive, lossy wave equation, conventionally deduced from fractional calculus techniques, could actually arise from the present asymptotic duality structure. Differentiability on a Cantor type fractal set is also formulated.
△ Less
Submitted 26 March, 2019; v1 submitted 11 January, 2016;
originally announced February 2016.
-
Comparative Study of Homotopy Analysis and Renormalization Group Methods on Rayleigh and Van der Pol Equations
Authors:
Aniruddha Palit,
Dhurjati Prasad Datta
Abstract:
A comparative study of the Homotopy Analysis method and an improved Renormalization Group method is presented in the context of the Rayleigh and the Van der Pol equations. Efficient approximate formulae as functions of the nonlinearity parameter $\varepsilon$ for the amplitudes $a(\varepsilon)$ of the limit cycles for both these oscillators are derived. The improvement in the Renormalization group…
▽ More
A comparative study of the Homotopy Analysis method and an improved Renormalization Group method is presented in the context of the Rayleigh and the Van der Pol equations. Efficient approximate formulae as functions of the nonlinearity parameter $\varepsilon$ for the amplitudes $a(\varepsilon)$ of the limit cycles for both these oscillators are derived. The improvement in the Renormalization group analysis is achieved by invoking the idea of nonlinear time that should have significance in a nonlinear system. Good approximate plots of limit cycles of the concerned oscillators are also presented within this framework.
△ Less
Submitted 31 July, 2015; v1 submitted 21 May, 2014;
originally announced May 2014.