-
Nemotron-4 340B Technical Report
Authors:
Nvidia,
:,
Bo Adler,
Niket Agarwal,
Ashwath Aithal,
Dong H. Anh,
Pallab Bhattacharya,
Annika Brundyn,
Jared Casper,
Bryan Catanzaro,
Sharon Clay,
Jonathan Cohen,
Sirshak Das,
Ayush Dattagupta,
Olivier Delalleau,
Leon Derczynski,
Yi Dong,
Daniel Egert,
Ellie Evans,
Aleksander Ficek,
Denys Fridman,
Shaona Ghosh,
Boris Ginsburg,
Igor Gitman,
Tomasz Grzegorzek
, et al. (58 additional authors not shown)
Abstract:
We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be…
▽ More
We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation benchmarks, and were sized to fit on a single DGX H100 with 8 GPUs when deployed in FP8 precision. We believe that the community can benefit from these models in various research studies and commercial applications, especially for generating synthetic data to train smaller language models. Notably, over 98% of data used in our model alignment process is synthetically generated, showcasing the effectiveness of these models in generating synthetic data. To further support open research and facilitate model development, we are also open-sourcing the synthetic data generation pipeline used in our model alignment process.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
From traces to measures: Large language models as a tool for psychological measurement from text
Authors:
Joseph J. P. Simons,
Wong Liang Ze,
Prasanta Bhattacharya,
Brandon Siyuan Loh,
Wei Gao
Abstract:
Digital trace data provide potentially valuable resources for understanding human behaviour, but their value has been limited by issues of unclear measurement. The growth of large language models provides an opportunity to address this limitation in the case of text data. Specifically, recognizing cases where their responses are a form of psychological measurement (the use of observable indicators…
▽ More
Digital trace data provide potentially valuable resources for understanding human behaviour, but their value has been limited by issues of unclear measurement. The growth of large language models provides an opportunity to address this limitation in the case of text data. Specifically, recognizing cases where their responses are a form of psychological measurement (the use of observable indicators to assess an underlying construct) allows existing measures and accuracy assessment frameworks from psychology to be re-purposed to use with large language models. Based on this, we offer four methodological recommendations for using these models to quantify text features: (1) identify the target of measurement, (2) use multiple prompts, (3) assess internal consistency, and (4) treat evaluation metrics (such as human annotations) as expected correlates rather than direct ground-truth measures. Additionally, we provide a workflow for implementing this approach.
△ Less
Submitted 12 May, 2024;
originally announced May 2024.
-
Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms
Authors:
Zhongyi Lin,
Ning Sun,
Pallab Bhattacharya,
Xizhou Feng,
Louis Feng,
John D. Owens
Abstract:
Characterizing and predicting the training performance of modern machine learning (ML) workloads on compute systems with compute and communication spread between CPUs, GPUs, and network devices is not only the key to optimization and planning but also a complex goal to achieve. The primary challenges include the complexity of synchronization and load balancing between CPUs and GPUs, the variance i…
▽ More
Characterizing and predicting the training performance of modern machine learning (ML) workloads on compute systems with compute and communication spread between CPUs, GPUs, and network devices is not only the key to optimization and planning but also a complex goal to achieve. The primary challenges include the complexity of synchronization and load balancing between CPUs and GPUs, the variance in input data distribution, and the use of different communication devices and topologies (e.g., NVLink, PCIe, network cards) that connect multiple compute devices, coupled with the desire for flexible training configurations. Built on top of our prior work for single-GPU platforms, we address these challenges and enable multi-GPU performance modeling by incorporating (1) data-distribution-aware performance models for embedding table lookup, and (2) data movement prediction of communication collectives, into our upgraded performance modeling pipeline equipped with inter-and intra-rank synchronization for ML workloads trained on multi-GPU platforms. Beyond accurately predicting the per-iteration training time of DLRM models with random configurations with a geomean error of 5.21% on two multi-GPU platforms, our prediction pipeline generalizes well to other types of ML workloads, such as Transformer-based NLP models with a geomean error of 3.00%. Moreover, even without actually running ML workloads like DLRMs on the hardware, it is capable of generating insights such as quickly selecting the fastest embedding table sharding configuration (with a success rate of 85%).
△ Less
Submitted 27 April, 2024; v1 submitted 19 April, 2024;
originally announced April 2024.
-
New infinite families in the stable homotopy groups of spheres
Authors:
Prasit Bhattacharya,
Irina Bobkova,
J. D. Quigley
Abstract:
We identify seven new $192$-periodic infinite families of elements in the $2$-primary stable homotopy groups of spheres. Although their Hurewicz image is trivial for topological modular forms, they remain nontrivial after $\mathrm{T}(2)$- as well as $\mathrm{K}(2)$-localization. We also obtain new information about $2$-torsion and $2$-divisibility of some of the previously known $192$-periodic inf…
▽ More
We identify seven new $192$-periodic infinite families of elements in the $2$-primary stable homotopy groups of spheres. Although their Hurewicz image is trivial for topological modular forms, they remain nontrivial after $\mathrm{T}(2)$- as well as $\mathrm{K}(2)$-localization. We also obtain new information about $2$-torsion and $2$-divisibility of some of the previously known $192$-periodic infinite families in the stable stems.
△ Less
Submitted 9 May, 2024; v1 submitted 15 April, 2024;
originally announced April 2024.
-
Numerical simulation of charging up, accumulation of space charge and formation of discharges
Authors:
Purba Bhattacharya,
Promita Roy,
Tanay Dey,
Jaydeep Datta,
Prasant K. Rout,
Nayana Majumdar,
Supratik Mukhopadhyay
Abstract:
Aging and stability of gaseous ionization detectors are intricately related to charging up, accumulation of space charge and formation of discharges. All these phenomena, in their turn, depend on the dynamics of charged particles within the device. Because of the large number of particles involved and their complex interactions, the dynamic processes of generation and loss of charged particles, an…
▽ More
Aging and stability of gaseous ionization detectors are intricately related to charging up, accumulation of space charge and formation of discharges. All these phenomena, in their turn, depend on the dynamics of charged particles within the device. Because of the large number of particles involved and their complex interactions, the dynamic processes of generation and loss of charged particles, and their transport within the detector volume are extremely expensive to simulate numerically. In this work, we propose and evaluate possible algorithms / approaches that show some promise in relation to the above-mentioned problems. Several important ionization detectors having parallel plate configurations, such as GEM, Micromegas, RPCs and THGEMs, are considered for this purpose. Information related to primary ionization is obtained from HEED, while all the transport properties are evaluated using MAGBOLTZ. The transport dynamics have been followed using two different approaches. In one, particle description using neBEM-Garfield++ combination has been used. For this purpose, the neBEM solver has been significantly improved such that perturbations due to the charged particles present within the device are considered while estimating electric field. In the other approach, the transport is simulated following hydrodynamic model using COMSOL during which the electric field is also provided by COMSOL where it is easy to set up space charge effects. A comparison between these possible approaches will be presented. Effect of different simulation parameters will also be demonstrated using simple examples.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Exploring Large Language Models for Code Explanation
Authors:
Paheli Bhattacharya,
Manojit Chakraborty,
Kartheek N S N Palepu,
Vikas Pandey,
Ishan Dindorkar,
Rakesh Rajpurohit,
Rishabh Gupta
Abstract:
Automating code documentation through explanatory text can prove highly beneficial in code understanding. Large Language Models (LLMs) have made remarkable strides in Natural Language Processing, especially within software engineering tasks such as code generation and code summarization. This study specifically delves into the task of generating natural-language summaries for code snippets, using…
▽ More
Automating code documentation through explanatory text can prove highly beneficial in code understanding. Large Language Models (LLMs) have made remarkable strides in Natural Language Processing, especially within software engineering tasks such as code generation and code summarization. This study specifically delves into the task of generating natural-language summaries for code snippets, using various LLMs. The findings indicate that Code LLMs outperform their generic counterparts, and zero-shot methods yield superior results when dealing with datasets with dissimilar distributions between training and testing sets.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Enhancing Stance Classification with Quantified Moral Foundations
Authors:
Hong Zhang,
Prasanta Bhattacharya,
Wei Gao,
Liang Ze Wong,
Brandon Siyuan Loh,
Joseph J. P. Simons,
Jisun An
Abstract:
This study enhances stance detection on social media by incorporating deeper psychological attributes, specifically individuals' moral foundations. These theoretically-derived dimensions aim to provide a comprehensive profile of an individual's moral concerns which, in recent work, has been linked to behaviour in a range of domains, including society, politics, health, and the environment. In this…
▽ More
This study enhances stance detection on social media by incorporating deeper psychological attributes, specifically individuals' moral foundations. These theoretically-derived dimensions aim to provide a comprehensive profile of an individual's moral concerns which, in recent work, has been linked to behaviour in a range of domains, including society, politics, health, and the environment. In this paper, we investigate how moral foundation dimensions can contribute to predicting an individual's stance on a given target. Specifically we incorporate moral foundation features extracted from text, along with message semantic features, to classify stances at both message- and user-levels across a range of targets and models. Our preliminary results suggest that encoding moral foundations can enhance the performance of stance detection tasks and help illuminate the associations between specific moral foundations and online stances on target topics. The results highlight the importance of considering deeper psychological attributes in stance analysis and underscores the role of moral foundations in guiding online social behavior.
△ Less
Submitted 15 October, 2023;
originally announced October 2023.
-
On the Steenrod module structure of $\mathbb{R}$-motivic Spanier-Whitehead duals
Authors:
Prasit Bhattacharya,
Bertrand J. Guillou,
Ang Li
Abstract:
The $\mathbb{R}$-motivic cohomology of an $\mathbb{R}$-motivic spectrum is a module over the $\mathbb{R}$-motivic Steenrod algebra $\mathcal{A}^{\mathbb{R}}$. In this paper, we describe how to recover the $\mathbb{R}$-motivic cohomology of the Spanier-Whitehead dual $\mathrm{DX}$ of an $\mathbb{R}$-motivic finite complex $\mathrm{X}$, as an $\mathcal{A}^{\mathbb{R}}$-module, given the…
▽ More
The $\mathbb{R}$-motivic cohomology of an $\mathbb{R}$-motivic spectrum is a module over the $\mathbb{R}$-motivic Steenrod algebra $\mathcal{A}^{\mathbb{R}}$. In this paper, we describe how to recover the $\mathbb{R}$-motivic cohomology of the Spanier-Whitehead dual $\mathrm{DX}$ of an $\mathbb{R}$-motivic finite complex $\mathrm{X}$, as an $\mathcal{A}^{\mathbb{R}}$-module, given the $\mathcal{A}^{\mathbb{R}}$-module structure on the cohomology of $\mathrm{X}$. As an application, we show that 16 out of 128 different $\mathcal{A}^{\mathbb{R}}$-module structures on $\mathcal{A}^{\mathbb{R}}(1):= \langle \mathrm{Sq}^1, \mathrm{Sq}^2 \rangle$ are self-dual.
△ Less
Submitted 18 October, 2023; v1 submitted 27 September, 2023;
originally announced September 2023.
-
Estimating Policy Effects in a Social Network with Independent Set Sampling
Authors:
Eugene Ang,
Prasanta Bhattacharya,
Andrew Lim
Abstract:
Evaluating the impact of policy interventions on respondents who are embedded in a social network is often challenging due to the presence of network interference within the treatment groups, as well as between treatment and non-treatment groups throughout the network. In this paper, we propose a modeling strategy that combines existing work on stochastic actor-oriented models (SAOM) with a novel…
▽ More
Evaluating the impact of policy interventions on respondents who are embedded in a social network is often challenging due to the presence of network interference within the treatment groups, as well as between treatment and non-treatment groups throughout the network. In this paper, we propose a modeling strategy that combines existing work on stochastic actor-oriented models (SAOM) with a novel network sampling method based on the identification of independent sets. By assigning respondents from an independent set to the treatment, we are able to block any spillover of the treatment and network influence, thereby allowing us to isolate the direct effect of the treatment from the indirect network-induced effects, in the immediate term. As a result, our method allows for the estimation of both the \textit{direct} as well as the \textit{net effect} of a chosen policy intervention, in the presence of network effects in the population. We perform a comparative simulation analysis to show that our proposed sampling technique leads to distinct direct and net effects of the policy, as well as significant network effects driven by policy-linked homophily. This study highlights the importance of network sampling techniques in improving policy evaluation studies and has the potential to help researchers and policymakers with better planning, designing, and anticipating policy responses in a networked society.
△ Less
Submitted 25 February, 2024; v1 submitted 25 June, 2023;
originally announced June 2023.
-
Equivariant orientation of vector bundles over disconnected base spaces
Authors:
Prasit Bhattacharya,
Foling Zou
Abstract:
In this paper, we view the equivariant orientation theory of equivariant vector bundles from the lenses of equivariant Picard spectra. This viewpoint allows us to identify, for a finite group $\mathrm{G}$, a precise condition under which an $\mathrm{R}$-orientation of a $\mathrm{G}$-equivariant vector bundle is encoded by a Thom class. Consequently, we are able to construct a generalization of the…
▽ More
In this paper, we view the equivariant orientation theory of equivariant vector bundles from the lenses of equivariant Picard spectra. This viewpoint allows us to identify, for a finite group $\mathrm{G}$, a precise condition under which an $\mathrm{R}$-orientation of a $\mathrm{G}$-equivariant vector bundle is encoded by a Thom class. Consequently, we are able to construct a generalization of the first Stiefel$-$Whitney class of a "homogeneous" $\mathrm{G}$-equivariant bundle with respect to an $\mathbb{E}_\infty^{\mathrm{G}}$-ring spectrum $\mathrm{R}$. As an application, we show that the $2$-fold direct sum of any homogeneous bundle is $\mathrm{H}\underline{\mathcal{A}}_{\mathrm{G}}$-orientable, where $\underline{\mathcal{A}}_{\mathrm{G}}$ is the Burnside Mackey functor. We notice that $\mathrm{H}\underline{\mathcal{A}}_{\mathrm{G}}$-orientability is equivalent to $\mathrm{H}\underline{\mathbb{Z}}$-orientability when the order of $\mathrm{G}$ is odd. When the order of $\mathrm{G}$ is even, we show that a $\mathrm{G}$-equivariant analog of the tautological line bundle over $\mathbb{RP}^\infty$ is $\mathrm{H}\underline{\mathbb{Z}}$-orientable but not $\mathrm{H}\underline{\mathcal{A}}_{\mathrm{G}}$-orientable.
△ Less
Submitted 29 March, 2023; v1 submitted 17 March, 2023;
originally announced March 2023.
-
The structure of the v_2-local algebraic tmf resolution
Authors:
Mark Behrens,
Prasit Bhattacharya,
Dominic Culver
Abstract:
We give a complete description of the E_1-term of the v_2-local as well as g-local algebraic tmf resolution.
We give a complete description of the E_1-term of the v_2-local as well as g-local algebraic tmf resolution.
△ Less
Submitted 26 January, 2023;
originally announced January 2023.
-
FlexShard: Flexible Sharding for Industry-Scale Sequence Recommendation Models
Authors:
Geet Sethi,
Pallab Bhattacharya,
Dhruv Choudhary,
Carole-Jean Wu,
Christos Kozyrakis
Abstract:
Sequence-based deep learning recommendation models (DLRMs) are an emerging class of DLRMs showing great improvements over their prior sum-pooling based counterparts at capturing users' long term interests. These improvements come at immense system cost however, with sequence-based DLRMs requiring substantial amounts of data to be dynamically materialized and communicated by each accelerator during…
▽ More
Sequence-based deep learning recommendation models (DLRMs) are an emerging class of DLRMs showing great improvements over their prior sum-pooling based counterparts at capturing users' long term interests. These improvements come at immense system cost however, with sequence-based DLRMs requiring substantial amounts of data to be dynamically materialized and communicated by each accelerator during a single iteration. To address this rapidly growing bottleneck, we present FlexShard, a new tiered sequence embedding table sharding algorithm which operates at a per-row granularity by exploiting the insight that not every row is equal. Through precise replication of embedding rows based on their underlying probability distribution, along with the introduction of a new sharding strategy adapted to the heterogeneous, skewed performance of real-world cluster network topologies, FlexShard is able to significantly reduce communication demand while using no additional memory compared to the prior state-of-the-art. When evaluated on production-scale sequence DLRMs, FlexShard was able to reduce overall global all-to-all communication traffic by over 85%, resulting in end-to-end training communication latency improvements of almost 6x over the prior state-of-the-art approach.
△ Less
Submitted 7 January, 2023;
originally announced January 2023.
-
What You Like: Generating Explainable Topical Recommendations for Twitter Using Social Annotations
Authors:
Parantapa Bhattacharya,
Saptarshi Ghosh,
Muhammad Bilal Zafar,
Soumya K. Ghosh,
Niloy Ganguly
Abstract:
With over 500 million tweets posted per day, in Twitter, it is difficult for Twitter users to discover interesting content from the deluge of uninteresting posts. In this work, we present a novel, explainable, topical recommendation system, that utilizes social annotations, to help Twitter users discover tweets, on topics of their interest. A major challenge in using traditional rating dependent r…
▽ More
With over 500 million tweets posted per day, in Twitter, it is difficult for Twitter users to discover interesting content from the deluge of uninteresting posts. In this work, we present a novel, explainable, topical recommendation system, that utilizes social annotations, to help Twitter users discover tweets, on topics of their interest. A major challenge in using traditional rating dependent recommendation systems, like collaborative filtering and content based systems, in high volume social networks is that, due to attention scarcity most items do not get any ratings. Additionally, the fact that most Twitter users are passive consumers, with 44% users never tweeting, makes it very difficult to use user ratings for generating recommendations. Further, a key challenge in developing recommendation systems is that in many cases users reject relevant recommendations if they are totally unfamiliar with the recommended item. Providing a suitable explanation, for why the item is recommended, significantly improves the acceptability of recommendation. By virtue of being a topical recommendation system our method is able to present simple topical explanations for the generated recommendations. Comparisons with state-of-the-art matrix factorization based collaborative filtering, content based and social recommendations demonstrate the efficacy of the proposed approach.
△ Less
Submitted 23 December, 2022;
originally announced December 2022.
-
Analyzing Regrettable Communications on Twitter: Characterizing Deleted Tweets and Their Authors
Authors:
Parantapa Bhattacharya,
Saptarshi Ghosh,
Niloy Ganguly
Abstract:
Over 500 million tweets are posted in Twitter each day, out of which about 11% tweets are deleted by the users posting them. This phenomenon of widespread deletion of tweets leads to a number of questions: what kind of content posted by users makes them want to delete them later? %Are all users equally active in deleting their tweets or Are users of certain predispositions more likely to post regr…
▽ More
Over 500 million tweets are posted in Twitter each day, out of which about 11% tweets are deleted by the users posting them. This phenomenon of widespread deletion of tweets leads to a number of questions: what kind of content posted by users makes them want to delete them later? %Are all users equally active in deleting their tweets or Are users of certain predispositions more likely to post regrettable tweets, deleting them later? In this paper we provide a detailed characterization of tweets posted and then later deleted by their authors. We collected tweets from over 200 thousand Twitter users during a period of four weeks. Our characterization shows significant personality differences between users who delete their tweets and those who do not. We find that users who delete their tweets are more likely to be extroverted and neurotic while being less conscientious. Also, we find that deleted tweets while containing less information and being less conversational, contain significant indications of regrettable content. Since users of online communication do not have instant social cues (like listener's body language) to gauge the impact of their words, they are often delayed in employing repair strategies. Finally, we build a classifier which takes textual, contextual, as well as user features to predict if a tweet will be deleted or not. The classifier achieves a F1-score of 0.78 and the precision increases when we consider response features of the tweets.
△ Less
Submitted 23 December, 2022;
originally announced December 2022.
-
Task Preferences across Languages on Community Question Answering Platforms
Authors:
Sebastin Santy,
Prasanta Bhattacharya,
Rishabh Mehrotra
Abstract:
With the steady emergence of community question answering (CQA) platforms like Quora, StackExchange, and WikiHow, users now have an unprecedented access to information on various kind of queries and tasks. Moreover, the rapid proliferation and localization of these platforms spanning geographic and linguistic boundaries offer a unique opportunity to study the task requirements and preferences of u…
▽ More
With the steady emergence of community question answering (CQA) platforms like Quora, StackExchange, and WikiHow, users now have an unprecedented access to information on various kind of queries and tasks. Moreover, the rapid proliferation and localization of these platforms spanning geographic and linguistic boundaries offer a unique opportunity to study the task requirements and preferences of users in different socio-linguistic groups. In this study, we implement an entity-embedding model trained on a large longitudinal dataset of multi-lingual and task-oriented question-answer pairs to uncover and quantify the (i) prevalence and distribution of various online tasks across linguistic communities, and (ii) emerging and receding trends in task popularity over time in these communities. Our results show that there exists substantial variance in task preference as well as popularity trends across linguistic communities on the platform. Findings from this study will help Q&A platforms better curate and personalize content for non-English users, while also offering valuable insights to businesses looking to target non-English speaking communities online.
△ Less
Submitted 18 December, 2022;
originally announced December 2022.
-
Numerical simulation of the response of single gap timing RPCs with the space charge effects and Garfield++
Authors:
Tanay Dey,
Purba Bhattacharya,
Supratik Mukhopadhyay,
Nayana Majumdar,
Abhishek Seal,
Subhasis Chattopadhyay
Abstract:
In this article, we report the simulated response of timing RPCs of different gas gaps. A 3D Montecarlo code was developed and integrated with Garfield++ to simulate the avalanche processes with space charge effects which allow actual charge and timing spectrums. The results of this study are presented with examples of timing RPCs of gas gaps 0.02 cm and 0.03 cm.
In this article, we report the simulated response of timing RPCs of different gas gaps. A 3D Montecarlo code was developed and integrated with Garfield++ to simulate the avalanche processes with space charge effects which allow actual charge and timing spectrums. The results of this study are presented with examples of timing RPCs of gas gaps 0.02 cm and 0.03 cm.
△ Less
Submitted 10 December, 2022;
originally announced December 2022.
-
Parallelization of Garfield++ and neBEM to simulate space charge effects in RPCs
Authors:
Tanay Dey,
Purba Bhattacharya,
Supratik Mukhopadhyay,
Nayana Majumdar,
Abhishek Seal,
Subhasis Chattopadhyay
Abstract:
Numerical simulation of avalanches, saturated avalanches, and streamers can help us understand the detector physics of Resistive Plate Chambers (RPC). 3D Monte Carlo simulation of an avalanche inside an RPC, the transition from avalanche to saturated avalanche to streamer may help the search for the optimum voltage and alternate gas mixtures. This task is dauntingly resource-hungry, especially whe…
▽ More
Numerical simulation of avalanches, saturated avalanches, and streamers can help us understand the detector physics of Resistive Plate Chambers (RPC). 3D Monte Carlo simulation of an avalanche inside an RPC, the transition from avalanche to saturated avalanche to streamer may help the search for the optimum voltage and alternate gas mixtures. This task is dauntingly resource-hungry, especially when space charge effects become important, which often coincides with important regimes of operation of these devices. By modifying the electric field inside the RPC dynamically, the space charge plays a crucial role in determining the response of the detector. In this work, a numerical model has been proposed to calculate the dynamic space-charge field inside an RPC and the same has been implemented in the Garfield++ framework. By modeling space charge as the large number of line charges and using the multithreading technique OpenMP to calculate electric field, drift line, electron gain, and space charge field, it has been possible to maintain time consumption within reasonable limits. For this purpose, a new class, pAvalancheMC has been introduced in Garfield++. The calculations have been successfully verified with those from existing solvers and an example is provided to show the performance of pAvalancheMC. Moreover, the details of the transition of an avalanche into a saturated avalanche have been discussed. The induced charge distribution is calculated for a timing RPC and results are verified with the experiment.
△ Less
Submitted 5 October, 2023; v1 submitted 11 November, 2022;
originally announced November 2022.
-
Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages
Authors:
Anusha Prakash,
Arun Kumar,
Ashish Seth,
Bhagyashree Mukherjee,
Ishika Gupta,
Jom Kuriakose,
Jordan Fernandes,
K V Vikram,
Mano Ranjith Kumar M,
Metilda Sagaya Mary,
Mohammad Wajahat,
Mohana N,
Mudit Batra,
Navina K,
Nihal John George,
Nithya Ravi,
Pruthwik Mishra,
Sudhanshu Srivastava,
Vasista Sai Lodagala,
Vandan Mujadia,
Kada Sai Venkata Vineeth,
Vrunda Sukhadia,
Dipti Sharma,
Hema Murthy,
Pushpak Bhattacharya
, et al. (2 additional authors not shown)
Abstract:
Cross-lingual dubbing of lecture videos requires the transcription of the original audio, correction and removal of disfluencies, domain term discovery, text-to-text translation into the target language, chunking of text using target language rhythm, text-to-speech synthesis followed by isochronous lipsyncing to the original video. This task becomes challenging when the source and target languages…
▽ More
Cross-lingual dubbing of lecture videos requires the transcription of the original audio, correction and removal of disfluencies, domain term discovery, text-to-text translation into the target language, chunking of text using target language rhythm, text-to-speech synthesis followed by isochronous lipsyncing to the original video. This task becomes challenging when the source and target languages belong to different language families, resulting in differences in generated audio duration. This is further compounded by the original speaker's rhythm, especially for extempore speech. This paper describes the challenges in regenerating English lecture videos in Indian languages semi-automatically. A prototype is developed for dubbing lectures into 9 Indian languages. A mean-opinion-score (MOS) is obtained for two languages, Hindi and Tamil, on two different courses. The output video is compared with the original video in terms of MOS (1-5) and lip synchronisation with scores of 4.09 and 3.74, respectively. The human effort also reduces by 75%.
△ Less
Submitted 1 November, 2022;
originally announced November 2022.
-
Design and studies of thick Gas Electron Multipliers fabricated in India
Authors:
Promita Roy,
Purba Bhattacharya,
Vishal Kumar,
Supratik Mukhopadhyay,
Nayana Majumdar,
Sandip Sarkar
Abstract:
THick Gas Electron Multipliers (THGEMs) are robust and high gain Micro Pattern Gaseous Detectors which are economically manufactured by standard drilling and etching of thin printed circuit boards. In this paper, we present our recent simulation as well as experimental studies on THGEMs which have been fabricated in India using local expertise. Two types of THGEMs have been fabricated; one set has…
▽ More
THick Gas Electron Multipliers (THGEMs) are robust and high gain Micro Pattern Gaseous Detectors which are economically manufactured by standard drilling and etching of thin printed circuit boards. In this paper, we present our recent simulation as well as experimental studies on THGEMs which have been fabricated in India using local expertise. Two types of THGEMs have been fabricated; one set has holes without any external rim and another set has holes with rims. These detectors have been characterized using argon-carbon dioxide and argon-isobutane gas mixtures. Electron transmission, effective gain, energy resolution and optimized working range studies have been presented for both the sets of THGEMs.
△ Less
Submitted 17 December, 2022; v1 submitted 19 October, 2022;
originally announced October 2022.
-
Deepfake Text Detection: Limitations and Opportunities
Authors:
Jiameng Pu,
Zain Sarwar,
Sifat Muhammad Abdullah,
Abdullah Rehman,
Yoonjin Kim,
Parantapa Bhattacharya,
Mobin Javed,
Bimal Viswanath
Abstract:
Recent advances in generative models for language have enabled the creation of convincing synthetic text or deepfake text. Prior work has demonstrated the potential for misuse of deepfake text to mislead content consumers. Therefore, deepfake text detection, the task of discriminating between human and machine-generated text, is becoming increasingly critical. Several defenses have been proposed f…
▽ More
Recent advances in generative models for language have enabled the creation of convincing synthetic text or deepfake text. Prior work has demonstrated the potential for misuse of deepfake text to mislead content consumers. Therefore, deepfake text detection, the task of discriminating between human and machine-generated text, is becoming increasingly critical. Several defenses have been proposed for deepfake text detection. However, we lack a thorough understanding of their real-world applicability. In this paper, we collect deepfake text from 4 online services powered by Transformer-based tools to evaluate the generalization ability of the defenses on content in the wild. We develop several low-cost adversarial attacks, and investigate the robustness of existing defenses against an adaptive attacker. We find that many defenses show significant degradation in performance under our evaluation scenarios compared to their original claimed performance. Our evaluation shows that tapping into the semantic information in the text content is a promising approach for improving the robustness and generalization performance of deepfake text detection schemes.
△ Less
Submitted 17 October, 2022;
originally announced October 2022.
-
Legal Case Document Summarization: Extractive and Abstractive Methods and their Evaluation
Authors:
Abhay Shukla,
Paheli Bhattacharya,
Soham Poddar,
Rajdeep Mukherjee,
Kripabandhu Ghosh,
Pawan Goyal,
Saptarshi Ghosh
Abstract:
Summarization of legal case judgement documents is a challenging problem in Legal NLP. However, not much analyses exist on how different families of summarization models (e.g., extractive vs. abstractive) perform when applied to legal case documents. This question is particularly important since many recent transformer-based abstractive summarization models have restrictions on the number of input…
▽ More
Summarization of legal case judgement documents is a challenging problem in Legal NLP. However, not much analyses exist on how different families of summarization models (e.g., extractive vs. abstractive) perform when applied to legal case documents. This question is particularly important since many recent transformer-based abstractive summarization models have restrictions on the number of input tokens, and legal documents are known to be very long. Also, it is an open question on how best to evaluate legal case document summarization systems. In this paper, we carry out extensive experiments with several extractive and abstractive summarization methods (both supervised and unsupervised) over three legal summarization datasets that we have developed. Our analyses, that includes evaluation by law practitioners, lead to several interesting insights on legal summarization in specific and long document summarization in general.
△ Less
Submitted 14 October, 2022;
originally announced October 2022.
-
Reduction in turbulence-induced non-linear dynamic vibration using tuned liquid damper (TLD)
Authors:
Ananya Majumdar,
Biplab Ranjan Adhikary,
Partha Bhattacharya
Abstract:
In the present research work, an attempt is made to develop a coupled non-linear turbulence-structure-damper model in a finite volume-finite difference (FV-FD) framework. Tuned liquid damper (TLD) is used as the additional damping system along with inherent structural damping. Real-time simulation of flow-excited bridge box girder or chimney section and the vibration reduction using TLD can be per…
▽ More
In the present research work, an attempt is made to develop a coupled non-linear turbulence-structure-damper model in a finite volume-finite difference (FV-FD) framework. Tuned liquid damper (TLD) is used as the additional damping system along with inherent structural damping. Real-time simulation of flow-excited bridge box girder or chimney section and the vibration reduction using TLD can be performed using the developed model. The turbulent flow field around a structure is modeled using an OpenFOAM transient PISO solver, and the time-varying drag force is calculated. This force perturbs the structure, causing the sloshing phenomena of the attached TLD, modeled using shallow depth approximation, damping the flow-induced vibration of the structure. The structural motion with and without the attached TLD is modeled involving the FD-based Newmark-Beta method using in-house MATLAB codes. The TLD is tuned with the vortex-shedding frequency of the low-Reynolds number flows, and it is found to be reducing the structural excitation significantly. On the other hand, the high-Reynolds number turbulent flow exhibits a broadband excitation, for which by tuning the TLD with few frequencies obtained through investigations, a good reduction in vibration is observed.
△ Less
Submitted 19 October, 2023; v1 submitted 2 October, 2022;
originally announced October 2022.
-
Legal Case Document Similarity: You Need Both Network and Text
Authors:
Paheli Bhattacharya,
Kripabandhu Ghosh,
Arindam Pal,
Saptarshi Ghosh
Abstract:
Estimating the similarity between two legal case documents is an important and challenging problem, having various downstream applications such as prior-case retrieval and citation recommendation. There are two broad approaches for the task -- citation network-based and text-based. Prior citation network-based approaches consider citations only to prior-cases (also called precedents) (PCNet). This…
▽ More
Estimating the similarity between two legal case documents is an important and challenging problem, having various downstream applications such as prior-case retrieval and citation recommendation. There are two broad approaches for the task -- citation network-based and text-based. Prior citation network-based approaches consider citations only to prior-cases (also called precedents) (PCNet). This approach misses important signals inherent in Statutes (written laws of a jurisdiction). In this work, we propose Hier-SPCNet that augments PCNet with a heterogeneous network of Statutes. We incorporate domain knowledge for legal document similarity into Hier-SPCNet, thereby obtaining state-of-the-art results for network-based legal document similarity. Both textual and network similarity provide important signals for legal case similarity; but till now, only trivial attempts have been made to unify the two signals. In this work, we apply several methods for combining textual and network information for estimating legal case similarity. We perform extensive experiments over legal case documents from the Indian judiciary, where the gold standard similarity between document-pairs is judged by law experts from two reputed Law institutes in India. Our experiments establish that our proposed network-based methods significantly improve the correlation with domain experts' opinion when compared to the existing methods for network-based legal document similarity. Our best-performing combination method (that combines network-based and text-based similarity) improves the correlation with domain experts' opinion by 11.8% over the best text-based method and 20.6\% over the best network-based method. We also establish that our best-performing method can be used to recommend / retrieve citable and similar cases for a source (query) case, which are well appreciated by legal experts.
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
Sensitivity mapping of TBL wall-pressure spectra with CFD turbulence models for wind tunnel test result prediction
Authors:
Biplab Ranjan Adhikary,
Ananya Majumdar,
Subhadeep Sarkar,
Partha Bhattacharya
Abstract:
In the present work, an attempt is made to map the sensitivity of the existing zero pressure gradient (ZPG) turbulent boundary layer (TBL) wall-pressure spectrum models with different TBL parameters, and eventually, with different Reynolds Averaged Navier Stokes (RANS) turbulence models, simulated in OpenFOAM and ANSYS Fluent solvers. This study will help future researchers to choose a particular…
▽ More
In the present work, an attempt is made to map the sensitivity of the existing zero pressure gradient (ZPG) turbulent boundary layer (TBL) wall-pressure spectrum models with different TBL parameters, and eventually, with different Reynolds Averaged Navier Stokes (RANS) turbulence models, simulated in OpenFOAM and ANSYS Fluent solvers. This study will help future researchers to choose a particular RANS turbulence model vis-à-vis a particular wall-spectrum model in order to obtain a reasonably accurate wind tunnel result predicting capability. First, the best-predicting pressure spectrum models are selected by comparing them with wind tunnel test data. Next, considering the experimental TBL parameters as benchmarks, errors in RANS-produced data are estimated. Furthermore, wall-pressure spectra are calculated following semi-empirical spectrum models using TBL parameter feed obtained from experiments and computational fluid dynamics (CFD) simulations. Finally, sensitivity mapping is performed between spectrum models and the RANS models, with different normalized wall-normal distances (y+).
△ Less
Submitted 24 September, 2022;
originally announced September 2022.
-
Study of space charge phenomena in GEM-based detectors
Authors:
Promita Roy,
Prasant Kumar Rout,
Jaydeep Datta,
Purba Bhattacharya,
Supratik Mukhopadhyay,
Nayana Majumdar,
Sandip Sarkar
Abstract:
Space charge accumulation within GEM holes is one of the vital phenomena which affects many of the key working parameters of the detector. This accumulation is found to be significantly affected by the initial primary charge configurations and applied GEM voltages since they determine charge sharing and the subsequent evolution of detector response. In this work, we have studied the effects of spa…
▽ More
Space charge accumulation within GEM holes is one of the vital phenomena which affects many of the key working parameters of the detector. This accumulation is found to be significantly affected by the initial primary charge configurations and applied GEM voltages since they determine charge sharing and the subsequent evolution of detector response. In this work, we have studied the effects of space charge phenomena on different parameters for single GEM detectors using a hybrid numerical model.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
TBL-induced energy transmission into a double wall backed enclosure system computed in a cloud-based Python-FE environment
Authors:
Biplab Ranjan Adhikary,
Atanu Sahu,
Partha Bhattacharya
Abstract:
We propose a fully coupled numerical model to predict turbulent boundary layer (TBL) induced energy transmission behavior for a double-wall backed enclosure system in a finite element (FE) framework computed in cloud-based Python environment. Goody single point wall-pressure spectrum and Corcos spatial correlation function are used to generate the TBL cross-power spectra. Mindlins first order shea…
▽ More
We propose a fully coupled numerical model to predict turbulent boundary layer (TBL) induced energy transmission behavior for a double-wall backed enclosure system in a finite element (FE) framework computed in cloud-based Python environment. Goody single point wall-pressure spectrum and Corcos spatial correlation function are used to generate the TBL cross-power spectra. Mindlins first order shear deformation model is considered for the panels and a fully coupled TBL-structure-acoustic model is developed using the FE approach to predict the acoustic power level inside the enclosure for variable gap distance between the panels. The model is developed in a way to capture the contribution of orthotropic lamina sequence, frequency-dependent structural damping, and stiffening orientation in predicting the energy transmission into a double-wall backed enclosure. Thus, a new numerical model is presented that enables the designers with more precise energy transmission quantification with greater flexibility in terms of the number of panel leaves, geometry, and boundary conditions of the enclosure system, backed by double wall made of isotropic or orthotropic laminates.
△ Less
Submitted 17 September, 2022;
originally announced September 2022.
-
A coupled FE-BE approach for vibro-acoustic response prediction of laminated composite panels due to turbulent boundary layer excitation involving Cholesky decomposition
Authors:
Biplab Ranjan Adhikary,
Atanu Sahu,
Partha Bhattacharya
Abstract:
An original numerical framework is developed in the present research work in order to estimate the free field sound radiation from baffled structural panels subjected to turbulent boundary layer-induced excitation. A semi-analytical method is used to estimate the TBL wall pressure spectrum which is decomposed using Cholesky technique to obtain random wall pressure in the frequency domain. Structur…
▽ More
An original numerical framework is developed in the present research work in order to estimate the free field sound radiation from baffled structural panels subjected to turbulent boundary layer-induced excitation. A semi-analytical method is used to estimate the TBL wall pressure spectrum which is decomposed using Cholesky technique to obtain random wall pressure in the frequency domain. Structural panels are modeled using the finite element technique and a coupled finite element boundary element modeling technique is developed to estimate the sound power level radiating into the free field. Results are obtained for laminated composite structural panels with various fiber orientations and significant findings are discussed. The developed technique has the potential to be further extended for complex structures in terms of geometry, material properties, and boundary conditions. The complete numerical toolbox, developed in an in-house MATLAB environment, enables the prediction of turbulent structure acoustic coupled behavior at an early design stage.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
Test-Beam and Simulation Studies Towards RPWELL-based DHCAL
Authors:
Dan Shaked-Renous,
Fernando Domingues Amaro,
Purba Bhattacharya,
Amos Breskin,
Maximilien Chefdeville,
Cyril Drancourt,
Theo Geralis,
Yannis Karyotakis,
Luca Moleri,
Andrea Tesi,
Maxim Titov,
Joao Veloso,
Guillaum Vouters,
Shikma Bressler
Abstract:
Digital Hadronic Calorimeters (DHCAL) were suggested for future Colliders as part of the particle-flow concept. Though studied mainly with Resistive Plate Chambers (RPC), studies focusing on Micro-Pattern Gaseous Detector (MPGD)-based sampling elements have shown the potential advantages; they can be operated with environmental friendly gases and reach similar detection efficiency at lower average…
▽ More
Digital Hadronic Calorimeters (DHCAL) were suggested for future Colliders as part of the particle-flow concept. Though studied mainly with Resistive Plate Chambers (RPC), studies focusing on Micro-Pattern Gaseous Detector (MPGD)-based sampling elements have shown the potential advantages; they can be operated with environmental friendly gases and reach similar detection efficiency at lower average pad-multiplicity. We summarize here the experimental test-beam results of a small-size DHCAL prototype, incorporating six Micromegas (MM) and two Resistive-Plate WELL (RPWELL) sampling elements, interlaced with steel-absorber plates. It was investigated with 2-6 GeV pion beam at the CERN/PS beam facility. The data permitted validating a GEANT4 simulation framework of a DHCAL, and evaluating the expected pion energy resolution of a full-scale RPWELL-based calorimeter. The pion energy resolution of $\fracσ{E[GeV]}=\frac{50.8\%}{\sqrt{E[GeV]}} \oplus 10.3\%$ derived expected with the RPWELL concept is competitive to that of glass RPC and MM sampling techniques.
△ Less
Submitted 7 October, 2022; v1 submitted 26 August, 2022;
originally announced August 2022.
-
A coupled FE-RRM-based numerical model for analysis of energy transmission loss through stiffened double-wall panel due to TBL excitation
Authors:
Biplab Ranjan Adhikary,
Atanu Sahu,
Partha Bhattacharya
Abstract:
We propose a fully coupled numerical model to predict energy transmission through a turbulent boundary layer (TBL) excited stiffened double-leaf flexible aircraft panel using a finite element (FE) framework. Mindlin first order shear deformation model is adopted for the panels and a TBL-structure-acoustic coupling model is developed using finite element-radiation resistance matrix (FE-RRM) approac…
▽ More
We propose a fully coupled numerical model to predict energy transmission through a turbulent boundary layer (TBL) excited stiffened double-leaf flexible aircraft panel using a finite element (FE) framework. Mindlin first order shear deformation model is adopted for the panels and a TBL-structure-acoustic coupling model is developed using finite element-radiation resistance matrix (FE-RRM) approach to predict the transmission loss (TL) through double-leaf panels with variable thickness and stiffener orientation. The model is also capable to capture the contribution of orthotropic lamina sequence and frequency-dependent structural damping in predicting the TL. Thus, a new numerical model is proposed that enables the designers with greater flexibility in terms of the number of panel leaves, boundary, and stiffening condition of the aircraft panel-cavity-panel system, made of isotropic or orthotropic laminates.
△ Less
Submitted 23 August, 2022;
originally announced August 2022.
-
TPP: Transparent Page Placement for CXL-Enabled Tiered-Memory
Authors:
Hasan Al Maruf,
Hao Wang,
Abhishek Dhanotia,
Johannes Weiner,
Niket Agarwal,
Pallab Bhattacharya,
Chris Petersen,
Mosharaf Chowdhury,
Shobhit Kanaujia,
Prakash Chauhan
Abstract:
The increasing demand for memory in hyperscale applications has led to memory becoming a large portion of the overall datacenter spend. The emergence of coherent interfaces like CXL enables main memory expansion and offers an efficient solution to this problem. In such systems, the main memory can constitute different memory technologies with varied characteristics. In this paper, we characterize…
▽ More
The increasing demand for memory in hyperscale applications has led to memory becoming a large portion of the overall datacenter spend. The emergence of coherent interfaces like CXL enables main memory expansion and offers an efficient solution to this problem. In such systems, the main memory can constitute different memory technologies with varied characteristics. In this paper, we characterize memory usage patterns of a wide range of datacenter applications across the server fleet of Meta. We, therefore, demonstrate the opportunities to offload colder pages to slower memory tiers for these applications. Without efficient memory management, however, such systems can significantly degrade performance.
We propose a novel OS-level application-transparent page placement mechanism (TPP) for CXL-enabled memory. TPP employs a lightweight mechanism to identify and place hot/cold pages to appropriate memory tiers. It enables a proactive page demotion from local memory to CXL-Memory. This technique ensures a memory headroom for new page allocations that are often related to request processing and tend to be short-lived and hot. At the same time, TPP can promptly promote performance-critical hot pages trapped in the slow CXL-Memory to the fast local memory, while minimizing both sampling overhead and unnecessary migrations. TPP works transparently without any application-specific knowledge and can be deployed globally as a kernel release.
We evaluate TPP in the production server fleet with early samples of new x86 CPUs with CXL 1.1 support. TPP makes a tiered memory system performant as an ideal baseline (<1% gap) that has all the memory in the local tier. It is 18% better than today's Linux, and 5-17% better than existing solutions including NUMA Balancing and AutoTiering. Most of the TPP patches have been merged in the Linux v5.18 release.
△ Less
Submitted 28 May, 2023; v1 submitted 6 June, 2022;
originally announced June 2022.
-
Double-hit separation and dE/dx resolution of a time projection chamber with GEM readout
Authors:
Yumi Aoki,
David Attié,
Ties Behnke,
Alain Bellerive,
Oleg Bezshyyko,
Deb Bhattacharya Sankar,
Purba Bhattacharya,
Sudeb Bhattacharya,
Yue Chang,
Paul Colas,
Gilles De Lentdecker,
Klaus Dehmelt,
Klaus Desch,
Ralf Diener,
Madhu Dixit,
Ulrich Einhaus,
Oleksiy Fedorchuk,
Ivor Fleck,
Keisuke Fujii,
Takahiro Fusayasu,
Serguei Ganjour,
Philippe Gros,
Peter Hayman,
Katsumasa Ikematsu,
Leif Jönsson
, et al. (46 additional authors not shown)
Abstract:
A time projection chamber (TPC) with micropattern gaseous detector (MPGD) readout is investigated as main tracking device of the International Large Detector (ILD) concept at the planned International Linear Collider (ILC). A prototype TPC equipped with a triple gas electron multiplier (GEM) readout has been built and operated in an electron test beam. The TPC was placed in a 1 T solenoidal field…
▽ More
A time projection chamber (TPC) with micropattern gaseous detector (MPGD) readout is investigated as main tracking device of the International Large Detector (ILD) concept at the planned International Linear Collider (ILC). A prototype TPC equipped with a triple gas electron multiplier (GEM) readout has been built and operated in an electron test beam. The TPC was placed in a 1 T solenoidal field at the DESY II Test Beam Facility, which provides an electron beam up to 6 GeV/c. The performance of the readout modules, in particular the spatial point resolution, is determined and compared to earlier tests. New studies are presented with first results on the separation of close-by tracks and the capability of the system to measure the specific energy loss dE/dx. This is complemented by a simulation study on the optimization of the readout granularity to improve particle identification by dE/dx.
△ Less
Submitted 25 November, 2022; v1 submitted 24 May, 2022;
originally announced May 2022.
-
A Tour of Visualization Techniques for Computer Vision Datasets
Authors:
Bilal Alsallakh,
Pamela Bhattacharya,
Vanessa Feng,
Narine Kokhlikyan,
Orion Reblitz-Richardson,
Rahul Rajan,
David Yan
Abstract:
We survey a number of data visualization techniques for analyzing Computer Vision (CV) datasets. These techniques help us understand properties and latent patterns in such data, by applying dataset-level analysis. We present various examples of how such analysis helps predict the potential impact of the dataset properties on CV models and informs appropriate mitigation of their shortcomings. Final…
▽ More
We survey a number of data visualization techniques for analyzing Computer Vision (CV) datasets. These techniques help us understand properties and latent patterns in such data, by applying dataset-level analysis. We present various examples of how such analysis helps predict the potential impact of the dataset properties on CV models and informs appropriate mitigation of their shortcomings. Finally, we explore avenues for further visualization techniques of different modalities of CV datasets as well as ones that are tailored to support specific CV tasks and analysis needs.
△ Less
Submitted 18 April, 2022;
originally announced April 2022.
-
Effect of hole geometry on charge sharing and other parameters in GEM-based detectors
Authors:
Promita Roy,
Purba Bhattacharya,
Prasant Kumar Rout,
Supratik Mukhopadhyay,
Nayana Majumdar,
Sandip Sarkar
Abstract:
Gas Electron Multipliers (GEM) are among the more prominent Micro-Pattern Gaseous Detectors (MPGDs) and widely used in high energy particle physics experiments and various related applications. Adoption of different production techniques lead to holes of varying geometries in GEM foils. Since the response of a GEM-based detector is closely related to the hole geometry through the influence of the…
▽ More
Gas Electron Multipliers (GEM) are among the more prominent Micro-Pattern Gaseous Detectors (MPGDs) and widely used in high energy particle physics experiments and various related applications. Adoption of different production techniques lead to holes of varying geometries in GEM foils. Since the response of a GEM-based detector is closely related to the hole geometry through the influence of the latter on charge sharing and transport through GEM foils, attempts have been made to relate hole configurations to different figures of merit of a detector. Numerical simulations have been performed to study the effects of hole geometry on important parameters such as charge sharing, collection efficiency, extraction efficiency, gain, possibility of transition from avalanche to streamer modes for single, double and triple layer GEM detectors. The numerical estimates have been compared to available experimental data. The comparisons, although not always in agreement, are found to be generally encouraging.
△ Less
Submitted 5 March, 2024; v1 submitted 23 December, 2021;
originally announced December 2021.
-
Incorporating Domain Knowledge for Extractive Summarization of Legal Case Documents
Authors:
Paheli Bhattacharya,
Soham Poddar,
Koustav Rudra,
Kripabandhu Ghosh,
Saptarshi Ghosh
Abstract:
Automatic summarization of legal case documents is an important and practical challenge. Apart from many domain-independent text summarization algorithms that can be used for this purpose, several algorithms have been developed specifically for summarizing legal case documents. However, most of the existing algorithms do not systematically incorporate domain knowledge that specifies what informati…
▽ More
Automatic summarization of legal case documents is an important and practical challenge. Apart from many domain-independent text summarization algorithms that can be used for this purpose, several algorithms have been developed specifically for summarizing legal case documents. However, most of the existing algorithms do not systematically incorporate domain knowledge that specifies what information should ideally be present in a legal case document summary. To address this gap, we propose an unsupervised summarization algorithm DELSumm which is designed to systematically incorporate guidelines from legal experts into an optimization setup. We conduct detailed experiments over case documents from the Indian Supreme Court. The experiments show that our proposed unsupervised method outperforms several strong baselines in terms of ROUGE scores, including both general summarization algorithms and legal-specific ones. In fact, though our proposed algorithm is unsupervised, it outperforms several supervised summarization models that are trained over thousands of document-summary pairs.
△ Less
Submitted 30 June, 2021;
originally announced June 2021.
-
On realizations of the subalgebra $A^R(1)$ of the $R$-motivic Steenrod Algebra
Authors:
Prasit Bhattacharya,
Bertrand J. Guillou,
Ang Li
Abstract:
In this paper, we show that the finite subalgebra $\mathcal{A}^{\mathbb{R}}(1)$, generated by $\mathrm{Sq}^1$ and $\mathrm{Sq}^2$, of the $\mathbb{R}$-motivic Steenrod algebra $\mathcal{A}^{\mathbb{R}}$ can be given $128$ different $\mathcal{A}^{\mathbb{R}}$-module structures. We also show that all of these $\mathcal{A}^{\mathbb{R}}$-modules can be realized as the cohomology of a $2$-local finite…
▽ More
In this paper, we show that the finite subalgebra $\mathcal{A}^{\mathbb{R}}(1)$, generated by $\mathrm{Sq}^1$ and $\mathrm{Sq}^2$, of the $\mathbb{R}$-motivic Steenrod algebra $\mathcal{A}^{\mathbb{R}}$ can be given $128$ different $\mathcal{A}^{\mathbb{R}}$-module structures. We also show that all of these $\mathcal{A}^{\mathbb{R}}$-modules can be realized as the cohomology of a $2$-local finite $\mathbb{R}$-motivic spectrum. The realization results are obtained using an $\mathbb{R}$ -motivic analogue of the Toda realization theorem. We notice that each realization of $\mathcal{A}^{\mathbb{R}}(1)$ can be expressed as a cofiber of an $\mathbb{R}$-motivic $v_1$-self-map. The $\mathrm{C}_2$-equivariant analogue of the above results then follows because of the Betti realization functor. We identify a relationship between the $\mathrm{RO}(\mathrm{C}_2)$-graded Steenrod operations on a $\mathrm{C}_2$-equivariant space and the classical Steenrod operations on both its underlying space and its fixed-points. This technique is then used to identify the geometric fixed-point spectra of the $\mathrm{C}_2$-equivariant realizations of $\mathcal{A}^{\mathrm{C}_2}(1)$. We find another application of the $\mathbb{R}$-motivic Toda realization theorem: we produce an $\mathbb{R}$-motivic, and consequently a $\mathrm{C}_2$-equivariant, analogue of the Bhattacharya-Egger spectrum $\mathcal{Z}$, which could be of independent interest.
△ Less
Submitted 11 July, 2021; v1 submitted 20 June, 2021;
originally announced June 2021.
-
A Discussion on Building Practical NLP Leaderboards: The Case of Machine Translation
Authors:
Sebastin Santy,
Prasanta Bhattacharya
Abstract:
Recent advances in AI and ML applications have benefited from rapid progress in NLP research. Leaderboards have emerged as a popular mechanism to track and accelerate progress in NLP through competitive model development. While this has increased interest and participation, the over-reliance on single, and accuracy-based metrics have shifted focus from other important metrics that might be equally…
▽ More
Recent advances in AI and ML applications have benefited from rapid progress in NLP research. Leaderboards have emerged as a popular mechanism to track and accelerate progress in NLP through competitive model development. While this has increased interest and participation, the over-reliance on single, and accuracy-based metrics have shifted focus from other important metrics that might be equally pertinent to consider in real-world contexts. In this paper, we offer a preliminary discussion of the risks associated with focusing exclusively on accuracy metrics and draw on recent discussions to highlight prescriptive suggestions on how to develop more practical and effective leaderboards that can better reflect the real-world utility of models.
△ Less
Submitted 30 December, 2022; v1 submitted 11 June, 2021;
originally announced June 2021.
-
Modeling Photocurrent Spectra of In$_{0.91}$Ga$_{0.09}$N/In$_{0.4}$Ga$_{0.6}$N Disk-in-Wire Photodiode on Silicon for $1.3$ $μ$m $-$ $1.55$ $μ$m Operation
Authors:
Fu-Chen Hsiao,
Arnab Hazari,
Pallab Bhattacharya,
Yia-Chung Chang,
John M. Dallesasse
Abstract:
This work reports comprehensive theoretical modeling of photocurrent spectra generated by an In$_{0.91}$Ga$_{0.09}$N/In$_{0.4}$Ga$_{0.6}$N disk-in-wire photodiode. The strain distribution is calculated by valence-force-field (VFF) model, while a realistic band structure of the InN/InGaN heterostructure is incorporated using an eight-band effective bond-orbital model (EBOM) with spin-orbit coupling…
▽ More
This work reports comprehensive theoretical modeling of photocurrent spectra generated by an In$_{0.91}$Ga$_{0.09}$N/In$_{0.4}$Ga$_{0.6}$N disk-in-wire photodiode. The strain distribution is calculated by valence-force-field (VFF) model, while a realistic band structure of the InN/InGaN heterostructure is incorporated using an eight-band effective bond-orbital model (EBOM) with spin-orbit coupling neglected. The electrostatic potential is obtained from self-consistent calculation employing the non-equilibrium Green's function (NEGF) method. With the strain distribution and band profile determined, a multi-band transfer-matrix method (TMM) is used to calculate the tunneling coefficients of optically-pumped carriers in the absorbing region. The photocurrent spectra contributed by both single-photon absorption (SPA) and two-photon absorption (TPA) are calculated. The absorption coefficient is weighted by the carrier tunneling rate and the photon density-of-state (DOS) in the optical cavity formed in the nanowire region to produce the photocurrent. The calculated photocurrent spectra is in good agreement with experimental data, while physical mechanisms for the observed prominent peaks are identified and investigated.
△ Less
Submitted 11 May, 2021;
originally announced May 2021.
-
Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation Models
Authors:
Dheevatsa Mudigere,
Yuchen Hao,
Jianyu Huang,
Zhihao Jia,
Andrew Tulloch,
Srinivas Sridharan,
Xing Liu,
Mustafa Ozdal,
Jade Nie,
Jongsoo Park,
Liang Luo,
Jie Amy Yang,
Leon Gao,
Dmytro Ivchenko,
Aarti Basant,
Yuxi Hu,
Jiyan Yang,
Ehsan K. Ardestani,
Xiaodong Wang,
Rakesh Komuravelli,
Ching-Hsiang Chu,
Serhat Yilmaz,
Huayu Li,
Jiyuan Qian,
Zhuobo Feng
, et al. (28 additional authors not shown)
Abstract:
Deep learning recommendation models (DLRMs) are used across many business-critical services at Facebook and are the single largest AI application in terms of infrastructure demand in its data-centers. In this paper we discuss the SW/HW co-designed solution for high-performance distributed training of large-scale DLRMs. We introduce a high-performance scalable software stack based on PyTorch and pa…
▽ More
Deep learning recommendation models (DLRMs) are used across many business-critical services at Facebook and are the single largest AI application in terms of infrastructure demand in its data-centers. In this paper we discuss the SW/HW co-designed solution for high-performance distributed training of large-scale DLRMs. We introduce a high-performance scalable software stack based on PyTorch and pair it with the new evolution of Zion platform, namely ZionEX. We demonstrate the capability to train very large DLRMs with up to 12 Trillion parameters and show that we can attain 40X speedup in terms of time to solution over previous systems. We achieve this by (i) designing the ZionEX platform with dedicated scale-out network, provisioned with high bandwidth, optimal topology and efficient transport (ii) implementing an optimized PyTorch-based training stack supporting both model and data parallelism (iii) developing sharding algorithms capable of hierarchical partitioning of the embedding tables along row, column dimensions and load balancing them across multiple workers; (iv) adding high-performance core operators while retaining flexibility to support optimizers with fully deterministic updates (v) leveraging reduced precision communications, multi-level memory hierarchy (HBM+DDR+SSD) and pipelining. Furthermore, we develop and briefly comment on distributed data ingestion and other supporting services that are required for the robust and efficient end-to-end training in production environments.
△ Less
Submitted 26 February, 2023; v1 submitted 11 April, 2021;
originally announced April 2021.
-
Circulant association schemes on triples
Authors:
Prabir Bhattacharya,
Cheryl E Praeger
Abstract:
Association Schemes and coherent configurations (and the related Bose-Mesner algebra and coherent algebras) are well known in combinatorics with many applications. In the 1990s, Mesner and Bhattacharya introduced a three-dimensional generalisation of association schemes which they called an association scheme on triples (AST) and constructed examples of several families of ASTs. Many of their exam…
▽ More
Association Schemes and coherent configurations (and the related Bose-Mesner algebra and coherent algebras) are well known in combinatorics with many applications. In the 1990s, Mesner and Bhattacharya introduced a three-dimensional generalisation of association schemes which they called an association scheme on triples (AST) and constructed examples of several families of ASTs. Many of their examples used 2-transitive permutation groups: the non-trivial ternary relations of the ASTs were sets of ordered triples of pairwise distinct points of the underlying set left invariant by the group; and the given permutation group was a subgroup of automorphisms of the AST. In this paper, we consider ASTs that do not necessarily admit 2-transitive groups as automorphism groups but instead a transitive cyclic subgroup of the symmetric group acts as automorphisms. Such ASTs are called circulant ASTs and the corresponding ternary relations are called circulant relations. We give a complete characterisation of circulant ASTs in terms of AST-regular partitions of the underlying set. We also show that a special type of circulant, that we call a thin circulant, plays a key role in describing the structure of circulant ASTs. We outline several open questions.
△ Less
Submitted 7 April, 2021;
originally announced April 2021.
-
Jekyll: Attacking Medical Image Diagnostics using Deep Generative Models
Authors:
Neal Mangaokar,
Jiameng Pu,
Parantapa Bhattacharya,
Chandan K. Reddy,
Bimal Viswanath
Abstract:
Advances in deep neural networks (DNNs) have shown tremendous promise in the medical domain. However, the deep learning tools that are helping the domain, can also be used against it. Given the prevalence of fraud in the healthcare domain, it is important to consider the adversarial use of DNNs in manipulating sensitive data that is crucial to patient healthcare. In this work, we present the desig…
▽ More
Advances in deep neural networks (DNNs) have shown tremendous promise in the medical domain. However, the deep learning tools that are helping the domain, can also be used against it. Given the prevalence of fraud in the healthcare domain, it is important to consider the adversarial use of DNNs in manipulating sensitive data that is crucial to patient healthcare. In this work, we present the design and implementation of a DNN-based image translation attack on biomedical imagery. More specifically, we propose Jekyll, a neural style transfer framework that takes as input a biomedical image of a patient and translates it to a new image that indicates an attacker-chosen disease condition. The potential for fraudulent claims based on such generated 'fake' medical images is significant, and we demonstrate successful attacks on both X-rays and retinal fundus image modalities. We show that these attacks manage to mislead both medical professionals and algorithmic detection schemes. Lastly, we also investigate defensive measures based on machine learning to detect images generated by Jekyll.
△ Less
Submitted 5 April, 2021;
originally announced April 2021.
-
Numerical estimation of discharge probability in GEM-based detectors
Authors:
Prasant Kumar Rout,
R. Kanishka,
Jaydeep Datta,
Promita Roy,
Purba Bhattacharya,
Supratik Mukhopadhyay,
Nayana Majumdar,
Sandip Sarkar
Abstract:
Discharge probability in GEM-based gaseous detectors has been numerically estimated using an axisymmetric hydrodynamic model. Initial primary charge configurations in the drift region, obtained using Heed and Geant4, are found to have significant effect on the subsequent evolution of detector response. Simulation of energy resolution has been performed to establish the capability of the hydrodynam…
▽ More
Discharge probability in GEM-based gaseous detectors has been numerically estimated using an axisymmetric hydrodynamic model. Initial primary charge configurations in the drift region, obtained using Heed and Geant4, are found to have significant effect on the subsequent evolution of detector response. Simulation of energy resolution has been performed to establish the capability of the hydrodynamic model to capture statistical nature of the experimental situation. Finally, single and triple GEM configurations exposed to alpha sources have been simulated to estimate discharge probability which have been compared with available experimental data. Despite the simplifying and drastic assumptions in the numerical model, the comparisons are encouraging.
△ Less
Submitted 13 August, 2021; v1 submitted 23 March, 2021;
originally announced March 2021.
-
Deepfake Videos in the Wild: Analysis and Detection
Authors:
Jiameng Pu,
Neal Mangaokar,
Lauren Kelly,
Parantapa Bhattacharya,
Kavya Sundaram,
Mobin Javed,
Bolun Wang,
Bimal Viswanath
Abstract:
AI-manipulated videos, commonly known as deepfakes, are an emerging problem. Recently, researchers in academia and industry have contributed several (self-created) benchmark deepfake datasets, and deepfake detection algorithms. However, little effort has gone towards understanding deepfake videos in the wild, leading to a limited understanding of the real-world applicability of research contributi…
▽ More
AI-manipulated videos, commonly known as deepfakes, are an emerging problem. Recently, researchers in academia and industry have contributed several (self-created) benchmark deepfake datasets, and deepfake detection algorithms. However, little effort has gone towards understanding deepfake videos in the wild, leading to a limited understanding of the real-world applicability of research contributions in this space. Even if detection schemes are shown to perform well on existing datasets, it is unclear how well the methods generalize to real-world deepfakes. To bridge this gap in knowledge, we make the following contributions: First, we collect and present the largest dataset of deepfake videos in the wild, containing 1,869 videos from YouTube and Bilibili, and extract over 4.8M frames of content. Second, we present a comprehensive analysis of the growth patterns, popularity, creators, manipulation strategies, and production methods of deepfake content in the real-world. Third, we systematically evaluate existing defenses using our new dataset, and observe that they are not ready for deployment in the real-world. Fourth, we explore the potential for transfer learning schemes and competition-winning techniques to improve defenses.
△ Less
Submitted 10 March, 2021; v1 submitted 6 March, 2021;
originally announced March 2021.
-
Single Electron Spectra in RPWELL-based detectors
Authors:
Purba Bhattacharya,
Andrea Tesi,
Dan Shaked-Renous,
Luca Moleri,
Amos Breskin,
Shikma Bressler
Abstract:
Single-electron avalanche distributions in gaseous multipliers affect their efficient detection and that of single UV-photons. In this work, we investigated the shape of single-photo-electron spectra in single- and double-stage Resistive Plate WELL (RPWELL) detector configurations, operated in $\mathrm{Ne/CH_{4}}$ and $\mathrm{Ar/CH_{4}}$. Discharge-free operation was reached over a broad dynamic…
▽ More
Single-electron avalanche distributions in gaseous multipliers affect their efficient detection and that of single UV-photons. In this work, we investigated the shape of single-photo-electron spectra in single- and double-stage Resistive Plate WELL (RPWELL) detector configurations, operated in $\mathrm{Ne/CH_{4}}$ and $\mathrm{Ar/CH_{4}}$. Discharge-free operation was reached over a broad dynamic range, with charge gains of \numrange[range-phrase = -]{e4}{e6}. Compared to the usual exponential ones, the observed Polya-like charge spectra pave the way towards higher single-electrons detection efficiencies. The latter were evaluated here, using experimental data combined with numerical simulations. The effects of the gas mixtures, electric field configuration and detector geometry on the Polya spectra and their related "$θ$" parameter are presented.
△ Less
Submitted 21 June, 2021; v1 submitted 5 January, 2021;
originally announced January 2021.
-
To Schedule or not to Schedule: Extracting Task Specific Temporal Entities and Associated Negation Constraints
Authors:
Barun Patra,
Chala Fufa,
Pamela Bhattacharya,
Charles Lee
Abstract:
State of the art research for date-time entity extraction from text is task agnostic. Consequently, while the methods proposed in literature perform well for generic date-time extraction from texts, they don't fare as well on task specific date-time entity extraction where only a subset of the date-time entities present in the text are pertinent to solving the task. Furthermore, some tasks require…
▽ More
State of the art research for date-time entity extraction from text is task agnostic. Consequently, while the methods proposed in literature perform well for generic date-time extraction from texts, they don't fare as well on task specific date-time entity extraction where only a subset of the date-time entities present in the text are pertinent to solving the task. Furthermore, some tasks require identifying negation constraints associated with the date-time entities to correctly reason over time. We showcase a novel model for extracting task-specific date-time entities along with their negation constraints. We show the efficacy of our method on the task of date-time understanding in the context of scheduling meetings for an email-based digital AI scheduling assistant. Our method achieves an absolute gain of 19\% f-score points compared to baseline methods in detecting the date-time entities relevant to scheduling meetings and a 4\% improvement over baseline methods for detecting negation constraints over date-time entities.
△ Less
Submitted 15 November, 2020;
originally announced December 2020.
-
Fast simulation of avalanche and streamer in GEM detector using hydrodynamic approach
Authors:
Prasant Kumar Rout,
Jaydeep Datta,
Promita Roy,
Purba Bhattacharya,
Supratik Mukhopadhyay,
Nayana Majumdar,
Sandip Sarkar
Abstract:
A fast, hydrodynamic numerical model has been developed on the COMSOL Multi-physics platform to simulate the evolution and dynamics of charged particles in gaseous ionization detectors based on the Gaseous Electron Multipliers (GEM). Effects of using two-dimensional (2D), 2D axisymmetric and three-dimensional(3D) models of the detectors have been analyzed to choose the optimum configuration. The c…
▽ More
A fast, hydrodynamic numerical model has been developed on the COMSOL Multi-physics platform to simulate the evolution and dynamics of charged particles in gaseous ionization detectors based on the Gaseous Electron Multipliers (GEM). Effects of using two-dimensional (2D), 2D axisymmetric and three-dimensional(3D) models of the detectors have been analyzed to choose the optimum configuration. The chosen model has been used to follow the entire operating regime of single, double and triple GEM detectors, including avalanche and streamer mode operations. The accumulation of space charge, its contribution towards the distortion of the applied electric field and production of streamers have been investigated in fair detail using the optimized model.
△ Less
Submitted 15 December, 2020; v1 submitted 5 November, 2020;
originally announced November 2020.
-
An $\mathbb{R}$-motivic $v_{1}-$self-map of periodicity $1$
Authors:
Prasit Bhattacharya,
Bertrand Guillou,
Ang Li
Abstract:
We consider a nontrivial action of $\mathrm{C}_2$ on the type $1$ spectrum $\mathcal{Y} := \mathrm{M}_2(1) \wedge \mathrm{C}(η)$, which is well-known for admitting a $1$-periodic $v_1-$self-map. The resultant finite $\mathrm{C}_2$-equivariant spectrum $\mathcal{Y}^{\mathrm{C}_2}$ can also be viewed as the complex points of a finite $\mathbb{R}$-motivic spectrum $\mathcal{Y}^\mathbb{R}$. In this pa…
▽ More
We consider a nontrivial action of $\mathrm{C}_2$ on the type $1$ spectrum $\mathcal{Y} := \mathrm{M}_2(1) \wedge \mathrm{C}(η)$, which is well-known for admitting a $1$-periodic $v_1-$self-map. The resultant finite $\mathrm{C}_2$-equivariant spectrum $\mathcal{Y}^{\mathrm{C}_2}$ can also be viewed as the complex points of a finite $\mathbb{R}$-motivic spectrum $\mathcal{Y}^\mathbb{R}$. In this paper, we show that one of the $1$-periodic $v_1-$self-maps of $\mathcal{Y}$ can be lifted to a self-map of $\mathcal{Y}^{\mathrm{C}_2}$ as well as $\mathcal{Y}^{\mathbb{R}}$. Further, the cofiber of the self-map of $\mathcal{Y}^{\mathbb{R}}$ is a realization of the subalgebra $\mathcal{A}^\mathbb{R}(1)$ of the $\mathbb{R}$-motivic Steenrod algebra. We also show that the $\mathrm{C}_2$-equivariant self-map is nilpotent on the geometric fixed-points of $\mathcal{Y}^{\mathrm{C}_2}$.
△ Less
Submitted 2 November, 2020; v1 submitted 12 August, 2020;
originally announced August 2020.
-
Hier-SPCNet: A Legal Statute Hierarchy-based Heterogeneous Network for Computing Legal Case Document Similarity
Authors:
Paheli Bhattacharya,
Kripabandhu Ghosh,
Arindam Pal,
Saptarshi Ghosh
Abstract:
Computing similarity between two legal case documents is an important and challenging task in Legal IR, for which text-based and network-based measures have been proposed in literature. All prior network-based similarity methods considered a precedent citation network among case documents only (PCNet). However, this approach misses an important source of legal knowledge -- the hierarchy of legal s…
▽ More
Computing similarity between two legal case documents is an important and challenging task in Legal IR, for which text-based and network-based measures have been proposed in literature. All prior network-based similarity methods considered a precedent citation network among case documents only (PCNet). However, this approach misses an important source of legal knowledge -- the hierarchy of legal statutes that are applicable in a given legal jurisdiction (e.g., country). We propose to augment the PCNet with the hierarchy of legal statutes, to form a heterogeneous network Hier-SPCNet, having citation links between case documents and statutes, as well as citation and hierarchy links among the statutes. Experiments over a set of Indian Supreme Court case documents show that our proposed heterogeneous network enables significantly better document similarity estimation, as compared to existing approaches using PCNet. We also show that the proposed network-based method can complement text-based measures for better estimation of legal document similarity.
△ Less
Submitted 7 July, 2020;
originally announced July 2020.
-
Electron transparency of a Micromegas mesh
Authors:
K. Nikolopoulos,
P. Bhattacharya,
V. Chernyatin,
R. Veenhof
Abstract:
Measurements of the electron transparency of a Micromegas mesh are compared to simulations. The flux conservation argument is shown to lead to inaccurate estimates of the transparency, the importance of accurate geometric modelling of the mesh is discussed and the effect of the dipole moment of the mesh is demonstrated. This study provides a validation of the microscopic simulation methods specifi…
▽ More
Measurements of the electron transparency of a Micromegas mesh are compared to simulations. The flux conservation argument is shown to lead to inaccurate estimates of the transparency, the importance of accurate geometric modelling of the mesh is discussed and the effect of the dipole moment of the mesh is demonstrated. This study provides a validation of the microscopic simulation methods specifically developed for micropattern devices where the characteristic dimensions are of the same order of magnitude as the electron mean free path in the gas.
△ Less
Submitted 3 May, 2020;
originally announced May 2020.
-
Charge sharing in single and double GEMs
Authors:
Promita Roy,
Purba Bhattacharya,
Supratik Mukhopadhyay,
Nayana Majumdar
Abstract:
The Gas Electron Multiplier (GEM) has become a widely used technology for high-rate particle physics experiments like COMPASS, LHCb and are being used as the readout system for the upcoming upgrade version of other experiments such as ALICE TPC. Radiation hardness, ageing resistance and stability against discharges are main criteria for long-term operation of such detectors in high-rate experiment…
▽ More
The Gas Electron Multiplier (GEM) has become a widely used technology for high-rate particle physics experiments like COMPASS, LHCb and are being used as the readout system for the upcoming upgrade version of other experiments such as ALICE TPC. Radiation hardness, ageing resistance and stability against discharges are main criteria for long-term operation of such detectors in high-rate experiments. In particular, discharge is a serious issue as it may cause irreversible damages to the detector as well as the readout electronics. The charge density inside the amplification region is the limiting factor for detector stability against discharges. By using multiple devices and thus, sharing the electron multiplication in different stages, maximum sustainable gain can be increased by several orders of magnitude. A common explanation for this is connected to the transverse electron diffusion, widening of the electron cloud and reducing the charge density in the last multiplier. However, this has not been verified yet. In our work, we are using Garfield simulation framework as a tool to extract the information related to the transverse size of the propagating electron cloud and thus, to estimate the charge density in the GEM holes for multiple stages. For a given gas mixture, we will present the initial results of charge sharing using single and double GEM detectors under different electric field configurations and its effect on other measurable detector parameters such as single point position resolution.
△ Less
Submitted 27 May, 2020; v1 submitted 28 April, 2020;
originally announced April 2020.
-
Exploring the contextual factors affecting multimodal emotion recognition in videos
Authors:
Prasanta Bhattacharya,
Raj Kumar Gupta,
Yinping Yang
Abstract:
Emotional expressions form a key part of user behavior on today's digital platforms. While multimodal emotion recognition techniques are gaining research attention, there is a lack of deeper understanding on how visual and non-visual features can be used to better recognize emotions in certain contexts, but not others. This study analyzes the interplay between the effects of multimodal emotion fea…
▽ More
Emotional expressions form a key part of user behavior on today's digital platforms. While multimodal emotion recognition techniques are gaining research attention, there is a lack of deeper understanding on how visual and non-visual features can be used to better recognize emotions in certain contexts, but not others. This study analyzes the interplay between the effects of multimodal emotion features derived from facial expressions, tone and text in conjunction with two key contextual factors: i) gender of the speaker, and ii) duration of the emotional episode. Using a large public dataset of 2,176 manually annotated YouTube videos, we found that while multimodal features consistently outperformed bimodal and unimodal features, their performance varied significantly across different emotions, gender and duration contexts. Multimodal features performed particularly better for male speakers in recognizing most emotions. Furthermore, multimodal features performed particularly better for shorter than for longer videos in recognizing neutral and happiness, but not sadness and anger. These findings offer new insights towards the development of more context-aware emotion recognition and empathetic systems.
△ Less
Submitted 30 June, 2021; v1 submitted 28 April, 2020;
originally announced April 2020.