-
Turkish Delights: a Dataset on Turkish Euphemisms
Authors:
Hasan Can Biyik,
Patrick Lee,
Anna Feldman
Abstract:
Euphemisms are a form of figurative language relatively understudied in natural language processing. This research extends the current computational work on potentially euphemistic terms (PETs) to Turkish. We introduce the Turkish PET dataset, the first available of its kind in the field. By creating a list of euphemisms in Turkish, collecting example contexts, and annotating them, we provide both…
▽ More
Euphemisms are a form of figurative language relatively understudied in natural language processing. This research extends the current computational work on potentially euphemistic terms (PETs) to Turkish. We introduce the Turkish PET dataset, the first available of its kind in the field. By creating a list of euphemisms in Turkish, collecting example contexts, and annotating them, we provide both euphemistic and non-euphemistic examples of PETs in Turkish. We describe the dataset and methodologies, and also experiment with transformer-based models on Turkish euphemism detection by using our dataset for binary classification. We compare performances across models using F1, accuracy, and precision as evaluation metrics.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
Quantum enhanced distributed phase sensing with a truncated SU(1,1) interferometer
Authors:
Seongjin Hong,
Matthew A. Feldman,
Claire E. Marvinney,
Donghwa Lee,
Changhyoup Lee,
Michael T. Febbraro,
Alberto M. Marino,
Raphael C. Pooser
Abstract:
In recent years, distributed quantum sensing has gained interest for a range of applications requiring networks of sensors, from global-scale clock synchronization to high energy physics. In particular, a network of entangled sensors can improve not only the sensitivity beyond the shot noise limit, but also enable a Heisenberg scaling with the number of sensors. Here, using bright entangled twin b…
▽ More
In recent years, distributed quantum sensing has gained interest for a range of applications requiring networks of sensors, from global-scale clock synchronization to high energy physics. In particular, a network of entangled sensors can improve not only the sensitivity beyond the shot noise limit, but also enable a Heisenberg scaling with the number of sensors. Here, using bright entangled twin beams, we theoretically and experimentally demonstrate the detection of a linear combination of two distributed phases beyond the shot noise limit with a truncated SU(1,1) interferometer. We experimentally demonstrate a quantum noise reduction of 1.7 dB and a classical 3 dB signal-to-noise ratio improvement over the separable sensing approach involving two truncated SU(1,1) interferometers. Additionally, we theoretically extend the use of a truncated SU(1,1) interferometer to a multi-phase-distributed sensing scheme that leverages entanglement as a resource to achieve a quantum improvement in the scaling with the number of sensors in the network. Our results pave the way for developing quantum enhanced sensor networks that can achieve an entanglement-enhanced sensitivity.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
Authors:
Stephanie Fu,
Mark Hamilton,
Laura Brandt,
Axel Feldman,
Zhoutong Zhang,
William T. Freeman
Abstract:
Deep features are a cornerstone of computer vision research, capturing image semantics and enabling the community to solve downstream tasks even in the zero- or few-shot regime. However, these features often lack the spatial resolution to directly perform dense prediction tasks like segmentation and depth prediction because models aggressively pool information over large areas. In this work, we in…
▽ More
Deep features are a cornerstone of computer vision research, capturing image semantics and enabling the community to solve downstream tasks even in the zero- or few-shot regime. However, these features often lack the spatial resolution to directly perform dense prediction tasks like segmentation and depth prediction because models aggressively pool information over large areas. In this work, we introduce FeatUp, a task- and model-agnostic framework to restore lost spatial information in deep features. We introduce two variants of FeatUp: one that guides features with high-resolution signal in a single forward pass, and one that fits an implicit model to a single image to reconstruct features at any resolution. Both approaches use a multi-view consistency loss with deep analogies to NeRFs. Our features retain their original semantics and can be swapped into existing applications to yield resolution and performance gains even without re-training. We show that FeatUp significantly outperforms other feature upsampling and image super-resolution approaches in class activation map generation, transfer learning for segmentation and depth prediction, and end-to-end training for semantic segmentation.
△ Less
Submitted 1 April, 2024; v1 submitted 15 March, 2024;
originally announced March 2024.
-
Evaluating Embeddings for One-Shot Classification of Doctor-AI Consultations
Authors:
Olumide Ebenezer Ojo,
Olaronke Oluwayemisi Adebanji,
Alexander Gelbukh,
Hiram Calvo,
Anna Feldman
Abstract:
Effective communication between healthcare providers and patients is crucial to providing high-quality patient care. In this work, we investigate how Doctor-written and AI-generated texts in healthcare consultations can be classified using state-of-the-art embeddings and one-shot classification systems. By analyzing embeddings such as bag-of-words, character n-grams, Word2Vec, GloVe, fastText, and…
▽ More
Effective communication between healthcare providers and patients is crucial to providing high-quality patient care. In this work, we investigate how Doctor-written and AI-generated texts in healthcare consultations can be classified using state-of-the-art embeddings and one-shot classification systems. By analyzing embeddings such as bag-of-words, character n-grams, Word2Vec, GloVe, fastText, and GPT2 embeddings, we examine how well our one-shot classification systems capture semantic information within medical consultations. Results show that the embeddings are capable of capturing semantic features from text in a reliable and adaptable manner. Overall, Word2Vec, GloVe and Character n-grams embeddings performed well, indicating their suitability for modeling targeted to this task. GPT2 embedding also shows notable performance, indicating its suitability for models tailored to this task as well. Our machine learning architectures significantly improved the quality of health conversations when training data are scarce, improving communication between patients and healthcare providers.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
MEDs for PETs: Multilingual Euphemism Disambiguation for Potentially Euphemistic Terms
Authors:
Patrick Lee,
Alain Chirino Trujillo,
Diana Cuevas Plancarte,
Olumide Ebenezer Ojo,
Xinyi Liu,
Iyanuoluwa Shode,
Yuan Zhao,
Jing Peng,
Anna Feldman
Abstract:
This study investigates the computational processing of euphemisms, a universal linguistic phenomenon, across multiple languages. We train a multilingual transformer model (XLM-RoBERTa) to disambiguate potentially euphemistic terms (PETs) in multilingual and cross-lingual settings. In line with current trends, we demonstrate that zero-shot learning across languages takes place. We also show cases…
▽ More
This study investigates the computational processing of euphemisms, a universal linguistic phenomenon, across multiple languages. We train a multilingual transformer model (XLM-RoBERTa) to disambiguate potentially euphemistic terms (PETs) in multilingual and cross-lingual settings. In line with current trends, we demonstrate that zero-shot learning across languages takes place. We also show cases where multilingual models perform better on the task compared to monolingual models by a statistically significant margin, indicating that multilingual data presents additional opportunities for models to learn about cross-lingual, computational properties of euphemisms. In a follow-up analysis, we focus on universal euphemistic "categories" such as death and bodily functions among others. We test to see whether cross-lingual data of the same domain is more important than within-language data of other domains to further understand the nature of the cross-lingual transfer.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
MedAI Dialog Corpus (MEDIC): Zero-Shot Classification of Doctor and AI Responses in Health Consultations
Authors:
Olumide E. Ojo,
Olaronke O. Adebanji,
Alexander Gelbukh,
Hiram Calvo,
Anna Feldman
Abstract:
Zero-shot classification enables text to be classified into classes not seen during training. In this study, we examine the efficacy of zero-shot learning models in classifying healthcare consultation responses from Doctors and AI systems. The models evaluated include BART, BERT, XLM, XLM-R and DistilBERT. The models were tested on three different datasets based on a binary and multi-label analysi…
▽ More
Zero-shot classification enables text to be classified into classes not seen during training. In this study, we examine the efficacy of zero-shot learning models in classifying healthcare consultation responses from Doctors and AI systems. The models evaluated include BART, BERT, XLM, XLM-R and DistilBERT. The models were tested on three different datasets based on a binary and multi-label analysis to identify the origins of text in health consultations without any prior corpus training. According to our findings, the zero-shot language models show a good understanding of language generally, but has limitations when trying to classify doctor and AI responses to healthcare consultations. This research provides a foundation for future research in the field of medical text classification by informing the development of more accurate methods of classifying text written by Doctors and AI systems in health consultations.
△ Less
Submitted 12 January, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
Legend at ArAIEval Shared Task: Persuasion Technique Detection using a Language-Agnostic Text Representation Model
Authors:
Olumide E. Ojo,
Olaronke O. Adebanji,
Hiram Calvo,
Damian O. Dieke,
Olumuyiwa E. Ojo,
Seye E. Akinsanya,
Tolulope O. Abiola,
Anna Feldman
Abstract:
In this paper, we share our best performing submission to the Arabic AI Tasks Evaluation Challenge (ArAIEval) at ArabicNLP 2023. Our focus was on Task 1, which involves identifying persuasion techniques in excerpts from tweets and news articles. The persuasion technique in Arabic texts was detected using a training loop with XLM-RoBERTa, a language-agnostic text representation model. This approach…
▽ More
In this paper, we share our best performing submission to the Arabic AI Tasks Evaluation Challenge (ArAIEval) at ArabicNLP 2023. Our focus was on Task 1, which involves identifying persuasion techniques in excerpts from tweets and news articles. The persuasion technique in Arabic texts was detected using a training loop with XLM-RoBERTa, a language-agnostic text representation model. This approach proved to be potent, leveraging fine-tuning of a multilingual language model. In our evaluation of the test set, we achieved a micro F1 score of 0.64 for subtask A of the competition.
△ Less
Submitted 14 October, 2023;
originally announced October 2023.
-
Guarantees on Robot System Performance Using Stochastic Simulation Rollouts
Authors:
Joseph A. Vincent,
Aaron O. Feldman,
Mac Schwager
Abstract:
We provide finite-sample performance guarantees for control policies executed on stochastic robotic systems. Given an open- or closed-loop policy and a finite set of trajectory rollouts under the policy, we bound the expected value, value-at-risk, and conditional-value-at-risk of the trajectory cost, and the probability of failure in a sparse cost setting. The bounds hold, with user-specified prob…
▽ More
We provide finite-sample performance guarantees for control policies executed on stochastic robotic systems. Given an open- or closed-loop policy and a finite set of trajectory rollouts under the policy, we bound the expected value, value-at-risk, and conditional-value-at-risk of the trajectory cost, and the probability of failure in a sparse cost setting. The bounds hold, with user-specified probability, for any policy synthesis technique and can be seen as a post-design safety certification. Generating the bounds only requires sampling simulation rollouts, without assumptions on the distribution or complexity of the underlying stochastic system. We adapt these bounds to also give a constraint satisfaction test to verify safety of the robot system. We provide a thorough analysis of the bound sensitivity to sim-to-real distribution shifts and provide results for constructing robust bounds that can tolerate some specified amount of distribution shift. Furthermore, we extend our method to apply when selecting the best policy from a set of candidates, requiring a multi-hypothesis correction. We show the statistical validity of our bounds in the Ant, Half-cheetah, and Swimmer MuJoCo environments and demonstrate our constraint satisfaction test with the Ant. Finally, using the 20 degree-of-freedom MuJoCo Shadow Hand, we show the necessity of the multi-hypothesis correction.
△ Less
Submitted 13 June, 2024; v1 submitted 19 September, 2023;
originally announced September 2023.
-
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models
Authors:
Neha Sengupta,
Sunil Kumar Sahu,
Bokang Jia,
Satheesh Katipomu,
Haonan Li,
Fajri Koto,
William Marshall,
Gurpreet Gosal,
Cynthia Liu,
Zhiming Chen,
Osama Mohammed Afzal,
Samta Kamboj,
Onkar Pandit,
Rahul Pal,
Lalit Pradhan,
Zain Muhammad Mujahid,
Massa Baali,
Xudong Han,
Sondos Mahmoud Bsharat,
Alham Fikri Aji,
Zhiqiang Shen,
Zhengzhong Liu,
Natalia Vassilieva,
Joel Hestness,
Andy Hock
, et al. (7 additional authors not shown)
Abstract:
We introduce Jais and Jais-chat, new state-of-the-art Arabic-centric foundation and instruction-tuned open generative large language models (LLMs). The models are based on the GPT-3 decoder-only architecture and are pretrained on a mixture of Arabic and English texts, including source code in various programming languages. With 13 billion parameters, they demonstrate better knowledge and reasoning…
▽ More
We introduce Jais and Jais-chat, new state-of-the-art Arabic-centric foundation and instruction-tuned open generative large language models (LLMs). The models are based on the GPT-3 decoder-only architecture and are pretrained on a mixture of Arabic and English texts, including source code in various programming languages. With 13 billion parameters, they demonstrate better knowledge and reasoning capabilities in Arabic than any existing open Arabic and multilingual models by a sizable margin, based on extensive evaluation. Moreover, the models are competitive in English compared to English-centric open models of similar size, despite being trained on much less English data. We provide a detailed description of the training, the tuning, the safety alignment, and the evaluation of the models. We release two open versions of the model -- the foundation Jais model, and an instruction-tuned Jais-chat variant -- with the aim of promoting research on Arabic LLMs. Available at https://huggingface.co/inception-mbzuai/jais-13b-chat
△ Less
Submitted 29 September, 2023; v1 submitted 30 August, 2023;
originally announced August 2023.
-
FEED PETs: Further Experimentation and Expansion on the Disambiguation of Potentially Euphemistic Terms
Authors:
Patrick Lee,
Iyanuoluwa Shode,
Alain Chirino Trujillo,
Yuan Zhao,
Olumide Ebenezer Ojo,
Diana Cuevas Plancarte,
Anna Feldman,
Jing Peng
Abstract:
Transformers have been shown to work well for the task of English euphemism disambiguation, in which a potentially euphemistic term (PET) is classified as euphemistic or non-euphemistic in a particular context. In this study, we expand on the task in two ways. First, we annotate PETs for vagueness, a linguistic property associated with euphemisms, and find that transformers are generally better at…
▽ More
Transformers have been shown to work well for the task of English euphemism disambiguation, in which a potentially euphemistic term (PET) is classified as euphemistic or non-euphemistic in a particular context. In this study, we expand on the task in two ways. First, we annotate PETs for vagueness, a linguistic property associated with euphemisms, and find that transformers are generally better at classifying vague PETs, suggesting linguistic differences in the data that impact performance. Second, we present novel euphemism corpora in three different languages: Yoruba, Spanish, and Mandarin Chinese. We perform euphemism disambiguation experiments in each language using multilingual transformer models mBERT and XLM-RoBERTa, establishing preliminary results from which to launch future work.
△ Less
Submitted 6 June, 2023; v1 submitted 31 May, 2023;
originally announced June 2023.
-
NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification
Authors:
Iyanuoluwa Shode,
David Ifeoluwa Adelani,
Jing Peng,
Anna Feldman
Abstract:
Africa has over 2000 indigenous languages but they are under-represented in NLP research due to lack of datasets. In recent years, there have been progress in developing labeled corpora for African languages. However, they are often available in a single domain and may not generalize to other domains. In this paper, we focus on the task of sentiment classification for cross domain adaptation. We c…
▽ More
Africa has over 2000 indigenous languages but they are under-represented in NLP research due to lack of datasets. In recent years, there have been progress in developing labeled corpora for African languages. However, they are often available in a single domain and may not generalize to other domains. In this paper, we focus on the task of sentiment classification for cross domain adaptation. We create a new dataset, NollySenti - based on the Nollywood movie reviews for five languages widely spoken in Nigeria (English, Hausa, Igbo, Nigerian-Pidgin, and Yoruba. We provide an extensive empirical evaluation using classical machine learning methods and pre-trained language models. Leveraging transfer learning, we compare the performance of cross-domain adaptation from Twitter domain, and cross-lingual adaptation from English language. Our evaluation shows that transfer from English in the same target domain leads to more than 5% improvement in accuracy compared to transfer from Twitter in the same language. To further mitigate the domain difference, we leverage machine translation (MT) from English to other Nigerian languages, which leads to a further improvement of 7% over cross-lingual evaluation. While MT to low-resource languages are often of low quality, through human evaluation, we show that most of the translated sentences preserve the sentiment of the original English reviews.
△ Less
Submitted 22 August, 2023; v1 submitted 18 May, 2023;
originally announced May 2023.
-
Analyzing the Large-Scale Bulk Flow using CosmicFlows4: Increasing Tension with the Standard Cosmological Model
Authors:
Richard Watkins,
Trey Allen,
Collin James Bradford,
Albert Ramon Jr.,
Alexandra Walker,
Hume A. Feldman,
Rachel Cionitti,
Yara Al-Shorman,
Ehsan Kourkchi,
R. Brent Tully
Abstract:
We present an estimate of the bulk flow in a volume of radii $150-200h^{-1}$Mpc using the minimum variance (MV) method with data from the CosmicFlows-4 (CF4) catalog. The addition of new data in the CF4 has resulted in an increase in the estimate of the bulk flow in a sphere of radius $150h^{-1}$Mpc relative to the CosmicFlows-3 (CF3). This bulk flow has less than a $0.03\%$ chance of occurring in…
▽ More
We present an estimate of the bulk flow in a volume of radii $150-200h^{-1}$Mpc using the minimum variance (MV) method with data from the CosmicFlows-4 (CF4) catalog. The addition of new data in the CF4 has resulted in an increase in the estimate of the bulk flow in a sphere of radius $150h^{-1}$Mpc relative to the CosmicFlows-3 (CF3). This bulk flow has less than a $0.03\%$ chance of occurring in the Standard Cosmological Model ($Λ$CDM) with cosmic microwave background derived parameters. Given that the CF4 is deeper than the CF3, we were able to use the CF4 to accurately estimate the bulk flow on scales of $200h^{-1}$Mpc (equivalent to 266 Mpc for Hubble constant $H_o=75$ km/s/Mpc) for the first time. This bulk flow is in even greater tension with the Standard Model, having less than $0.003\%$ probability of occurring. To estimate the bulk flow accurately, we introduce a novel method to calculate distances and velocities from distance moduli that is unbiased and accurate at all distances. Our results are completely independent of the value of $H_o$.
△ Less
Submitted 3 February, 2023;
originally announced February 2023.
-
Combining quantum noise reduction resources: a practical approach
Authors:
Sohitri Ghosh,
Matthew A. Feldman,
Seongjin Hong,
Claire Marvinney,
Raphael Pooser,
Jacob M. Taylor
Abstract:
Optomechanical sensors are capable of transducing external perturbations to resolvable optical signals. A particular regime of interest is that of high-bandwidth force detection, where an impulse is delivered to the system over a short period of time. Exceedingly sensitive impulse detection has been proposed to observe very weak signals like those for long range interactions with dark matter requi…
▽ More
Optomechanical sensors are capable of transducing external perturbations to resolvable optical signals. A particular regime of interest is that of high-bandwidth force detection, where an impulse is delivered to the system over a short period of time. Exceedingly sensitive impulse detection has been proposed to observe very weak signals like those for long range interactions with dark matter requiring much higher sensitivities than current sensors can provide. Quantum resources to go beyond the standard quantum limit of noise in these sensors include squeezing of the light used to transduce the signal, backaction evasion by measuring the optimum quadrature, and quantum nondemolition (QND) measurements which reduce backaction directly. However, it has been extremely difficult to determine a scheme where all these quantum resources contribute to noise reduction thereby exceeding the benefit of using only one quantum resource alone. We provide the theoretical limits to noise reduction while combining quantum enhanced readout techniques such as squeezing and QND measurements for these optomechanical sensors. We demonstrate that backaction evasion through QND techniques dramatically reduces the technical challenges presented when using squeezed light for broadband force detection, paving the way for combining multiple quantum noise reduction techniques for enhanced sensitivity in the context of impulse metrology.
△ Less
Submitted 25 November, 2022;
originally announced November 2022.
-
A Report on the Euphemisms Detection Shared Task
Authors:
Patrick Lee,
Anna Feldman,
Jing Peng
Abstract:
This paper presents The Shared Task on Euphemism Detection for the Third Workshop on Figurative Language Processing (FigLang 2022) held in conjunction with EMNLP 2022. Participants were invited to investigate the euphemism detection task: given input text, identify whether it contains a euphemism. The input data is a corpus of sentences containing potentially euphemistic terms (PETs) collected fro…
▽ More
This paper presents The Shared Task on Euphemism Detection for the Third Workshop on Figurative Language Processing (FigLang 2022) held in conjunction with EMNLP 2022. Participants were invited to investigate the euphemism detection task: given input text, identify whether it contains a euphemism. The input data is a corpus of sentences containing potentially euphemistic terms (PETs) collected from the GloWbE corpus (Davies and Fuchs, 2015), and are human-annotated as containing either a euphemistic or literal usage of a PET. In this paper, we present the results and analyze the common themes, methods and findings of the participating teams
△ Less
Submitted 3 December, 2022; v1 submitted 23 November, 2022;
originally announced November 2022.
-
A Quantum Algorithm for Computing All Diagnoses of a Switching Circuit
Authors:
Alexander Feldman,
Johan de Kleer,
Ion Matei
Abstract:
Faults are stochastic by nature while most man-made systems, and especially computers, work deterministically. This necessitates the linking of probability theory with mathematical logics, automata, and switching circuit theory. This paper provides such a connecting via quantum information theory which is an intuitive approach as quantum physics obeys probability laws. In this paper we provide a n…
▽ More
Faults are stochastic by nature while most man-made systems, and especially computers, work deterministically. This necessitates the linking of probability theory with mathematical logics, automata, and switching circuit theory. This paper provides such a connecting via quantum information theory which is an intuitive approach as quantum physics obeys probability laws. In this paper we provide a novel approach for computing diagnosis of switching circuits with gate-based quantum computers. The approach is based on the idea of putting the qubits representing faults in superposition and compute all, often exponentially many, diagnoses simultaneously. We empirically compare the quantum algorithm for diagnostics to an approach based on SAT and model-counting. For a benchmark of combinational circuits we establish an error of less than one percent in estimating the true probability of faults.
△ Less
Submitted 8 September, 2022;
originally announced September 2022.
-
Searching for PETs: Using Distributional and Sentiment-Based Methods to Find Potentially Euphemistic Terms
Authors:
Patrick Lee,
Martha Gavidia,
Anna Feldman,
Jing Peng
Abstract:
This paper presents a linguistically driven proof of concept for finding potentially euphemistic terms, or PETs. Acknowledging that PETs tend to be commonly used expressions for a certain range of sensitive topics, we make use of distributional similarities to select and filter phrase candidates from a sentence and rank them using a set of simple sentiment-based metrics. We present the results of…
▽ More
This paper presents a linguistically driven proof of concept for finding potentially euphemistic terms, or PETs. Acknowledging that PETs tend to be commonly used expressions for a certain range of sensitive topics, we make use of distributional similarities to select and filter phrase candidates from a sentence and rank them using a set of simple sentiment-based metrics. We present the results of our approach tested on a corpus of sentences containing euphemisms, demonstrating its efficacy for detecting single and multi-word PETs from a broad range of topics. We also discuss future potential for sentiment-based methods on this task.
△ Less
Submitted 20 May, 2022;
originally announced May 2022.
-
CATs are Fuzzy PETs: A Corpus and Analysis of Potentially Euphemistic Terms
Authors:
Martha Gavidia,
Patrick Lee,
Anna Feldman,
Jing Peng
Abstract:
Euphemisms have not received much attention in natural language processing, despite being an important element of polite and figurative language. Euphemisms prove to be a difficult topic, not only because they are subject to language change, but also because humans may not agree on what is a euphemism and what is not. Nevertheless, the first step to tackling the issue is to collect and analyze exa…
▽ More
Euphemisms have not received much attention in natural language processing, despite being an important element of polite and figurative language. Euphemisms prove to be a difficult topic, not only because they are subject to language change, but also because humans may not agree on what is a euphemism and what is not. Nevertheless, the first step to tackling the issue is to collect and analyze examples of euphemisms. We present a corpus of potentially euphemistic terms (PETs) along with example texts from the GloWbE corpus. Additionally, we present a subcorpus of texts where these PETs are not being used euphemistically, which may be useful for future applications. We also discuss the results of multiple analyses run on the corpus. Firstly, we find that sentiment analysis on the euphemistic texts supports that PETs generally decrease negative and offensive sentiment. Secondly, we observe cases of disagreement in an annotation task, where humans are asked to label PETs as euphemistic or not in a subset of our corpus text examples. We attribute the disagreement to a variety of potential reasons, including if the PET was a commonly accepted term (CAT).
△ Less
Submitted 5 May, 2022;
originally announced May 2022.
-
yosm: A new yoruba sentiment corpus for movie reviews
Authors:
Iyanuoluwa Shode,
David Ifeoluwa Adelani,
Anna Feldman
Abstract:
A movie that is thoroughly enjoyed and recommended by an individual might be hated by another. One characteristic of humans is the ability to have feelings which could be positive or negative. To automatically classify and study human feelings, an aspect of natural language processing, sentiment analysis and opinion mining were designed to understand human feelings regarding several issues which c…
▽ More
A movie that is thoroughly enjoyed and recommended by an individual might be hated by another. One characteristic of humans is the ability to have feelings which could be positive or negative. To automatically classify and study human feelings, an aspect of natural language processing, sentiment analysis and opinion mining were designed to understand human feelings regarding several issues which could affect a product, a social media platforms, government, or societal discussions or even movies. Several works on sentiment analysis have been done on high resource languages while low resources languages like Yoruba have been sidelined. Due to the scarcity of datasets and linguistic architectures that will suit low resource languages, African languages "low resource languages" have been ignored and not fully explored. For this reason, our attention is placed on Yoruba to explore sentiment analysis on reviews of Nigerian movies. The data comprised 1500 movie reviews that were sourced from IMDB, Rotten Tomatoes, Letterboxd, Cinemapointer and Nollyrated. We develop sentiment classification models using the state-of-the-art pre-trained language models like mBERT and AfriBERTa to classify the movie reviews.
△ Less
Submitted 20 April, 2022;
originally announced April 2022.
-
Snowmass 2021 White Paper: The Windchime Project
Authors:
The Windchime Collaboration,
Alaina Attanasio,
Sunil A. Bhave,
Carlos Blanco,
Daniel Carney,
Marcel Demarteau,
Bahaa Elshimy,
Michael Febbraro,
Matthew A. Feldman,
Sohitri Ghosh,
Abby Hickin,
Seongjin Hong,
Rafael F. Lang,
Benjamin Lawrie,
Shengchao Li,
Zhen Liu,
Juan P. A. Maldonado,
Claire Marvinney,
Hein Zay Yar Oo,
Yun-Yi Pai,
Raphael Pooser,
Juehang Qin,
Tobias J. Sparmann,
Jacob M. Taylor,
Hao Tian
, et al. (1 additional authors not shown)
Abstract:
The absence of clear signals from particle dark matter in direct detection experiments motivates new approaches in disparate regions of viable parameter space. In this Snowmass white paper, we outline the Windchime project, a program to build a large array of quantum-enhanced mechanical sensors. The ultimate aim is to build a detector capable of searching for Planck mass-scale dark matter purely t…
▽ More
The absence of clear signals from particle dark matter in direct detection experiments motivates new approaches in disparate regions of viable parameter space. In this Snowmass white paper, we outline the Windchime project, a program to build a large array of quantum-enhanced mechanical sensors. The ultimate aim is to build a detector capable of searching for Planck mass-scale dark matter purely through its gravitational coupling to ordinary matter. In the shorter term, we aim to search for a number of other physics targets, especially some ultralight dark matter candidates. Here, we discuss the basic design, open R&D challenges and opportunities, current experimental efforts, and both short- and long-term physics targets of the Windchime project.
△ Less
Submitted 14 March, 2022;
originally announced March 2022.
-
Findings of the NLP4IF-2021 Shared Tasks on Fighting the COVID-19 Infodemic and Censorship Detection
Authors:
Shaden Shaar,
Firoj Alam,
Giovanni Da San Martino,
Alex Nikolov,
Wajdi Zaghouani,
Preslav Nakov,
Anna Feldman
Abstract:
We present the results and the main findings of the NLP4IF-2021 shared tasks. Task 1 focused on fighting the COVID-19 infodemic in social media, and it was offered in Arabic, Bulgarian, and English. Given a tweet, it asked to predict whether that tweet contains a verifiable claim, and if so, whether it is likely to be false, is of general interest, is likely to be harmful, and is worthy of manual…
▽ More
We present the results and the main findings of the NLP4IF-2021 shared tasks. Task 1 focused on fighting the COVID-19 infodemic in social media, and it was offered in Arabic, Bulgarian, and English. Given a tweet, it asked to predict whether that tweet contains a verifiable claim, and if so, whether it is likely to be false, is of general interest, is likely to be harmful, and is worthy of manual fact-checking; also, whether it is harmful to society, and whether it requires the attention of policy makers. Task~2 focused on censorship detection, and was offered in Chinese. A total of ten teams submitted systems for task 1, and one team participated in task 2; nine teams also submitted a system description paper. Here, we present the tasks, analyze the results, and discuss the system submissions and the methods they used. Most submissions achieved sizable improvements over several baselines, and the best systems used pre-trained Transformers and ensembles. The data, the scorers and the leaderboards for the tasks are available at http://gitlab.com/NLP4IF/nlp4if-2021.
△ Less
Submitted 23 September, 2021;
originally announced September 2021.
-
Improved Methods for Estimating Peculiar Velocity Correlation Functions Using Volume Weighting
Authors:
Yuyu Wang,
Sarah Peery,
Hume A. Feldman,
Richard Watkins
Abstract:
We present an improved method for calculating the parallel and perpendicular velocity correlation functions directly from peculiar velocity surveys using weighted maximum-likelihood estimators. A central feature of the new method is the use of position-dependent weighting scheme that reduces the influence of nearby galaxies, which are typically overrepresented relative to the more distant galaxies…
▽ More
We present an improved method for calculating the parallel and perpendicular velocity correlation functions directly from peculiar velocity surveys using weighted maximum-likelihood estimators. A central feature of the new method is the use of position-dependent weighting scheme that reduces the influence of nearby galaxies, which are typically overrepresented relative to the more distant galaxies in most surveys. We demonstrate that the correlation functions calculated this way are less susceptible to biases due to our particular location in the Universe, and thus are more easily comparable to linear theory and between surveys. Our results suggest that the parallel velocity correlation function is a promising cosmological probe, given that it provides a better approximation of a Gaussian distribution than other velocity correlation functions and that its bias is more easily minimized by weighting. Though the position weighted parallel velocity correlation function increases the statistical uncertainty, it decreases the cosmic variance and is expected to provide more stable and tighter cosmological parameter constraints than other correlation methods in conjunction with more precise velocity surveys in the future.
△ Less
Submitted 18 August, 2021;
originally announced August 2021.
-
Magnetostriction of α-RuCl3 flakes in the zigzag phase
Authors:
Yun-Yi Pai,
Claire E. Marvinney,
Matthew A. Feldman,
Brian Lerner,
Yoong Sheng Phang,
Kai Xiao,
Jiaqiang Yan,
Liangbo Liang,
Matthew Brahlek,
Benjamin J. Lawrie
Abstract:
Motivated by the possibility of an intermediate U(1) quantum spin liquid phase in out-of-plane magnetic fields and enhanced magnetic fluctuations in exfoliated α-RuCl3 flakes, we study magneto-Raman spectra of exfoliated multilayer α-RuCl3 in out-of-plane magnetic fields of -6 T to 6 T at temperatures of 670 mK - 4 K. While the literature currently suggests that bulk α-RuCl3 is in an antiferromagn…
▽ More
Motivated by the possibility of an intermediate U(1) quantum spin liquid phase in out-of-plane magnetic fields and enhanced magnetic fluctuations in exfoliated α-RuCl3 flakes, we study magneto-Raman spectra of exfoliated multilayer α-RuCl3 in out-of-plane magnetic fields of -6 T to 6 T at temperatures of 670 mK - 4 K. While the literature currently suggests that bulk α-RuCl3 is in an antiferromagnetic zigzag phase with R3bar symmetry at low temperature, we do not observe R3bar symmetry in exfoliated α-RuCl3 at low temperatures. While we saw no magnetic field driven transitions, the Raman modes exhibit unexpected stochastic shifts in response to applied magnetic field that are above the uncertainties inferred from Bayesian analysis. These stochastic shifts are consistent with the emergence of magnetostrictive interactions in exfoliated α-RuCl3.
△ Less
Submitted 2 May, 2021;
originally announced May 2021.
-
Multifunctional Superconducting Nanowire Quantum Sensors
Authors:
Benjamin J Lawrie,
Claire E. Marvinney,
Yun-Yi Pai,
Matthew A. Feldman,
Jie Zhang,
Aaron J. Miller,
Chengyun Hua,
Eugene Dumitrescu,
Gábor B. Halász
Abstract:
Superconducting nanowire single photon detectors (SNSPDs) offer high-quantum-efficiency and low-dark-count-rate single photon detection. In a growing number of cases, large magnetic fields are being incorporated into quantum microscopes, nanophotonic devices, and sensors for nuclear and high-energy physics that rely on SNSPDs, but superconducting devices generally operate poorly in large magnetic…
▽ More
Superconducting nanowire single photon detectors (SNSPDs) offer high-quantum-efficiency and low-dark-count-rate single photon detection. In a growing number of cases, large magnetic fields are being incorporated into quantum microscopes, nanophotonic devices, and sensors for nuclear and high-energy physics that rely on SNSPDs, but superconducting devices generally operate poorly in large magnetic fields. Here, we demonstrate robust performance of amorphous SNSPDs in magnetic fields of up to $\pm 6$ T with a negligible dark count rate and unchanged quantum efficiency at typical bias currents. Critically, we also show that in the electrothermal oscillation regime, the SNSPD can be used as a magnetometer with sensitivity of better than 100 $\mathrm{μT/\sqrt{Hz}}$ and as a thermometer with sensitivity of 20 $\mathrm{μK/\sqrt{Hz}}$ at 1 K. Thus, a single photon detector integrated into a quantum device can be used as a multifunctional quantum sensor capable of describing the temperature and magnetic field on-chip simply by varying the bias current to change the operating modality from single photon detection to thermometry or magnetometry.
△ Less
Submitted 17 March, 2021;
originally announced March 2021.
-
Peculiar Velocity Estimation from Kinetic SZ Effect using Deep Neural Networks
Authors:
Yuyu Wang,
Nesar Ramachandra,
Edgar M. Salazar-Canizales,
Hume A. Feldman,
Richard Watkins,
Klaus Dolag
Abstract:
The Sunyaev-Zel'dolvich (SZ) effect is expected to be instrumental in measuring velocities of distant clusters in near future telescope surveys. We simplify the calculation of peculiar velocities of galaxy clusters using deep learning frameworks trained on numerical simulations to avoid the estimation of the optical depth. The image of distorted photon backgrounds are generated for idealized obser…
▽ More
The Sunyaev-Zel'dolvich (SZ) effect is expected to be instrumental in measuring velocities of distant clusters in near future telescope surveys. We simplify the calculation of peculiar velocities of galaxy clusters using deep learning frameworks trained on numerical simulations to avoid the estimation of the optical depth. The image of distorted photon backgrounds are generated for idealized observations using one of the largest cosmological hydrodynamical simulations, the Magneticum simulations. The model is tested to be capable peculiar velocities from future kinetic SZ observations under different noise conditions. The deep learning algorithm displays robustness in estimating peculiar velocities from kinetic SZ effect by an improvement in accuracy of about 17% compared to the analytical approach.
△ Less
Submitted 8 October, 2020;
originally announced October 2020.
-
Unsupervised Region-based Anomaly Detection in Brain MRI with Adversarial Image Inpainting
Authors:
Bao Nguyen,
Adam Feldman,
Sarath Bethapudi,
Andrew Jennings,
Chris G. Willcocks
Abstract:
Medical segmentation is performed to determine the bounds of regions of interest (ROI) prior to surgery. By allowing the study of growth, structure, and behaviour of the ROI in the planning phase, critical information can be obtained, increasing the likelihood of a successful operation. Usually, segmentations are performed manually or via machine learning methods trained on manual annotations. In…
▽ More
Medical segmentation is performed to determine the bounds of regions of interest (ROI) prior to surgery. By allowing the study of growth, structure, and behaviour of the ROI in the planning phase, critical information can be obtained, increasing the likelihood of a successful operation. Usually, segmentations are performed manually or via machine learning methods trained on manual annotations. In contrast, this paper proposes a fully automatic, unsupervised inpainting-based brain tumour segmentation system for T1-weighted MRI. First, a deep convolutional neural network (DCNN) is trained to reconstruct missing healthy brain regions. Then, upon application, anomalous regions are determined by identifying areas of highest reconstruction loss. Finally, superpixel segmentation is performed to segment those regions. We show the proposed system is able to segment various sized and abstract tumours and achieves a mean and standard deviation Dice score of 0.771 and 0.176, respectively.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.
-
On the Evolution of Subjective Experience
Authors:
Jerome A. Feldman
Abstract:
Subjective Experience (SE) is part of the ancient mind-body problem, which continues to be one of deepest mysteries of science. Despite major advances in many fields, there is still no plausible causal link between SE and its realization in the body. The core issue is the incompatibility of objective (3rd person) public science with subjective (1st person) private experience. Any scientific approa…
▽ More
Subjective Experience (SE) is part of the ancient mind-body problem, which continues to be one of deepest mysteries of science. Despite major advances in many fields, there is still no plausible causal link between SE and its realization in the body. The core issue is the incompatibility of objective (3rd person) public science with subjective (1st person) private experience. Any scientific approach to SE assumes that it arose from extended evolutionary processes and that examining evolutionary history should help us understand it. While the core mystery remains, converging evidence from theoretical, experimental, and computational studies yields strong constraints on SE and some suggestions for further research. All animals confront many of the same fitness challenges. They all need some kind of internal model to relate their life goals and actionable sensed information to action. We understand the evolution of the bodily aspects of human perception and emotion, but not the SE. The first evolutionary evidence for SE appears in vertebrates and much of its neural substrate and simulation mechanism is preserved in mammals and humans. People exhibit the same phenomena, but there are remaining mysteries of everyday experience that are demonstrably incompatible with current neuroscience. In spite of this limitation, there is considerable progress on understanding the role of SE in the success of prostheses.
△ Less
Submitted 25 March, 2022; v1 submitted 18 August, 2020;
originally announced August 2020.
-
Photochromism in a Hexagonal Boron Nitride Single Photon Emitter
Authors:
Matthew A. Feldman,
Claire E. Marvinney,
Alexander A. Puretzky,
Benjamin J. Lawrie
Abstract:
Solid-state single-photon emitters (SPEs) such as the bright, stable, room-temperature defects within hexagonal boron nitride (hBN) are of increasing interest for quantum information science applications. To date, the atomic and electronic origins of SPEs within hBN are not well understood, and no studies have reported photochromism or explored cross-correlations between hBN SPEs. Here, we combine…
▽ More
Solid-state single-photon emitters (SPEs) such as the bright, stable, room-temperature defects within hexagonal boron nitride (hBN) are of increasing interest for quantum information science applications. To date, the atomic and electronic origins of SPEs within hBN are not well understood, and no studies have reported photochromism or explored cross-correlations between hBN SPEs. Here, we combine irradiation-time dependent measures of quantum efficiency and microphotoluminescence ($μ$PL) spectroscopy with two-color Hanbury Brown-Twiss interferometry to enable an investigation of the electronic structure of hBN defects. We identify photochromism in a hBN SPE that exhibits cross-correlations and correlated quantum efficiencies between the emission of its two zero-phonon lines.
△ Less
Submitted 15 September, 2020; v1 submitted 12 August, 2020;
originally announced August 2020.
-
Injectors in $π$-separable groups
Authors:
M. Arroyo-Jordá,
P. Arroyo-Jordá,
R. Dark,
A. D. Feldman,
M. D. Pérez-Ramos
Abstract:
Let $π$ be a set of primes. We show that $π$-separable groups have a conjugacy class of $\mathfrak F$-injectors for suitable Fitting classes $\mathfrak F$, which coincide with the usual ones when specializing to soluble groups.
Let $π$ be a set of primes. We show that $π$-separable groups have a conjugacy class of $\mathfrak F$-injectors for suitable Fitting classes $\mathfrak F$, which coincide with the usual ones when specializing to soluble groups.
△ Less
Submitted 3 August, 2020;
originally announced August 2020.
-
Extension of Carter subgroups in $π$-separable groups
Authors:
M. Arroyo-Jordá,
P. Arroyo-Jordá,
R. Dark,
A. D. Feldman,
M. D. Pérez-Ramos
Abstract:
Let $π$ be a set of primes. We show that $π$-separable groups have a conjugacy class of subgroups which specialize to Carter subgroups, i.e. self-normalizing nilpotent subgroups, or equivalently, nilpotent projectors, when specializing to soluble groups.
Let $π$ be a set of primes. We show that $π$-separable groups have a conjugacy class of subgroups which specialize to Carter subgroups, i.e. self-normalizing nilpotent subgroups, or equivalently, nilpotent projectors, when specializing to soluble groups.
△ Less
Submitted 3 August, 2020;
originally announced August 2020.
-
On a Gravity Dual to Flavored Topological Quantum Mechanics
Authors:
Andrey Feldman
Abstract:
In this paper, we propose a generalization of the $\mathrm{AdS_2/CFT_1}$ correspondence constructed by Mezei, Pufu and Wang in \cite{MezeiPufuWang}, which is the duality between 2d Yang-Mills theory with higher derivatives in the $\mathrm{AdS}_2$ background, and 1d topological quantum mechanics of two adjoint and two fundamental $\mathrm{U}(N)$ fields, governing certain protected sector of operato…
▽ More
In this paper, we propose a generalization of the $\mathrm{AdS_2/CFT_1}$ correspondence constructed by Mezei, Pufu and Wang in \cite{MezeiPufuWang}, which is the duality between 2d Yang-Mills theory with higher derivatives in the $\mathrm{AdS}_2$ background, and 1d topological quantum mechanics of two adjoint and two fundamental $\mathrm{U}(N)$ fields, governing certain protected sector of operators in 3d ABJM theory at the Chern-Simons level $k = 1$. We construct a holographic dual to a flavored generalization of the 1d quantum mechanics considered in \cite{MezeiPufuWang}, which arises as the effective field theory living on the intersection of stacks of $N$ D2-branes and $k$ D6-branes in the $Ω$-background in Type IIA string theory, and describes the dynamics of the protected sector of operators in $\mathcal{N} = 4$ theory with $k$ fundamental hypermultiplets, having a holographic description as M-theory in the $\mathrm{AdS}_4 \times \mathrm{S}^7/\mathbb{Z}_k$ background. We compute the structure constants of the bulk theory gauge group, and construct a map between the observables of the boundary theory and the fields of the bulk theory.
△ Less
Submitted 25 May, 2020;
originally announced May 2020.
-
Hybrid modeling: Applications in real-time diagnosis
Authors:
Ion Matei,
Johan de Kleer,
Alexander Feldman,
Rahul Rai,
Souma Chowdhury
Abstract:
Reduced-order models that accurately abstract high fidelity models and enable faster simulation is vital for real-time, model-based diagnosis applications. In this paper, we outline a novel hybrid modeling approach that combines machine learning inspired models and physics-based models to generate reduced-order models from high fidelity models. We are using such models for real-time diagnosis appl…
▽ More
Reduced-order models that accurately abstract high fidelity models and enable faster simulation is vital for real-time, model-based diagnosis applications. In this paper, we outline a novel hybrid modeling approach that combines machine learning inspired models and physics-based models to generate reduced-order models from high fidelity models. We are using such models for real-time diagnosis applications. Specifically, we have developed machine learning inspired representations to generate reduced order component models that preserve, in part, the physical interpretation of the original high fidelity component models. To ensure the accuracy, scalability and numerical stability of the learning algorithms when training the reduced-order models we use optimization platforms featuring automatic differentiation. Training data is generated by simulating the high-fidelity model. We showcase our approach in the context of fault diagnosis of a rail switch system. Three new model abstractions whose complexities are two orders of magnitude smaller than the complexity of the high fidelity model, both in the number of equations and simulation time are shown. The numerical experiments and results demonstrate the efficacy of the proposed hybrid modeling approach.
△ Less
Submitted 3 March, 2020;
originally announced March 2020.
-
Linguistic Fingerprints of Internet Censorship: the Case of SinaWeibo
Authors:
Kei Yin Ng,
Anna Feldman,
Jing Peng
Abstract:
This paper studies how the linguistic components of blogposts collected from Sina Weibo, a Chinese microblogging platform, might affect the blogposts' likelihood of being censored. Our results go along with King et al. (2013)'s Collective Action Potential (CAP) theory, which states that a blogpost's potential of causing riot or assembly in real life is the key determinant of it getting censored. A…
▽ More
This paper studies how the linguistic components of blogposts collected from Sina Weibo, a Chinese microblogging platform, might affect the blogposts' likelihood of being censored. Our results go along with King et al. (2013)'s Collective Action Potential (CAP) theory, which states that a blogpost's potential of causing riot or assembly in real life is the key determinant of it getting censored. Although there is not a definitive measure of this construct, the linguistic features that we identify as discriminatory go along with the CAP theory. We build a classifier that significantly outperforms non-expert humans in predicting whether a blogpost will be censored. The crowdsourcing results suggest that while humans tend to see censored blogposts as more controversial and more likely to trigger action in real life than the uncensored counterparts, they in general cannot make a better guess than our model when it comes to `reading the mind' of the censors in deciding whether a blogpost should be censored. We do not claim that censorship is only determined by the linguistic features. There are many other factors contributing to censorship decisions. The focus of the present paper is on the linguistic form of blogposts. Our work suggests that it is possible to use linguistic properties of social media posts to automatically predict if they are going to be censored.
△ Less
Submitted 23 January, 2020;
originally announced January 2020.
-
Contemplating electromagnetic phenomena in lived experience through somatic meditation
Authors:
Zosia Krusberg,
Andrew Feldman,
Elam Coalson
Abstract:
One of the objectives of the undergraduate physics curriculum is for students to become aware of the connections between formal physical principles and personal experience. However, research has shown that awareness of connections between the abstract and the experiential tends to deteriorate, sometimes significantly, following instruction in undergraduate physics courses. Although this phenomenon…
▽ More
One of the objectives of the undergraduate physics curriculum is for students to become aware of the connections between formal physical principles and personal experience. However, research has shown that awareness of connections between the abstract and the experiential tends to deteriorate, sometimes significantly, following instruction in undergraduate physics courses. Although this phenomenon has been discussed extensively in the literature, few pedagogical interventions have been designed or implemented to address this particular weakness in undergraduate physics instruction. In this work, we show that a contemplative practice consisting of a somatic meditation and a contemplation deepens students' awareness of the connections between formal physical principles and personal experience by deliberately drawing their attention to electromagnetic phenomena in their surroundings. In this process, students also note interdisciplinary connections between electrodynamic principles and chemical and biological systems. We also find that the contemplative practice awakens an intrinsic motivation to understand electromagnetic theory, as well as an appreciation for the somatic, affective, and cognitive benefits of a contemplative practice.
△ Less
Submitted 4 June, 2022; v1 submitted 23 January, 2020;
originally announced January 2020.
-
Longitudinal structural connectomic and rich-club analysis in adolescent mTBI reveals persistent, distributed brain alterations acutely through to one year post-injury
Authors:
Ai Wern Chung,
Rebekah Mannix,
Henry A. Feldman,
P. Ellen Grant,
Kiho Im
Abstract:
The diffused nature of mild traumatic brain injury (mTBI) impacts brain white-matter pathways with potentially long-term consequences, even after initial symptoms have resolved. To understand post-mTBI recovery in adolescents, longitudinal studies are needed to determine the interplay between highly individualised recovery trajectories and ongoing development. To capture the distributed nature of…
▽ More
The diffused nature of mild traumatic brain injury (mTBI) impacts brain white-matter pathways with potentially long-term consequences, even after initial symptoms have resolved. To understand post-mTBI recovery in adolescents, longitudinal studies are needed to determine the interplay between highly individualised recovery trajectories and ongoing development. To capture the distributed nature of mTBI and recovery, we employ connectomes to probe the brain's structural organisation. We present a diffusion MRI study on adolescent mTBI subjects scanned one day, two weeks and one year after injury with controls. Longitudinal global network changes over time suggests an altered and more 'diffuse' network topology post-injury (specifically lower transitivity and global efficiency). Stratifying the connectome by its back-bone, known as the 'rich-club', these network changes were driven by the 'peripheral' local subnetwork by way of increased network density, fractional anisotropy and decreased diffusivities. This increased structural integrity of the local subnetwork may be to compensate for an injured network, or it may be robust to mTBI and is exhibiting a normal developmental trend. The rich-club also revealed lower diffusivities over time with controls, potentially indicative of longer-term structural ramifications. Our results show evolving, diffuse alterations in adolescent mTBI connectomes beginning acutely and continuing to one year.
△ Less
Submitted 17 September, 2019;
originally announced September 2019.
-
Little String Theories on Curved Manifolds
Authors:
Ofer Aharony,
Mikhail Evtikhiev,
Andrey Feldman
Abstract:
In this paper, we study the 6d Little String Theory (LST) (the decoupled theory on the worldvolume of $N$ NS5-branes) on curved manifolds, by using its holographic duality to Type II string theory in asymptotically linear dilaton backgrounds. We focus on backgrounds with a large number of Killing vectors (namely, products of maximally symmetric spaces), without requiring supersymmetry (we do not t…
▽ More
In this paper, we study the 6d Little String Theory (LST) (the decoupled theory on the worldvolume of $N$ NS5-branes) on curved manifolds, by using its holographic duality to Type II string theory in asymptotically linear dilaton backgrounds. We focus on backgrounds with a large number of Killing vectors (namely, products of maximally symmetric spaces), without requiring supersymmetry (we do not turn on any background fields except the metric). LST is non-local so it is not obvious which spaces it can be defined on; we show that holography implies that the theory cannot be put on negatively curved spaces, but only on spaces with zero or positive curvature. For example, one cannot put LST on a product of an anti-de Sitter space times another space, without turning on extra background fields. On spaces with positive curvature, such as $S^6$, $\mathbb{R}^2\times S^4$, $S^3\times S^3$, etc., we typically find (for large $N$) dual holographic backgrounds which are weakly coupled and weakly curved everywhere, so that they can be well-described by Type II supergravity. In some cases more than one smooth solution exists for LST on the same space, and they all contribute to the partition function. We also study the thermodynamical properties of LST compactified on spheres, finding the leading correction to the Hagedorn behavior of the spectrum, which is different on curved space than on flat space. We discuss the holographic renormalization procedure, which must be implemented in order to get a finite free energy for the LST; we do not know how to implement it for general spaces, but we can (and we do) implement it for the theory compactified on $S^4$.
△ Less
Submitted 28 August, 2019; v1 submitted 7 August, 2019;
originally announced August 2019.
-
Design Space Exploration as Quantified Satisfaction
Authors:
Alexander Feldman,
Johan de Kleer,
Ion Matei
Abstract:
We present novel algorithms for design and design space exploration. The designs discovered by these algorithms are compositions of function types specified in component libraries. Our algorithms reduce the design problem to quantified satisfiability and use advanced solvers to find solutions that represent useful systems.
The algorithms we present in this paper are sound and complete and are gu…
▽ More
We present novel algorithms for design and design space exploration. The designs discovered by these algorithms are compositions of function types specified in component libraries. Our algorithms reduce the design problem to quantified satisfiability and use advanced solvers to find solutions that represent useful systems.
The algorithms we present in this paper are sound and complete and are guaranteed to discover correct designs of optimal size, if they exist. We apply our method to the design of Boolean systems and discover new and more optimal classical digital and quantum circuits for common arithmetic functions such as addition and multiplication.
The performance of our algorithms is evaluated through extensive experimentation. We created a benchmark consisting of specifications of scalable synthetic digital circuits and real-world mirochips. We have generated multiple circuits functionally equivalent to the ones in the benchmark. The quantified satisfiability method shows more than four orders of magnitude speed-up, compared to a generate and test method that enumerates all non-isomorphic circuit topologies.
Our approach generalizes circuit optimization. It uses arbitrary component libraries and has applications to areas such as digital circuit design, diagnostics, abductive reasoning, test vector generation, and combinatorial optimization.
△ Less
Submitted 31 January, 2021; v1 submitted 6 May, 2019;
originally announced May 2019.
-
Segmentation of the Prostatic Gland and the Intraprostatic Lesions on Multiparametic MRI Using Mask-RCNN
Authors:
Zhenzhen Dai,
Eric Carver,
Chang Liu,
Joon Lee,
Aharon Feldman,
Weiwei Zong,
Milan Pantelic,
Mohamed Elshaikh,
Ning Wen
Abstract:
Prostate cancer (PCa) is the most common cancer in men in the United States. Multiparametic magnetic resonance imaging (mp-MRI) has been explored by many researchers to targeted prostate biopsies and radiation therapy. However, assessment on mp-MRI can be subjective, development of computer-aided diagnosis systems to automatically delineate the prostate gland and the intraprostratic lesions (ILs)…
▽ More
Prostate cancer (PCa) is the most common cancer in men in the United States. Multiparametic magnetic resonance imaging (mp-MRI) has been explored by many researchers to targeted prostate biopsies and radiation therapy. However, assessment on mp-MRI can be subjective, development of computer-aided diagnosis systems to automatically delineate the prostate gland and the intraprostratic lesions (ILs) becomes important to facilitate with radiologists in clinical practice. In this paper, we first study the implementation of the Mask-RCNN model to segment the prostate and ILs. We trained and evaluated models on 120 patients from two different cohorts of patients. We also used 2D U-Net and 3D U-Net as benchmarks to segment the prostate and compared the model's performance. The contour variability of ILs using the algorithm was also benchmarked against the interobserver variability between two different radiation oncologists on 19 patients. Our results indicate that the Mask-RCNN model is able to reach state-of-art performance in the prostate segmentation and outperforms several competitive baselines in ILs segmentation.
△ Less
Submitted 4 April, 2019;
originally announced April 2019.
-
A Deep Dive into Understanding Tumor Foci Classification using Multiparametric MRI Based on Convolutional Neural Network
Authors:
Weiwei Zong,
Joon Lee,
Chang Liu,
Eric Carver,
Aharon Feldman,
Branislava Janic,
Mohamed Elshaikh,
Milan Pantelic,
David Hearshen,
Indrin Chetty,
Benjamin Movsas,
Ning Wen
Abstract:
Deep learning models have had a great success in disease classifications using large data pools of skin cancer images or lung X-rays. However, data scarcity has been the roadblock of applying deep learning models directly on prostate multiparametric MRI (mpMRI). Although model interpretation has been heavily studied for natural images for the past few years, there has been a lack of interpretation…
▽ More
Deep learning models have had a great success in disease classifications using large data pools of skin cancer images or lung X-rays. However, data scarcity has been the roadblock of applying deep learning models directly on prostate multiparametric MRI (mpMRI). Although model interpretation has been heavily studied for natural images for the past few years, there has been a lack of interpretation of deep learning models trained on medical images. This work designs a customized workflow for the small and imbalanced data set of prostate mpMRI where features were extracted from a deep learning model and then analyzed by a traditional machine learning classifier. In addition, this work contributes to revealing how deep learning models interpret mpMRI for prostate cancer patients stratification.
△ Less
Submitted 14 May, 2020; v1 submitted 28 March, 2019;
originally announced March 2019.
-
A String Dual for Partially Topological Chern-Simons-Matter Theories
Authors:
Ofer Aharony,
Andrey Feldman,
Masazumi Honda
Abstract:
We consider a string dual of a partially topological $U(N)$ Chern-Simons-matter (PTCSM) theory recently introduced by Aganagic, Costello, McNamara and Vafa. In this theory, fundamental matter fields are coupled to the Chern-Simons theory in a way that depends only on a transverse holomorphic structure on a manifold; they are not fully dynamical, but the theory is also not fully topological. One de…
▽ More
We consider a string dual of a partially topological $U(N)$ Chern-Simons-matter (PTCSM) theory recently introduced by Aganagic, Costello, McNamara and Vafa. In this theory, fundamental matter fields are coupled to the Chern-Simons theory in a way that depends only on a transverse holomorphic structure on a manifold; they are not fully dynamical, but the theory is also not fully topological. One description of this theory arises from topological strings on the deformed conifold $T^* S^3$ with $N$ Lagrangian 3-branes and additional coisotropic `flavor' 5-branes. Applying the idea of the Gopakumar-Vafa duality to this setup, we suggest that this has a dual description as a topological string on the resolved conifold ${\cal O} \left( - 1 \right) \oplus {\cal O} \left( - 1 \right) \rightarrow \mathbb{CP}^1$, in the presence of coisotropic 5-branes. We test this duality by computing the annulus amplitude on the deformed conifold and the disc amplitude on the resolved conifold via equivariant localization, and we find an agreement between the two. We find a small discrepancy between the topological string results and the large $N$ limit of the partition function of the PTCSM theory arising from the deformed conifold, computed via field theory localization by a method proposed by Aganagic et al. We discuss possible origins of the mismatch.
△ Less
Submitted 13 June, 2019; v1 submitted 15 March, 2019;
originally announced March 2019.
-
Easily Interpretable Bulk Flows: Continuing Tension with the Standard Cosmological Model
Authors:
Sarah Peery,
Richard Watkins,
Hume A. Feldman
Abstract:
We present an improved Minimal Variance (MV) method for using a radial peculiar velocity sample to estimate the average of the three-dimensional velocity field over a spherical volume, which leads to an easily interpretable bulk flow measurement. The only assumption required is that the velocity field is irrotational. The resulting bulk flow estimate is particularly insensitive to smaller scale fl…
▽ More
We present an improved Minimal Variance (MV) method for using a radial peculiar velocity sample to estimate the average of the three-dimensional velocity field over a spherical volume, which leads to an easily interpretable bulk flow measurement. The only assumption required is that the velocity field is irrotational. The resulting bulk flow estimate is particularly insensitive to smaller scale flows. We also introduce a new constraint into the MV method that ensures that bulk flow estimates are independent of the value of the Hubble constant $H_o$; this is important given the tension between the locally measured $H_o$ and that obtained from the cosmic background radiation observations. We apply our method to the \textit{CosmicFlows-3} catalogue and find that, while the bulk flows for shallower spheres are consistent with the standard cosmological model, there is some tension between the bulk flow in a spherical volume with radius $150$\hmpc\ and its expectations; we find only a $\sim 2\%$ chance of obtaining a bulk flow as large or larger in the standard cosmological model with \textit{Planck} parameters
△ Less
Submitted 23 August, 2018;
originally announced August 2018.
-
The Peculiar Velocity Correlation Function
Authors:
Yuyu Wang,
Christopher Rooney,
Hume A. Feldman,
Richard Watkins
Abstract:
We present an analysis of the two-point peculiar velocity correlation function using data from the CosmicFlows catalogues. The Millennium and MultiDark Planck 2 N-body simulations are used to estimate cosmic variance and uncertainties due to measurement errors. We compare the velocity correlation function to expectations from linear theory to constrain cosmological parameters. Using the maximum li…
▽ More
We present an analysis of the two-point peculiar velocity correlation function using data from the CosmicFlows catalogues. The Millennium and MultiDark Planck 2 N-body simulations are used to estimate cosmic variance and uncertainties due to measurement errors. We compare the velocity correlation function to expectations from linear theory to constrain cosmological parameters. Using the maximum likelihood method, we find values of $Ω_m= 0.315^{+0.205}_{-0.135}$ and $σ_8=0.92^{+0.440}_{-0.295}$, consistent with the Planck and Wilkinson Microwave Anisotropy Probe CMB derived estimates. However, we find that the cosmic variance of the correlation function is large and non-Gaussian distributed, making the peculiar velocity correlation function less than ideal as a probe of large-scale structure.
△ Less
Submitted 22 August, 2018;
originally announced August 2018.
-
Linguistic Characteristics of Censorable Language on SinaWeibo
Authors:
Kei Yin Ng,
Anna Feldman,
Jing Peng,
Chris Leberknight
Abstract:
This paper investigates censorship from a linguistic perspective. We collect a corpus of censored and uncensored posts on a number of topics, build a classifier that predicts censorship decisions independent of discussion topics. Our investigation reveals that the strongest linguistic indicator of censored content of our corpus is its readability.
This paper investigates censorship from a linguistic perspective. We collect a corpus of censored and uncensored posts on a number of topics, build a classifier that predicts censorship decisions independent of discussion topics. Our investigation reveals that the strongest linguistic indicator of censored content of our corpus is its readability.
△ Less
Submitted 10 July, 2018;
originally announced July 2018.
-
Classifying Idiomatic and Literal Expressions Using Topic Models and Intensity of Emotions
Authors:
Jing Peng,
Anna Feldman,
Ekaterina Vylomova
Abstract:
We describe an algorithm for automatic classification of idiomatic and literal expressions. Our starting point is that words in a given text segment, such as a paragraph, that are highranking representatives of a common topic of discussion are less likely to be a part of an idiomatic expression. Our additional hypothesis is that contexts in which idioms occur, typically, are more affective and the…
▽ More
We describe an algorithm for automatic classification of idiomatic and literal expressions. Our starting point is that words in a given text segment, such as a paragraph, that are highranking representatives of a common topic of discussion are less likely to be a part of an idiomatic expression. Our additional hypothesis is that contexts in which idioms occur, typically, are more affective and therefore, we incorporate a simple analysis of the intensity of the emotions expressed by the contexts. We investigate the bag of words topic representation of one to three paragraphs containing an expression that should be classified as idiomatic or literal (a target phrase). We extract topics from paragraphs containing idioms and from paragraphs containing literals using an unsupervised clustering method, Latent Dirichlet Allocation (LDA) (Blei et al., 2003). Since idiomatic expressions exhibit the property of non-compositionality, we assume that they usually present different semantics than the words used in the local topic. We treat idioms as semantic outliers, and the identification of a semantic shift as outlier detection. Thus, this topic representation allows us to differentiate idioms from literals using local semantic contexts. Our results are encouraging.
△ Less
Submitted 27 February, 2018;
originally announced February 2018.
-
Kondo-Resonance Mediated Metal-Insulator Transition in GaAs Embedded with Erbium Arsenide Quantum Dots
Authors:
W-D. Zhang,
E. R. Brown,
A. D. Feldman,
T. E. Harvey,
R. P. Mirin
Abstract:
We report anomalous critical transport behavior in a GaAs structure containing a dense array of ErAs quantum dots. The structure displays a voltage (electric field)-controlled insulator-to-metal transition and strong hysteresis in the Kondo-like current-vs-temperature characteristic, with critical temperatures as high as 50 K. We attribute this behavior to a strong distributed Kondo resonance betw…
▽ More
We report anomalous critical transport behavior in a GaAs structure containing a dense array of ErAs quantum dots. The structure displays a voltage (electric field)-controlled insulator-to-metal transition and strong hysteresis in the Kondo-like current-vs-temperature characteristic, with critical temperatures as high as 50 K. We attribute this behavior to a strong distributed Kondo resonance between the quantum dots after the Coulomb blockade of the array is lifted. This is consistent with a high sensitivity of the phase transition to a small external magnetic field that we have observed in the Voigt configuration, and a phenomenological model based on the RKKY interaction within a quantum dot and the cooperative Kondo-resonance amongst quantum dots.
△ Less
Submitted 3 November, 2017;
originally announced November 2017.
-
Colossal photon bunching in quasiparticle-mediated nanodiamond cathodoluminescence
Authors:
Matthew A. Feldman,
Eugene F. Dumitrescu,
Denzel Bridges,
Matthew F. Chisholm,
Roderick B. Davidson,
Philip G. Evans,
Jordan A. Hachtel,
Anming Hu,
Raphael C. Pooser,
Richard F. Haglund,
Benjamin J. Lawrie
Abstract:
Nanoscale control over the second-order photon correlation function $g^{(2)}(τ)$ is critical to emerging research in nonlinear nanophotonics and integrated quantum information science. Here we report on quasiparticle control of photon bunching with $g^{(2)}(0)>45$ in the cathodoluminescence of nanodiamond nitrogen vacancy (NV$^0$) centers excited by a converged electron beam in an aberration-corre…
▽ More
Nanoscale control over the second-order photon correlation function $g^{(2)}(τ)$ is critical to emerging research in nonlinear nanophotonics and integrated quantum information science. Here we report on quasiparticle control of photon bunching with $g^{(2)}(0)>45$ in the cathodoluminescence of nanodiamond nitrogen vacancy (NV$^0$) centers excited by a converged electron beam in an aberration-corrected scanning transmission electron microscope. Plasmon-mediated NV$^0$ cathodoluminescence exhibits a 16-fold increase in luminescence intensity correlated with a three fold reduction in photon bunching compared with that of uncoupled NV$^0$ centers. This effect is ascribed to the excitation of single temporally uncorrelated NV$^0$ centers by single surface plasmon polaritons. Spectrally resolved Hanbury Brown--Twiss interferometry is employed to demonstrate that the bunching is mediated by the NV$^0$ phonon sidebands, while no observable bunching is detected at the zero-phonon line. The data are consistent with fast phonon-mediated recombination dynamics, a conclusion substantiated by agreement between Bayesian regression and Monte Carlo models of superthermal NV$^0$ luminescence.
△ Less
Submitted 16 February, 2018; v1 submitted 17 October, 2017;
originally announced October 2017.
-
Readiness of Quantum Optimization Machines for Industrial Applications
Authors:
Alejandro Perdomo-Ortiz,
Alexander Feldman,
Asier Ozaeta,
Sergei V. Isakov,
Zheng Zhu,
Bryan O'Gorman,
Helmut G. Katzgraber,
Alexander Diedrich,
Hartmut Neven,
Johan de Kleer,
Brad Lackey,
Rupak Biswas
Abstract:
There have been multiple attempts to demonstrate that quantum annealing and, in particular, quantum annealing on quantum annealing machines, has the potential to outperform current classical optimization algorithms implemented on CMOS technologies. The benchmarking of these devices has been controversial. Initially, random spin-glass problems were used, however, these were quickly shown to be not…
▽ More
There have been multiple attempts to demonstrate that quantum annealing and, in particular, quantum annealing on quantum annealing machines, has the potential to outperform current classical optimization algorithms implemented on CMOS technologies. The benchmarking of these devices has been controversial. Initially, random spin-glass problems were used, however, these were quickly shown to be not well suited to detect any quantum speedup. Subsequently, benchmarking shifted to carefully crafted synthetic problems designed to highlight the quantum nature of the hardware while (often) ensuring that classical optimization techniques do not perform well on them. Even worse, to date a true sign of improved scaling with the number of problem variables remains elusive when compared to classical optimization techniques. Here, we analyze the readiness of quantum annealing machines for real-world application problems. These are typically not random and have an underlying structure that is hard to capture in synthetic benchmarks, thus posing unexpected challenges for optimization techniques, both classical and quantum alike. We present a comprehensive computational scaling analysis of fault diagnosis in digital circuits, considering architectures beyond D-wave quantum annealers. We find that the instances generated from real data in multiplier circuits are harder than other representative random spin-glass benchmarks with a comparable number of variables. Although our results show that transverse-field quantum annealing is outperformed by state-of-the-art classical optimization algorithms, these benchmark instances are hard and small in the size of the input, therefore representing the first industrial application ideally suited for testing near-term quantum annealers and other quantum algorithmic strategies for optimization problems.
△ Less
Submitted 2 July, 2019; v1 submitted 31 August, 2017;
originally announced August 2017.
-
Optical amplitude and phase modulation dynamics at the single-photon level in a quantum dot ridge waveguide
Authors:
Galan Moody,
Corey McDonald,
Ari Feldman,
Todd Harvey,
Richard P. Mirin,
Kevin L. Silverman
Abstract:
The amplitude and phase of a material's nonlinear optical response provide insight into the underlying electronic dynamics that determine its optical properties. Phase-sensitive nonlinear spectroscopy techniques are widely implemented to explore these dynamics through demodulation of the complex optical signal field into its quadrature components; however, complete reconstruction of the optical re…
▽ More
The amplitude and phase of a material's nonlinear optical response provide insight into the underlying electronic dynamics that determine its optical properties. Phase-sensitive nonlinear spectroscopy techniques are widely implemented to explore these dynamics through demodulation of the complex optical signal field into its quadrature components; however, complete reconstruction of the optical response requires measuring both the amplitude and phase of each quadrature, which is often lost in standard detection methods. Here, we implement a heterodyne-detection scheme to fully reconstruct the amplitude and phase response of spectral hole-burning from InAs/GaAs charged quantum dots. We observe an ultra-narrow absorption profile and a corresponding dispersive lineshape of the phase, which reflect the nanosecond optical coherence time of the charged exciton transition. Simultaneously, the measurements are sensitive to electron spin relaxation dynamics on a millisecond timescale, as this manifests as a magnetic-field dependent delay of the amplitude and phase modulation. Appreciable amplitude modulation depth and nonlinear phase shift up to 0.09$\timesπ$ radians (16$°$) are demonstrated, providing new possibilities for quadrature modulation at faint photon levels with several independent control parameters, including photon number, modulation frequency, detuning, and externally applied fields.
△ Less
Submitted 26 August, 2016;
originally announced August 2016.
-
Gravitational potential wells and the cosmic bulk flow
Authors:
Abhinav Kumar,
Yuyu Wang,
Hume A. Feldman,
Richard Watkins
Abstract:
The bulk flow is a volume average of the peculiar velocities and a useful probe of the mass distribution on large scales. The gravitational instability model views the bulk flow as a potential flow that obeys a Maxwellian Distribution. We use two N-body simulations, the LasDamas Carmen and the Horizon Run, to calculate the bulk flows of various sized volumes in the simulation boxes. Once we have t…
▽ More
The bulk flow is a volume average of the peculiar velocities and a useful probe of the mass distribution on large scales. The gravitational instability model views the bulk flow as a potential flow that obeys a Maxwellian Distribution. We use two N-body simulations, the LasDamas Carmen and the Horizon Run, to calculate the bulk flows of various sized volumes in the simulation boxes. Once we have the bulk flow velocities as a function of scale, we investigate the mass and gravitational potential distribution around the volume. We found that matter densities can be asymmetrical and difficult to detect in real surveys, however, the gravitational potential and its gradient may provide better tools to investigate the underlying matter distribution. This study shows that bulk flows are indeed potential flows and thus provides information on the flow sources. We also show that bulk flow magnitudes follow a Maxwellian distribution on scales $>10\ h^{-1}$Mpc.
△ Less
Submitted 29 December, 2015;
originally announced December 2015.
-
Electronic Coherence Control in a Charged Quantum Dot
Authors:
Galan Moody,
Corey McDonald,
Ari Feldman,
Todd Harvey,
Richard P. Mirin,
Kevin L. Silverman
Abstract:
Minimizing decoherence due to coupling of a quantum system to its fluctuating environment is at the forefront of quantum information science and photonics research. Nature sets the ultimate limit, however, given by the strength of the system's coupling to the electromagnetic field. Here, we establish the ability to electronically control this coupling and $\textit{enhance}$ the coherence time of a…
▽ More
Minimizing decoherence due to coupling of a quantum system to its fluctuating environment is at the forefront of quantum information science and photonics research. Nature sets the ultimate limit, however, given by the strength of the system's coupling to the electromagnetic field. Here, we establish the ability to electronically control this coupling and $\textit{enhance}$ the coherence time of a quantum dot excitonic state. Coherence control is demonstrated on the positively charged exciton transition (an electron Coulomb-bound with two holes) in quantum dots embedded in a photonic waveguide by manipulating the electron and hole wavefunctions through an applied lateral electric field. With increasing field up to 15 kV cm$^{-1}$, the coherence time increases by a factor of two from $\sim1.4$ ns to $\sim2.7$ ns. Numerical calculations reveal that longer coherence arises from the separation of charge carriers by up to $\sim6$ nm, which leads to a $30\%$ weaker transition dipole moment. The ability to electrostatically control the coherence time and transition dipole moment opens new avenues for quantum communication and novel coupling schemes between distant qubits.
△ Less
Submitted 19 October, 2015;
originally announced October 2015.
-
An atomic clock with $1\times 10^{-18}$ room-temperature blackbody Stark uncertainty
Authors:
K. Beloy,
N. Hinkley,
N. B. Phillips,
J. A. Sherman,
M. Schioppo,
J. Lehman,
A. Feldman,
L. M. Hanssen,
C. W. Oates,
A. D. Ludlow
Abstract:
The Stark shift due to blackbody radiation (BBR) is the key factor limiting the performance of many atomic frequency standards, with the BBR environment inside the clock apparatus being difficult to characterize at a high level of precision. Here we demonstrate an in-vacuum radiation shield that furnishes a uniform, well-characterized BBR environment for the atoms in an ytterbium optical lattice c…
▽ More
The Stark shift due to blackbody radiation (BBR) is the key factor limiting the performance of many atomic frequency standards, with the BBR environment inside the clock apparatus being difficult to characterize at a high level of precision. Here we demonstrate an in-vacuum radiation shield that furnishes a uniform, well-characterized BBR environment for the atoms in an ytterbium optical lattice clock. Operated at room temperature, this shield enables specification of the BBR environment to a corresponding fractional clock uncertainty contribution of $5.5 \times 10^{-19}$. Combined with uncertainty in the atomic response, the total uncertainty of the BBR Stark shift is now $1\times10^{-18}$. Further operation of the shield at elevated temperatures enables a direct measure of the BBR shift temperature dependence and demonstrates consistency between our evaluated BBR environment and the expected atomic response.
△ Less
Submitted 2 December, 2014;
originally announced December 2014.