Search | arXiv e-print repository

Resonator-mediated quantum gate between distant charge qubits

Authors: Florian Kayatz, Jonas Mielke, Guido Burkard

Abstract: Strong charge-photon coupling allows the coherent coupling of a charge qubit, realized by a single charge carrier (either an electron or a hole) in a double quantum dot, to photons of a microwave resonator. Here, we theoretically demonstrate that, in the dispersive regime, the photons can mediate both an $i$SWAP gate as well as a $\sqrt{i\mathrm{SWAP}}$ gate between two distant charge qubits. We p… ▽ More Strong charge-photon coupling allows the coherent coupling of a charge qubit, realized by a single charge carrier (either an electron or a hole) in a double quantum dot, to photons of a microwave resonator. Here, we theoretically demonstrate that, in the dispersive regime, the photons can mediate both an $i$SWAP gate as well as a $\sqrt{i\mathrm{SWAP}}$ gate between two distant charge qubits. We provide a thorough discussion of the impact of the dominant noise sources, resonator damping and charge qubit dephasing on the average gate fidelity. Assuming a state-of-the art resonator decay rate and charge qubit dephasing rate, the predicted average gate fidelities are below 90%. However, an increase of the charge qubit dephasing rate by one order of magnitude is conjectured to result in gate fidelities surpassing 95%. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: 12+7 pages, 3 figures

arXiv:2405.03607 [pdf]

doi 10.1103/PhysRevD.110.024037

Observability of spin precession in the presence of a black-hole remnant kick

Authors: Angela Borchers, Frank Ohme, Jannik Mielke, Shrobana Ghosh

Abstract: Remnants of binary black-hole mergers can gain significant recoil or kick velocities when the binaries are asymmetric. The kick is the consequence of anisotropic emission of gravitational waves, which may leave a characteristic imprint in the observed signal. So far, only one gravitational-wave event supports a non-zero kick velocity: GW200129_065458. This signal is also the first to show evidence… ▽ More Remnants of binary black-hole mergers can gain significant recoil or kick velocities when the binaries are asymmetric. The kick is the consequence of anisotropic emission of gravitational waves, which may leave a characteristic imprint in the observed signal. So far, only one gravitational-wave event supports a non-zero kick velocity: GW200129_065458. This signal is also the first to show evidence for spin-precession. For most other gravitational-wave observations, spin orientations are poorly constrained as this would require large signal-to-noise ratios, unequal mass ratios or inclined systems. Here we investigate whether the imprint of the kick can help to extract more information about the spins. We perform an injection and recovery study comparing binary black-hole signals with significantly different kick magnitudes, but the same spin magnitudes and spin tilts. To exclude the impact of higher signal harmonics in parameter estimation, we focus on equal-mass binaries that are oriented face-on. We generate signals with PhenomXO4a, which includes mode asymmetries. These asymmetries are the main cause for the kick in precessing binaries. For comparison with an equivalent model without asymmetries, we repeat the same injections with PhenomXPHM. We find that signals with large kicks necessarily include large asymmetries, and these give more structure to the signal, leading to more informative measurements of the spins and mass ratio. Our results also complement previous findings that argued precession in equal-mass, face-on or face-away binaries is nearly impossible to identify. In contrast, we find that in the presence of a remnant kick, even those signals become more informative and allow determining precession with signal-to-noise ratios observable already by current gravitational-wave detectors. △ Less

Submitted 17 July, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

Comments: 24 pages, 22 figures

Report number: LIGO-P2400159

Journal ref: Phys. Rev. D 110, 024037 (2024)

arXiv:2312.15493 [pdf, other]

doi 10.1103/PhysRevB.110.035304

Measurement-Based Entanglement of Semiconductor Spin Qubits

Authors: Remy L. Delva, Jonas Mielke, Guido Burkard, Jason R. Petta

Abstract: Measurement-based entanglement is a method for entangling quantum systems through the state projection that accompanies a parity measurement. We derive a stochastic master equation describing measurement-based entanglement of a pair of silicon double-dot flopping-mode spin qubits, develop numerical simulations to model this process, and explore what modifications could enable an experimental imple… ▽ More Measurement-based entanglement is a method for entangling quantum systems through the state projection that accompanies a parity measurement. We derive a stochastic master equation describing measurement-based entanglement of a pair of silicon double-dot flopping-mode spin qubits, develop numerical simulations to model this process, and explore what modifications could enable an experimental implementation of such a protocol. With device parameters corresponding to current qubit and cavity designs, we predict an entanglement fidelity $F_e \approx$ 61%. By increasing the cavity outcoupling rate by a factor of ten, we are able to obtain a simulated $F_e \approx$ 81% while maintaining a yield of 33%. △ Less

Submitted 24 December, 2023; originally announced December 2023.

Journal ref: Phys. Rev. B 110, 035304 (2024)

arXiv:2309.06776 [pdf, other]

Fast adiabatic transport of single laser-cooled $^9$Be$^+$ ions in a cryogenic Penning trap stack

Authors: T. Meiners, J. -A. Coenders, J. Mielke, M. Niemann, J. M. Cornejo, S. Ulmer, C. Ospelkaus

Abstract: High precision mass and $g$-factor measurements in Penning traps have enabled groundbreaking tests of fundamental physics. The most advanced setups use multi-trap methods, which employ transport of particles between specialized trap zones. Present developments focused on the implementation of sympathetic laser cooling will enable significantly shorter duty cycles and better accuracies in many of t… ▽ More High precision mass and $g$-factor measurements in Penning traps have enabled groundbreaking tests of fundamental physics. The most advanced setups use multi-trap methods, which employ transport of particles between specialized trap zones. Present developments focused on the implementation of sympathetic laser cooling will enable significantly shorter duty cycles and better accuracies in many of these scenarios. To take full advantage of these increased capabilities, we implement fast adiabatic transport concepts developed in the context of trapped-ion quantum information processing in a cryogenic Penning trap system. We show adiabatic transport of a single $^9\mathrm{Be}^+$ ion initially cooled to 2 mK over a 2.2 cm distance within 15 ms and with less than 10\,mK energy gain at a peak velocity of 3 m/s. These results represent an important step towards the implementation of quantum logic spectroscopy in the \ppbar system. Applying these developments to other multi-trap systems has the potential to considerably increase the data-sampling rate in these experiments. △ Less

Submitted 13 September, 2023; originally announced September 2023.

Comments: 15 pages, 7 figures

arXiv:2211.05100 [pdf, other]

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License. △ Less

Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

arXiv:2209.10026 [pdf, other]

doi 10.1103/PhysRevB.107.155302

Dispersive cavity-mediated quantum gate between driven dot-donor nuclear spins

Authors: Jonas Mielke, Guido Burkard

Abstract: Nuclear spins show exceptionally long coherence times but the underlying good isolation from their environment is a challenge when it comes to controlling nuclear spin qubits. A particular difficulty, not only for nuclear spin qubits, is the realization of two-qubit gates between distant qubits. Recently, strong coupling between an electron spin and microwave resonator photons as well as a microwa… ▽ More Nuclear spins show exceptionally long coherence times but the underlying good isolation from their environment is a challenge when it comes to controlling nuclear spin qubits. A particular difficulty, not only for nuclear spin qubits, is the realization of two-qubit gates between distant qubits. Recently, strong coupling between an electron spin and microwave resonator photons as well as a microwave resonator mediated coupling between two electron spins both in the resonant and the dispersive regime have been reported and, thus, a microwave resonator mediated electron spin two qubit gate seems to be in reach. Inspired by these findings, we theoretically investigate the interaction of a microwave resonator with a hybrid quantum dot-donor (QDD) system consisting of a gate defined Si QD and a laterally displaced $^{31}$P phosphorous donor atom implanted in the Si host material. We find that driving the QDD system allows to compensate the frequency mismatch between the donor nuclear spin splitting in the MHz regime and typical superconducting resonator frequencies in the GHz regime, and also enables an effective nuclear spin-photon coupling. While we expect this coupling to be weak, we predict that coupling the nuclear spins of two distant QDD systems dispersively to the microwave resonator allows the implementation of a resonator mediated nuclear spin two-qubit $\sqrt{i\mathrm{SWAP}}$ gate with a gate fidelity approaching $90\%$. △ Less

Submitted 10 April, 2023; v1 submitted 20 September, 2022; originally announced September 2022.

Comments: 27 pages, 13 figures

Journal ref: Phys. Rev. B 107, 155302 (2023)

arXiv:2205.03608 [pdf, other]

UniMorph 4.0: Universal Morphology

Authors: Khuyagbaatar Batsuren, Omer Goldman, Salam Khalifa, Nizar Habash, Witold Kieraś, Gábor Bella, Brian Leonard, Garrett Nicolai, Kyle Gorman, Yustinus Ghanggo Ate, Maria Ryskina, Sabrina J. Mielke, Elena Budianskaya, Charbel El-Khaissi, Tiago Pimentel, Michael Gasser, William Lane, Mohit Raj, Matt Coler, Jaime Rafael Montoya Samame, Delio Siticonatzi Camaiteri, Benoît Sagot, Esaú Zumaeta Rojas, Didier López Francis, Arturo Oncevay , et al. (71 additional authors not shown)

Abstract: The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This pa… ▽ More The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This paper presents the expansions and improvements made on several fronts over the last couple of years (since McCarthy et al. (2020)). Collaborative efforts by numerous linguists have added 67 new languages, including 30 endangered languages. We have implemented several improvements to the extraction pipeline to tackle some issues, e.g. missing gender and macron information. We have also amended the schema to use a hierarchical structure that is needed for morphological phenomena like multiple-argument agreement and case stacking, while adding some missing morphological features to make the schema more inclusive. In light of the last UniMorph release, we also augmented the database with morpheme segmentation for 16 languages. Lastly, this new release makes a push towards inclusion of derivational morphology in UniMorph by enriching the data and annotation schema with instances representing derivational processes from MorphyNet. △ Less

Submitted 19 June, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

Comments: LREC 2022; The first two authors made equal contributions

arXiv:2112.10508 [pdf, other]

Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP

Authors: Sabrina J. Mielke, Zaid Alyafeai, Elizabeth Salesky, Colin Raffel, Manan Dey, Matthias Gallé, Arun Raja, Chenglei Si, Wilson Y. Lee, Benoît Sagot, Samson Tan

Abstract: What are the units of text that we want to model? From bytes to multi-word expressions, text can be analyzed and generated at many granularities. Until recently, most natural language processing (NLP) models operated over words, treating those as discrete and atomic tokens, but starting with byte-pair encoding (BPE), subword-based approaches have become dominant in many areas, enabling small vocab… ▽ More What are the units of text that we want to model? From bytes to multi-word expressions, text can be analyzed and generated at many granularities. Until recently, most natural language processing (NLP) models operated over words, treating those as discrete and atomic tokens, but starting with byte-pair encoding (BPE), subword-based approaches have become dominant in many areas, enabling small vocabularies while still allowing for fast inference. Is the end of the road character-level model or byte-level processing? In this survey, we connect several lines of work from the pre-neural and neural era, by showing how hybrid approaches of words and characters as well as subword-based approaches based on learned segmentation have been proposed and evaluated. We conclude that there is and likely will never be a silver bullet singular solution for all applications and that thinking seriously about tokenization remains important for many applications. △ Less

Submitted 20 December, 2021; originally announced December 2021.

Comments: 15 page preprint

arXiv:2107.08642 [pdf, ps, other]

doi 10.7566/JPSCP.18.011006

Towards Quantum Logic Inspired Cooling and Detection for Single (Anti-)Protons

Authors: T. Meiners, M. Niemann, A. -G. Paschke, J. Mielke, A. Idel, M. Borchert, K. Voges, A. Bautista-Salvador, S. Ulmer, C. Ospelkaus

Abstract: We discuss laser-based and quantum logic inspired cooling and detection methods amenable to single (anti-)protons. These would be applicable e.g. in a g-factor based test of CPT invariance as currently pursued within the BASE collaboration. Towards this end, we explore sympathetic cooling of single (anti-)protons with atomic ions as suggested by Heinzen and Wineland (1990). We discuss laser-based and quantum logic inspired cooling and detection methods amenable to single (anti-)protons. These would be applicable e.g. in a g-factor based test of CPT invariance as currently pursued within the BASE collaboration. Towards this end, we explore sympathetic cooling of single (anti-)protons with atomic ions as suggested by Heinzen and Wineland (1990). △ Less

Submitted 19 July, 2021; originally announced July 2021.

Comments: Presented at LEAP 2016 in Kanazawa, Japan

Journal ref: JPS Conf. Proc. 18, 011006 (2017)

arXiv:2107.08449 [pdf, ps, other]

doi 10.1007/s10751-018-1502-6

Towards Sympathetic Cooling of Single (Anti-)Protons

Authors: T. Meiners, M. Niemann, J. Mielke, M. Borchert, N. Pulido, J. M. Cornejo, S. Ulmer, C. Ospelkaus

Abstract: We present methods to manipulate and detect the motional state and the spin state of a single antiproton or proton which are currently under development within the BASE (Baryon Antibaryon Symmetry Experiment) collaboration. These methods include sympathetic laser cooling of a single (anti-)proton using a co-trapped atomic ion as well as quantum logic spectroscopy with the two particles and could b… ▽ More We present methods to manipulate and detect the motional state and the spin state of a single antiproton or proton which are currently under development within the BASE (Baryon Antibaryon Symmetry Experiment) collaboration. These methods include sympathetic laser cooling of a single (anti-)proton using a co-trapped atomic ion as well as quantum logic spectroscopy with the two particles and could be implemented within the collaboration for state preparation and state readout in the antiproton $g$-factor measurement experiment at CERN. In our project, these techniques shall be applied using a single $^9\text{Be}^+$ ion as the atomic ion in a Penning trap system at a magnetic field of 5 T. As an intermediate step, a controlled interaction of two beryllium ions in a double-well potential as well as sympathetic cooling of one ion by the other shall be demonstrated. △ Less

Submitted 18 July, 2021; originally announced July 2021.

Comments: Proceedings of the 13th International Conference on Low Energy Antiproton Physics (LEAP 2018) Paris, France, 12-16 March 2018

Journal ref: Hyperfine Interactions volume 239, Article number: 26 (2018)

arXiv:2107.08435 [pdf, other]

doi 10.1142/9789813148505_0022

Towards Sympathetic Laser Cooling and Detection of Single (Anti-)Proton

Authors: T. Meiners, M. Niemann, A. -G. Paschke, M. Borchert, A. Idel, J. Mielke, K. Voges, A. Bautista-Salvador, R. Lehnert, S. Ulmer, C. Ospelkaus

Abstract: Current experimental efforts to test the fundamental CPT symmetry with single (anti-)protons are progressing at a rapid pace but are hurt by the nonzero temperature of particles and the difficulty of spin state detection. We describe a laser-based and quantum logic inspired approach to single (anti-)proton cooling and state detection. Current experimental efforts to test the fundamental CPT symmetry with single (anti-)protons are progressing at a rapid pace but are hurt by the nonzero temperature of particles and the difficulty of spin state detection. We describe a laser-based and quantum logic inspired approach to single (anti-)proton cooling and state detection. △ Less

Submitted 18 July, 2021; originally announced July 2021.

Comments: Presented at the Seventh Meeting on CPT and Lorentz Symmetry, Bloomington, Indiana, June 20-24, 2016

Journal ref: World Scientific, Singapore, Proceedings of the Seventh Meeting on CPT and Lorentz Symmetry (2017)

arXiv:2107.08433 [pdf, other]

doi 10.1142/9789811213984_0029

Cryogenic Penning-Trap Apparatus for Precision Experiments with Sympathetically Cooled (anti)protons

Authors: M. Niemann, T. Meiners, J. Mielke, N. Pulido, J. Schaper, M. J. Borchert, J. M. Cornejo, A. -G. Paschke, G. Zarantonello, H. Hahn, T. Lang, C. Manzoni, M. Marangoni, G. Cerullo, U. Morgner, J. -A. Fenske, A. Bautista-Salvador, R. Lehnert, S. Ulmer, C. Ospelkaus

Abstract: Current precision experiments with single (anti)protons to test CPT symmetry progress at a rapid pace, but are complicated by the need to cool particles to sub-thermal energies. We describe a cryogenic Penning-trap setup for $^9$Be$^+$ ions designed to allow coupling of single (anti)protons to laser-cooled atomic ions for sympathetic cooling and quantum logic spectroscopy. We report on trapping an… ▽ More Current precision experiments with single (anti)protons to test CPT symmetry progress at a rapid pace, but are complicated by the need to cool particles to sub-thermal energies. We describe a cryogenic Penning-trap setup for $^9$Be$^+$ ions designed to allow coupling of single (anti)protons to laser-cooled atomic ions for sympathetic cooling and quantum logic spectroscopy. We report on trapping and laser cooling of clouds and single $^9$Be$^+$ ions. We discuss prospects for a microfabricated trap to allow coupling of single (anti)protons to laser-cooled $^9$Be$^+$ ions for sympathetic laser cooling to sub-mK temperatures on ms time scales. △ Less

Submitted 18 July, 2021; originally announced July 2021.

Comments: Presented at the Eighth Meeting on CPT and Lorentz Symmetry, Bloomington, Indiana, May 12-16, 2019

Journal ref: World Scientific, Singapore, Proceedings of the Eighth Meeting on CPT and Lorentz Symmetry (2020)

arXiv:2106.13532 [pdf, other]

doi 10.1088/1361-6455/ac319d

139 GHz UV phase-locked Raman laser system for thermometry and sideband cooling of $^9$Be$^+$ ions in a Penning trap

Authors: Johannes Mielke, Julian Pick, Julia A. Coenders, Teresa Meiners, Malte Niemann, Juan M. Cornejo, Stefan Ulmer, Christian Ospelkaus

Abstract: We demonstrate phase locking of two ultraviolet laser sources by modulating a fundamental infrared laser with 4th-order sidebands using an electro-optic modulator and phase locking of one sideband to a second fundamental infrared laser. Subsequent sum frequency generation and second harmonic generation successfully translates the frequency offset to the ultraviolet domain. The phase lock at 139 GH… ▽ More We demonstrate phase locking of two ultraviolet laser sources by modulating a fundamental infrared laser with 4th-order sidebands using an electro-optic modulator and phase locking of one sideband to a second fundamental infrared laser. Subsequent sum frequency generation and second harmonic generation successfully translates the frequency offset to the ultraviolet domain. The phase lock at 139 GHz is confirmed through stimulated Raman transitions for thermometry of $^9$Be$^+$ ions confined in a cryogenic Penning trap. This technique might be used for sideband cooling of single $^9$Be$^+$ ions as well as sympathetic cooling schemes and quantum logic based measurements in Penning traps in the future. △ Less

Submitted 18 October, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

Comments: 7 figures, accepted for publication in J. Phys. B

Journal ref: J. Phys. B: At. Mol. Opt. Phys. 54, 195402 (2021)

arXiv:2106.06252 [pdf, other]

doi 10.1088/1367-2630/ac136e

Quantum logic inspired techniques for spacetime-symmetry tests with (anti-)protons

Authors: Juan M. Cornejo, Ralf Lehnert, Malte Niemann, Johannes Mielke, Teresa Meiners, Amado Bautista-Salvador, Marius Schulte, Diana Nitzschke, Matthias J. Borchert, Klemens Hammerer, Stefan Ulmer, Christian Ospelkaus

Abstract: Cosmological observations as well as theoretical approaches to physics beyond the Standard Model provide strong motivations for experimental tests of fundamental symmetries, such as CPT invariance. In this context, the availability of cold baryonic antimatter at CERN has opened an avenue for ultrahigh-precision comparisons of protons and antiprotons in Penning traps. This work discusses an experim… ▽ More Cosmological observations as well as theoretical approaches to physics beyond the Standard Model provide strong motivations for experimental tests of fundamental symmetries, such as CPT invariance. In this context, the availability of cold baryonic antimatter at CERN has opened an avenue for ultrahigh-precision comparisons of protons and antiprotons in Penning traps. This work discusses an experimental method inspired by quantum logic techniques that will improve particle localization and readout speed in such experiments. The method allows for sympathetic cooling of the (anti-)proton to its quantum-mechanical ground state as well as the readout of its spin alignment, replacing the commonly used continuous Stern-Gerlach effect. Both of these features are achieved through coupling to a laser-cooled `logic' ion co-trapped in a double-well potential. This technique will boost the measurement sampling rate and will thus provide results with lower statistical uncertainty, contributing to stringent searches for time dependent variations in the data. Such measurements ultimately yield extremely high sensitivities to CPT violating coefficients acting on baryons in the Standard-Model Extension, will allow the exploration of previously unmeasured types of symmetry violations, and will enable antimatter-based axion-like dark matter searches with improved mass resolution. △ Less

Submitted 13 July, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

Comments: Accepted for publication in New Journal of Physics

arXiv:2106.03895 [pdf, other]

SIGTYP 2021 Shared Task: Robust Spoken Language Identification

Authors: Elizabeth Salesky, Badr M. Abdullah, Sabrina J. Mielke, Elena Klyachko, Oleg Serikov, Edoardo Ponti, Ritesh Kumar, Ryan Cotterell, Ekaterina Vylomova

Abstract: While language identification is a fundamental speech and language processing task, for many languages and language families it remains a challenging task. For many low-resource and endangered languages this is in part due to resource availability: where larger datasets exist, they may be single-speaker or have different domains than desired application scenarios, demanding a need for domain and s… ▽ More While language identification is a fundamental speech and language processing task, for many languages and language families it remains a challenging task. For many low-resource and endangered languages this is in part due to resource availability: where larger datasets exist, they may be single-speaker or have different domains than desired application scenarios, demanding a need for domain and speaker-invariant language identification systems. This year's shared task on robust spoken language identification sought to investigate just this scenario: systems were to be trained on largely single-speaker speech from one domain, but evaluated on data in other domains recorded from speakers under different recording circumstances, mimicking realistic low-resource scenarios. We see that domain and speaker mismatch proves very challenging for current methods which can perform above 95% accuracy in-domain, which domain adaptation can address to some degree, but that these conditions merit further investigation to make spoken language identification accessible in many scenarios. △ Less

Submitted 7 June, 2021; originally announced June 2021.

Comments: The first three authors contributed equally

arXiv:2012.14983 [pdf, other]

Reducing conversational agents' overconfidence through linguistic calibration

Authors: Sabrina J. Mielke, Arthur Szlam, Emily Dinan, Y-Lan Boureau

Abstract: While improving neural dialogue agents' factual accuracy is the object of much research, another important aspect of communication, less studied in the setting of neural dialogue, is transparency about ignorance. In this work, we analyze to what extent state-of-the-art chit-chat models are linguistically calibrated in the sense that their verbalized expression of doubt (or confidence) matches the… ▽ More While improving neural dialogue agents' factual accuracy is the object of much research, another important aspect of communication, less studied in the setting of neural dialogue, is transparency about ignorance. In this work, we analyze to what extent state-of-the-art chit-chat models are linguistically calibrated in the sense that their verbalized expression of doubt (or confidence) matches the likelihood that the model's responses are factually incorrect (or correct). We find that these models are poorly calibrated, yet we show that likelihood of correctness can accurately be predicted. By incorporating such metacognitive features into the training of a controllable generation model, we obtain a dialogue agent with greatly improved linguistic calibration. While improving neural dialogue agents' factual accuracy is the object of much research, another important aspect of communication, less studied in the setting of neural dialogue, is transparency about ignorance. In this work, we analyze to what extent state-of-the-art chit-chat models are linguistically calibrated in the sense that their verbalized expression of doubt (or confidence) matches the likelihood that the model's responses are factually incorrect (or correct). We find that these models are poorly calibrated, yet we show that likelihood of correctness can accurately be predicted. By incorporating such metacognitive features into the training of a controllable generation model, we obtain a dialogue agent with greatly improved linguistic calibration. △ Less

Submitted 26 June, 2022; v1 submitted 29 December, 2020; originally announced December 2020.

Comments: Accepted in TACL, to be presented at NAACL 2022

arXiv:2012.01322 [pdf, other]

doi 10.1103/PRXQuantum.2.020347

Nuclear spin readout in a cavity-coupled hybrid quantum dot-donor system

Authors: Jonas Mielke, Jason R. Petta, Guido Burkard

Abstract: Nuclear spins show long coherence times and are well isolated from the environment, which are properties making them promising for quantum information applications. Here, we present a method for nuclear spin readout by probing the transmission of a microwave resonator. We consider a single electron in a silicon quantum dot-donor device interacting with a microwave resonator via the electric dipole… ▽ More Nuclear spins show long coherence times and are well isolated from the environment, which are properties making them promising for quantum information applications. Here, we present a method for nuclear spin readout by probing the transmission of a microwave resonator. We consider a single electron in a silicon quantum dot-donor device interacting with a microwave resonator via the electric dipole coupling and subjected to a homogeneous magnetic field and a transverse magnetic field gradient. In our scenario, the electron spin interacts with a $^{31}\mathrm{P}$ defect nuclear spin via the hyperfine interaction. We theoretically investigate the influence of the P nuclear spin state on the microwave transmission through the cavity and show that nuclear spin readout is feasible with current state-of-the-art devices. Moreover, we identify optimal readout points with strong signal contrast to facilitate the experimental implementation of nuclear spin readout. Furthermore, we investigate the potential for achieving coherent excitation exchange between a nuclear spin qubit and cavity photons. △ Less

Submitted 17 May, 2021; v1 submitted 2 December, 2020; originally announced December 2020.

Comments: 8+11 pages, 6+4 figures, v2: substantial revision and focus on dot-donor system

Journal ref: PRX Quantum 2, 020347 (2021)

arXiv:2010.08246 [pdf, other]

SIGTYP 2020 Shared Task: Prediction of Typological Features

Authors: Johannes Bjerva, Elizabeth Salesky, Sabrina J. Mielke, Aditi Chaudhary, Giuseppe G. A. Celano, Edoardo M. Ponti, Ekaterina Vylomova, Ryan Cotterell, Isabelle Augenstein

Abstract: Typological knowledge bases (KBs) such as WALS (Dryer and Haspelmath, 2013) contain information about linguistic properties of the world's languages. They have been shown to be useful for downstream applications, including cross-lingual transfer learning and linguistic probing. A major drawback hampering broader adoption of typological KBs is that they are sparsely populated, in the sense that mos… ▽ More Typological knowledge bases (KBs) such as WALS (Dryer and Haspelmath, 2013) contain information about linguistic properties of the world's languages. They have been shown to be useful for downstream applications, including cross-lingual transfer learning and linguistic probing. A major drawback hampering broader adoption of typological KBs is that they are sparsely populated, in the sense that most languages only have annotations for some features, and skewed, in that few features have wide coverage. As typological features often correlate with one another, it is possible to predict them and thus automatically populate typological KBs, which is also the focus of this shared task. Overall, the task attracted 8 submissions from 5 teams, out of which the most successful methods make use of such feature correlations. However, our error analysis reveals that even the strongest submitted systems struggle with predicting feature values for languages where few features are known. △ Less

Submitted 26 October, 2020; v1 submitted 16 October, 2020; originally announced October 2020.

Comments: SigTyp 2020 Shared Task Description Paper @ EMNLP 2020

arXiv:2007.01176 [pdf]

Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset

Authors: Brian Roark, Lawrence Wolf-Sonkin, Christo Kirov, Sabrina J. Mielke, Cibu Johny, Isin Demirsahin, Keith Hall

Abstract: This paper describes the Dakshina dataset, a new resource consisting of text in both the Latin and native scripts for 12 South Asian languages. The dataset includes, for each language: 1) native script Wikipedia text; 2) a romanization lexicon; and 3) full sentence parallel data in both a native script of the language and the basic Latin alphabet. We document the methods used for preparation and s… ▽ More This paper describes the Dakshina dataset, a new resource consisting of text in both the Latin and native scripts for 12 South Asian languages. The dataset includes, for each language: 1) native script Wikipedia text; 2) a romanization lexicon; and 3) full sentence parallel data in both a native script of the language and the basic Latin alphabet. We document the methods used for preparation and selection of the Wikipedia text in each language; collection of attested romanizations for sampled lexicons; and manual romanization of held-out sentences from the native script collections. We additionally provide baseline results on several tasks made possible by the dataset, including single word transliteration, full sentence transliteration, and language modeling of native script and romanized text. Keywords: romanization, transliteration, South Asian languages △ Less

Submitted 2 July, 2020; originally announced July 2020.

Comments: Published at LREC 2020

arXiv:2006.11572 [pdf, other]

SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection

Authors: Ekaterina Vylomova, Jennifer White, Elizabeth Salesky, Sabrina J. Mielke, Shijie Wu, Edoardo Ponti, Rowan Hall Maudslay, Ran Zmigrod, Josef Valvoda, Svetlana Toldova, Francis Tyers, Elena Klyachko, Ilya Yegorov, Natalia Krizhanovsky, Paula Czarnowska, Irene Nikkarinen, Andrew Krizhanovsky, Tiago Pimentel, Lucas Torroba Hennigen, Christo Kirov, Garrett Nicolai, Adina Williams, Antonios Anastasopoulos, Hilaria Cruz, Eleanor Chodroff , et al. (3 additional authors not shown)

Abstract: A broad goal in natural language processing (NLP) is to develop a system that has the capacity to process any natural language. Most systems, however, are developed using data from just one language such as English. The SIGMORPHON 2020 shared task on morphological reinflection aims to investigate systems' ability to generalize across typologically distinct languages, many of which are low resource… ▽ More A broad goal in natural language processing (NLP) is to develop a system that has the capacity to process any natural language. Most systems, however, are developed using data from just one language such as English. The SIGMORPHON 2020 shared task on morphological reinflection aims to investigate systems' ability to generalize across typologically distinct languages, many of which are low resource. Systems were developed using data from 45 languages and just 5 language families, fine-tuned with data from an additional 45 languages and 10 language families (13 in total), and evaluated on all 90 languages. A total of 22 systems (19 neural) from 10 teams were submitted to the task. All four winning systems were neural (two monolingual transformers and two massively multilingual RNN-based models with gated attention). Most teams demonstrate utility of data hallucination and augmentation, ensembles, and multilingual training for low-resource languages. Non-neural learners and manually designed grammars showed competitive and even superior performance on some languages (such as Ingrian, Tajik, Tagalog, Zarma, Lingala), especially with very limited data. Some language families (Afro-Asiatic, Niger-Congo, Turkic) were relatively easy for most systems and achieved over 90% mean accuracy while others were more challenging. △ Less

Submitted 14 July, 2020; v1 submitted 20 June, 2020; originally announced June 2020.

Comments: 39 pages, SIGMORPHON

arXiv:2005.02354 [pdf, other]

It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information

Authors: Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos, Ryan Cotterell, Naoaki Okazaki

Abstract: The performance of neural machine translation systems is commonly evaluated in terms of BLEU. However, due to its reliance on target language properties and generation, the BLEU metric does not allow an assessment of which translation directions are more difficult to model. In this paper, we propose cross-mutual information (XMI): an asymmetric information-theoretic metric of machine translation d… ▽ More The performance of neural machine translation systems is commonly evaluated in terms of BLEU. However, due to its reliance on target language properties and generation, the BLEU metric does not allow an assessment of which translation directions are more difficult to model. In this paper, we propose cross-mutual information (XMI): an asymmetric information-theoretic metric of machine translation difficulty that exploits the probabilistic nature of most neural machine translation models. XMI allows us to better evaluate the difficulty of translating text into the target language while controlling for the difficulty of the target-side generation component independent of the translation task. We then present the first systematic and controlled study of cross-lingual translation difficulties using modern neural translation systems. Code for replicating our experiments is available online at https://github.com/e-bug/nmt-difficulty. △ Less

Submitted 17 May, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

Comments: Accepted at ACL 2020

arXiv:2004.14914 [pdf, other]

Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics too!

Authors: Suzanna Sia, Ayush Dalmia, Sabrina J. Mielke

Abstract: Topic models are a useful analysis tool to uncover the underlying themes within document collections. The dominant approach is to use probabilistic topic models that posit a generative story, but in this paper we propose an alternative way to obtain topics: clustering pre-trained word embeddings while incorporating document information for weighted clustering and reranking top words. We provide be… ▽ More Topic models are a useful analysis tool to uncover the underlying themes within document collections. The dominant approach is to use probabilistic topic models that posit a generative story, but in this paper we propose an alternative way to obtain topics: clustering pre-trained word embeddings while incorporating document information for weighted clustering and reranking top words. We provide benchmarks for the combination of different word embeddings and clustering algorithms, and analyse their performance under dimensionality reduction with PCA. The best performing combination for our approach performs as well as classical topic models, but with lower runtime and computational complexity. △ Less

Submitted 6 October, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

Comments: Published as a short paper at EMNLP 2020

arXiv:1910.11493 [pdf, ps, other]

doi 10.18653/v1/W19-4226

The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection

Authors: Arya D. McCarthy, Ekaterina Vylomova, Shijie Wu, Chaitanya Malaviya, Lawrence Wolf-Sonkin, Garrett Nicolai, Christo Kirov, Miikka Silfverberg, Sabrina J. Mielke, Jeffrey Heinz, Ryan Cotterell, Mans Hulden

Abstract: The SIGMORPHON 2019 shared task on cross-lingual transfer and contextual analysis in morphology examined transfer learning of inflection between 100 language pairs, as well as contextual lemmatization and morphosyntactic description in 66 languages. The first task evolves past years' inflection tasks by examining transfer of morphological inflection knowledge from a high-resource language to a low… ▽ More The SIGMORPHON 2019 shared task on cross-lingual transfer and contextual analysis in morphology examined transfer learning of inflection between 100 language pairs, as well as contextual lemmatization and morphosyntactic description in 66 languages. The first task evolves past years' inflection tasks by examining transfer of morphological inflection knowledge from a high-resource language to a low-resource language. This year also presents a new second challenge on lemmatization and morphological feature analysis in context. All submissions featured a neural component and built on either this year's strong baselines or highly ranked systems from previous years' shared tasks. Every participating team improved in accuracy over the baselines for the inflection task (though not Levenshtein distance), and every team in the contextual analysis task improved on both state-of-the-art neural and non-neural baselines. △ Less

Submitted 25 February, 2020; v1 submitted 24 October, 2019; originally announced October 2019.

Comments: Presented at SIGMORPHON 2019

Journal ref: Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology (2019) 229-244

arXiv:1906.09249 [pdf, other]

doi 10.1088/1361-6501/ab5722

Cryogenic $^9$Be$^+$ Penning trap for precision measurements with (anti-)protons

Authors: Malte Niemann, Teresa Meiners, Johannes Mielke, Matthias Joachim Borchert, Juan Manuel Cornejo, Stefan Ulmer, Christian Ospelkaus

Abstract: Cooling and detection schemes using laser cooling and methods of quantum logic can contribute to high precision CPT symmetry tests in the baryonic sector. This work introduces an experiment to sympathetically cool protons and antiprotons using the Coulomb interaction with a $^9$Be$^+$ ion trapped in a nearby but separate potential well. We have designed and set up an apparatus to show such couplin… ▽ More Cooling and detection schemes using laser cooling and methods of quantum logic can contribute to high precision CPT symmetry tests in the baryonic sector. This work introduces an experiment to sympathetically cool protons and antiprotons using the Coulomb interaction with a $^9$Be$^+$ ion trapped in a nearby but separate potential well. We have designed and set up an apparatus to show such coupling between two identical ions for the first time in a Penning trap. In this paper, we present evidence for successful loading and Doppler cooling of clouds and single ions. Our coupling scheme has applications in a range of high-precision measurements in Penning traps and has the potential to substantially improve motional control in these experiments. △ Less

Submitted 24 June, 2019; v1 submitted 21 June, 2019; originally announced June 2019.

Comments: Corrected typos

arXiv:1906.04726 [pdf, other]

What Kind of Language Is Hard to Language-Model?

Authors: Sabrina J. Mielke, Ryan Cotterell, Kyle Gorman, Brian Roark, Jason Eisner

Abstract: How language-agnostic are current state-of-the-art NLP tools? Are there some types of language that are easier to model with current methods? In prior work (Cotterell et al., 2018) we attempted to address this question for language modeling, and observed that recurrent neural network language models do not perform equally well over all the high-resource European languages found in the Europarl cor… ▽ More How language-agnostic are current state-of-the-art NLP tools? Are there some types of language that are easier to model with current methods? In prior work (Cotterell et al., 2018) we attempted to address this question for language modeling, and observed that recurrent neural network language models do not perform equally well over all the high-resource European languages found in the Europarl corpus. We speculated that inflectional morphology may be the primary culprit for the discrepancy. In this paper, we extend these earlier experiments to cover 69 languages from 13 language families using a multilingual Bible corpus. Methodologically, we introduce a new paired-sample multiplicative mixed-effects model to obtain language difficulty coefficients from at-least-pairwise parallel corpora. In other words, the model is aware of inter-sentence variation and can handle missing data. Exploiting this model, we show that "translationese" is not any easier to model than natively written language in a fair comparison. Trying to answer the question of what features difficult languages have in common, we try and fail to reproduce our earlier (Cotterell et al., 2018) observation about morphological complexity and instead reveal far simpler statistics of the data that seem to drive complexity in a much larger sample. △ Less

Submitted 25 February, 2020; v1 submitted 11 June, 2019; originally announced June 2019.

Comments: Published at ACL 2019

arXiv:1906.04571 [pdf, other]

Counterfactual Data Augmentation for Mitigating Gender Stereotypes in Languages with Rich Morphology

Authors: Ran Zmigrod, Sabrina J. Mielke, Hanna Wallach, Ryan Cotterell

Abstract: Gender stereotypes are manifest in most of the world's languages and are consequently propagated or amplified by NLP systems. Although research has focused on mitigating gender stereotypes in English, the approaches that are commonly employed produce ungrammatical sentences in morphologically rich languages. We present a novel approach for converting between masculine-inflected and feminine-inflec… ▽ More Gender stereotypes are manifest in most of the world's languages and are consequently propagated or amplified by NLP systems. Although research has focused on mitigating gender stereotypes in English, the approaches that are commonly employed produce ungrammatical sentences in morphologically rich languages. We present a novel approach for converting between masculine-inflected and feminine-inflected sentences in such languages. For Spanish and Hebrew, our approach achieves F1 scores of 82% and 73% at the level of tags and accuracies of 90% and 87% at the level of forms. By evaluating our approach using four different languages, we show that, on average, it reduces gender stereotyping by a factor of 2.5 without any sacrifice to grammaticality. △ Less

Submitted 27 May, 2020; v1 submitted 11 June, 2019; originally announced June 2019.

Comments: ACL 2019

arXiv:1905.03981 [pdf]

Confidence intervals with maximal average power

Authors: Christian Bartels, Johanna Mielke, Ekkehard Glimm

Abstract: We propose a frequentist testing procedure that maintains a defined coverage and is optimal in the sense that it gives maximal power to detect deviations from a null hypothesis when the alternative to the null hypothesis is sampled from a pre-specified distribution (the prior distribution). Selecting a prior distribution allows to tune the decision rule. This leads to an increased power, if the tr… ▽ More We propose a frequentist testing procedure that maintains a defined coverage and is optimal in the sense that it gives maximal power to detect deviations from a null hypothesis when the alternative to the null hypothesis is sampled from a pre-specified distribution (the prior distribution). Selecting a prior distribution allows to tune the decision rule. This leads to an increased power, if the true data generating distribution happens to be compatible with the prior. It comes at the cost of losing power, if the data generating distribution or the observed data are incompatible with the prior. We illustrate the proposed approach for a binomial experiment, which is sufficiently simple such that the decision sets can be illustrated in figures, which should facilitate an intuitive understanding. The potential beyond the simple example will be discussed: the approach is generic in that the test is defined based on the likelihood function and the prior only. It is comparatively simple to implement and efficient to execute, since it does not rely on Minimax optimization. Conceptually it is interesting to note that for constructing the testing procedure the Bayesian posterior probability distribution is used. △ Less

Submitted 5 July, 2020; v1 submitted 10 May, 2019; originally announced May 2019.

arXiv:1810.11101 [pdf, other]

UniMorph 2.0: Universal Morphology

Authors: Christo Kirov, Ryan Cotterell, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Patrick Xia, Manaal Faruqui, Sabrina J. Mielke, Arya D. McCarthy, Sandra Kübler, David Yarowsky, Jason Eisner, Mans Hulden

Abstract: The Universal Morphology UniMorph project is a collaborative effort to improve how NLP handles complex morphology across the world's languages. The project releases annotated morphological data using a universal tagset, the UniMorph schema. Each inflected form is associated with a lemma, which typically carries its underlying lexical meaning, and a bundle of morphological features from our schema.… ▽ More The Universal Morphology UniMorph project is a collaborative effort to improve how NLP handles complex morphology across the world's languages. The project releases annotated morphological data using a universal tagset, the UniMorph schema. Each inflected form is associated with a lemma, which typically carries its underlying lexical meaning, and a bundle of morphological features from our schema. Additional supporting data and tools are also released on a per-language basis when available. UniMorph is based at the Center for Language and Speech Processing (CLSP) at Johns Hopkins University in Baltimore, Maryland and is sponsored by the DARPA LORELEI program. This paper details advances made to the collection, annotation, and dissemination of project resources since the initial UniMorph release described at LREC 2016. lexical resources} } △ Less

Submitted 25 February, 2020; v1 submitted 25 October, 2018; originally announced October 2018.

Comments: LREC 2018

arXiv:1810.07125 [pdf, other]

The CoNLL--SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection

Authors: Ryan Cotterell, Christo Kirov, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Arya D. McCarthy, Katharina Kann, Sabrina J. Mielke, Garrett Nicolai, Miikka Silfverberg, David Yarowsky, Jason Eisner, Mans Hulden

Abstract: The CoNLL--SIGMORPHON 2018 shared task on supervised learning of morphological generation featured data sets from 103 typologically diverse languages. Apart from extending the number of languages involved in earlier supervised tasks of generating inflected forms, this year the shared task also featured a new second task which asked participants to inflect words in sentential context, similar to a… ▽ More The CoNLL--SIGMORPHON 2018 shared task on supervised learning of morphological generation featured data sets from 103 typologically diverse languages. Apart from extending the number of languages involved in earlier supervised tasks of generating inflected forms, this year the shared task also featured a new second task which asked participants to inflect words in sentential context, similar to a cloze task. This second task featured seven languages. Task 1 received 27 submissions and task 2 received 6 submissions. Both tasks featured a low, medium, and high data condition. Nearly all submissions featured a neural component and built on highly-ranked systems from the earlier 2017 shared task. In the inflection task (task 1), 41 of the 52 languages present in last year's inflection task showed improvement by the best systems in the low-resource setting. The cloze task (task 2) proved to be difficult, and few submissions managed to consistently improve upon both a simple neural baseline system and a lemma-repeating baseline. △ Less

Submitted 25 February, 2020; v1 submitted 16 October, 2018; originally announced October 2018.

Comments: CoNLL 2018. arXiv admin note: text overlap with arXiv:1706.09031

arXiv:1806.03746 [pdf, other]

A Structured Variational Autoencoder for Contextual Morphological Inflection

Authors: Lawrence Wolf-Sonkin, Jason Naradowsky, Sabrina J. Mielke, Ryan Cotterell

Abstract: Statistical morphological inflectors are typically trained on fully supervised, type-level data. One remaining open research question is the following: How can we effectively exploit raw, token-level data to improve their performance? To this end, we introduce a novel generative latent-variable model for the semi-supervised learning of inflection generation. To enable posterior inference over the… ▽ More Statistical morphological inflectors are typically trained on fully supervised, type-level data. One remaining open research question is the following: How can we effectively exploit raw, token-level data to improve their performance? To this end, we introduce a novel generative latent-variable model for the semi-supervised learning of inflection generation. To enable posterior inference over the latent variables, we derive an efficient variational inference procedure based on the wake-sleep algorithm. We experiment on 23 languages, using the Universal Dependencies corpora in a simulated low-resource setting, and find improvements of over 10% absolute accuracy in some cases. △ Less

Submitted 25 February, 2020; v1 submitted 10 June, 2018; originally announced June 2018.

Comments: Published at ACL 2018

arXiv:1806.03743 [pdf, other]

Are All Languages Equally Hard to Language-Model?

Authors: Ryan Cotterell, Sabrina J. Mielke, Jason Eisner, Brian Roark

Abstract: For general modeling methods applied to diverse languages, a natural question is: how well should we expect our models to work on languages with differing typological profiles? In this work, we develop an evaluation framework for fair cross-linguistic comparison of language models, using translated text so that all models are asked to predict approximately the same information. We then conduct a s… ▽ More For general modeling methods applied to diverse languages, a natural question is: how well should we expect our models to work on languages with differing typological profiles? In this work, we develop an evaluation framework for fair cross-linguistic comparison of language models, using translated text so that all models are asked to predict approximately the same information. We then conduct a study on 21 languages, demonstrating that in some languages, the textual expression of the information is harder to predict with both $n$-gram and LSTM language models. We show complex inflectional morphology to be a cause of performance differences among languages. △ Less

Submitted 25 February, 2020; v1 submitted 10 June, 2018; originally announced June 2018.

Comments: Published at NAACL 2018

arXiv:1806.03740 [pdf, other]

Unsupervised Disambiguation of Syncretism in Inflected Lexicons

Authors: Ryan Cotterell, Christo Kirov, Sabrina J. Mielke, Jason Eisner

Abstract: Lexical ambiguity makes it difficult to compute various useful statistics of a corpus. A given word form might represent any of several morphological feature bundles. One can, however, use unsupervised learning (as in EM) to fit a model that probabilistically disambiguates word forms. We present such an approach, which employs a neural network to smoothly model a prior distribution over feature bu… ▽ More Lexical ambiguity makes it difficult to compute various useful statistics of a corpus. A given word form might represent any of several morphological feature bundles. One can, however, use unsupervised learning (as in EM) to fit a model that probabilistically disambiguates word forms. We present such an approach, which employs a neural network to smoothly model a prior distribution over feature bundles (even rare ones). Although this basic model does not consider a token's context, that very property allows it to operate on a simple list of unigram type counts, partitioning each count among different analyses of that unigram. We discuss evaluation metrics for this novel task and report results on 5 languages. △ Less

Submitted 25 February, 2020; v1 submitted 10 June, 2018; originally announced June 2018.

Comments: Published at NAACL 2018

arXiv:1804.08205 [pdf, other]

Spell Once, Summon Anywhere: A Two-Level Open-Vocabulary Language Model

Authors: Sabrina J. Mielke, Jason Eisner

Abstract: We show how the spellings of known words can help us deal with unknown words in open-vocabulary NLP tasks. The method we propose can be used to extend any closed-vocabulary generative model, but in this paper we specifically consider the case of neural language modeling. Our Bayesian generative story combines a standard RNN language model (generating the word tokens in each sentence) with an RNN-b… ▽ More We show how the spellings of known words can help us deal with unknown words in open-vocabulary NLP tasks. The method we propose can be used to extend any closed-vocabulary generative model, but in this paper we specifically consider the case of neural language modeling. Our Bayesian generative story combines a standard RNN language model (generating the word tokens in each sentence) with an RNN-based spelling model (generating the letters in each word type). These two RNNs respectively capture sentence structure and word structure, and are kept separate as in linguistics. By invoking the second RNN to generate spellings for novel words in context, we obtain an open-vocabulary language model. For known words, embeddings are naturally inferred by combining evidence from type spelling and token context. Comparing to baselines (including a novel strong baseline), we beat previous work and establish state-of-the-art results on multiple datasets. △ Less

Submitted 25 February, 2020; v1 submitted 22 April, 2018; originally announced April 2018.

Comments: Accepted for publication at AAAI 2019

arXiv:1709.07188 [pdf, other]

doi 10.1063/1.5005515

A highly stable monolithic enhancement cavity for SHG generation in the UV

Authors: S. Hannig, J. Mielke, J. A. Fenske, M. Misera, N. Beev, C. Ospelkaus, P. O. Schmidt

Abstract: We present a highly stable bow-tie power enhancement cavity for critical second-harmonic generation into the UV using a Brewster-cut $β$-BaB$_2$O$_4$ (BBO) nonlinear crystal. The cavity geometry is suitable for all UV wavelengths reachable with BBO and can be modified to accommodate anti-reflection coated crystals, extending its applicability to the entire wavelength range accessible with non-line… ▽ More We present a highly stable bow-tie power enhancement cavity for critical second-harmonic generation into the UV using a Brewster-cut $β$-BaB$_2$O$_4$ (BBO) nonlinear crystal. The cavity geometry is suitable for all UV wavelengths reachable with BBO and can be modified to accommodate anti-reflection coated crystals, extending its applicability to the entire wavelength range accessible with non-linear frequency conversion. The cavity is length-stabilized using a fast general purpose digital PI controller based on the open source STEMlab 125-14 (formerly Red Pitaya) system acting on a mirror mounted on a fast piezo actuator. We observe $130\,\mathrm{h}$ uninterrupted operation without decay in output power at $313\,\mathrm{nm}$. The robustness of the system has been confirmed by exposing it to accelerations of up to $1\,\mathrm{g}$ with less than $10\%$ in-lock output power variations. Furthermore, the cavity can withstand 30~minutes of acceleration exposure at a level of $3\,\mathrm{g}_\mathrm{rms}$ without substantial change in SHG output power, demonstrating that the design is suitable for transportable setups. △ Less

Submitted 21 September, 2017; originally announced September 2017.

Showing 1–34 of 34 results for author: Mielke, J