Search | arXiv e-print repository

arXiv:2407.20914 [pdf, ps, other]

doi 10.1109/LSP.2024.3436669

An Efficient Convex-Hull Relaxation Based Algorithm for Multi-User Discrete Passive Beamforming

Authors: Wenhai Lai, Zheyu Wu, Yi Feng, Kaiming Shen, Ya-Feng Liu

Abstract: Intelligent reflecting surface (IRS) is an emerging technology to enhance spatial multiplexing in wireless networks. This letter considers the discrete passive beamforming design for IRS in order to maximize the minimum signal-to-interference-plus-noise ratio (SINR) among multiple users in an IRS-assisted downlink network. The main design difficulty lies in the discrete phase-shift constraint. Dif… ▽ More Intelligent reflecting surface (IRS) is an emerging technology to enhance spatial multiplexing in wireless networks. This letter considers the discrete passive beamforming design for IRS in order to maximize the minimum signal-to-interference-plus-noise ratio (SINR) among multiple users in an IRS-assisted downlink network. The main design difficulty lies in the discrete phase-shift constraint. Differing from most existing works, this letter advocates a convex-hull relaxation of the discrete constraints which leads to a continuous reformulated problem equivalent to the original discrete problem. This letter further proposes an efficient alternating projection/proximal gradient descent and ascent algorithm for solving the reformulated problem. Simulation results show that the proposed algorithm outperforms the state-of-the-art methods significantly. △ Less

Submitted 30 July, 2024; originally announced July 2024.

Comments: 5 pages

Journal ref: IEEE Signal Processing Letters 2024

arXiv:2407.12648 [pdf, ps, other]

Blind Beamforming for Coverage Enhancement with Intelligent Reflecting Surface

Authors: Fan Xu, Jiawei Yao, Wenhai Lai, Kaiming Shen, Xin Li, Xin Chen, Zhi-Quan Luo

Abstract: Conventional policy for configuring an intelligent reflecting surface (IRS) typically requires channel state information (CSI), thus incurring substantial overhead costs and facing incompatibility with the current network protocols. This paper proposes a blind beamforming strategy in the absence of CSI, aiming to boost the minimum signal-to-noise ratio (SNR) among all the receiver positions, namel… ▽ More Conventional policy for configuring an intelligent reflecting surface (IRS) typically requires channel state information (CSI), thus incurring substantial overhead costs and facing incompatibility with the current network protocols. This paper proposes a blind beamforming strategy in the absence of CSI, aiming to boost the minimum signal-to-noise ratio (SNR) among all the receiver positions, namely the coverage enhancement. Although some existing works already consider the IRS-assisted coverage enhancement without CSI, they assume certain position-channel models through which the channels can be recovered from the geographic locations. In contrast, our approach solely relies on the received signal power data, not assuming any position-channel model. We examine the achievability and converse of the proposed blind beamforming method. If the IRS has $N$ reflective elements and there are $U$ receiver positions, then our method guarantees the minimum SNR of $Ω(N^2/U)$ -- which is fairly close to the upper bound $O(N+N^2\sqrt{\ln (NU)}/\sqrt[4]{U})$. Aside from the simulation results, we justify the practical use of blind beamforming in a field test at 2.6 GHz. According to the real-world experiment, the proposed blind beamforming method boosts the minimum SNR across seven random positions in a conference room by 18.22 dB, while the position-based method yields a boost of 12.08 dB. △ Less

Submitted 17 July, 2024; originally announced July 2024.

Comments: 17 pages

arXiv:2407.12496 [pdf, other]

Towards real-world applications of levitated optomechanics

Authors: Yuanbin Jin, Kunhong Shen, Peng Ju, Tongcang Li

Abstract: Levitated optomechanics, a rapidly expanding field that employs light to monitor and manipulate the mechanical motion of levitated objects, is increasingly relevant across physics, engineering, and other fields. This technique, which involves levitating micro- and nano-scale objects in a vacuum where they exhibit high-quality motion, provides an essential platform for precision measurements. Noted… ▽ More Levitated optomechanics, a rapidly expanding field that employs light to monitor and manipulate the mechanical motion of levitated objects, is increasingly relevant across physics, engineering, and other fields. This technique, which involves levitating micro- and nano-scale objects in a vacuum where they exhibit high-quality motion, provides an essential platform for precision measurements. Noted for their ultra-high sensitivity, levitated particles hold potential for a wide range of real-world applications. This perspective article briefly introduces the principle of optical levitation and the dynamics of levitated particles. It then reviews the emerging applications of levitated particles in ultrasensitive force and torque measurements, acceleration and rotation sensing, electric and magnetic field detection, scanning probe microscopy, localized vacuum pressure gauging, acoustic transduction, and chemical and biological sensing. Moreover, we discuss the present challenges and explore opportunities to minimize and integrate levitation systems for broader applications. We also briefly review optomechanics with ion traps and magnetic traps which can levitate particles in high vacuum without laser heating. △ Less

Submitted 17 July, 2024; originally announced July 2024.

Comments: Perspective article, 20 pages, 12 figures

arXiv:2407.12258 [pdf, other]

Facial Affect Recognition based on Multi Architecture Encoder and Feature Fusion for the ABAW7 Challenge

Authors: Kang Shen, Xuxiong Liu, Boyan Wang, Jun Yao, Xin Liu, Yujie Guan, Yu Wang, Gengchen Li, Xiao Sun

Abstract: In this paper, we present our approach to addressing the challenges of the 7th ABAW competition. The competition comprises three sub-challenges: Valence Arousal (VA) estimation, Expression (Expr) classification, and Action Unit (AU) detection. To tackle these challenges, we employ state-of-the-art models to extract powerful visual features. Subsequently, a Transformer Encoder is utilized to integr… ▽ More In this paper, we present our approach to addressing the challenges of the 7th ABAW competition. The competition comprises three sub-challenges: Valence Arousal (VA) estimation, Expression (Expr) classification, and Action Unit (AU) detection. To tackle these challenges, we employ state-of-the-art models to extract powerful visual features. Subsequently, a Transformer Encoder is utilized to integrate these features for the VA, Expr, and AU sub-challenges. To mitigate the impact of varying feature dimensions, we introduce an affine module to align the features to a common dimension. Overall, our results significantly outperform the baselines. △ Less

Submitted 26 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

arXiv:2407.12257 [pdf, other]

Compound Expression Recognition via Multi Model Ensemble for the ABAW7 Challenge

Authors: Xuxiong Liu, Kang Shen, Jun Yao, Boyan Wang, Minrui Liu, Liuwei An, Zishun Cui, Weijie Feng, Xiao Sun

Abstract: Compound Expression Recognition (CER) is vital for effective interpersonal interactions. Human emotional expressions are inherently complex due to the presence of compound expressions, requiring the consideration of both local and global facial cues for accurate judgment. In this paper, we propose an ensemble learning-based solution to address this complexity. Our approach involves training three… ▽ More Compound Expression Recognition (CER) is vital for effective interpersonal interactions. Human emotional expressions are inherently complex due to the presence of compound expressions, requiring the consideration of both local and global facial cues for accurate judgment. In this paper, we propose an ensemble learning-based solution to address this complexity. Our approach involves training three distinct expression classification models using convolutional networks, Vision Transformers, and multiscale local attention networks. By employing late fusion for model ensemble, we combine the outputs of these models to predict the final results. Our method demonstrates high accuracy on the RAF-DB datasets and is capable of recognizing expressions in certain portions of the C-EXPR-DB through zero-shot learning. △ Less

Submitted 26 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

Comments: arXiv admin note: text overlap with arXiv:2403.12572 by other authors

arXiv:2407.10714 [pdf, other]

SEMINAR: Search Enhanced Multi-modal Interest Network and Approximate Retrieval for Lifelong Sequential Recommendation

Authors: Kaiming Shen, Xichen Ding, Zixiang Zheng, Yuqi Gong, Qianqian Li, Zhongyi Liu, Guannan Zhang

Abstract: The modeling of users' behaviors is crucial in modern recommendation systems. A lot of research focuses on modeling users' lifelong sequences, which can be extremely long and sometimes exceed thousands of items. These models use the target item to search for the most relevant items from the historical sequence. However, training lifelong sequences in click through rate (CTR) prediction or personal… ▽ More The modeling of users' behaviors is crucial in modern recommendation systems. A lot of research focuses on modeling users' lifelong sequences, which can be extremely long and sometimes exceed thousands of items. These models use the target item to search for the most relevant items from the historical sequence. However, training lifelong sequences in click through rate (CTR) prediction or personalized search ranking (PSR) is extremely difficult due to the insufficient learning problem of ID embedding, especially when the IDs in the lifelong sequence features do not exist in the samples of training dataset. Additionally, existing target attention mechanisms struggle to learn the multi-modal representations of items in the sequence well. The distribution of multi-modal embedding (text, image and attributes) output of user's interacted items are not properly aligned and there exist divergence across modalities. We also observe that users' search query sequences and item browsing sequences can fully depict users' intents and benefit from each other. To address these challenges, we propose a unified lifelong multi-modal sequence model called SEMINAR-Search Enhanced Multi-Modal Interest Network and Approximate Retrieval. Specifically, a network called Pretraining Search Unit (PSU) learns the lifelong sequences of multi-modal query-item pairs in a pretraining-finetuning manner with multiple objectives: multi-modal alignment, next query-item pair prediction, query-item relevance prediction, etc. After pretraining, the downstream model restores the pretrained embedding as initialization and finetunes the network. To accelerate the online retrieval speed of multi-modal embedding, we propose a multi-modal codebook-based product quantization strategy to approximate the exact attention calculati △ Less

Submitted 15 July, 2024; originally announced July 2024.

Comments: 9 pages,code released

arXiv:2407.09982 [pdf]

Artificial intelligence and machine learning applications for cultured meat

Authors: Michael E. Todhunter, Sheikh Jubair, Ruchika Verma, Rikard Saqe, Kevin Shen, Breanna Duffy

Abstract: Cultured meat has the potential to provide a complementary meat industry with reduced environmental, ethical, and health impacts. However, major technological challenges remain which require time- and resource-intensive research and development efforts. Machine learning has the potential to accelerate cultured meat technology by streamlining experiments, predicting optimal results, and reducing ex… ▽ More Cultured meat has the potential to provide a complementary meat industry with reduced environmental, ethical, and health impacts. However, major technological challenges remain which require time- and resource-intensive research and development efforts. Machine learning has the potential to accelerate cultured meat technology by streamlining experiments, predicting optimal results, and reducing experimentation time and resources. However, the use of machine learning in cultured meat is in its infancy. This review covers the work available to date on the use of machine learning in cultured meat and explores future possibilities. We address four major areas of cultured meat research and development: establishing cell lines, cell culture media design, microscopy and image analysis, and bioprocessing and food processing optimization. This review aims to provide the foundation necessary for both cultured meat and machine learning scientists to identify research opportunities at the intersection between cultured meat and machine learning. △ Less

Submitted 30 April, 2024; originally announced July 2024.

Comments: 23 pages (43 pages with references), 4 figures. The first two listed authors share first authorship; they and the last listed author contributed equally to this work

arXiv:2407.05100 [pdf, other]

doi 10.1109/TPAMI.2024.3425222

Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference

Authors: Kai Shen, Lingfei Wu, Siliang Tang, Fangli Xu, Bo Long, Yueting Zhuang, Jian Pei

Abstract: The visual question generation (VQG) task aims to generate human-like questions from an image and potentially other side information (e.g. answer type). Previous works on VQG fall in two aspects: i) They suffer from one image to many questions mapping problem, which leads to the failure of generating referential and meaningful questions from an image. ii) They fail to model complex implicit relati… ▽ More The visual question generation (VQG) task aims to generate human-like questions from an image and potentially other side information (e.g. answer type). Previous works on VQG fall in two aspects: i) They suffer from one image to many questions mapping problem, which leads to the failure of generating referential and meaningful questions from an image. ii) They fail to model complex implicit relations among the visual objects in an image and also overlook potential interactions between the side information and image. To address these limitations, we first propose a novel learning paradigm to generate visual questions with answer-awareness and region-reference. Concretely, we aim to ask the right visual questions with Double Hints - textual answers and visual regions of interests, which could effectively mitigate the existing one-to-many mapping issue. Particularly, we develop a simple methodology to self-learn the visual hints without introducing any additional human annotations. Furthermore, to capture these sophisticated relationships, we propose a new double-hints guided Graph-to-Sequence learning framework, which first models them as a dynamic graph and learns the implicit topology end-to-end, and then utilizes a graph-to-sequence model to generate the questions with double hints. Experimental results demonstrate the priority of our proposed method. △ Less

Submitted 6 July, 2024; originally announced July 2024.

Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 2024

arXiv:2407.03424 [pdf, other]

Supernova Shocks Cannot Explain the Inflated State of Hypervelocity Runaways from White Dwarf Binaries

Authors: Aakash Bhat, Evan B. Bauer, Rüdiger Pakmor, Ken J. Shen, Ilaria Caiazzo, Abinaya Swaruba Rajamuthukumar, Kareem El-Badry, Wolfgang E. Kerzendorf

Abstract: Recent observations have found a growing number of hypervelocity stars with speeds of $\approx 1500-2500\,$km\,s$^{-1}$ which could have only been produced through thermonuclear supernovae in white dwarf binaries. Most of the observed hypervelocity runaways in this class display a surprising inflated structure: their current radii are roughly an order of magnitude greater than they would have been… ▽ More Recent observations have found a growing number of hypervelocity stars with speeds of $\approx 1500-2500\,$km\,s$^{-1}$ which could have only been produced through thermonuclear supernovae in white dwarf binaries. Most of the observed hypervelocity runaways in this class display a surprising inflated structure: their current radii are roughly an order of magnitude greater than they would have been as white dwarfs filling their Roche lobe. While many simulations exist studying the dynamical phase leading to supernova detonation in these systems, no detailed calculations of the long-term structure of the runaways have yet been performed. We use an existing \textsc{Arepo} hydrodynamical simulation of a supernova in a white dwarf binary as a starting point for the evolution of these stars with the 1 dimensional stellar evolution code MESA. We show that the supernova shock is not enough to inflate the white dwarf over timescales longer than a few thousand years, significantly shorter than the $10^{5-6}$ year lifetimes inferred for observed hypervelocity runaways. Despite experiencing a shock from a supernova less than $\approx 0.02\,R_\odot$ away, our models do not experience significant interior heating, and all contract back to radii around $0.01\,R_\odot$ within about $10^4$\,years. Explaining the observed inflated states requires either an additional source of significant heating or some other physics that is not yet accounted for in the subsequent evolution. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: Submitted to A\&A. 15 pages, 17 figures

arXiv:2406.14809 [pdf, other]

Gas permeability, diffusivity, and solubility in polymers: Simulation-experiment data fusion and multi-task machine learning

Authors: Brandon K. Phan, Kuan-Hsuan Shen, Rishi Gurnani, Huan Tran, Ryan Lively, Rampi Ramprasad

Abstract: Machine learning (ML) models for predicting gas permeability through polymers have traditionally relied on experimental data. While these models exhibit robustness within familiar chemical domains, reliability wanes when applied to new spaces. To address this challenge, we present a multi-tiered multi-task learning framework empowered with advanced machine-crafted polymer fingerprinting algorithms… ▽ More Machine learning (ML) models for predicting gas permeability through polymers have traditionally relied on experimental data. While these models exhibit robustness within familiar chemical domains, reliability wanes when applied to new spaces. To address this challenge, we present a multi-tiered multi-task learning framework empowered with advanced machine-crafted polymer fingerprinting algorithms and data fusion techniques. This framework combines scarce "high-fidelity" experimental data with abundant diverse "low-fidelity" simulation or synthetic data, resulting in predictive models that display a high level of generalizability across novel chemical spaces. Additionally, this multi-task scheme capitalizes on known physics and interrelated properties, such as gas diffusivity and solubility, both of which are closely tied to permeability. By amalgamating high-throughput generated simulation data with available experimental data for gas permeability, diffusivity, and solubility for various gases, we construct multi-task deep learning models. These models can simultaneously predict all three properties for all gases under consideration. With markedly enhanced predictive accuracy, particularly compared to traditional models reliant solely on experimental data for a singular property. This strategy underscores the potential of coupling high-throughput classical simulations with data fusion methodologies to yield state-of-the-art property predictors, especially when experimental data for targeted properties is scarce. △ Less

Submitted 20 June, 2024; originally announced June 2024.

Comments: Submitted to npj Computational Materials

arXiv:2406.10910 [pdf, ps, other]

Fast Fractional Programming for Multi-Cell Integrated Sensing and Communications

Authors: Yannan Chen, Yi Feng, Xiaoyang Li, Licheng Zhao, Kaiming Shen

Abstract: This paper concerns the coordinate multi-cell beamforming design for integrated sensing and communications (ISAC). In particular, we assume that each base station (BS) has massive antennas. The optimization objective is to maximize a weighted sum of the data rates (for communications) and the Fisher information (for sensing). We first show that the conventional beamforming method for the multiple-… ▽ More This paper concerns the coordinate multi-cell beamforming design for integrated sensing and communications (ISAC). In particular, we assume that each base station (BS) has massive antennas. The optimization objective is to maximize a weighted sum of the data rates (for communications) and the Fisher information (for sensing). We first show that the conventional beamforming method for the multiple-input multiple-output (MIMO) transmission, i.e., the weighted minimum mean square error (WMMSE) algorithm, has a natural extension to the ISAC problem scenario from a fractional programming (FP) perspective. However, the extended WMMSE algorithm requires computing the $N\times N$ matrix inverse extensively, where $N$ is proportional to the antenna array size, so the algorithm becomes quite costly when antennas are massively deployed. To address this issue, we develop a nonhomogeneous bound and use it in conjunction with the FP technique to solve the ISAC beamforming problem without the need to invert any large matrices. It is further shown that the resulting new FP algorithm has an intimate connection with gradient projection, based on which we can accelerate the convergence via Nesterov's gradient extrapolation. △ Less

Submitted 16 June, 2024; originally announced June 2024.

arXiv:2406.07119 [pdf, other]

T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text

Authors: Aoxiong Yin, Haoyuan Li, Kai Shen, Siliang Tang, Yueting Zhuang

Abstract: In this work, we propose a two-stage sign language production (SLP) paradigm that first encodes sign language sequences into discrete codes and then autoregressively generates sign language from text based on the learned codebook. However, existing vector quantization (VQ) methods are fixed-length encodings, overlooking the uneven information density in sign language, which leads to under-encoding… ▽ More In this work, we propose a two-stage sign language production (SLP) paradigm that first encodes sign language sequences into discrete codes and then autoregressively generates sign language from text based on the learned codebook. However, existing vector quantization (VQ) methods are fixed-length encodings, overlooking the uneven information density in sign language, which leads to under-encoding of important regions and over-encoding of unimportant regions. To address this issue, we propose a novel dynamic vector quantization (DVA-VAE) model that can dynamically adjust the encoding length based on the information density in sign language to achieve accurate and compact encoding. Then, a GPT-like model learns to generate code sequences and their corresponding durations from spoken language text. Extensive experiments conducted on the PHOENIX14T dataset demonstrate the effectiveness of our proposed method. To promote sign language research, we propose a new large German sign language dataset, PHOENIX-News, which contains 486 hours of sign language videos, audio, and transcription texts.Experimental analysis on PHOENIX-News shows that the performance of our model can be further improved by increasing the size of the training data. Our project homepage is https://t2sgpt-demo.yinaoxiong.cn. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: Accepted by ACL 2024

arXiv:2405.19417 [pdf, other]

Almost All Carbon/Oxygen White Dwarfs Can Support Double Detonations

Authors: Ken J. Shen, Samuel J. Boos, Dean M. Townsley

Abstract: Double detonations of sub-Chandrasekhar-mass white dwarfs (WDs) in unstably mass-transferring double WD binaries have become a leading contender to explain most, if not all, Type Ia supernovae. However, past theoretical studies of the explosion process have assumed relatively ad hoc initial conditions for the helium shells in which the double detonations begin. In this work, we construct realistic… ▽ More Double detonations of sub-Chandrasekhar-mass white dwarfs (WDs) in unstably mass-transferring double WD binaries have become a leading contender to explain most, if not all, Type Ia supernovae. However, past theoretical studies of the explosion process have assumed relatively ad hoc initial conditions for the helium shells in which the double detonations begin. In this work, we construct realistic C/O WDs to use as the starting points for multidimensional double detonation simulations. We supplement these with simplified one-dimensional detonation calculations to gain a physical understanding of the conditions under which shell detonations can propagate successfully. We find that C/O WDs <= 1.0 Msol, which make up the majority of C/O WDs, are born with structures that can support double detonations. More massive C/O WDs require ~1e-3 Msol of accretion before detonations can successfully propagate in their shells, but such accretion may be common in the double WD binaries that host massive WDs. Our findings strongly suggest that if the direct impact accretion stream reaches high enough temperatures and densities during mass transfer from one WD to another, the accreting WD will undergo a double detonation. Furthermore, if the companion is also a C/O WD <= 1.0 Msol, it will undergo its own double detonation when impacted by the ejecta from the first explosion. Exceptions to this outcome may explain the newly discovered class of hypervelocity supernova survivors. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: Submitted

arXiv:2405.16096 [pdf, other]

doi 10.1109/TII.2024.3366221

MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface Defects

Authors: Kunye Shen, Xiaofei Zhou, Zhi Liu

Abstract: The automated surface defect detection is a fundamental task in industrial production, and the existing saliencybased works overcome the challenging scenes and give promising detection results. However, the cutting-edge efforts often suffer from large parameter size, heavy computational cost, and slow inference speed, which heavily limits the practical applications. To this end, we devise a multi-… ▽ More The automated surface defect detection is a fundamental task in industrial production, and the existing saliencybased works overcome the challenging scenes and give promising detection results. However, the cutting-edge efforts often suffer from large parameter size, heavy computational cost, and slow inference speed, which heavily limits the practical applications. To this end, we devise a multi-scale interactive (MI) module, which employs depthwise convolution (DWConv) and pointwise convolution (PWConv) to independently extract and interactively fuse features of different scales, respectively. Particularly, the MI module can provide satisfactory characterization for defect regions with fewer parameters. Embarking on this module, we propose a lightweight Multi-scale Interactive Network (MINet) to conduct real-time salient object detection of strip steel surface defects. Comprehensive experimental results on SD-Saliency-900 dataset, which contains three kinds of strip steel surface defect detection images (i.e., inclusion, patches, and scratches), demonstrate that the proposed MINet presents comparable detection accuracy with the state-of-the-art methods while running at a GPU speed of 721FPS and a CPU speed of 6.3FPS for 368*368 images with only 0.28M parameters. The code is available at https://github.com/Kunye-Shen/MINet. △ Less

Submitted 25 May, 2024; originally announced May 2024.

Comments: accepted by IEEE Transactions on Industrial Informatics

arXiv:2405.15185 [pdf, other]

An Evaluation of Estimative Uncertainty in Large Language Models

Authors: Zhisheng Tang, Ke Shen, Mayank Kejriwal

Abstract: Words of estimative probability (WEPs), such as ''maybe'' or ''probably not'' are ubiquitous in natural language for communicating estimative uncertainty, compared with direct statements involving numerical probability. Human estimative uncertainty, and its calibration with numerical estimates, has long been an area of study -- including by intelligence agencies like the CIA. This study compares e… ▽ More Words of estimative probability (WEPs), such as ''maybe'' or ''probably not'' are ubiquitous in natural language for communicating estimative uncertainty, compared with direct statements involving numerical probability. Human estimative uncertainty, and its calibration with numerical estimates, has long been an area of study -- including by intelligence agencies like the CIA. This study compares estimative uncertainty in commonly used large language models (LLMs) like GPT-4 and ERNIE-4 to that of humans, and to each other. Here we show that LLMs like GPT-3.5 and GPT-4 align with human estimates for some, but not all, WEPs presented in English. Divergence is also observed when the LLM is presented with gendered roles and Chinese contexts. Further study shows that an advanced LLM like GPT-4 can consistently map between statistical and estimative uncertainty, but a significant performance gap remains. The results contribute to a growing body of research on human-LLM alignment. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2404.19142 [pdf, other]

Purcell enhanced optical refrigeration

Authors: Peng Ju, Stefan Püschel, Kunhong Shen, Yuanbin Jin, Hiroki Tanaka, Tongcang Li

Abstract: Optical refrigeration of solids with anti-Stokes fluorescence has been widely explored as a vibration-free cryogenic cooling technology. A minimum temperature of 87 K has been demonstrated with rare-earth ion doped crystals using optical refrigeration. However, the depletion of the upper-lying energy levels in the ground state manifold hinders further cooling to below liquid nitrogen (LN$_2$) temp… ▽ More Optical refrigeration of solids with anti-Stokes fluorescence has been widely explored as a vibration-free cryogenic cooling technology. A minimum temperature of 87 K has been demonstrated with rare-earth ion doped crystals using optical refrigeration. However, the depletion of the upper-lying energy levels in the ground state manifold hinders further cooling to below liquid nitrogen (LN$_2$) temperatures, confining its applications. In this work, we introduce a Purcell enhanced optical refrigeration method to circumvent this limitation. This approach enhances the emission of high energy photons by coupling to a nearby nanocavity, blue shifting the mean emission wavelength. Such Purcell enhanced emission facilitates cooling starting from a lower energy level in the ground state manifold, which exhibits a higher occupation below LN$_2$ temperatures. Using our experimentally measured optical coefficients, our theoretical analysis predicts a minimum achievable temperature of 38 K for a Yb$^{3+}$:YLiF$_{4}$ nanocrystal near a cavity under realistic conditions. The proposed method is applicable to other rare-earth ion doped materials and semiconductors, and will have applications in creating superconducting and other quantum devices with solid-state cooling. △ Less

Submitted 29 April, 2024; originally announced April 2024.

Comments: 6 pages

arXiv:2404.16581 [pdf, other]

AudioScenic: Audio-Driven Video Scene Editing

Authors: Kaixin Shen, Ruijie Quan, Linchao Zhu, Jun Xiao, Yi Yang

Abstract: Audio-driven visual scene editing endeavors to manipulate the visual background while leaving the foreground content unchanged, according to the given audio signals. Unlike current efforts focusing primarily on image editing, audio-driven video scene editing has not been extensively addressed. In this paper, we introduce AudioScenic, an audio-driven framework designed for video scene editing. Audi… ▽ More Audio-driven visual scene editing endeavors to manipulate the visual background while leaving the foreground content unchanged, according to the given audio signals. Unlike current efforts focusing primarily on image editing, audio-driven video scene editing has not been extensively addressed. In this paper, we introduce AudioScenic, an audio-driven framework designed for video scene editing. AudioScenic integrates audio semantics into the visual scene through a temporal-aware audio semantic injection process. As our focus is on background editing, we further introduce a SceneMasker module, which maintains the integrity of the foreground content during the editing process. AudioScenic exploits the inherent properties of audio, namely, audio magnitude and frequency, to guide the editing process, aiming to control the temporal dynamics and enhance the temporal consistency. First, we present an audio Magnitude Modulator module that adjusts the temporal dynamics of the scene in response to changes in audio magnitude, enhancing the visual dynamics. Second, the audio Frequency Fuser module is designed to ensure temporal consistency by aligning the frequency of the audio with the dynamics of the video scenes, thus improving the overall temporal coherence of the edited videos. These integrated features enable AudioScenic to not only enhance visual diversity but also maintain temporal consistency throughout the video. We present a new metric named temporal score for more comprehensive validation of temporal consistency. We demonstrate substantial advancements of AudioScenic over competing methods on DAVIS and Audioset datasets. △ Less

Submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.16579 [pdf, other]

Neural Interaction Energy for Multi-Agent Trajectory Prediction

Authors: Kaixin Shen, Ruijie Quan, Linchao Zhu, Jun Xiao, Yi Yang

Abstract: Maintaining temporal stability is crucial in multi-agent trajectory prediction. Insufficient regularization to uphold this stability often results in fluctuations in kinematic states, leading to inconsistent predictions and the amplification of errors. In this study, we introduce a framework called Multi-Agent Trajectory prediction via neural interaction Energy (MATE). This framework assesses the… ▽ More Maintaining temporal stability is crucial in multi-agent trajectory prediction. Insufficient regularization to uphold this stability often results in fluctuations in kinematic states, leading to inconsistent predictions and the amplification of errors. In this study, we introduce a framework called Multi-Agent Trajectory prediction via neural interaction Energy (MATE). This framework assesses the interactive motion of agents by employing neural interaction energy, which captures the dynamics of interactions and illustrates their influence on the future trajectories of agents. To bolster temporal stability, we introduce two constraints: inter-agent interaction constraint and intra-agent motion constraint. These constraints work together to ensure temporal stability at both the system and agent levels, effectively mitigating prediction fluctuations inherent in multi-agent systems. Comparative evaluations against previous methods on four diverse datasets highlight the superior prediction accuracy and generalization capabilities of our model. △ Less

Submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.03204 [pdf, other]

RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis

Authors: Detai Xin, Xu Tan, Kai Shen, Zeqian Ju, Dongchao Yang, Yuancheng Wang, Shinnosuke Takamichi, Hiroshi Saruwatari, Shujie Liu, Jinyu Li, Sheng Zhao

Abstract: We present RALL-E, a robust language modeling method for text-to-speech (TTS) synthesis. While previous work based on large language models (LLMs) shows impressive performance on zero-shot TTS, such methods often suffer from poor robustness, such as unstable prosody (weird pitch and rhythm/duration) and a high word error rate (WER), due to the autoregressive prediction style of language models. Th… ▽ More We present RALL-E, a robust language modeling method for text-to-speech (TTS) synthesis. While previous work based on large language models (LLMs) shows impressive performance on zero-shot TTS, such methods often suffer from poor robustness, such as unstable prosody (weird pitch and rhythm/duration) and a high word error rate (WER), due to the autoregressive prediction style of language models. The core idea behind RALL-E is chain-of-thought (CoT) prompting, which decomposes the task into simpler steps to enhance the robustness of LLM-based TTS. To accomplish this idea, RALL-E first predicts prosody features (pitch and duration) of the input text and uses them as intermediate conditions to predict speech tokens in a CoT style. Second, RALL-E utilizes the predicted duration prompt to guide the computing of self-attention weights in Transformer to enforce the model to focus on the corresponding phonemes and prosody features when predicting speech tokens. Results of comprehensive objective and subjective evaluations demonstrate that, compared to a powerful baseline method VALL-E, RALL-E significantly improves the WER of zero-shot TTS from $5.6\%$ (without reranking) and $1.7\%$ (with reranking) to $2.5\%$ and $1.0\%$, respectively. Furthermore, we demonstrate that RALL-E correctly synthesizes sentences that are hard for VALL-E and reduces the error rate from $68\%$ to $4\%$. △ Less

Submitted 19 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

arXiv:2404.01359 [pdf]

Parallel Proportional Fusion of Spiking Quantum Neural Network for Optimizing Image Classification

Authors: Zuyu Xu, Kang Shen, Pengnian Cai, Tao Yang, Yuanming Hu, Shixian Chen, Yunlai Zhu, Zuheng Wu, Yuehua Dai, Jun Wang, Fei Yang

Abstract: The recent emergence of the hybrid quantum-classical neural network (HQCNN) architecture has garnered considerable attention due to the potential advantages associated with integrating quantum principles to enhance various facets of machine learning algorithms and computations. However, the current investigated serial structure of HQCNN, wherein information sequentially passes from one network to… ▽ More The recent emergence of the hybrid quantum-classical neural network (HQCNN) architecture has garnered considerable attention due to the potential advantages associated with integrating quantum principles to enhance various facets of machine learning algorithms and computations. However, the current investigated serial structure of HQCNN, wherein information sequentially passes from one network to another, often imposes limitations on the trainability and expressivity of the network. In this study, we introduce a novel architecture termed Parallel Proportional Fusion of Quantum and Spiking Neural Networks (PPF-QSNN). The dataset information is simultaneously fed into both the spiking neural network and the variational quantum circuits, with the outputs amalgamated in proportion to their individual contributions. We systematically assess the impact of diverse PPF-QSNN parameters on network performance for image classification, aiming to identify the optimal configuration. Numerical results on the MNIST dataset unequivocally illustrate that our proposed PPF-QSNN outperforms both the existing spiking neural network and the serial quantum neural network across metrics such as accuracy, loss, and robustness. This study introduces a novel and effective amalgamation approach for HQCNN, thereby laying the groundwork for the advancement and application of quantum advantage in artificial intelligent computations. △ Less

Submitted 1 April, 2024; originally announced April 2024.

arXiv:2403.17152 [pdf, other]

Local magnetic response of superconducting Sr$\mathrm{_2}$RuO$\mathrm{_4}$ thin films and rings

Authors: G. M. Ferguson, Hari P. Nair, Nathaniel J. Schreiber, Ludi Miao, Kyle M. Shen, Darrell G. Schlom, Katja C. Nowack

Abstract: We conduct local magnetic measurements on superconducting thin-film samples of Sr$\mathrm{_2}$RuO$\mathrm{_4}$ using scanning Superconducting Quantum Interference Device (SQUID) susceptometry. From the diamagnetic response, we extract the magnetic penetration depth, $λ$, which exhibits a quadratic temperature dependence at low temperatures. Although a quadratic dependence in high-purity bulk sampl… ▽ More We conduct local magnetic measurements on superconducting thin-film samples of Sr$\mathrm{_2}$RuO$\mathrm{_4}$ using scanning Superconducting Quantum Interference Device (SQUID) susceptometry. From the diamagnetic response, we extract the magnetic penetration depth, $λ$, which exhibits a quadratic temperature dependence at low temperatures. Although a quadratic dependence in high-purity bulk samples has been attributed to non-local electrodynamics, our analysis suggests that in our thin-film samples the presence of scattering is the origin of the quadratic dependence. While we observe micron-scale variations in the diamagnetic response and superconducting transition temperature, the form of the temperature dependence of $λ$ is independent of position. Finally, we characterize flux trapping in superconducting rings lithographically fabricated from the thin films, paving the way to systematic device-based tests of the superconducting order parameter in Sr$\mathrm{_2}$RuO$\mathrm{_4}$. △ Less

Submitted 25 March, 2024; originally announced March 2024.

arXiv:2403.08286 [pdf, ps, other]

Optical-Cavity Manipulation Strategies of Conical Intersections Mediated Singlet Fission Systems

Authors: Kewei Sun, Maxim Gelin, Kaijun Shen, Yang Zhao

Abstract: We offer a theoretical perspective on simulation and engineering of polaritonic conical-intersection-driven singlet-fission (SF) materials. Using rubrene as an example and applying the numerically accurate Davydov-Ansatz methodology, we derive dynamic and spectroscopic responses of the system and demonstrate key mechanisms capable of SF manipulation, viz. cavity-induced enhancement/weakening/suppr… ▽ More We offer a theoretical perspective on simulation and engineering of polaritonic conical-intersection-driven singlet-fission (SF) materials. Using rubrene as an example and applying the numerically accurate Davydov-Ansatz methodology, we derive dynamic and spectroscopic responses of the system and demonstrate key mechanisms capable of SF manipulation, viz. cavity-induced enhancement/weakening/suppression of SF, population localization on the singlet state via engineering of the cavity-mode excitation, polaron/polariton decoupling, collective enhancement of SF. We outline unsolved problems and challenges in the field, and share our views on the development of the future lines of research. We emphasize the significance of careful modeling of cascades of polaritonic conical intersections in high excitation manifolds and envisage that collective geometric phase effects may remarkably affect the SF dynamics and yield. We argue that microscopic interpretation of the main regulatory mechanisms of the polaritonic conical-intersection-driven SF can substantially deepen our understanding of this process, thereby providing novel ideas and solutions for improving conversion efficiency in photovoltaics. △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: 14 pages, 6 figures

arXiv:2403.06051 [pdf, other]

Observation of non-contact Casimir friction

Authors: Zhujing Xu, Peng Ju, Kunhong Shen, Yuanbin Jin, Zubin Jacob, Tongcang Li

Abstract: Quantum mechanics predicts the occurrence of random electromagnetic field fluctuations, or virtual photons, in vacuum. The exchange of virtual photons between two bodies in relative motion could lead to non-contact quantum vacuum friction or Casimir friction. Despite its theoretical significance, the non-contact Casimir frictional force has not been observed and its theoretical predictions have va… ▽ More Quantum mechanics predicts the occurrence of random electromagnetic field fluctuations, or virtual photons, in vacuum. The exchange of virtual photons between two bodies in relative motion could lead to non-contact quantum vacuum friction or Casimir friction. Despite its theoretical significance, the non-contact Casimir frictional force has not been observed and its theoretical predictions have varied widely. In this work, we report the first measurement of the non-contact Casimir frictional force between two moving bodies. By employing two mechanical oscillators with resonant frequencies far lower than those in Lorentz models of electrons in dielectric materials, we have amplified the Casimir frictional force at low relative velocities by several orders of magnitude. We directly measure the non-contact Casimir frictional force between the two oscillators and show its linear dependence on velocity, proving the dissipative nature of Casimir friction. This advancement marks a pivotal contribution to the field of dissipative quantum electrodynamics and enhances our understanding of friction at the nanoscale. △ Less

Submitted 9 March, 2024; originally announced March 2024.

arXiv:2403.03100 [pdf, other]

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Authors: Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao

Abstract: While recent large-scale text-to-speech (TTS) models have achieved significant progress, they still fall short in speech quality, similarity, and prosody. Considering speech intricately encompasses various attributes (e.g., content, prosody, timbre, and acoustic details) that pose significant challenges for generation, a natural idea is to factorize speech into individual subspaces representing di… ▽ More While recent large-scale text-to-speech (TTS) models have achieved significant progress, they still fall short in speech quality, similarity, and prosody. Considering speech intricately encompasses various attributes (e.g., content, prosody, timbre, and acoustic details) that pose significant challenges for generation, a natural idea is to factorize speech into individual subspaces representing different attributes and generate them individually. Motivated by it, we propose NaturalSpeech 3, a TTS system with novel factorized diffusion models to generate natural speech in a zero-shot way. Specifically, 1) we design a neural codec with factorized vector quantization (FVQ) to disentangle speech waveform into subspaces of content, prosody, timbre, and acoustic details; 2) we propose a factorized diffusion model to generate attributes in each subspace following its corresponding prompt. With this factorization design, NaturalSpeech 3 can effectively and efficiently model intricate speech with disentangled subspaces in a divide-and-conquer way. Experiments show that NaturalSpeech 3 outperforms the state-of-the-art TTS systems on quality, similarity, prosody, and intelligibility, and achieves on-par quality with human recordings. Furthermore, we achieve better performance by scaling to 1B parameters and 200K hours of training data. △ Less

Submitted 23 April, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

Comments: Achieving human-level quality and naturalness on multi-speaker datasets (e.g., LibriSpeech) in a zero-shot way

arXiv:2403.02705 [pdf, other]

Impact of (magneto-)thermoelectric effect on diffusion of conserved charges in hot and dense hadronic matter

Authors: He-Xia Zhang, Ke-Ming Shen, Yu-Xin Xiao, Ben-Wei Zhang

Abstract: We investigate the thermoelectric effect, which describes the generation of an electric field induced by temperature and conserved charge chemical potential gradients, in the hot and dense hadronic matter created in heavy-ion collisions. Utilizing the Boltzmann kinetic theory within the repulsive mean-field hadron resonance gas model, we evaluate both the diffusion thermopower matrix and diffusion… ▽ More We investigate the thermoelectric effect, which describes the generation of an electric field induced by temperature and conserved charge chemical potential gradients, in the hot and dense hadronic matter created in heavy-ion collisions. Utilizing the Boltzmann kinetic theory within the repulsive mean-field hadron resonance gas model, we evaluate both the diffusion thermopower matrix and diffusion coefficient matrix for the baryon number ($B$), electric charge ($Q$), and strangeness ($S$). The Landau-Lifshitz choice for the rest frame of the fluid is enforced in the derivation. We find that the thermoelectric effect hinders the diffusion processes of multiple conserved charges, particularly reducing the coupling between electric charge and baryon number (strangeness) in baryon (strangeness) diffusion. Given that the repulsive mean-field interactions between hadrons have a significant effect on the diffusion thermopower matrix and diffusion coefficient matrix in the baryon-rich region, we extend the investigation to include the impact of magnetic fields, analyzing the magneto-thermoelectric effect on both the diffusion coefficient matrix and the Hall-like diffusion coefficient matrix. The sensitivities of the magnetic field-dependent diffusion thermopower matrix and magneto-thermoelectric modified diffusion coefficient matrix to the choices of various transverse conditions are also studied. △ Less

Submitted 25 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

Comments: 21 pages, 12 figures, Version accepted by Phys. Rev. D

arXiv:2403.02405 [pdf, other]

Classification of the Fashion-MNIST Dataset on a Quantum Computer

Authors: Kevin Shen, Bernhard Jobst, Elvira Shishenina, Frank Pollmann

Abstract: The potential impact of quantum machine learning algorithms on industrial applications remains an exciting open question. Conventional methods for encoding classical data into quantum computers are not only too costly for a potential quantum advantage in the algorithms but also severely limit the scale of feasible experiments on current hardware. Therefore, recent works, despite claiming the near-… ▽ More The potential impact of quantum machine learning algorithms on industrial applications remains an exciting open question. Conventional methods for encoding classical data into quantum computers are not only too costly for a potential quantum advantage in the algorithms but also severely limit the scale of feasible experiments on current hardware. Therefore, recent works, despite claiming the near-term suitability of their algorithms, do not provide experimental benchmarking on standard machine learning datasets. We attempt to solve the data encoding problem by improving a recently proposed variational algorithm [1] that approximately prepares the encoded data, using asymptotically shallow circuits that fit the native gate set and topology of currently available quantum computers. We apply the improved algorithm to encode the Fashion-MNIST dataset [2], which can be directly used in future empirical studies of quantum machine learning algorithms. We deploy simple quantum variational classifiers trained on the encoded dataset on a current quantum computer ibmq-kolkata [3] and achieve moderate accuracies, providing a proof of concept for the near-term usability of our data encoding method. △ Less

Submitted 4 March, 2024; originally announced March 2024.

Comments: (15 pages, 11 figures)

arXiv:2402.19454 [pdf, ps, other]

Controllable suppression of the unconventional superconductivity in bulk and thin-film Sr$_{2}$RuO$_{4}$ via high-energy electron irradiation

Authors: Jacob P. Ruf, Hilary M. L. Noad, Romain Grasset, Ludi Miao, Elina Zhakina, Philippa H. McGuinness, Hari P. Nair, Nathaniel J. Schreiber, Naoki Kikugawa, Dmitry Sokolov, Marcin Konczykowski, Darrell G. Schlom, Kyle M. Shen, Andrew P. Mackenzie

Abstract: In bulk Sr$_{2}$RuO$_{4}$, the strong sensitivity of the superconducting transition temperature $T_{\text{c}}$ to nonmagnetic impurities provides robust evidence for a superconducting order parameter that changes sign around the Fermi surface. In superconducting epitaxial thin-film Sr$_{2}$RuO$_{4}$, the relationship between $T_{\text{c}}$ and the residual resistivity $ρ_0$, which in bulk samples… ▽ More In bulk Sr$_{2}$RuO$_{4}$, the strong sensitivity of the superconducting transition temperature $T_{\text{c}}$ to nonmagnetic impurities provides robust evidence for a superconducting order parameter that changes sign around the Fermi surface. In superconducting epitaxial thin-film Sr$_{2}$RuO$_{4}$, the relationship between $T_{\text{c}}$ and the residual resistivity $ρ_0$, which in bulk samples is taken to be a proxy for the low-temperature elastic scattering rate, is far less clear. Using high-energy electron irradiation to controllably introduce point disorder into bulk single-crystal and thin-film Sr$_{2}$RuO$_{4}$, we show that $T_{\text{c}}$ is suppressed in both systems at nearly identical rates. This suggests that part of $ρ_0$ in films comes from defects that do not contribute to superconducting pairbreaking, and establishes a quantitative link between the superconductivity of bulk and thin-film samples. △ Less

Submitted 29 February, 2024; originally announced February 2024.

arXiv:2402.13960 [pdf, other]

Evaluating Ground State Energies of Chemical Systems with Low-Depth Quantum Circuits and High Accuracy

Authors: Shuo Sun, Chandan Kumar, Kevin Shen, Elvira Shishenina, Christian B. Mendl

Abstract: Solving electronic structure problems is considered one of the most promising applications of quantum computing. However, due to limitations imposed by the coherence time of qubits in the Noisy Intermediate Scale Quantum (NISQ) era or the capabilities of early fault-tolerant quantum devices, it is vital to design algorithms with low-depth circuits. In this work, we develop an enhanced Variational… ▽ More Solving electronic structure problems is considered one of the most promising applications of quantum computing. However, due to limitations imposed by the coherence time of qubits in the Noisy Intermediate Scale Quantum (NISQ) era or the capabilities of early fault-tolerant quantum devices, it is vital to design algorithms with low-depth circuits. In this work, we develop an enhanced Variational Quantum Eigensolver (VQE) ansatz based on the Qubit Coupled Cluster (QCC) approach, which demands optimization over only $n$ parameters rather than the usual $n+2m$ parameters, where $n$ represents the number of Pauli string time evolution gates $e^{-itP}$, and $m$ is the number of qubits involved. We evaluate the ground state energies of $\mathrm{O_3}$, $\mathrm{Li_4}$, and $\mathrm{Cr_2}$, using CAS(2,2), (4,4) and (6,6) respectively in conjunction with our enhanced QCC ansatz, UCCSD (Unitary Coupled Cluster Single Double) ansatz, and canonical CCSD method as the active space solver, and compare with CASCI results. Finally, we assess our enhanced QCC ansatz on two distinct quantum hardware, IBM Kolkata and Quantinuum H1-1. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: 10 pages, 6 figures

arXiv:2402.13435 [pdf, other]

Learning to Retrieve for Job Matching

Authors: Jianqiang Shen, Yuchin Juan, Shaobo Zhang, Ping Liu, Wen Pu, Sriram Vasudevan, Qingquan Song, Fedor Borisyuk, Kay Qianqi Shen, Haichao Wei, Yunxiang Ren, Yeou S. Chiou, Sicong Kuang, Yuan Yin, Ben Zheng, Muchen Wu, Shaghayegh Gharghabi, Xiaoqing Wang, Huichao Xue, Qi Guo, Daniel Hewlett, Luke Simon, Liangjie Hong, Wenjing Zhang

Abstract: Web-scale search systems typically tackle the scalability challenge with a two-step paradigm: retrieval and ranking. The retrieval step, also known as candidate selection, often involves extracting standardized entities, creating an inverted index, and performing term matching for retrieval. Such traditional methods require manual and time-consuming development of query models. In this paper, we d… ▽ More Web-scale search systems typically tackle the scalability challenge with a two-step paradigm: retrieval and ranking. The retrieval step, also known as candidate selection, often involves extracting standardized entities, creating an inverted index, and performing term matching for retrieval. Such traditional methods require manual and time-consuming development of query models. In this paper, we discuss applying learning-to-retrieve technology to enhance LinkedIns job search and recommendation systems. In the realm of promoted jobs, the key objective is to improve the quality of applicants, thereby delivering value to recruiter customers. To achieve this, we leverage confirmed hire data to construct a graph that evaluates a seeker's qualification for a job, and utilize learned links for retrieval. Our learned model is easy to explain, debug, and adjust. On the other hand, the focus for organic jobs is to optimize seeker engagement. We accomplished this by training embeddings for personalized retrieval, fortified by a set of rules derived from the categorization of member feedback. In addition to a solution based on a conventional inverted index, we developed an on-GPU solution capable of supporting both KNN and term matching efficiently. △ Less

Submitted 20 February, 2024; originally announced February 2024.

arXiv:2402.13430 [pdf, other]

LinkSAGE: Optimizing Job Matching Using Graph Neural Networks

Authors: Ping Liu, Haichao Wei, Xiaochen Hou, Jianqiang Shen, Shihai He, Kay Qianqi Shen, Zhujun Chen, Fedor Borisyuk, Daniel Hewlett, Liang Wu, Srikant Veeraraghavan, Alex Tsun, Chengming Jiang, Wenjing Zhang

Abstract: We present LinkSAGE, an innovative framework that integrates Graph Neural Networks (GNNs) into large-scale personalized job matching systems, designed to address the complex dynamics of LinkedIns extensive professional network. Our approach capitalizes on a novel job marketplace graph, the largest and most intricate of its kind in industry, with billions of nodes and edges. This graph is not merel… ▽ More We present LinkSAGE, an innovative framework that integrates Graph Neural Networks (GNNs) into large-scale personalized job matching systems, designed to address the complex dynamics of LinkedIns extensive professional network. Our approach capitalizes on a novel job marketplace graph, the largest and most intricate of its kind in industry, with billions of nodes and edges. This graph is not merely extensive but also richly detailed, encompassing member and job nodes along with key attributes, thus creating an expansive and interwoven network. A key innovation in LinkSAGE is its training and serving methodology, which effectively combines inductive graph learning on a heterogeneous, evolving graph with an encoder-decoder GNN model. This methodology decouples the training of the GNN model from that of existing Deep Neural Nets (DNN) models, eliminating the need for frequent GNN retraining while maintaining up-to-date graph signals in near realtime, allowing for the effective integration of GNN insights through transfer learning. The subsequent nearline inference system serves the GNN encoder within a real-world setting, significantly reducing online latency and obviating the need for costly real-time GNN infrastructure. Validated across multiple online A/B tests in diverse product scenarios, LinkSAGE demonstrates marked improvements in member engagement, relevance matching, and member retention, confirming its generalizability and practical impact. △ Less

Submitted 20 February, 2024; originally announced February 2024.

arXiv:2402.12635 [pdf, other]

User Feedback-Informed Interface Design for Flow Management Data and Services (FMDS)

Authors: Sinan Abdulhak, Anthony Carvette, Kate Shen, Robert Goldman, Bill Tuck, Max Z. Li

Abstract: The transition to a microservices-based Flow Management Data and Services (FMDS) architecture from the existing Traffic Flow Management System (TFMS) is a critical enabler of the vision for an Information-Centric National Airspace System (NAS). The need to design a user-centric interface for FMDS is a key technical gap, as this interface connects NAS data and services to the traffic management spe… ▽ More The transition to a microservices-based Flow Management Data and Services (FMDS) architecture from the existing Traffic Flow Management System (TFMS) is a critical enabler of the vision for an Information-Centric National Airspace System (NAS). The need to design a user-centric interface for FMDS is a key technical gap, as this interface connects NAS data and services to the traffic management specialists within all stakeholder groups (e.g., FAA, airlines). We provide a research-driven approach towards designing such a graphical user interface (GUI) for FMDS. Major goals include unifying the more than 50 disparate traffic management services currently hosted on TFMS, as well as streamlining the process of evaluating, modeling, and monitoring Traffic Management Initiatives (TMIs). Motivated by this, we iteratively designed a GUI leveraging human factors engineering and user experience design principles, as well as user interviews. Through user testing and interviews, we identify workflow benefits of our GUI (e.g., reduction in task completion time), along with next steps for developing a live prototype. △ Less

Submitted 19 February, 2024; originally announced February 2024.

Comments: 8 pages, 8 figures

arXiv:2402.02718 [pdf, other]

Denoising Time Cycle Modeling for Recommendation

Authors: Sicong Xie, Qunwei Li, Weidi Xu, Kaiming Shen, Shaohu Chen, Wenliang Zhong

Abstract: Recently, modeling temporal patterns of user-item interactions have attracted much attention in recommender systems. We argue that existing methods ignore the variety of temporal patterns of user behaviors. We define the subset of user behaviors that are irrelevant to the target item as noises, which limits the performance of target-related time cycle modeling and affect the recommendation perform… ▽ More Recently, modeling temporal patterns of user-item interactions have attracted much attention in recommender systems. We argue that existing methods ignore the variety of temporal patterns of user behaviors. We define the subset of user behaviors that are irrelevant to the target item as noises, which limits the performance of target-related time cycle modeling and affect the recommendation performance. In this paper, we propose Denoising Time Cycle Modeling (DiCycle), a novel approach to denoise user behaviors and select the subset of user behaviors that are highly related to the target item. DiCycle is able to explicitly model diverse time cycle patterns for recommendation. Extensive experiments are conducted on both public benchmarks and a real-world dataset, demonstrating the superior performance of DiCycle over the state-of-the-art recommendation methods. △ Less

Submitted 4 February, 2024; originally announced February 2024.

arXiv:2401.11184 [pdf, ps, other]

doi 10.1021/acs.jpclett.3c03298

Finite-Temperature Hole-Magnon Dynamics in an Antiferromagnet

Authors: Kaijun Shen, Kewei Sun, Maxim F. Gelin, Yang Zhao

Abstract: Employing the numerically accurate multiple Davydov Ansatz in combination with the thermo-field dynamics approach, we delve into interplay of the finite-temperature dynamics of holes and magnons in an antiferromagnet, which allows for scrutinizing previous predictions from self-consistent Born approximation while offering, for the first time, accurate finite-temperature computation of detailed mag… ▽ More Employing the numerically accurate multiple Davydov Ansatz in combination with the thermo-field dynamics approach, we delve into interplay of the finite-temperature dynamics of holes and magnons in an antiferromagnet, which allows for scrutinizing previous predictions from self-consistent Born approximation while offering, for the first time, accurate finite-temperature computation of detailed magnon dynamics as a response and a facilitator to the hole motion. The study also uncovers pronounced temperature dependence of the magnon and hole populations, pointing to the feasibility of potential thermal manipulation and control of hole dynamics. Our methodology can be applied not only to the calculation of steady-state angular-resolved photoemission spectra, but also to the simulation of femtosecond terahertz pump-probe and other nonlinear signals for the characterization of antiferromagnetic materials. △ Less

Submitted 20 January, 2024; originally announced January 2024.

Comments: 28 pages, 5 figures

Journal ref: J. Phys. Chem. Lett. 15 (2024), 447-453

arXiv:2401.08011 [pdf, other]

Type Ia Supernovae Can Arise from the Detonations of Both Stars in a Double Degenerate Binary

Authors: Samuel J. Boos, Dean M. Townsley, Ken J. Shen

Abstract: The precise origin of Type Ia supernovae (SNe Ia) is unknown despite their value to numerous areas in astronomy. While it is a long-standing consensus that they arise from an explosion of a carbon/oxygen white dwarf, the exact progenitor configurations and explosion mechanisms that lead to SNe Ia are still debated. One popular theory is the double detonation in which a helium layer, accreted from… ▽ More The precise origin of Type Ia supernovae (SNe Ia) is unknown despite their value to numerous areas in astronomy. While it is a long-standing consensus that they arise from an explosion of a carbon/oxygen white dwarf, the exact progenitor configurations and explosion mechanisms that lead to SNe Ia are still debated. One popular theory is the double detonation in which a helium layer, accreted from a binary companion, detonates on the surface of the primary star, leading to a converging shock-induced detonation of the underlying core. It has recently been seen in simulations that a helium-rich degenerate companion may undergo its own explosion triggered by the impact from the ejecta of the primary star. We show 2D simulations that approximate a white dwarf undergoing a double detonation which triggers the explosion of the degenerate companion, leading to either a triple or quadruple detonation. We also present the first multi-dimensional radiative transfer results from the triple and quadruple detonation scenario. We find that within a range of mass configurations of the degenerate binary, the synthetic light curves and spectra of these events match observations as well as theoretical models of isolated double detonations do. Notably, double and quadruple detonations that are spectrally similar and reach the same peak brightnesses have drastically different ejection masses and produce different amounts of Si- and Fe-group elements. Further understanding of this scenario is needed in order to determine if at least some observed SNe Ia actually originate from two stars exploding. △ Less

Submitted 8 July, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

Comments: 22 pages, 21 figures, 3 tables. Accepted for publication in ApJ

arXiv:2401.07828 [pdf]

Transient Magnetoelastic Coupling in CrSBr

Authors: Youn Jue Bae, Taketo Handa, Yanan Dai, Jue Wang, Huicong Liu, Allen Scheie, Daniel G. Chica, Michael E. Ziebel, Andrew D. Kent, Xiaodong Xu, Ka Shen, Xavier Roy, Xiaoyang Zhu

Abstract: Recent research has revealed remarkable properties of the two-dimensional (2D) van der Waals layered crystal CrSBr, which is both a semiconductor and an A-type antiferromagnet. Here we show the role of strong magnetoelastic coupling in the generation and propagation of coherent magnons in CrSBr. Time and spatially resolved magneto-optical Kerr effect (tr-MOKE) microscopy reveals two time-varying t… ▽ More Recent research has revealed remarkable properties of the two-dimensional (2D) van der Waals layered crystal CrSBr, which is both a semiconductor and an A-type antiferromagnet. Here we show the role of strong magnetoelastic coupling in the generation and propagation of coherent magnons in CrSBr. Time and spatially resolved magneto-optical Kerr effect (tr-MOKE) microscopy reveals two time-varying transient strain fields induced by out-of-plane transverse and in-plane longitudinal lattice displacements. These transient strain fields launch coherent wavepackets of magnons, optical and acoustic at 24.6 GHz and 33.4 GHz, respectively. These findings suggest mechanisms for controlling and manipulating coherent magnons from distinct magnetoelastic couplings in this 2D van der Waals magnetic semiconductor. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: 12 pages, 4 figures, SI

arXiv:2401.07507 [pdf, ps, other]

Effects of plasma nonuniformity on toroidal Alfvén eigenmode nonlinear decay

Authors: Zhiwen Cheng, Kexun Shen, Zhiyong Qiu

Abstract: The parametric decay of toroidal Alfvén eigenmode (TAE) in nonuniform plasmas is investigated using nonlinear gyrokinetic equation. It is found that, the plasma nonuniformity not only significantly enhances the nonlinear coupling cross-section, but also qualitatively modifies the decay process. Specifically, the condition for spontaneous decay becomes the toroidal mode number of the sideband TAE b… ▽ More The parametric decay of toroidal Alfvén eigenmode (TAE) in nonuniform plasmas is investigated using nonlinear gyrokinetic equation. It is found that, the plasma nonuniformity not only significantly enhances the nonlinear coupling cross-section, but also qualitatively modifies the decay process. Specifically, the condition for spontaneous decay becomes the toroidal mode number of the sideband TAE being higher than that of the pump TAE, instead of the frequency of the sideband TAE being lower than the pump TAE in uniform plasmas. The consequences on TAE saturation and energetic particle transport are also discussed. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Report number: NF-106807

arXiv:2401.07480 [pdf, other]

Exploring the photoproduction of $ρ$ and $φ$ in hadronic heavy-ion collisions

Authors: Kaifeng Shen, Xin Wu, Zebo Tang, Wangmei Zha

Abstract: Significant enhancements of J/$ψ$ production have been observed by various experiments at RHIC and LHC for very low transverse momenta in peripheral heavy-ion collisions, which has ignited a surge of investigations into photon-induced processes in hadronic heavy-ion collisions (HHICs). Within this wave of research enthusiasm, the search for more photon induced products in HHICs becomes paramount.… ▽ More Significant enhancements of J/$ψ$ production have been observed by various experiments at RHIC and LHC for very low transverse momenta in peripheral heavy-ion collisions, which has ignited a surge of investigations into photon-induced processes in hadronic heavy-ion collisions (HHICs). Within this wave of research enthusiasm, the search for more photon induced products in HHICs becomes paramount. In this paper, we perform the calculation of the $ρ$ and $φ$ production resulting from photon-nucleus interactions in HHICs, which are crucial probes for studying the properties of Quark-Gluon Plasma (QGP) in HHICs. Our study reveals that, in comparison to hadronic production, the photon-induced production of $ρ$ and $φ$ does not reach the same level of significance as that observed in J/$ψ$ production. Nevertheless, it remains substantial, especially in peripheral collisions, holding great promise for experimental verification in the imminent future. △ Less

Submitted 15 January, 2024; originally announced January 2024.

arXiv:2401.07129 [pdf, ps, other]

doi 10.1063/5.0197304

Synthesis of thin film infinite-layer nickelates by atomic hydrogen reduction: clarifying the role of the capping layer

Authors: Christopher T. Parzyck, Vivek Anil, Yi Wu, Berit H. Goodge, Matthew Roddy, Lena F. Kourkoutis, Darrell G. Schlom, Kyle M. Shen

Abstract: We present an integrated procedure for the synthesis of infinite-layer nickelates using molecular-beam epitaxy with gas-phase reduction by atomic hydrogen. We first discuss challenges in the growth and characterization of perovskite NdNiO$_3$/SrTiO$_3$, arising from post growth crack formation in stoichiometric films. We then detail a procedure for fully reducing NdNiO$_3$ films to the infinite-la… ▽ More We present an integrated procedure for the synthesis of infinite-layer nickelates using molecular-beam epitaxy with gas-phase reduction by atomic hydrogen. We first discuss challenges in the growth and characterization of perovskite NdNiO$_3$/SrTiO$_3$, arising from post growth crack formation in stoichiometric films. We then detail a procedure for fully reducing NdNiO$_3$ films to the infinite-layer phase, NdNiO$_2$, using atomic hydrogen; the resulting films display excellent structural quality, smooth surfaces, and lower residual resistivities than films reduced by other methods. We utilize the in situ nature of this technique to investigate of the role that SrTiO$_3$ capping layers play in the reduction process, illustrating their importance in preventing the formation of secondary phases at the exposed nickelate surface. A comparative bulk- and surface-sensitive study indicates formation of a polycrystalline crust on the film surface serves to limit the reduction process. △ Less

Submitted 13 January, 2024; originally announced January 2024.

Comments: Main text: 12 pages, 7 figures. Supplemental Materials: 11 pages, 11 figures

arXiv:2401.02711 [pdf, ps, other]

Resonant Decay of Kinetic Alfvén Waves and Implication on Spectral Cascading

Authors: Kexun Shen, Zhiwen Cheng, Zhiyong Qiu

Abstract: A general equation describing the resonant nonlinear mode-coupling among kinetic Alfvén waves (KAWs) is derived using nonlinear gyrokinetic theory, which can be applied to study the potentially strong spectral energy transfer of KAWs. As a first application, the parametric decay of a pump KAW into two sideband KAWs are studied, with particular emphasis on the cascading in perpendicular wavenumber.… ▽ More A general equation describing the resonant nonlinear mode-coupling among kinetic Alfvén waves (KAWs) is derived using nonlinear gyrokinetic theory, which can be applied to study the potentially strong spectral energy transfer of KAWs. As a first application, the parametric decay of a pump KAW into two sideband KAWs are studied, with particular emphasis on the cascading in perpendicular wavenumber. It is found that, for the "co-propagating" cases with all three KAWs propagating in the same direction along the equilibrium magnetic field line, it exhibits a dual cascading character in the perpendicular wavenumber space; while for the "counter-propagating" cases with one sideband propagating in the opposite direction with respect to the pump wave, it instead, can exhibit both dual and inverse cascading behaviors. The implications on SAW instability nonlinear saturation and charged particle transport in fusion plasmas is also discussed. △ Less

Submitted 5 January, 2024; originally announced January 2024.

arXiv:2312.16918 [pdf, other]

Intelligent Surfaces Empowered Wireless Network: Recent Advances and The Road to 6G

Authors: Qingqing Wu, Beixiong Zheng, Changsheng You, Lipeng Zhu, Kaiming Shen, Xiaodan Shao, Weidong Mei, Boya Di, Hongliang Zhang, Ertugrul Basar, Lingyang Song, Marco Di Renzo, Zhi-Quan Luo, Rui Zhang

Abstract: Intelligent surfaces (ISs) have emerged as a key technology to empower a wide range of appealing applications for wireless networks, due to their low cost, high energy efficiency, flexibility of deployment and capability of constructing favorable wireless channels/radio environments. Moreover, the recent advent of several new IS architectures further expanded their electromagnetic functionalities… ▽ More Intelligent surfaces (ISs) have emerged as a key technology to empower a wide range of appealing applications for wireless networks, due to their low cost, high energy efficiency, flexibility of deployment and capability of constructing favorable wireless channels/radio environments. Moreover, the recent advent of several new IS architectures further expanded their electromagnetic functionalities from passive reflection to active amplification, simultaneous reflection and refraction, as well as holographic beamforming. However, the research on ISs is still in rapid progress and there have been recent technological advances in ISs and their emerging applications that are worthy of a timely review. Thus, we provide in this paper a comprehensive survey on the recent development and advances of ISs aided wireless networks. Specifically, we start with an overview on the anticipated use cases of ISs in future wireless networks such as 6G, followed by a summary of the recent standardization activities related to ISs. Then, the main design issues of the commonly adopted reflection-based IS and their state-of-the-art solutions are presented in detail, including reflection optimization, deployment, signal modulation, wireless sensing, and integrated sensing and communications. Finally, recent progress and new challenges in advanced IS architectures are discussed to inspire futrue research. △ Less

Submitted 24 March, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

arXiv:2312.16134 [pdf, other]

Does PML exponentially absorb outgoing waves scattering from a periodic surface?

Authors: Wangtao Lu, Kuanrong Shen, Ruming Zhang

Abstract: The PML method is well-known for its exponential convergence rate and easy implementation for scattering problems with unbounded domains. For rough-surface scattering problems, authors in [5] proved that the PML method converges at most algebraically in the physical domain. However, the authors also asked a question whether exponential convergence still holds for compact subsets. In [25], one of o… ▽ More The PML method is well-known for its exponential convergence rate and easy implementation for scattering problems with unbounded domains. For rough-surface scattering problems, authors in [5] proved that the PML method converges at most algebraically in the physical domain. However, the authors also asked a question whether exponential convergence still holds for compact subsets. In [25], one of our authors proved the exponential convergence for periodic surfaces via the Floquet-Bloch transform when the wavenumber is positive and not a half integer; when the wavenumber is a positive half integer, a nearly fourth-order convergence rate was shown in [26]. The extension of this method to locally perturbed cases is not straightforward, since the domain is no longer periodic thus the Floquet-Bloch transform doesn't work, especially when the domain topology is changed. Moreover, the exact decay rate when the wavenumber is a half integer remains unclear. The purpose of this paper is to address these two significant issues. For the first topic, the main idea is to reduce the problem by the DtN map on an artificial curve, then the convergence rate of the PML is obtained from the investigation of the DtN map. It shows exactly the same convergence rate as in the unperturbed case. Second, to illustrate the convergence rate when the wavenumber is a half integer, we design a specific periodic structure for which the PML converges at the fourth-order, showing that the algebraic convergence rate is sharp. We adopt a previously developed high-accuracy PML-BIE solver to exhibit this unexpected phenomenon. △ Less

Submitted 26 December, 2023; originally announced December 2023.

arXiv:2312.13502 [pdf, other]

Energy Relaxation and dynamics in the correlated metal Sr$_2$RuO$_4$ via THz two-dimensional coherent spectroscopy

Authors: David Barbalas, Ralph Romero III, Dipanjan Chaudhuri, Fahad Mahmood, Hari P. Nair, Nathaniel J. Schreiber, Darrel G. Schlom, K. M. Shen, N. P. Armitage

Abstract: Separating out the contributions of different scattering channels in strongly interacting metals is crucial in identifying the mechanisms that govern their properties. While momentum or current relaxation rates can be readily probed via \textit{dc} resistivity or optical/THz spectroscopy, distinguishing different kinds of inelastic scattering can be more challenging. Using nonlinear THz 2D coheren… ▽ More Separating out the contributions of different scattering channels in strongly interacting metals is crucial in identifying the mechanisms that govern their properties. While momentum or current relaxation rates can be readily probed via \textit{dc} resistivity or optical/THz spectroscopy, distinguishing different kinds of inelastic scattering can be more challenging. Using nonlinear THz 2D coherent spectroscopy, we measure the rates of energy relaxation after THz excitation in the strongly interacting Fermi liquid, Sr$_2$RuO$_4$. Energy relaxation is a bound on the total scattering and specifically a measure of contributions to the electron self-energy that arise from {\it inelastic} coupling to a bath. We observe two distinct energy relaxation channels: a fast process that we interpret as energy loss to the phonon system and a much slower relaxation that we interpret as arising from a non-equilibrium phonon effects and subsequent heat loss through diffusion. Interestingly, even the faster energy relaxation rate is at least an order of magnitude slower than the overall momentum relaxation rate, consistent with strong electron interactions and the dominance of energy-conserving umklapp or interband electron-electron scattering in momentum relaxation. The slowest energy relaxation rate decays on a sub-GHz scale, consistent with the relaxation dynamics of non-equilibrium phonons. Our observations reveal the versatility of nonlinear THz spectroscopy to measure the energy relaxation dynamics in correlated metals. Our work also highlights the need for improved theoretical understanding of such processes in interacting metals. △ Less

Submitted 20 December, 2023; originally announced December 2023.

Comments: 6 pages, 3 figures. SM included

arXiv:2312.05726 [pdf, ps, other]

Accelerating Quadratic Transform and WMMSE

Authors: Kaiming Shen, Ziping Zhao, Yannan Chen, Zepeng Zhang, Hei Victor Cheng

Abstract: Fractional programming (FP) arises in various communications and signal processing problems because several key quantities in the field are fractionally structured, e.g., the Cramér-Rao bound, the Fisher information, and the signal-to-interference-plus-noise ratio (SINR). A recently proposed method called the quadratic transform has been applied to the FP problems extensively. The main contributio… ▽ More Fractional programming (FP) arises in various communications and signal processing problems because several key quantities in the field are fractionally structured, e.g., the Cramér-Rao bound, the Fisher information, and the signal-to-interference-plus-noise ratio (SINR). A recently proposed method called the quadratic transform has been applied to the FP problems extensively. The main contributions of the present paper are two-fold. First, we investigate how fast the quadratic transform converges. To the best of our knowledge, this is the first work that analyzes the convergence rate for the quadratic transform as well as its special case the weighted minimum mean square error (WMMSE) algorithm. Second, we accelerate the existing quadratic transform via a novel use of Nesterov's extrapolation scheme [1]. Specifically, by generalizing the minorization-maximization (MM) approach in [2], we establish a nontrivial connection between the quadratic transform and the gradient projection, thereby further incorporating the gradient extrapolation into the quadratic transform to make it converge more rapidly. Moreover, the paper showcases the practical use of the accelerated quadratic transform with two frontier wireless applications: integrated sensing and communications (ISAC) and massive multiple-input multiple-output (MIMO). △ Less

Submitted 28 May, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

Comments: 15 pages

Journal ref: IEEE Journal on Selected Areas in Communications 2024

arXiv:2311.17619 [pdf]

doi 10.1002/adom.202303028

Spatially-coded Fourier ptychography: flexible and detachable coded thin films for quantitative phase imaging with uniform phase transfer characteristics

Authors: Ruihai Wang, Liming Yang, Yujin Lee, Kevin Sun, Kuangyu Shen, Qianhao Zhao, Tianbo Wang, Xincheng Zhang, Jiayi Liu, Pengming Song, Guoan Zheng

Abstract: Fourier ptychography (FP) is an enabling imaging technique that produces high-resolution complex-valued images with extended field coverages. However, when FP images a phase object with any specific spatial frequency, the captured images contain only constant values, rendering the recovery of the corresponding linear phase ramp impossible. This challenge is not unique to FP but also affects other… ▽ More Fourier ptychography (FP) is an enabling imaging technique that produces high-resolution complex-valued images with extended field coverages. However, when FP images a phase object with any specific spatial frequency, the captured images contain only constant values, rendering the recovery of the corresponding linear phase ramp impossible. This challenge is not unique to FP but also affects other common microscopy techniques -- a rather counterintuitive outcome given their widespread use in phase imaging. The underlying issue originates from the non-uniform phase transfer characteristic inherent in microscope systems, which impedes the conversion of object wavefields into discernible intensity variations. To address this challenge, we present spatially-coded Fourier ptychography (scFP), a new method that synergizes FP with spatial-domain coded detection for true quantitative phase imaging. In scFP, a flexible and detachable coded thin film is attached atop the image sensor in a regular FP setup. The spatial modulation of this thin film ensures a uniform phase response across the entire synthetic bandwidth. It improves reconstruction quality and corrects refractive index underestimation issues prevalent in conventional FP and related tomographic implementations. The inclusion of the coded thin film further adds a new dimension of measurement diversity in the spatial domain. The development of scFP is expected to catalyse new research directions and applications for phase imaging, emphasizing the need for true quantitative accuracy with uniform frequency response. △ Less

Submitted 29 November, 2023; originally announced November 2023.

arXiv:2311.16553 [pdf, other]

doi 10.1038/s41467-024-49714-y

Magnon interactions in a moderately correlated Mott insulator

Authors: Qisi Wang, S. Mustafi, E. Fogh, N. Astrakhantsev, Z. He, I. Biało, Ying Chan, L. Martinelli, M. Horio, O. Ivashko, N. E. Shaik, K. von Arx, Y. Sassa, E. Paris, M. H. Fischer, Y. Tseng, N. B. Christensen, A. Galdi, D. G. Schlom, K. M. Shen, T. Schmitt, H. M. Rønnow, J. Chang

Abstract: Quantum fluctuations in low-dimensional systems and near quantum phase transitions have significant influences on material properties. Yet, it is difficult to experimentally gauge the strength and importance of quantum fluctuations. Here we provide a resonant inelastic x-ray scattering study of magnon excitations in Mott insulating cuprates. From the thin film of SrCuO$_2$, single- and bi-magnon d… ▽ More Quantum fluctuations in low-dimensional systems and near quantum phase transitions have significant influences on material properties. Yet, it is difficult to experimentally gauge the strength and importance of quantum fluctuations. Here we provide a resonant inelastic x-ray scattering study of magnon excitations in Mott insulating cuprates. From the thin film of SrCuO$_2$, single- and bi-magnon dispersions are derived. Using an effective Heisenberg Hamiltonian generated from the Hubbard model, we show that the single-magnon dispersion is only described satisfactorily when including significant quantum corrections stemming from magnon-magnon interactions. Comparative results on La$_2$CuO$_4$ indicate that quantum fluctuations are much stronger in SrCuO$_2$ suggesting closer proximity to a magnetic quantum critical point. Monte Carlo calculations reveal that other magnetic orders may compete with the antiferromagnetic Néel order as the ground state. Our results indicate that SrCuO$_2$ - due to strong quantum fluctuations - is a unique starting point for the exploration of novel magnetic ground states. △ Less

Submitted 26 June, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

Journal ref: Nature Communications 15, 5348 (2024)

arXiv:2311.11842 [pdf]

Spontaneous supercrystal formation during a strain-engineered metal-insulator transition

Authors: O. Yu. Gorobtsov, L. Miao, Z. Shao, Y. Tan, N. I. Schnitzer, B. H. Goodge, J. Ruf, D. Weinstock, M. Cherukara, M. V. Holt, H. Nair, L. -Q. Chen, L. F. Kourkoutis, D. G. Schlom, K. M. Shen, A. Singer

Abstract: Mott metal-insulator transitions possess electronic, magnetic, and structural degrees of freedom promising next generation energy-efficient electronics. We report a previously unknown, hierarchically ordered state during a Mott transition and demonstrate correlated switching of functional electronic properties. We elucidate in-situ formation of an intrinsic supercrystal in a Ca2RuO4 thin film. Mac… ▽ More Mott metal-insulator transitions possess electronic, magnetic, and structural degrees of freedom promising next generation energy-efficient electronics. We report a previously unknown, hierarchically ordered state during a Mott transition and demonstrate correlated switching of functional electronic properties. We elucidate in-situ formation of an intrinsic supercrystal in a Ca2RuO4 thin film. Machine learning-assisted X-ray nanodiffraction together with electron microscopy reveal multi-scale periodic domain formation at and below the film transition temperature (TFilm ~ 200-250 K) and a separate anisotropic spatial structure at and above TFilm. Local resistivity measurements imply an intrinsic coupling of the supercrystal orientation to the material's anisotropic conductivity. Our findings add an additional degree of complexity to the physical understanding of Mott transitions, opening opportunities for designing materials with tunable electronic properties. △ Less

Submitted 20 November, 2023; originally announced November 2023.

arXiv:2311.11557 [pdf, other]

Replay-enhanced Continual Reinforcement Learning

Authors: Tiantian Zhang, Kevin Zehua Shen, Zichuan Lin, Bo Yuan, Xueqian Wang, Xiu Li, Deheng Ye

Abstract: Replaying past experiences has proven to be a highly effective approach for averting catastrophic forgetting in supervised continual learning. However, some crucial factors are still largely ignored, making it vulnerable to serious failure, when used as a solution to forgetting in continual reinforcement learning, even in the context of perfect memory where all data of previous tasks are accessibl… ▽ More Replaying past experiences has proven to be a highly effective approach for averting catastrophic forgetting in supervised continual learning. However, some crucial factors are still largely ignored, making it vulnerable to serious failure, when used as a solution to forgetting in continual reinforcement learning, even in the context of perfect memory where all data of previous tasks are accessible in the current task. On the one hand, since most reinforcement learning algorithms are not invariant to the reward scale, the previously well-learned tasks (with high rewards) may appear to be more salient to the current learning process than the current task (with small initial rewards). This causes the agent to concentrate on those salient tasks at the expense of generality on the current task. On the other hand, offline learning on replayed tasks while learning a new task may induce a distributional shift between the dataset and the learned policy on old tasks, resulting in forgetting. In this paper, we introduce RECALL, a replay-enhanced method that greatly improves the plasticity of existing replay-based methods on new tasks while effectively avoiding the recurrence of catastrophic forgetting in continual reinforcement learning. RECALL leverages adaptive normalization on approximate targets and policy distillation on old tasks to enhance generality and stability, respectively. Extensive experiments on the Continual World benchmark show that RECALL performs significantly better than purely perfect memory replay, and achieves comparable or better overall performance against state-of-the-art continual learning methods. △ Less

Submitted 20 November, 2023; originally announced November 2023.

Comments: Accepted by Transactions on Machine Learning Research 2023

arXiv:2311.07666 [pdf, other]

Efficient MPS representations and quantum circuits from the Fourier modes of classical image data

Authors: Bernhard Jobst, Kevin Shen, Carlos A. Riofrío, Elvira Shishenina, Frank Pollmann

Abstract: Machine learning tasks are an exciting application for quantum computers, as it has been proven that they can learn certain problems more efficiently than classical ones. Applying quantum machine learning algorithms to classical data can have many important applications, as qubits allow for dealing with exponentially more data than classical bits. However, preparing the corresponding quantum state… ▽ More Machine learning tasks are an exciting application for quantum computers, as it has been proven that they can learn certain problems more efficiently than classical ones. Applying quantum machine learning algorithms to classical data can have many important applications, as qubits allow for dealing with exponentially more data than classical bits. However, preparing the corresponding quantum states usually requires an exponential number of gates and therefore may ruin any potential quantum speedups. Here, we show that classical data with a sufficiently quickly decaying Fourier spectrum after being mapped to a quantum state can be well-approximated by states with a small Schmidt rank (i.e., matrix product states) and we derive explicit error bounds. These approximated states can, in turn, be prepared on a quantum computer with a linear number of nearest-neighbor two-qubit gates. We confirm our results numerically on a set of $1024\times1024$-pixel images taken from the 'Imagenette' dataset. Additionally, we consider different variational circuit ansätze and demonstrate numerically that one-dimensional sequential circuits achieve the same compression quality as more powerful ansätze. △ Less

Submitted 1 December, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

Comments: 15 pages, 9 figures (+ 9 pages appendix); minor corrections

arXiv:2311.04546 [pdf, ps, other]

Discerning and Enhancing the Weighted Sum-Rate Maximization Algorithms in Communications

Authors: Zepeng Zhang, Ziping Zhao, Kaiming Shen, Daniel P. Palomar, Wei Yu

Abstract: Weighted sum-rate (WSR) maximization plays a critical role in communication system design. This paper examines three optimization methods for WSR maximization, which ensure convergence to stationary points: two block coordinate ascent (BCA) algorithms, namely, weighted sum-minimum mean-square error (WMMSE) and WSR maximization via fractional programming (WSR-FP), along with a minorization-maximiza… ▽ More Weighted sum-rate (WSR) maximization plays a critical role in communication system design. This paper examines three optimization methods for WSR maximization, which ensure convergence to stationary points: two block coordinate ascent (BCA) algorithms, namely, weighted sum-minimum mean-square error (WMMSE) and WSR maximization via fractional programming (WSR-FP), along with a minorization-maximization (MM) algorithm, WSR maximization via MM (WSR-MM). Our contributions are threefold. Firstly, we delineate the exact relationships among WMMSE, WSR-FP, and WSR-MM, which, despite their extensive use in the literature, lack a comprehensive comparative study. By probing the theoretical underpinnings linking the BCA and MM algorithmic frameworks, we reveal the direct correlations between the equivalent transformation techniques, essential to the development of WMMSE and WSR-FP, and the surrogate functions pivotal to WSR-MM. Secondly, we propose a novel algorithm, WSR-MM+, harnessing the flexibility of selecting surrogate functions in MM framework. By circumventing the repeated matrix inversions in the search for optimal Lagrange multipliers in existing algorithms, WSR-MM+ significantly reduces the computational load per iteration and accelerates convergence. Thirdly, we reconceptualize WSR-MM+ within the BCA framework, introducing a new equivalent transform, which gives rise to an enhanced version of WSR-FP, named as WSR-FP+. We further demonstrate that WSR-MM+ can be construed as the basic gradient projection method. This perspective yields a deeper understanding into its computational intricacies. Numerical simulations corroborate the connections between WMMSE, WSR-FP, and WSR-MM and confirm the efficacy of the proposed WSR-MM+ and WSR-FP+ algorithms. △ Less

Submitted 8 November, 2023; originally announced November 2023.

arXiv:2311.04309 [pdf, other]

Evidence for saturated and disrupted magnetic braking from samples of detached close binaries with M and K dwarfs

Authors: Diogo Belloni, Matthias R. Schreiber, Maxwell Moe, Kareem El-Badry, Ken J. Shen

Abstract: Context. Recent observations of close detached eclipsing M and K dwarf binaries have provided substantial support for magnetic saturation when stars rotate sufficiently fast, leading to a magnetic braking (MB) torque proportional to the spin of the star. Aims. We investigated here how strong MB torques need to be to reproduce the observationally-inferred relative numbers of white dwarf plus M dw… ▽ More Context. Recent observations of close detached eclipsing M and K dwarf binaries have provided substantial support for magnetic saturation when stars rotate sufficiently fast, leading to a magnetic braking (MB) torque proportional to the spin of the star. Aims. We investigated here how strong MB torques need to be to reproduce the observationally-inferred relative numbers of white dwarf plus M dwarf post-common-envelope binaries under the assumption of magnetic saturation. Methods. We carried out binary population simulations with the BSE code adopting empirically-derived inter-correlated main-sequence binary distributions as initial binary populations and compared the simulation outcomes with observations. Results. We found that the dearth of extreme mass ratio binaries in the inter-correlated initial distributions is key to reproduce the large fraction of post-common-envelope binaries hosting low-mass M dwarfs (${\sim0.1-0.2}$ M$_\odot$). In addition, orbital angular momentum loss rates due to MB should be high for M dwarfs with radiative cores and orders of magnitude smaller for fully convective stars to explain the observed dramatic change of the fraction of short-period binaries at the fully convective boundary. Conclusions. We conclude that saturated but disrupted, that is, dropping drastically at the fully convective boundary, MB can explain the observations of both close main-sequence binaries containing M and K dwarfs and post-common-envelope binaries. Whether a similar prescription can explain the spin down rates of single stars and of binaries containing more massive stars needs to be tested. △ Less

Submitted 7 November, 2023; originally announced November 2023.

Comments: Accepted for publication in A&A

Showing 1–50 of 401 results for author: Shen, K