Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 192 results for author: Sha, F

.
  1. arXiv:2406.19907  [pdf, ps, other

    math.AP

    Global well-posedness of inhomogeneous Navier-Stokes equations with bounded density

    Authors: Tiantian Hao, Feng Shao, Dongyi Wei, Zhifei Zhang

    Abstract: In this paper, we solve Lions' open problem: {\it the uniqueness of weak solutions for the 2-D inhomogeneous Navier-Stokes equations (INS)}. We first prove the global existence of weak solutions to 2-D (INS) with bounded initial density and initial velocity in $L^2(\mathbb R^2)$. Moreover, if the initial density is bounded away from zero, then our weak solution equals to Lions' weak solution, whic… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 23 pages

  2. arXiv:2406.16269  [pdf, other

    hep-ph hep-ex

    Displaced Heavy Neutral Lepton from New Higgs Doublet

    Authors: Fa-Xin Yang, Feng-Lan Shao, Zhi-Long Han, Yi Jin, Honglei Li

    Abstract: Heavy neutral leptons $N$ are introduced to explain the tiny neutrino masses via the seesaw mechanism. For proper small mixing parameter $V_{\ell N}$, the heavy neutral leptons $N$ become long-lived, which leads to the displaced vertex signature at colliders. In this paper, we consider the displaced heavy neutral lepton from the neutrinophilic Higgs doublet $Φ_ν$ decay. The new Higgs doublet with… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 24 pages, 11 figures

  3. arXiv:2406.07984  [pdf, ps, other

    math.AP

    On the density patch problem for the 2-D inhomogeneous Navier-Stokes equations

    Authors: Tiantian Hao, Feng Shao, Dongyi Wei, Zhifei Zhang

    Abstract: In this paper, we first construct a class of global strong solutions for the 2-D inhomogeneous Navier-Stokes equations under very general assumption that the initial density is only bounded and the initial velocity is in $H^1(\mathbb{R}^2)$. With suitable assumptions on the initial density, which includes the case of density patch and vacuum bubbles, we prove that Lions' s weak solution is the sam… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 23 pages

  4. arXiv:2405.19674  [pdf, ps, other

    math.AP

    On blow-up for the supercritical defocusing nonlinear wave equation

    Authors: Feng Shao, Dongyi Wei, Zhifei Zhang

    Abstract: In this paper, we consider the defocusing nonlinear wave equation $-\partial_t^2u+Δu=|u|^{p-1}u$ in $\mathbb R\times \mathbb R^d$. Building on our companion work ({\it \small Self-similar imploding solutions of the relativistic Euler equations}), we prove that for $d=4, p\geq 29$ and $d\geq 5, p\geq 17$, there exists a smooth complex-valued solution that blows up in finite time.

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 56 pages

  5. arXiv:2404.08216  [pdf, other

    physics.plasm-ph

    Role of nonlocal heat transport on the laser ablative Rayleigh-Taylor instability

    Authors: Z. H. Chen, X. H. Yang, G. B. Zhang, Y. Y. Ma, R. Yan, H. Xu, Z. M. Sheng, F. Q. Shao, J. Zhang

    Abstract: Ablative Rayleigh-Taylor instability (ARTI) and nonlocal heat transport are the critical problems in laser-driven inertial confinement fusion, while their coupling with each other is not completely understood yet. Here the ARTI in the presence of nonlocal heat transport is studied self-consistently for the first time theoretically and by using radiation hydrodynamic simulations. It is found that t… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 8 pages, 5 figures

  6. arXiv:2403.11471  [pdf, other

    math.AP math-ph

    Self-similar imploding solutions of the relativistic Euler equations

    Authors: Feng Shao, Dongyi Wei, Zhifei Zhang

    Abstract: Motivated by recent breakthrough on smooth imploding solutions of compressible Euler, we construct self-similar smooth imploding solutions of isentropic relativistic Euler equations with isothermal equation of state $p=\frac1\ell\varrho$ for \textit{all} $\ell>1$ in physical space dimension $d=2,3$ and for $\ell>1$ close to 1 in higher dimensions. This work is a crucial step toward solving the lon… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 68 pages, 2 figures

  7. arXiv:2402.04467  [pdf, other

    cs.LG math.DS

    DySLIM: Dynamics Stable Learning by Invariant Measure for Chaotic Systems

    Authors: Yair Schiff, Zhong Yi Wan, Jeffrey B. Parker, Stephan Hoyer, Volodymyr Kuleshov, Fei Sha, Leonardo Zepeda-Núñez

    Abstract: Learning dynamics from dissipative chaotic systems is notoriously difficult due to their inherent instability, as formalized by their positive Lyapunov exponents, which exponentially amplify errors in the learned dynamics. However, many of these systems exhibit ergodicity and an attractor: a compact and highly complex manifold, to which trajectories converge in finite-time, that supports an invari… ▽ More

    Submitted 5 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: ICML 2024; Code to reproduce our experiments is available at https://github.com/google-research/swirl-dynamics/tree/main/swirl_dynamics/projects/ergodic

  8. arXiv:2401.14687  [pdf, other

    hep-ph hep-ex

    Heavy Neutral Leptons in Gauged $U(1)_{L_μ-L_τ}$ at Muon Collider

    Authors: Ru-Yi He, Jia-Qi Huang, Jin-Yuan Xu, Fa-Xin Yang, Zhi-Long Han, Feng-Lan Shao

    Abstract: Heavy neutral leptons $N$ are the most appealing candidates to generate the tiny neutrino masses. In this paper, we study the signature of heavy neutral leptons in gauged $U(1)_{L_μ-L_τ}$ at a muon collider. Charged under the $U(1)_{L_μ-L_τ}$ symmetry, the heavy neutral leptons can be pair produced via the new gauge boson $Z'$ at muon collider as $μ^+μ^-\to Z^{\prime *}\to NN$ and… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 20 pages, 8 figures, 2 tables

  9. arXiv:2311.15559  [pdf, ps, other

    math.AG math.DS

    Bigness of tangent bundles and dynamical rigidity of Fano manifolds of Picard number 1 (with an appendix by Jie Liu)

    Authors: Feng Shao, Guolei Zhong

    Abstract: Let $f\colon X\to Y$ be a surjective morphism of Fano manifolds of Picard number 1 whose VMRTs at a general point are not dual defective. Suppose that the tangent bundle $T_X$ is big. We show that $f$ is an isomorphism unless $Y$ is a projective space. As applications, we study the bigness of the tangent bundles of complete intersections, del Pezzo manifolds, and Mukai manifolds, as well as their… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 21 pages with an appendix by Jie Liu, comments are welcome!

    MSC Class: 14J40; 14J45

  10. arXiv:2311.14784  [pdf, other

    astro-ph.IM astro-ph.SR

    Characterization and Correction of the Scattering Background Produced by Dust on the Objective Lens of the Lijiang 10-cm Coronagraph

    Authors: Feiyang Sha, Yu Liu, Xuefei Zhang, Tengfei Song

    Abstract: Scattered light from the objective lens, directly exposed to the intense sunlight, is a dominant source of stray light in internally occulted coronagraphs. The variable stray light, such as the scatter from dust on the objective lens, can produce varying scattering backgrounds in coronal images, significantly impacting image quality and data analysis. Using data acquired by the Lijiang 10-cm Coron… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 18 pages, 14 figrues

  11. arXiv:2311.07085  [pdf, other

    cond-mat.mes-hall

    Engineering 2D material exciton lineshape with graphene/h-BN encapsulation

    Authors: Steffi Y. Woo, Fuhui Shao, Ashish Arora, Robert Schneider, Nianjheng Wu, Andrew J. Mayne, Ching-Hwa Ho, Mauro Och, Cecilia Mattevi, Antoine Reserbat-Plantey, Alvaro Moreno, Hanan Herzig Sheinfux, Kenji Watanabe, Takashi Taniguchi, Steffen Michaelis de Vasconcellos, Frank H. L. Koppens, Zhichuan Niu, Odile Stéphan, Mathieu Kociak, F. Javier García de Abajo, Rudolf Bratschitsch, Andrea Konečná, Luiz H. G. Tizei

    Abstract: Control over the optical properties of atomically thin two-dimensional (2D) layers, including those of transition metal dichalcogenides (TMDs), is needed for future optoelectronic applications. Remarkable advances have been achieved through alloying, chemical and electrical doping, and applied strain. However, the integration of TMDs with other 2D materials in van der Waals heterostructures (vdWHs… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  12. NODLINK: An Online System for Fine-Grained APT Attack Detection and Investigation

    Authors: Shaofei Li, Feng Dong, Xusheng Xiao, Haoyu Wang, Fei Shao, Jiedong Chen, Yao Guo, Xiangqun Chen, Ding Li

    Abstract: Advanced Persistent Threats (APT) attacks have plagued modern enterprises, causing significant financial losses. To counter these attacks, researchers propose techniques that capture the complex and stealthy scenarios of APT attacks by using provenance graphs to model system entities and their dependencies. Particularly, to accelerate attack detection and reduce financial losses, online provenance… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: The final version of this paper is going to appear in the Conference on Network and Distributed System Security Symposium (NDSS'24), 26 Feb - 1 Mar 2024, San Diego, California

  13. arXiv:2311.00445  [pdf, other

    cs.CL cs.AI cs.LG

    A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models

    Authors: Tiwalayo Eisape, MH Tessler, Ishita Dasgupta, Fei Sha, Sjoerd van Steenkiste, Tal Linzen

    Abstract: A central component of rational behavior is logical inference: the process of determining which conclusions follow from a set of premises. Psychologists have documented several ways in which humans' inferences deviate from the rules of logic. Do language models, which are trained on text generated by humans, replicate such human biases, or are they able to overcome them? Focusing on the case of sy… ▽ More

    Submitted 11 April, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: NAACL 2024

  14. arXiv:2310.19956  [pdf, other

    cs.CL

    The Impact of Depth on Compositional Generalization in Transformer Language Models

    Authors: Jackson Petty, Sjoerd van Steenkiste, Ishita Dasgupta, Fei Sha, Dan Garrette, Tal Linzen

    Abstract: To process novel sentences, language models (LMs) must generalize compositionally -- combine familiar elements in new ways. What aspects of a model's structure promote compositional generalization? Focusing on transformers, we test the hypothesis, motivated by theoretical and empirical work, that deeper transformers generalize more compositionally. Simply adding layers increases the total number o… ▽ More

    Submitted 10 April, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to NAACL 2024

  15. arXiv:2309.16296  [pdf, ps, other

    nucl-th hep-ex hep-ph

    Production properties of deuterons, helions and tritons via an analytical nucleon coalescence method in Pb-Pb collisions at $\sqrt{s_{NN}}=2.76$ TeV

    Authors: Rui-Qin Wang, Yan-Hao Li, Jun Song, Feng-Lan Shao

    Abstract: We improve a nucleon coalescence model to include the coordinate-momentum correlation in nucleon joint distributions, and apply it to Pb-Pb collisions at $\sqrt{s_{NN}}=2.76$ TeV to study production properties of deuterons ($d$), helions ($^3$He) and tritons ($t$). We give formulas of the coalescence factors $B_2$ and $B_3$, and naturally explain their behaviors as functions of the collision centr… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 12 pages, 8 figures, 1 table

  16. arXiv:2308.15560  [pdf, other

    physics.ao-ph cs.AI

    WeatherBench 2: A benchmark for the next generation of data-driven global weather models

    Authors: Stephan Rasp, Stephan Hoyer, Alexander Merose, Ian Langmore, Peter Battaglia, Tyler Russel, Alvaro Sanchez-Gonzalez, Vivian Yang, Rob Carver, Shreya Agrawal, Matthew Chantry, Zied Ben Bouallegue, Peter Dueben, Carla Bromberg, Jared Sisk, Luke Barrington, Aaron Bell, Fei Sha

    Abstract: WeatherBench 2 is an update to the global, medium-range (1-14 day) weather forecasting benchmark proposed by Rasp et al. (2020), designed with the aim to accelerate progress in data-driven weather modeling. WeatherBench 2 consists of an open-source evaluation framework, publicly available training, ground truth and baseline data as well as a continuously updated website with the latest metrics and… ▽ More

    Submitted 26 January, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

  17. arXiv:2308.12588  [pdf, other

    hep-ph

    Sterile Neutrino Portal Dark Matter from Semi-Production

    Authors: Ang Liu, Feng-Lan Shao, Zhi-Long Han, Yi Jin, Honglei Li

    Abstract: In this paper, we study the feeble sterile neutrino portal dark matter under the $Z_3$ symmetry. The dark sector consists of one fermion singlet $χ$ and one scalar singlet $χ$, which transforms as $χ\to e^{i2π/3}χ, φ\to e^{i2π/3}φ$ under the $Z_3$ symmetry. Regarding fermion singlet $χ$ as the dark matter candidate, the new interaction terms $y_χφ\bar{χ^c}χ$ and $μφ^3/2$ could induce various new p… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: 24 pages, 9 figures

  18. arXiv:2307.09972  [pdf, other

    cs.CV

    Fine-grained Text-Video Retrieval with Frozen Image Encoders

    Authors: Zuozhuo Dai, Fangtao Shao, Qingkun Su, Zilong Dong, Siyu Zhu

    Abstract: State-of-the-art text-video retrieval (TVR) methods typically utilize CLIP and cosine similarity for efficient retrieval. Meanwhile, cross attention methods, which employ a transformer decoder to compute attention between each text query and all frames in a video, offer a more comprehensive interaction between text and videos. However, these methods lack important fine-grained spatial information… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  19. arXiv:2306.14066  [pdf, other

    cs.LG physics.ao-ph

    SEEDS: Emulation of Weather Forecast Ensembles with Diffusion Models

    Authors: Lizao Li, Rob Carver, Ignacio Lopez-Gomez, Fei Sha, John Anderson

    Abstract: Uncertainty quantification is crucial to decision-making. A prominent example is probabilistic forecasting in numerical weather prediction. The dominant approach to representing uncertainty in weather forecasting is to generate an ensemble of forecasts. This is done by running many physics-based simulations under different conditions, which is a computationally costly process. We propose to amorti… ▽ More

    Submitted 8 October, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

    Comments: fixed a mistake of the previous version; the paper has not been submitted to neurips 2023

  20. arXiv:2306.09224  [pdf, other

    cs.CV

    Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories

    Authors: Thomas Mensink, Jasper Uijlings, Lluis Castrejon, Arushi Goel, Felipe Cadar, Howard Zhou, Fei Sha, André Araujo, Vittorio Ferrari

    Abstract: We propose Encyclopedic-VQA, a large scale visual question answering (VQA) dataset featuring visual questions about detailed properties of fine-grained categories and instances. It contains 221k unique question+answer pairs each matched with (up to) 5 images, resulting in a total of 1M VQA samples. Moreover, our dataset comes with a controlled knowledge base derived from Wikipedia, marking the evi… ▽ More

    Submitted 24 July, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: ICCV'23

  21. arXiv:2306.07526  [pdf, other

    cs.LG cs.AI

    User-defined Event Sampling and Uncertainty Quantification in Diffusion Models for Physical Dynamical Systems

    Authors: Marc Finzi, Anudhyan Boral, Andrew Gordon Wilson, Fei Sha, Leonardo Zepeda-Núñez

    Abstract: Diffusion models are a class of probabilistic generative models that have been widely used as a prior for image processing tasks like text conditional generation and inpainting. We demonstrate that these models can be adapted to make predictions and provide uncertainty quantification for chaotic dynamical systems. In these applications, diffusion models can implicitly represent knowledge about out… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: ICML 2023 Conference

  22. arXiv:2306.01174  [pdf, other

    cs.LG math.NA

    Neural Ideal Large Eddy Simulation: Modeling Turbulence with Neural Stochastic Differential Equations

    Authors: Anudhyan Boral, Zhong Yi Wan, Leonardo Zepeda-Núñez, James Lottes, Qing Wang, Yi-fan Chen, John Roberts Anderson, Fei Sha

    Abstract: We introduce a data-driven learning framework that assimilates two powerful ideas: ideal large eddy simulation (LES) from turbulence closure modeling and neural stochastic differential equations (SDE) for stochastic modeling. The ideal LES models the LES flow by treating each full-order trajectory as a random realization of the underlying dynamics, as such, the effect of small-scales is marginaliz… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 18 pages

  23. arXiv:2305.15618  [pdf, other

    cs.LG physics.app-ph

    Debias Coarsely, Sample Conditionally: Statistical Downscaling through Optimal Transport and Probabilistic Diffusion Models

    Authors: Zhong Yi Wan, Ricardo Baptista, Yi-fan Chen, John Anderson, Anudhyan Boral, Fei Sha, Leonardo Zepeda-Núñez

    Abstract: We introduce a two-stage probabilistic framework for statistical downscaling using unpaired data. Statistical downscaling seeks a probabilistic map to transform low-resolution data from a biased coarse-grained numerical scheme to high-resolution data that is consistent with a high-fidelity scheme. Our framework tackles the problem by composing two transformations: (i) a debiasing step via an optim… ▽ More

    Submitted 30 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 (spotlight)

  24. arXiv:2305.15434  [pdf

    physics.plasm-ph physics.optics

    Hybrid Optimization of Laser-Driven Fusion Targets and Laser Profiles

    Authors: Z. Li, Z. Q. Zhao, X. H. Yang, G. B. Zhang, Y. Y. Ma, H. Xu, F. Y. Wu, F. Q. Shao, J. Zhang

    Abstract: Quasi-isentropic compression is an effective method to achieve high-density and high-temperature implosion in laser-driven inertial confinement fusion (ICF). However, it requires precise matching between the laser profile and the target structure. Designing the optimal laser profile and the corresponding target for ICF is a challenge due to the large number of parameters involved. In this paper, w… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  25. arXiv:2305.15354  [pdf, other

    cs.CV

    Counterfactual Co-occurring Learning for Bias Mitigation in Weakly-supervised Object Localization

    Authors: Feifei Shao, Yawei Luo, Lei Chen, Ping Liu, Wei Yang, Yi Yang, Jun Xiao

    Abstract: Contemporary weakly-supervised object localization (WSOL) methods have primarily focused on addressing the challenge of localizing the most discriminative region while largely overlooking the relatively less explored issue of biased activation -- incorrectly spotlighting co-occurring background with the foreground feature. In this paper, we conduct a thorough causal analysis to investigate the ori… ▽ More

    Submitted 9 March, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 10 pages, 6 figures, 8 tables

  26. arXiv:2305.06594  [pdf, other

    cs.SD cs.CV cs.LG cs.MM eess.AS

    V2Meow: Meowing to the Visual Beat via Video-to-Music Generation

    Authors: Kun Su, Judith Yue Li, Qingqing Huang, Dima Kuzmin, Joonseok Lee, Chris Donahue, Fei Sha, Aren Jansen, Yu Wang, Mauro Verzetti, Timo I. Denk

    Abstract: Video-to-music generation demands both a temporally localized high-quality listening experience and globally aligned video-acoustic signatures. While recent music generation models excel at the former through advanced audio codecs, the exploration of video-acoustic signatures has been confined to specific visual scenarios. In contrast, our research confronts the challenge of learning globally alig… ▽ More

    Submitted 22 February, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: accepted at AAAI 2024, music samples available at https://tinyurl.com/v2meow

  27. arXiv:2305.05182  [pdf, other

    math.AP

    Self-similar algebraic spiral solution of 2-D incompressible Euler equations

    Authors: Feng Shao, Dongyi Wei, Zhifei Zhang

    Abstract: In this paper, we prove the existence of self-similar algebraic spiral solutions for 2-D incompressible Euler equations for the initial vorticity of the form $|y|^{-\frac1μ}\ \mathringω(θ)$ with $μ>\frac12$ and $\mathringω\in L^1(\mathbb T)$ satisfying $m$-fold symmetry ($m\geq 2$) and a dominant condition. As an important application, we prove the existence of weak solution when $\mathringω$ is a… ▽ More

    Submitted 1 June, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 60 pages, 1 figure

  28. arXiv:2304.00434  [pdf, ps, other

    hep-ph

    Transverse momentum and multiplicity dependence of $Λ_{c}^{+}/D^{0}$ ratio in $pp$ collisions at $\sqrt{s}=13$ TeV

    Authors: Jun Song, Hai-hong Li, Feng-lan Shao

    Abstract: We apply an equal-velocity quark combination model to study the $Λ_{c}^{+}/D^{0}$ ratio in the range $p_{T}\lesssim10$ GeV/c in $pp$ collisions at $\sqrt{s}=13$ TeV. We decompose the ratio into four parts which are related to quark numbers, light-flavor quark $p_{T}$ spectrum, charm quark $p_{T}$ spectrum, momentum correlation between light and charm quarks, respectively. Their influence on… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: 13 pages, 5 figures

  29. arXiv:2302.07546  [pdf, ps, other

    nucl-th hep-ph

    Production of Strange and Charm Hadrons in Pb+Pb Collisions at $\sqrt{s_{NN}}=$ 5.02 TeV

    Authors: Wen-bin Chang, Rui-qin Wang, Jun Song, Feng-lan Shao, Qun Wang, Zuo-tang Liang

    Abstract: Using a quark combination model with the equal-velocity combination approximation, we study the production of hadrons with strangeness and charm flavor quantum numbers in Pb+Pb collisions at $\sqrt{s_{NN}}=$5.02 TeV. We present analytical expressions and numerical results for these hadrons' transverse momentum spectra and yield ratios. Our numerical results agree well with the experimental data av… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 12 pages, 9 figures

    Journal ref: Symmetry 2023,15(2),400

  30. arXiv:2302.06009  [pdf, other

    cs.LG cs.CV

    Policy-Induced Self-Supervision Improves Representation Finetuning in Visual RL

    Authors: Sébastien M. R. Arnold, Fei Sha

    Abstract: We study how to transfer representations pretrained on source tasks to target tasks in visual percept based RL. We analyze two popular approaches: freezing or finetuning the pretrained representations. Empirical studies on a set of popular tasks reveal several properties of pretrained representations. First, finetuning is required even when pretrained representations perfectly capture the informat… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  31. arXiv:2301.10448  [pdf, other

    cs.CL cs.AI cs.LG

    Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute

    Authors: Michiel de Jong, Yury Zemlyanskiy, Nicholas FitzGerald, Joshua Ainslie, Sumit Sanghai, Fei Sha, William Cohen

    Abstract: Retrieval-augmented language models such as Fusion-in-Decoder are powerful, setting the state of the art on a variety of knowledge-intensive tasks. However, they are also expensive, due to the need to encode a large number of retrieved passages. Some work avoids this cost by pre-encoding a text corpus into a memory and retrieving dense representations directly. However, pre-encoding memory incurs… ▽ More

    Submitted 2 June, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  32. arXiv:2301.10391  [pdf, other

    cs.LG physics.comp-ph

    Evolve Smoothly, Fit Consistently: Learning Smooth Latent Dynamics For Advection-Dominated Systems

    Authors: Zhong Yi Wan, Leonardo Zepeda-Núñez, Anudhyan Boral, Fei Sha

    Abstract: We present a data-driven, space-time continuous framework to learn surrogate models for complex physical systems described by advection-dominated partial differential equations. Those systems have slow-decaying Kolmogorov n-width that hinders standard methods, including reduced order modeling, from producing high-fidelity simulations at low cost. In this work, we construct hypernetwork-based laten… ▽ More

    Submitted 6 February, 2023; v1 submitted 24 January, 2023; originally announced January 2023.

    Comments: 25 pages, 9 figures

  33. arXiv:2301.09416  [pdf, other

    cs.CV

    Towards Robust Video Instance Segmentation with Temporal-Aware Transformer

    Authors: Zhenghao Zhang, Fangtao Shao, Zuozhuo Dai, Siyu Zhu

    Abstract: Most existing transformer based video instance segmentation methods extract per frame features independently, hence it is challenging to solve the appearance deformation problem. In this paper, we observe the temporal information is important as well and we propose TAFormer to aggregate spatio-temporal features both in transformer encoder and decoder. Specifically, in transformer encoder, we propo… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

  34. arXiv:2301.01060  [pdf, other

    cs.CV

    Knowledge-guided Causal Intervention for Weakly-supervised Object Localization

    Authors: Feifei Shao, Yawei Luo, Fei Gao, Yi Yang, Jun Xiao

    Abstract: Previous weakly-supervised object localization (WSOL) methods aim to expand activation map discriminative areas to cover the whole objects, yet neglect two inherent challenges when relying solely on image-level labels. First, the ``entangled context'' issue arises from object-context co-occurrence (\eg, fish and water), making the model inspection hard to distinguish object boundaries clearly. Sec… ▽ More

    Submitted 12 March, 2024; v1 submitted 3 January, 2023; originally announced January 2023.

    Comments: 13 pages, 7 figures, 7 tables

  35. arXiv:2212.10043  [pdf, other

    hep-ph hep-ex

    Shinning Light on Sterile Neutrino Portal Dark Matter from Cosmology and Collider

    Authors: Ang Liu, Feng-Lan Shao, Zhi-Long Han, Yi Jin, Honglei Li

    Abstract: Provided the dark sector consisted of a dark scalar $φ$ and a dark fermion $χ$ under an exact $Z_2$ symmetry, the sterile neutrino $N$ can act as the messenger between the dark sector and standard model via the Yukawa coupling $λ_{ds} \barχφN$. In this paper, we focus on the specific scenario $m_N>m_φ+m_χ$ with $χ$ being a FIMP dark matter. The decay width of dark scalar $φ$ is doubly suppressed b… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: 18 pages,8 figures

  36. arXiv:2212.08153  [pdf, other

    cs.CL cs.AI cs.LG

    FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference

    Authors: Michiel de Jong, Yury Zemlyanskiy, Joshua Ainslie, Nicholas FitzGerald, Sumit Sanghai, Fei Sha, William Cohen

    Abstract: Fusion-in-Decoder (FiD) is a powerful retrieval-augmented language model that sets the state-of-the-art on many knowledge-intensive NLP tasks. However, the architecture used for FiD was chosen by making minimal modifications to a standard T5 model, which our analysis shows to be highly suboptimal for a retrieval-augmented model. In particular, FiD allocates the bulk of FLOPs to the encoder, while… ▽ More

    Submitted 2 June, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: ACL Findings 2023

  37. arXiv:2212.05141  [pdf

    physics.flu-dyn physics.comp-ph

    A High-resolution Large-eddy Simulation Framework for Wildland Fire Predictions using TensorFlow

    Authors: Qing Wang, Matthias Ihme, Rod R. Linn, Yi-Fan Chen, Vivian Yang, Fei Sha, Craig Clements, Jenna S. McDanold, John Anderson

    Abstract: As the impact of wildfires has become increasingly more severe over the last decades, there is continued pressure for improvements in our ability to predict wildland fire behavior over a wide range of conditions. One approach towards this goal is through coupled fire/atmosphere modeling tools. While significant progress has been made on advancing their physical fidelity, existing modeling tools ha… ▽ More

    Submitted 12 July, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: 10 figures, 3 tables, 4844 words

  38. Boundedness of finite morphisms onto Fano manifolds with large Fano index

    Authors: Feng Shao, Guolei Zhong

    Abstract: Let $f:Y\to X$ be a finite morphism between Fano manifolds $Y$ and $X$ such that the Fano index of $X$ is greater than 1. On the one hand, when both $X$ and $Y$ are fourfolds of Picard number 1, we show that the degree of $f$ is bounded in terms of $X$ and $Y$ unless $X\cong\mathbb{P}^4$; hence, such $X$ does not admit any non-isomorphic surjective endomorphism. On the other hand, when $X=Y$ is ei… ▽ More

    Submitted 27 November, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: 30 pages, minor revisions, Journal of Algebra (to appear), comments are welcome!

    MSC Class: 08A35; 14E30; 14J35; 14J40; 14M25

    Journal ref: Journal of Algebra, Volume 639, 1 February 2024, Pages 678-707

  39. arXiv:2210.10271  [pdf, other

    hep-ph nucl-th

    Different Coalescence Sources of Light Nuclei Production in Au-Au Collisions at $\sqrt{s_{NN}}=3$ GeV

    Authors: Rui-Qin Wang, Ji-Peng Lv, Yan-Hao Li, Jun Song, Feng-Lan Shao

    Abstract: We study the production of light nuclei in the coalescence mechanism in Au-Au collisions at midrapidity at $\sqrt{s_{NN}}=3$ GeV. We derive analytic formulas of momentum distributions of two bodies, three bodies and four nucleons coalescing into light nuclei, respectively. We naturally explain the transverse momentum spectra of the deuteron ($d$), triton ($t$), helium-3 ($^3$He) and helium-4 (… ▽ More

    Submitted 15 October, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: 5 figures, 6 tables

  40. arXiv:2209.14899  [pdf, other

    cs.CL

    Generate-and-Retrieve: use your predictions to improve retrieval for semantic parsing

    Authors: Yury Zemlyanskiy, Michiel de Jong, Joshua Ainslie, Panupong Pasupat, Peter Shaw, Linlu Qiu, Sumit Sanghai, Fei Sha

    Abstract: A common recent approach to semantic parsing augments sequence-to-sequence models by retrieving and appending a set of training samples, called exemplars. The effectiveness of this recipe is limited by the ability to retrieve informative exemplars that help produce the correct parse, which is especially challenging in low-resource settings. Existing retrieval is commonly based on similarity of que… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: To appear in the proceedings of COLING 2022

  41. arXiv:2208.02021  [pdf, ps, other

    hep-ph

    Collision centrality and energy dependence of strange hadron production in Au + Au collisions at \sqrt{s_{NN}}= 7.7-54.4 GeV

    Authors: Yanting Feng, Ziyao Song, Fenglan Shao, Jun Song

    Abstract: We apply an equal-velocity quark combination model to systematically study the transverse momentum (p_{T}) spectra of strange hadrons K_{S}^{0}, φ, Λ, Ξ^{-}, Ω^{-}, \barΛ, \barΞ^{+} and \barΩ^{+} at mid-rapidity in Au+Au collisions at \sqrt{s_{NN}}= 7.7, 11.5, 19.6, 27, 39, 54.4 GeV. Relative deviation between the model calculation and experimental data of these eight hadrons is generally about 2-… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

  42. arXiv:2208.00623  [pdf, other

    cs.CV cs.MM eess.IV

    Quality Evaluation of Arbitrary Style Transfer: Subjective Study and Objective Metric

    Authors: Hangwei Chen, Feng Shao, Xiongli Chai, Yuese Gu, Qiuping Jiang, Xiangchao Meng, Yo-Sung Ho

    Abstract: Arbitrary neural style transfer is a vital topic with great research value and wide industrial application, which strives to render the structure of one image using the style of another. Recent researches have devoted great efforts on the task of arbitrary style transfer (AST) for improving the stylization quality. However, there are very few explorations about the quality evaluation of AST images… ▽ More

    Submitted 29 January, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology 2022, Code and Dataset: https://github.com/Hangwei-Chen/AST-IQAD-SRQE

  43. arXiv:2207.05595  [pdf, other

    nucl-th hep-ph nucl-ex

    Energy Dependence of the Breit-Wheeler process in Heavy-Ion Collisions and its Application to Nuclear Charge Radius Measurements

    Authors: Xiaofeng Wang, James Daniel Brandenburg, Lijuan Ruan, Fenglan Shao, Zhangbu Xu, Chi Yang, Wangmei Zha

    Abstract: The collision energy dependence of the cross section and the transverse momentum distribution of dielectrons from the Breit-Wheeler process in heavy-ion collisions are computed in the lowest-order QED and found to be sensitive to the nuclear charge distribution and the infrared-divergence of the ultra-Lorentz boosted Coulomb field. Within a given experimental kinematic acceptance, the cross sectio… ▽ More

    Submitted 4 April, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: 3 figures

  44. arXiv:2205.14205  [pdf, other

    cs.LG

    ALMA: Hierarchical Learning for Composite Multi-Agent Tasks

    Authors: Shariq Iqbal, Robby Costales, Fei Sha

    Abstract: Despite significant progress on multi-agent reinforcement learning (MARL) in recent years, coordination in complex domains remains a challenge. Work in MARL often focuses on solving tasks where agents interact with all other agents and entities in the environment; however, we observe that real-world tasks are often composed of several isolated instances of local agent interactions (subtasks), and… ▽ More

    Submitted 25 September, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022 Camera Ready

  45. arXiv:2205.12253  [pdf, other

    cs.CL

    Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing

    Authors: Linlu Qiu, Peter Shaw, Panupong Pasupat, Tianze Shi, Jonathan Herzig, Emily Pitler, Fei Sha, Kristina Toutanova

    Abstract: Despite their strong performance on many tasks, pre-trained language models have been shown to struggle on out-of-distribution compositional generalization. Meanwhile, recent work has shown considerable improvements on many NLP tasks from model scaling. Can scaling up model size also improve compositional generalization in semantic parsing? We evaluate encoder-decoder models up to 11B parameters a… ▽ More

    Submitted 24 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: EMNLP 2022

  46. Sterile Neutrino Portal Dark Matter in $ν$THDM

    Authors: Ang Liu, Feng-Lan Shao, Zhi-Long Han, Yi Jin, Honglei Li

    Abstract: In this paper, we propose the sterile neutrino portal dark matter in $ν$THDM. This model can naturally generate tiny neutrino mass with the neutrinophilic scalar doublet $Φ_ν$ and sterile neutrinos $N$ around TeV scale. Charged under a $Z_2$ symmetry, one Dirac fermion singlet $χ$ and one scalar singlet $φ$ are further introduced in the dark sector. The sterile neutrinos $N$ are the mediators betw… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: 27 pages, 14 figures

  47. Averaged transverse momentum correlations of hadrons in relativistic heavy-ion collisions

    Authors: Yan-ting Feng, Feng-lan Shao, Jun Song

    Abstract: We compile experimental data for the averaged transverse momentum ($\left\langle p_{T}\right\rangle $) of proton, $Λ$, $Ξ^{-}$, $Ω^{-}$ and $φ$ at mid-rapidity in Au+Au collisions at $\sqrt{s_{NN}}=$ 200, 39, 27, 19.6, 11.5, 7.7 GeV and in Pb+Pb collisions at $\sqrt{s_{NN}}=$ 2.76 TeV, and find that experimental data of these hadrons exhibit systematic correlations. We apply a quark combination mo… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 10 pages, 7 figures

    Journal ref: Phys. Rev. C 106, 034910, 2022

  48. arXiv:2203.12686  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Possibility Before Utility: Learning And Using Hierarchical Affordances

    Authors: Robby Costales, Shariq Iqbal, Fei Sha

    Abstract: Reinforcement learning algorithms struggle on tasks with complex hierarchical dependency structures. Humans and other intelligent agents do not waste time assessing the utility of every high-level action in existence, but instead only consider ones they deem possible in the first place. By focusing only on what is feasible, or "afforded", at the present moment, an agent can spend more time both ev… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: ICLR 2022 camera-ready

  49. arXiv:2202.12588  [pdf, other

    cs.CV

    Active Learning for Point Cloud Semantic Segmentation via Spatial-Structural Diversity Reasoning

    Authors: Feifei Shao, Yawei Luo, Ping Liu, Jie Chen, Yi Yang, Yulei Lu, Jun Xiao

    Abstract: The expensive annotation cost is notoriously known as the main constraint for the development of the point cloud semantic segmentation technique. Active learning methods endeavor to reduce such cost by selecting and labeling only a subset of the point clouds, yet previous attempts ignore the spatial-structural diversity of the selected samples, inducing the model to select clustered candidates wit… ▽ More

    Submitted 18 April, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

    Comments: 9 pages, 6 figures, 2 tables

  50. arXiv:2202.07808  [pdf, other

    cs.LG

    Policy Learning and Evaluation with Randomized Quasi-Monte Carlo

    Authors: Sebastien M. R. Arnold, Pierre L'Ecuyer, Liyu Chen, Yi-fan Chen, Fei Sha

    Abstract: Reinforcement learning constantly deals with hard integrals, for example when computing expectations in policy evaluation and policy iteration. These integrals are rarely analytically solvable and typically estimated with the Monte Carlo method, which induces high variance in policy values and gradients. In this work, we propose to replace Monte Carlo samples with low-discrepancy point sets. We co… ▽ More

    Submitted 21 February, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: AISTATS 2022 camera ready; more info at: http://seba1511.net/projects/qrl/