Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 108 results for author: Yun, C

.
  1. arXiv:2405.20671  [pdf, other

    cs.LG cs.AI cs.CL

    Position Coupling: Leveraging Task Structure for Improved Length Generalization of Transformers

    Authors: Hanseul Cho, Jaeyoung Cha, Pranjal Awasthi, Srinadh Bhojanapalli, Anupam Gupta, Chulhee Yun

    Abstract: Even for simple arithmetic tasks like integer addition, it is challenging for Transformers to generalize to longer sequences than those encountered during training. To tackle this problem, we propose position coupling, a simple yet effective method that directly embeds the structure of the tasks into the positional encoding of a (decoder-only) Transformer. Taking a departure from the vanilla absol… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 73 pages, 20 figures, 90 tables

  2. arXiv:2405.16002  [pdf, other

    cs.LG math.OC stat.ML

    Does SGD really happen in tiny subspaces?

    Authors: Minhak Song, Kwangjun Ahn, Chulhee Yun

    Abstract: Understanding the training dynamics of deep neural networks is challenging due to their high-dimensional nature and intricate loss landscapes. Recent studies have revealed that, along the training trajectory, the gradient approximately aligns with a low-rank top eigenspace of the training loss Hessian, referred to as the dominant subspace. Given this alignment, this paper explores whether neural n… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 22 pages

  3. arXiv:2405.09860  [pdf, other

    quant-ph cs.NI

    Optimal Switching Networks for Paired-Egress Bell State Analyzer Pools

    Authors: Marii Koyama, Claire Yun, Amin Taherkhani, Naphan Benchasattabuse, Bernard Ousmane Sane, Michal Hajdušek, Shota Nagayama, Rodney Van Meter

    Abstract: To scale quantum computers to useful levels, we must build networks of quantum computational nodes that can share entanglement for use in distributed forms of quantum algorithms. In one proposed architecture, node-to-node entanglement is created when nodes emit photons entangled with stationary memories, with the photons routed through a switched interconnect to a shared pool of Bell state analyze… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 11 pages, 8 figures, 1 table

  4. arXiv:2405.02831  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el

    Nonvolatile optical control of interlayer stacking order in 1T-TaS2

    Authors: Junde Liu, Pei Liu, Liu Yang, Sung-Hoon Lee, Mojun Pan, Famin Chen, Jierui Huang, Bei Jiang, Mingzhe Hu, Yuchong Zhang, Zhaoyang Xie, Gang Wang, Mengxue Guan, Wei Jiang, Huaixin Yang, Jianqi Li, Chenxia Yun, Zhiwei Wang, Sheng Meng, Yugui Yao, Tian Qian, Xun Shi

    Abstract: Nonvolatile optical manipulation of material properties on demand is a highly sought-after feature in the advancement of future optoelectronic applications. While the discovery of such metastable transition in various materials holds good promise for achieving this goal, their practical implementation is still in the nascent stage. Here, we unravel the nature of the ultrafast laser-induced hidden… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  5. arXiv:2403.06624  [pdf, ps, other

    math.AG math.AT math.CO

    On the topology of the moduli of tropical unramified $p$-covers

    Authors: Yassine El Maazouz, Paul Alexander Helminck, Felix Röhrle, Pedro Souza, Claudia He Yun

    Abstract: We study the topology of the moduli space of tropical unramified $\mathbb{Z}/p$-covers of tropical curves of genus $g \geq 2$ where $p$ is a prime number. We use recent techniques by Chan--Galatius--Payne to identify a contractible subcomplex of the moduli space. We then use this contractibility result to show that this moduli space is simply connected for all $g$ and $p$. In the case of genus… ▽ More

    Submitted 12 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 38 pages, 11 figures, 5 tables

    MSC Class: 14T20; 05E14; 14H10

  6. arXiv:2402.10475  [pdf, other

    math.OC cs.LG

    Fundamental Benefit of Alternating Updates in Minimax Optimization

    Authors: Jaewook Lee, Hanseul Cho, Chulhee Yun

    Abstract: The Gradient Descent-Ascent (GDA) algorithm, designed to solve minimax optimization problems, takes the descent and ascent steps either simultaneously (Sim-GDA) or alternately (Alt-GDA). While Alt-GDA is commonly observed to converge faster, the performance gap between the two is not yet well understood theoretically, especially in terms of global convergence rates. To address this theory-practice… ▽ More

    Submitted 15 July, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024 (Spotlight). 76 pages, 2 figures. Additional experiments (quadratic game, GAN) and proofs

  7. arXiv:2401.17433  [pdf

    q-bio.TO

    Coronary CTA and Quantitative Cardiac CT Perfusion (CCTP) in Coronary Artery Disease

    Authors: Hao Wu, Yingnan Song, Ammar Hoori, Ananya Subramaniam, Juhwan Lee, Justin Kim, Tao Hu, Sadeer Al-Kindi, Wei-Ming Huang, Chun-Ho Yun, Chung-Lieh Hung, Sanjay Rajagopalan, David L. Wilson

    Abstract: We assessed the benefit of combining stress cardiac CT perfusion (CCTP) myocardial blood flow (MBF) with coronary CT angiography (CCTA) using our innovative CCTP software. By combining CCTA and CCTP, one can uniquely identify a flow limiting stenosis (obstructive-lesion + low-MBF) versus MVD (no-obstructive-lesion + low-MBF. We retrospectively evaluated 104 patients with suspected CAD, including 1… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  8. arXiv:2401.15554  [pdf

    cs.CV

    Pericoronary adipose tissue feature analysis in CT calcium score images with comparison to coronary CTA

    Authors: Yingnan Song, Hao Wu, Juhwan Lee, Justin Kim, Ammar Hoori, Tao Hu, Vladislav Zimin, Mohamed Makhlouf, Sadeer Al-Kindi, Sanjay Rajagopalan, Chun-Ho Yun, Chung-Lieh Hung, David L. Wilson

    Abstract: We investigated the feasibility and advantages of using non-contrast CT calcium score (CTCS) images to assess pericoronary adipose tissue (PCAT) and its association with major adverse cardiovascular events (MACE). PCAT features from coronary CTA (CCTA) have been shown to be associated with cardiovascular risk but are potentially confounded by iodine. If PCAT in CTCS images can be similarly analyze… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 24 pages,10 figures

  9. arXiv:2312.14455  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Evidence for an Excitonic Insulator State in Ta$_2$Pd$_3$Te$_5$

    Authors: Jierui Huang, Bei Jiang, Jingyu Yao, Dayu Yan, Xincheng Lei, Jiacheng Gao, Zhaopeng Guo, Feng Jin, Yupeng Li, Zhenyu Yuan, Congcong Chai, Haohao Sheng, Mojun Pan, Famin Chen, Junde Liu, Shunye Gao, Gexing Qu, Bo Liu, Zhicheng Jiang, Zhengtai Liu, Xiaoyan Ma, Shiming Zhou, Yaobo Huang, Chenxia Yun, Qingming Zhang , et al. (8 additional authors not shown)

    Abstract: The excitonic insulator (EI) is an exotic ground state of narrow-gap semiconductors and semimetals arising from spontaneous condensation of electron-hole pairs bound by attractive Coulomb interaction. Despite research on EIs dating back to half a century ago, their existence in real materials remains a subject of ongoing debate. In this study, through systematic experimental and theoretical invest… ▽ More

    Submitted 14 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 10 pages, 5 figures

    Journal ref: Phys. Rev. X 14, 011046, 2024

  10. arXiv:2311.15051  [pdf, other

    cs.LG math.OC stat.ML

    Gradient Descent with Polyak's Momentum Finds Flatter Minima via Large Catapults

    Authors: Prin Phunyaphibarn, Junghyun Lee, Bohan Wang, Huishuai Zhang, Chulhee Yun

    Abstract: Although gradient descent with Polyak's momentum is widely used in modern machine and deep learning, a concrete understanding of its effects on the training trajectory remains elusive. In this work, we empirically show that for linear diagonal networks and nonlinear neural networks, momentum gradient descent with a large learning rate displays large catapults, driving the iterates towards much fla… ▽ More

    Submitted 29 May, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: v3: major updates; 25 pages, 17 figures; the first two authors contributed equally. The preliminary version was accepted to the NeurIPS 2023 M3L Workshop (oral) under the title "Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study."

  11. arXiv:2310.18593  [pdf, other

    stat.ML cs.CY cs.LG

    Fair Streaming Principal Component Analysis: Statistical and Algorithmic Viewpoint

    Authors: Junghyun Lee, Hanseul Cho, Se-Young Yun, Chulhee Yun

    Abstract: Fair Principal Component Analysis (PCA) is a problem setting where we aim to perform PCA while making the resulting representation fair in that the projected distributions, conditional on the sensitive attributes, match one another. However, existing approaches to fair PCA have two main problems: theoretically, there has been no statistical foundation of fair PCA in terms of learnability; practica… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: 42 pages, 5 figures, 4 tables. Accepted to the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  12. arXiv:2310.08028  [pdf, other

    cond-mat.mes-hall

    Time-resolved ARPES with probe energy of 6.0/7.2 eV and switchable resolution configuration

    Authors: Mojun Pan, Junde Liu, Famin Chen, Ji Wang, ChenXia Yun, Tian Qian

    Abstract: We present a detailed exposition of the design for time- and angle-resolved photoemission spectroscopy using a UV probe laser source that combines the nonlinear effects of \b{eta}-BaB2O4 and KBe2BO3F2 optical crystals. The photon energy of the probe laser can be switched between 6.0 and 7.2 eV, with the flexibility to operate each photon energy setting under two distinct resolution configurations.… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  13. arXiv:2310.01082  [pdf, other

    cs.LG cs.AI math.OC

    Linear attention is (maybe) all you need (to understand transformer optimization)

    Authors: Kwangjun Ahn, Xiang Cheng, Minhak Song, Chulhee Yun, Ali Jadbabaie, Suvrit Sra

    Abstract: Transformer training is notoriously difficult, requiring a careful design of optimizers and use of various heuristics. We make progress towards understanding the subtleties of training Transformers by carefully studying a simple yet canonical linearized shallow Transformer model. Specifically, we train linear Transformers to solve regression tasks, inspired by J.~von Oswald et al.~(ICML 2023), and… ▽ More

    Submitted 13 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Published at ICLR 2024

  14. arXiv:2307.15777  [pdf, other

    cs.PL

    Error Localization for Sequential Effect Systems (Extended Version)

    Authors: Colin S. Gordon, Chaewon Yun

    Abstract: We describe a new concrete approach to giving predictable error locations for sequential (flow-sensitive) effect systems. Prior implementations of sequential effect systems rely on either computing a bottom-up effect and comparing it to a declaration (e.g., method annotation) or leaning on constraint-based type inference. These approaches do not necessarily report program locations that precisely… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Extended report of upcoming Static Analysis Symposium 2023 paper

  15. arXiv:2307.09265  [pdf, ps, other

    math.AG math.RT

    PGL orbits in tree varieties

    Authors: Izzet Coskun, Demir Eken, Chris Yun

    Abstract: In this paper, we introduce tree varieties as a natural generalization of products of partial flag varieties. We study orbits of the PGL action on tree varieties. We characterize tree varieties with finitely many PGL orbits, generalizing a celebrated theorem of Magyar, Weyman and Zelevinsky. We give criteria that guarantee that a tree variety has a dense PGL orbit and provide many examples of tree… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 25 pages

    MSC Class: Primary: 14L30; 14M15; 14M17. Secondary: 14L35; 51N30

  16. arXiv:2307.04204  [pdf, other

    cs.LG math.OC stat.ML

    Trajectory Alignment: Understanding the Edge of Stability Phenomenon via Bifurcation Theory

    Authors: Minhak Song, Chulhee Yun

    Abstract: Cohen et al. (2021) empirically study the evolution of the largest eigenvalue of the loss Hessian, also known as sharpness, along the gradient descent (GD) trajectory and observe the Edge of Stability (EoS) phenomenon. The sharpness increases at the early phase of training (referred to as progressive sharpening), and eventually saturates close to the threshold of $2 / \text{(step size)}$. In this… ▽ More

    Submitted 26 October, 2023; v1 submitted 9 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023 camera-ready; 51 pages

  17. arXiv:2307.01960  [pdf, ps, other

    math.AG math.AT math.CO

    A Serre spectral sequence for the moduli space of tropical curves

    Authors: Christin Bibby, Melody Chan, Nir Gadish, Claudia He Yun

    Abstract: We construct, for all $g\geq 2$ and $n\geq 0$, a spectral sequence of rational $S_n$-representations which computes the $S_n$-equivariant reduced rational cohomology of the tropical moduli spaces of curves $Δ_{g,n}$ in terms of compactly supported cohomology groups of configuration spaces of $n$ points on graphs of genus $g$. Using the canonical $S_n$-equivariant isomorphisms… ▽ More

    Submitted 15 April, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 24 pages plus appendix

    MSC Class: 14H10; 14Q05; 14T20; 55N30; 55R80; 55T10

  18. arXiv:2307.01329  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Efficient current-induced spin torques and field-free magnetization switching in a room-temperature van der Waals magnet

    Authors: Chao Yun, Haoran Guo, Zhongchong Lin, Licong Peng, Zhongyu Liang, Miao Meng, Biao Zhang, Zijing Zhao, Leran Wang, Yifei Ma, Yajing Liu, Weiwei Li, Shuai Ning, Yanglong Hou, Jinbo Yang, Zhaochu Luo

    Abstract: The discovery of magnetism in van der Waals (vdW) materials has established unique building blocks for the research of emergent spintronic phenomena. In particular, owing to their intrinsically clean surface without dangling bonds, the vdW magnets hold the potential to construct a superior interface that allows for efficient electrical manipulation of magnetism. Despite several attempts in this di… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  19. arXiv:2306.15593  [pdf

    cs.CV

    Cardiac CT perfusion imaging of pericoronary adipose tissue (PCAT) highlights potential confounds in coronary CTA

    Authors: Hao Wu, Yingnan Song, Ammar Hoori, Ananya Subramaniam, Juhwan Lee, Justin Kim, Tao Hu, Sadeer Al-Kindi, Wei-Ming Huang, Chun-Ho Yun, Chung-Lieh Hung, Sanjay Rajagopalan, David L. Wilson

    Abstract: Features of pericoronary adipose tissue (PCAT) assessed from coronary computed tomography angiography (CCTA) are associated with inflammation and cardiovascular risk. As PCAT is vascularly connected with coronary vasculature, the presence of iodine is a potential confounding factor on PCAT HU and textures that has not been adequately investigated. Use dynamic cardiac CT perfusion (CCTP) to inform… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 13 pages, 8 figures

  20. arXiv:2306.13604  [pdf, other

    math.CO hep-th math.AG

    Positive del Pezzo Geometry

    Authors: Nick Early, Alheydis Geiger, Marta Panizzut, Bernd Sturmfels, Claudia He Yun

    Abstract: Real, complex, and tropical algebraic geometry join forces in a new branch of mathematical physics called positive geometry. We develop the positive geometry of del Pezzo surfaces and their moduli spaces, viewed as very affine varieties. Their connected components are derived from polyhedral spaces with Weyl group symmetries. We study their canonical forms and scattering amplitudes, and we solve t… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: 34 pages, 4 figures

  21. arXiv:2306.10711  [pdf, other

    cs.LG

    PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning

    Authors: Hojoon Lee, Hanseul Cho, Hyunseung Kim, Daehoon Gwak, Joonkee Kim, Jaegul Choo, Se-Young Yun, Chulhee Yun

    Abstract: In Reinforcement Learning (RL), enhancing sample efficiency is crucial, particularly in scenarios when data acquisition is costly and risky. In principle, off-policy RL algorithms can improve sample efficiency by allowing multiple updates per environment interaction. However, these multiple updates often lead the model to overfit to earlier interactions, which is referred to as the loss of plastic… ▽ More

    Submitted 8 December, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 26 pages, 6 figures, accepted to NeurIPS 2023

  22. arXiv:2306.09850  [pdf, other

    cs.LG math.OC stat.ML

    Practical Sharpness-Aware Minimization Cannot Converge All the Way to Optima

    Authors: Dongkuk Si, Chulhee Yun

    Abstract: Sharpness-Aware Minimization (SAM) is an optimizer that takes a descent step based on the gradient at a perturbation $y_t = x_t + ρ\frac{\nabla f(x_t)}{\lVert \nabla f(x_t) \rVert}$ of the current point $x_t$. Existing studies prove convergence of SAM for smooth functions, but they do so by assuming decaying perturbation size $ρ$ and/or no gradient normalization in $y_t$, which is detached from pr… ▽ More

    Submitted 27 October, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 39 pages. v3 NeurIPS 2023 camera ready version

  23. arXiv:2306.00267  [pdf, other

    cs.LG math.OC stat.ML

    Provable Benefit of Mixup for Finding Optimal Decision Boundaries

    Authors: Junsoo Oh, Chulhee Yun

    Abstract: We investigate how pair-wise data augmentation techniques like Mixup affect the sample complexity of finding optimal decision boundaries in a binary linear classification problem. For a family of data distributions with a separability constant $κ$, we analyze how well the optimal classifier in terms of training loss aligns with the optimal one in test accuracy (i.e., Bayes optimal classifier). For… ▽ More

    Submitted 5 June, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: ICML 2023 camera-ready version; 48 pages

  24. arXiv:2305.03422  [pdf

    cond-mat.mes-hall

    Electrically programmable magnetic coupling in an Ising network exploiting solid-state ionic gating

    Authors: Chao Yun, Zhongyu Liang, Aleš Hrabec, Zhentao Liu, Mantao Huang, Leran Wang, Yifei Xiao, Yikun Fang, Wei Li, Wenyun Yang, Yanglong Hou, Jinbo Yang, Laura J. Heyderman, Pietro Gambardella, Zhaochu Luo

    Abstract: Two-dimensional arrays of magnetically coupled nanomagnets provide a mesoscopic platform for exploring collective phenomena as well as realizing a broad range of spintronic devices. In particular, the magnetic coupling plays a critical role in determining the nature of the cooperative behaviour and providing new functionalities in nanomagnet-based devices. Here, we create coupled Ising-like nanoma… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Journal ref: Nat Commun 14, 6367 (2023)

  25. Some thoughts and experiments on Bergman's compact amalgamation problem

    Authors: Michael Joswig, Mario Kummer, Andreas Thom, Claudia He Yun

    Abstract: We study the question whether copies of $S^1$ in $\mathrm{SU}(3)$ can be amalgamated in a compact group. This is the simplest instance of a fundamental open problem in the theory of compact groups raised by George Bergman in 1987. Considerable computational experiments suggest that the answer is positive in this case. We obtain a positive answer for a relaxed problem using theoretical consideratio… ▽ More

    Submitted 13 July, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: 15 pages, 2 figures, 3 tables; update contains minor changes that address referee comments

    MSC Class: 22C05; 18B99; 90-05; 90C90

  26. arXiv:2303.07160  [pdf, ps, other

    cs.LG math.OC stat.ML

    Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond

    Authors: Jaeyoung Cha, Jaewook Lee, Chulhee Yun

    Abstract: We study convergence lower bounds of without-replacement stochastic gradient descent (SGD) for solving smooth (strongly-)convex finite-sum minimization problems. Unlike most existing results focusing on final iterate lower bounds in terms of the number of components $n$ and the number of epochs $K$, we seek bounds for arbitrary weighted average iterates that are tight in all factors including the… ▽ More

    Submitted 9 June, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: 58 pages

  27. arXiv:2302.12444  [pdf, other

    cs.LG math.OC

    On the Training Instability of Shuffling SGD with Batch Normalization

    Authors: David X. Wu, Chulhee Yun, Suvrit Sra

    Abstract: We uncover how SGD interacts with batch normalization and can exhibit undesirable training dynamics such as divergence. More precisely, we study how Single Shuffle (SS) and Random Reshuffle (RR) -- two widely used variants of SGD -- interact surprisingly differently in the presence of batch normalization: RR leads to much more stable evolution of training loss than SS. As a concrete example, for r… ▽ More

    Submitted 14 August, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: ICML 2023 camera-ready version, added references; 75 pages

  28. Oscillating cosmic evolution and constraints on big bang nucleosynthesis in the extended Starobinsky model

    Authors: Jubin Park, Chae-min Yun, Myung-Ki Cheoun, Dukjae Jang

    Abstract: We investigate the cosmic evolutions in the extended Starobinsky model (eSM) obtained by adding one $R^{ab}R_{ab}$ term to the Starobinsky model. We discuss the possibility of various cosmic evolutions with a special focus on the radiation-dominated era (RDE). Using simple assumptions, a second-order non-linear differential equation describing the various cosmic evolutions in the eSM is introduced… ▽ More

    Submitted 1 May, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Journal ref: Journal of Cosmology and Astroparticle Physics, Volume 2023, May 2023

  29. arXiv:2210.05995  [pdf, other

    math.OC stat.ML

    SGDA with shuffling: faster convergence for nonconvex-PŁ minimax optimization

    Authors: Hanseul Cho, Chulhee Yun

    Abstract: Stochastic gradient descent-ascent (SGDA) is one of the main workhorses for solving finite-sum minimax optimization problems. Most practical implementations of SGDA randomly reshuffle components and sequentially use them (i.e., without-replacement sampling); however, there are few theoretical results on this approach for minimax algorithms, especially outside the easier-to-analyze (strongly-)monot… ▽ More

    Submitted 20 February, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: ICLR 2023 camera-ready version; 46 pages

  30. arXiv:2209.01070  [pdf, ps, other

    math.CO

    Discrete Morse theory for symmetric Delta-complexes

    Authors: Claudia He Yun

    Abstract: We generalize Forman's discrete Morse theory to the context of symmetric $Δ$-complexes. As an application, we prove that the coloop subcomplex of the link of the origin $LA^{\mathrm{trop},\mathrm{P}}_g$ in the moduli space of principally polarized tropical abelian varieties of dimension $g$ with respect to the perfect cone decomposition is contractible.

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: 16 pages, 5 figures

    MSC Class: 57Q70; 14T15

  31. arXiv:2208.06286  [pdf

    physics.acc-ph

    Simulated Lorentz Force Detuning Compensation With A Double Lever Tuner On A Dressed ILC/1.3 GHZ Cavity At Room Temperature

    Authors: C. Contreras-Martinez, Y. Pischalnikov, J. C. Yun

    Abstract: Pulsed SRF linacs with high accelerating gradients experience large frequency shifts caused by Lorentz force detuning (LFD). A piezoelectric actuator with a resonance control algorithm can maintain the cavity frequency at the nominal level, thus reducing the RF power. This study uses a double lever tuner with a piezoelectric actuator for compensation and another piezoelectric actuator to simulate… ▽ More

    Submitted 12 August, 2022; originally announced August 2022.

    Report number: FERMILAB-CONF-22-581-TD

  32. arXiv:2208.04432  [pdf

    physics.acc-ph

    Accelerated Lifetime Test of The SRF Dressed Cavity/Tuner System for LCLS II HE Project

    Authors: Y. Pischalnikov, T. Arkan, C. Contreras-Martinez, B. Hartsell, J. Kaluzny, R. Pilipenko, J. C. Yun, W. Lahmadi

    Abstract: The off-frequency detune method is being considered for application in the LCLS-II-HE superconducting linac to produce multi-energy electron beams for supporting multiple undulator lines simultaneously [1]. Design of the tuner has been changed to deliver roughly 3 times larger frequency tuning range. Working requirements for off-frequency operation (OFO) state that cavities be tuned at least twice… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Report number: FERMILAB-CONF-22-555-TD

  33. An exact solution of the higher-order gravity in standard radiation-dominated era

    Authors: Chae-min Yun, Jubin Park, Myung-Ki Cheoun, Dukjae Jang

    Abstract: We report that the standard evolution of radiation-dominated era (RDE) universe $a \propto t^{1/2}$ is a sufficient condition for solving a sixth order gravitational field equation derived from the Lagrangian containing $B R^{ab}R_{ab} + C R {R^{;c}}_{c}$ as well as a polynomial $f(R)$ for a spatially flat radiation FLRW universe. By virtue of the similarity between $R^{ab}R_{ab}$ and $R^2$ models… ▽ More

    Submitted 2 January, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

  34. arXiv:2207.02800  [pdf, ps, other

    math.AG

    Equivariant Hodge polynomials of heavy/light moduli spaces

    Authors: Siddarth Kannan, Stefano Serpente, Claudia He Yun

    Abstract: Let $\bar{\mathcal{M}}_{g, m|n}$ denote Hassett's moduli space of weighted pointed stable curves of genus $g$ for the heavy/light weight data $\left(1^{(m)}, 1/n^{(n)}\right)$, and let $\mathcal{M}_{g, m|n} \subset \bar{\mathcal{M}}_{g, m|n}$ be the locus parameterizing smooth, not necessarily distinctly marked curves. We give a change-of-variables formula which computes the generating function fo… ▽ More

    Submitted 22 April, 2024; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: 21 pages, 3 tables. Edits based on referee suggestions

    MSC Class: 14H10

  35. arXiv:2204.02873   

    physics.geo-ph

    Multi-task Unscented Kalman Inversion for joint inversion of receiver function and surface wave dispersion

    Authors: Wang Longlong, Liu Youshan, Chen Yun, Du nanqiao

    Abstract: Based on the recently developed theory of Unscented Kalman Inversion in computational mathematics, we proposed a Bayesian joint inversion framework, i.e., Multi-task Unscented Kalman Inversion (MTUKI), and apply it to the joint inversion of receiver function (RF) and surface wave dispersion (SWD). This method can share information between different observations in a derivative-free way and provide… ▽ More

    Submitted 15 January, 2023; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: This version is repeated with another version (arXiv:2202.09544)

  36. arXiv:2110.10342  [pdf, other

    cs.LG math.OC stat.ML

    Minibatch vs Local SGD with Shuffling: Tight Convergence Bounds and Beyond

    Authors: Chulhee Yun, Shashank Rajput, Suvrit Sra

    Abstract: In distributed learning, local SGD (also known as federated averaging) and its simple baseline minibatch SGD are widely studied optimization methods. Most existing analyses of these methods assume independent and unbiased gradient estimates obtained via with-replacement sampling. In contrast, we study shuffling-based variants: minibatch and local Random Reshuffling, which draw stochastic gradients… ▽ More

    Submitted 23 March, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: ICLR 2022 camera-ready (selected for an oral presentation); 76 pages, 3 figures

  37. The role of viral infectivity in oncolytic virotherapy outcomes: A mathematical study

    Authors: Pantea Pooladvand, Chae-Ok Yun, A-Rum Yoon, Peter S. Kim, Federico Frascoli

    Abstract: A model capturing the dynamics between virus and tumour cells in the context of oncolytic virotherapy is presented and analysed. The ability of the virus to be internalised by uninfected cells is described by an infectivity parameter, which is inferred from available experimental data. The parameter is also able to describe the effects of changes in the tumour environment that affect viral uptake… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: 29 pages, 13 figures, 1 table

    MSC Class: 92-10

    Journal ref: Mathematical Biosciences, 334: 108520 (2021)

  38. arXiv:2109.03302  [pdf, ps, other

    math.CO math.AG

    Homology representations of compactified configurations on graphs applied to $\mathcal{M}_{2,n}$

    Authors: Christin Bibby, Melody Chan, Nir Gadish, Claudia He Yun

    Abstract: We obtain new calculations of the top weight rational cohomology of the moduli spaces $\mathcal{M}_{2,n}$, equivalently the rational homology of the tropical moduli spaces $Δ_{2,n}$, as a representation of $S_n$. These calculations are achieved fully for all $n\leq 10$, and partially -- for specific irreducible representations of $S_n$ -- for $n\le 22$. We also present conjectures, verified up to… ▽ More

    Submitted 25 April, 2023; v1 submitted 7 September, 2021; originally announced September 2021.

    Comments: 18 pages, minor edits

    MSC Class: 05C10 (primary); 14H10; 14Q05; 14T20; 55R80; 55P65

  39. Observation of the Orbital Rashba-Edelstein Magnetoresistance

    Authors: Shilei Ding, Zhongyu Liang, Dongwook Go, Chao Yun, Mingzhu Xue, Zhou Liu, Sven Becker, Wenyun Yang, Honglin Du, Changsheng Wang, Yingchang Yang, Gerhard Jakob, Mathias Kläui, Yuriy Mokrousov, Jinbo Yang

    Abstract: We report the observation of magnetoresistance (MR) originating from the orbital angular momentum transport (OAM) in a Permalloy (Py) / oxidized Cu (Cu*) heterostructure: the orbital Rashba-Edelstein magnetoresistance. The angular dependence of the MR depends on the relative angle between the induced OAM and the magnetization in a similar fashion as the spin Hall magnetoresistance (SMR). Despite t… ▽ More

    Submitted 11 May, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: 11 pages,3 figures

  40. arXiv:2103.07079  [pdf, other

    cs.LG math.OC

    Can Single-Shuffle SGD be Better than Reshuffling SGD and GD?

    Authors: Chulhee Yun, Suvrit Sra, Ali Jadbabaie

    Abstract: We propose matrix norm inequalities that extend the Recht-Ré (2012) conjecture on a noncommutative AM-GM inequality by supplementing it with another inequality that accounts for single-shuffle, which is a widely used without-replacement sampling scheme that shuffles only once in the beginning and is overlooked in the Recht-Ré conjecture. Instead of general positive semidefinite matrices, we restri… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: 26 pages, 2 figures

  41. arXiv:2010.13363  [pdf, other

    cs.LG

    Provable Memorization via Deep Neural Networks using Sub-linear Parameters

    Authors: Sejun Park, Jaeho Lee, Chulhee Yun, Jinwoo Shin

    Abstract: It is known that $O(N)$ parameters are sufficient for neural networks to memorize arbitrary $N$ input-label pairs. By exploiting depth, we show that $O(N^{2/3})$ parameters suffice to memorize $N$ pairs, under a mild condition on the separation of input points. In particular, deeper networks (even with width $3$) are shown to memorize more pairs than shallow networks, which also agrees with the re… ▽ More

    Submitted 2 November, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

  42. arXiv:2010.11767  [pdf, other

    math.CO math.AG

    Topology of tropical moduli spaces of weighted stable curves in higher genus

    Authors: Siddarth Kannan, Shiyue Li, Stefano Serpente, Claudia He Yun

    Abstract: Given integers $g \geq 0$, $n \geq 1$, and a vector $w \in (\mathbb{Q} \cap (0, 1])^n$ such that ${2g - 2 + \sum w_i > 0}$, we study the topology of the moduli space $Δ_{g, w}$ of $w$-stable tropical curves of genus $g$ with volume 1. The space $Δ_{g, w}$ is the dual complex of the divisor of singular curves in Hassett's moduli space of $w$-stable genus $g$ curves $\overline{\mathcal{M}}_{g, w}$.… ▽ More

    Submitted 15 March, 2022; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: 14 pages; 1 figure; final version accepted at Advances in Geometry

    MSC Class: 14T05

  43. DML-GANR: Deep Metric Learning With Generative Adversarial Network Regularization for High Spatial Resolution Remote Sensing Image Retrieval

    Authors: Yun Cao, Yuebin Wang, Junhuan Peng, Liqiang Zhang, Linlin Xu, Kai Yan, Lihua Li

    Abstract: With a small number of labeled samples for training, it can save considerable manpower and material resources, especially when the amount of high spatial resolution remote sensing images (HSR-RSIs) increases considerably. However, many deep models face the problem of overfitting when using a small number of labeled samples. This might degrade HSRRSI retrieval accuracy. Aiming at obtaining more acc… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: 17 pages

  44. SLCRF: Subspace Learning with Conditional Random Field for Hyperspectral Image Classification

    Authors: Yun Cao, Jie Mei, Yuebin Wang, Liqiang Zhang, Junhuan Peng, Bing Zhang, Lihua Li, Yibo Zheng

    Abstract: Subspace learning (SL) plays an important role in hyperspectral image (HSI) classification, since it can provide an effective solution to reduce the redundant information in the image pixels of HSIs. Previous works about SL aim to improve the accuracy of HSI recognition. Using a large number of labeled samples, related methods can train the parameters of the proposed solutions to obtain better rep… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: 13 pages, 6 figures

  45. arXiv:2010.02501  [pdf, other

    cs.LG math.OC stat.ML

    A Unifying View on Implicit Bias in Training Linear Neural Networks

    Authors: Chulhee Yun, Shankar Krishnan, Hossein Mobahi

    Abstract: We study the implicit bias of gradient flow (i.e., gradient descent with infinitesimal step size) on linear neural network training. We propose a tensor formulation of neural networks that includes fully-connected, diagonal, and convolutional networks as special cases, and investigate the linear version of the formulation called linear tensor networks. With this formulation, we can characterize th… ▽ More

    Submitted 10 September, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: 38 pages, 7 figures. Revision after ICLR 2021 camera-ready version. Figure 2 newly added, theorem statements revised, including correction of Theorem 2

  46. arXiv:2008.04426  [pdf, ps, other

    math.AG math.CO

    The $S_n$-equivariant rational homology of the tropical moduli spaces $Δ_{2,n}$

    Authors: Claudia He Yun

    Abstract: We compute the $S_n$-equivariant rational homology of the tropical moduli spaces $Δ_{2,n}$ for $n\leq 8$ using a cellular chain complex for symmetric $Δ$-complexes in Sage.

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: 17 pages, 2 figures, 6 tables

    MSC Class: 14T10 (Primary); 14Q05 (Secondary)

  47. arXiv:2006.14759  [pdf

    math.FA math.NA

    Existence and convergence theorems for monotone generalized alpa-nonexpansive mappings in uniformly convex partially ordered hyperbolic metric spaces and its application

    Authors: Chang Il Rim, Jong Gyong Kim, Chol-Hui Yun

    Abstract: In this paper, we generalize the existence result in [14] and prove convergence theorems of the iterative scheme in [12, 16] for monotone generalized alpa-nonexpansive mappings in uniformly convex partially ordered hyperbolic metric spaces. And we also give a numerical example to show that this scheme converges faster than the scheme in [14] and apply the result to the integral equation.

    Submitted 25 June, 2020; originally announced June 2020.

  48. arXiv:2006.08859  [pdf, other

    cs.LG stat.ML

    Minimum Width for Universal Approximation

    Authors: Sejun Park, Chulhee Yun, Jaeho Lee, Jinwoo Shin

    Abstract: The universal approximation property of width-bounded networks has been studied as a dual of classical universal approximation results on depth-bounded networks. However, the critical width enabling the universal approximation has not been exactly characterized in terms of the input dimension $d_x$ and the output dimension $d_y$. In this work, we provide the first definitive result in this directi… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  49. arXiv:2006.06946  [pdf, other

    math.OC stat.ML

    SGD with shuffling: optimal rates without component convexity and large epoch requirements

    Authors: Kwangjun Ahn, Chulhee Yun, Suvrit Sra

    Abstract: We study without-replacement SGD for solving finite-sum optimization problems. Specifically, depending on how the indices of the finite-sum are shuffled, we consider the RandomShuffle (shuffle at the beginning of each epoch) and SingleShuffle (shuffle only once) algorithms. First, we establish minimax optimal convergence rates of these algorithms up to poly-log factors. Notably, our analysis is ge… ▽ More

    Submitted 21 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: 53 pages; supersedes the preprint arXiv:2004.08657; v2 corrects an erroneous claim about SingleShuffle and newly adds Theorem 24 and Appendix F for SingleShuffle

  50. arXiv:2006.04862  [pdf, other

    cs.LG stat.ML

    $O(n)$ Connections are Expressive Enough: Universal Approximability of Sparse Transformers

    Authors: Chulhee Yun, Yin-Wen Chang, Srinadh Bhojanapalli, Ankit Singh Rawat, Sashank J. Reddi, Sanjiv Kumar

    Abstract: Recently, Transformer networks have redefined the state of the art in many NLP tasks. However, these models suffer from quadratic computational cost in the input sequence length $n$ to compute pairwise attention in each layer. This has prompted recent research into sparse Transformers that sparsify the connections in the attention layers. While empirically promising for long sequences, fundamental… ▽ More

    Submitted 19 December, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: 31 pages, NeurIPS 2020 Camera-ready