Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–8 of 8 results for author: Winsor, E

.
  1. arXiv:2312.10091  [pdf, other

    cs.IR cs.CL cs.LG

    Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models

    Authors: Alexandre Variengien, Eric Winsor

    Abstract: When solving challenging problems, language models (LMs) are able to identify relevant information from long and complicated contexts. To study how LMs solve retrieval tasks in diverse situations, we introduce ORION, a collection of structured retrieval tasks spanning six domains, from text understanding to coding. Each task in ORION can be represented abstractly by a request (e.g. a question) tha… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  2. arXiv:2211.12312  [pdf, other

    cs.LG cs.AI

    Interpreting Neural Networks through the Polytope Lens

    Authors: Sid Black, Lee Sharkey, Leo Grinsztajn, Eric Winsor, Dan Braun, Jacob Merizian, Kip Parker, Carlos Ramón Guevara, Beren Millidge, Gabriel Alfour, Connor Leahy

    Abstract: Mechanistic interpretability aims to explain what a neural network has learned at a nuts-and-bolts level. What are the fundamental primitives of neural network representations? Previous mechanistic descriptions have used individual neurons or their linear combinations to understand the representations a network has learned. But there are clues that neurons and their linear combinations are not the… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 22/11/22 initial upload

  3. arXiv:2110.15343  [pdf, other

    cs.LG

    Scatterbrain: Unifying Sparse and Low-rank Attention Approximation

    Authors: Beidi Chen, Tri Dao, Eric Winsor, Zhao Song, Atri Rudra, Christopher Ré

    Abstract: Recent advances in efficient Transformers have exploited either the sparsity or low-rank properties of attention matrices to reduce the computational and memory bottlenecks of modeling long sequences. However, it is still challenging to balance the trade-off between model quality and efficiency to perform a one-size-fits-all approximation for different tasks. To better understand this trade-off, w… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021

  4. arXiv:1905.04746  [pdf, other

    math.CO cs.DM

    Generalized Lyndon Factorizations of Infinite Words

    Authors: Amanda Burcroff, Eric Winsor

    Abstract: A generalized lexicographic order on words is a lexicographic order where the total order of the alphabet depends on the position of the comparison. A generalized Lyndon word is a finite word which is strictly smallest among its class of rotations with respect to a generalized lexicographic order. This notion can be extended to infinite words: an infinite generalized Lyndon word is an infinite wor… ▽ More

    Submitted 20 June, 2019; v1 submitted 12 May, 2019; originally announced May 2019.

    Comments: 14 pages, 1 figure

    MSC Class: 05A05; 68R15

  5. A Refined Conjecture for the Variance of Gaussian Primes Across Sectors

    Authors: Ryan C. Chen, Yujin H. Kim, Jared D. Lichtman, Steven J. Miller, Alina Shubina, Shannon Sweitzer, Ezra Waxman, Eric Winsor, Jianing Yang

    Abstract: We derive a refined conjecture for the variance of Gaussian primes across sectors, with a power saving error term, by applying the L-functions Ratios Conjecture. We observe a bifurcation point in the main term, consistent with the Random Matrix Theory (RMT) heuristic previously proposed by Rudnick and Waxman. Our model also identifies a second bifurcation point, undetected by the RMT model, that e… ▽ More

    Submitted 22 February, 2021; v1 submitted 22 January, 2019; originally announced January 2019.

    Comments: 47 pages. Minor revisions. Appendix with relevant Mathematica code, included. Accepted for publication in Experimental Mathematics

    Journal ref: Experimental Mathematics (2020)

  6. arXiv:1810.03053  [pdf, ps, other

    math.NT

    Limiting Distributions in Generalized Zeckendorf Decompositions

    Authors: Alexandre Gueganic, Granger Carty, Yujin H. Kim, Steven J. Miller, Alina Shubina, Shannon Sweitzer, Eric Winsor, Jianing Yang

    Abstract: An equivalent definition of the Fibonacci numbers is that they are the unique sequence such that every integer can be written uniquely as a sum of non-adjacent terms. We can view this as we have bins of length 1, we can take at most one element from a bin, and if we choose an element from a bin we cannot take one from a neighboring bin. We generalize to allowing bins of varying length and restrict… ▽ More

    Submitted 6 October, 2018; originally announced October 2018.

    Comments: Version 1.0, 18 pages

    MSC Class: 11G05 (primary); 11G07; 11G40; 11M41 (secondary)

    Journal ref: The Fibonacci Quarterly 57 (2019) 109-125

  7. Lower-Order Biases Second Moments of Dirichlet Coefficients in Families of $L$-Functions

    Authors: Megumi Asada, Ryan Chen, Eva Fourakis, Yujin Kim, Andrew Kwon, Jared D. Lichtman, Blake Mackall, Steven J. Miller, Eric Winsor, Karl Winsor, Jianing Yang, Kevin Yang

    Abstract: Let $\mathcal E: y^2 = x^3 + A(T)x + B(T)$ be a nontrivial one-parameter family of elliptic curves over $\mathbb{Q}(T)$, with $A(T), B(T) \in \mathbb Z(T)$, and consider the $k$\textsuperscript{th} moments $A_{k,\mathcal{E}}(p) := \sum_{t (p)} a_{\mathcal{E}_t}(p)^k$ of the Dirichlet coefficients $a_{\mathcal{E}_t}(p) := p + 1 - |\mathcal{E}_t (\mathbb{F}_p)|$. Rosen and Silverman proved a conject… ▽ More

    Submitted 7 February, 2021; v1 submitted 18 August, 2018; originally announced August 2018.

    Comments: Version 1.0, 40 pages, 2 appendices

    MSC Class: 60B10; 11B39; 11B05 (primary) 65Q30 (secondary)

    Journal ref: Experimental Mathematics (2021)

  8. Spectral Statistics of Non-Hermitian Random Matrix Ensembles

    Authors: Ryan C. Chen, Yujin H. Kim, Jared D. Lichtman, Steven J. Miller, Shannon Sweitzer, Eric Winsor

    Abstract: Recently Burkhardt et. al. introduced the $k$-checkerboard random matrix ensembles, which have a split limiting behavior of the eigenvalues (in the limit all but $k$ of the eigenvalues are on the order of $\sqrt{N}$ and converge to semi-circular behavior, with the remaining $k$ of size $N$ and converging to hollow Gaussian ensembles). We generalize their work to consider non-Hermitian ensembles wi… ▽ More

    Submitted 10 April, 2018; v1 submitted 21 March, 2018; originally announced March 2018.

    Comments: Version 1.0, 35 pages, 5 figures

    MSC Class: 15B52 (primary); 15B57 (secondary)

    Journal ref: Random Matrices Theory Appl. 08 (2019) 1950005