Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–28 of 28 results for author: Wan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.17458  [pdf, ps, other

    cs.CR cs.LG

    Expectations Versus Reality: Evaluating Intrusion Detection Systems in Practice

    Authors: Jake Hesford, Daniel Cheng, Alan Wan, Larry Huynh, Seungho Kim, Hyoungshick Kim, Jin B. Hong

    Abstract: Our paper provides empirical comparisons between recent IDSs to provide an objective comparison between them to help users choose the most appropriate solution based on their requirements. Our results show that no one solution is the best, but is dependent on external variables such as the types of attacks, complexity, and network environment in the dataset. For example, BoT_IoT and Stratosphere I… ▽ More

    Submitted 28 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: 10 pages

    MSC Class: 68M25; 68M20 ACM Class: C.4; D.m

  2. arXiv:2402.11782  [pdf, other

    cs.CL cs.LG

    What Evidence Do Language Models Find Convincing?

    Authors: Alexander Wan, Eric Wallace, Dan Klein

    Abstract: Retrieval-augmented language models are being increasingly tasked with subjective, contentious, and conflicting queries such as "is aspartame linked to cancer". To resolve these ambiguous queries, one must search through a large range of websites and consider "which, if any, of this evidence do I find convincing?". In this work, we study how LLMs answer this question. In particular, we construct C… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  3. arXiv:2307.08771  [pdf, other

    cs.CV

    UPSCALE: Unconstrained Channel Pruning

    Authors: Alvin Wan, Hanxiang Hao, Kaushik Patnaik, Yueyang Xu, Omer Hadad, David Güera, Zhile Ren, Qi Shan

    Abstract: As neural networks grow in size and complexity, inference speeds decline. To combat this, one of the most effective compression techniques -- channel pruning -- removes channels from weights. However, for multi-branch segments of a model, channel removal can introduce inference-time memory copies. In turn, these copies increase inference latency -- so much so that the pruned model can be slower th… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 29 pages, 26 figures, accepted to ICML 2023

  4. arXiv:2305.04356  [pdf, other

    cs.CL cs.LG

    Stanford MLab at SemEval-2023 Task 10: Exploring GloVe- and Transformer-Based Methods for the Explainable Detection of Online Sexism

    Authors: Hee Jung Choi, Trevor Chow, Aaron Wan, Hong Meng Yam, Swetha Yogeswaran, Beining Zhou

    Abstract: In this paper, we discuss the methods we applied at SemEval-2023 Task 10: Towards the Explainable Detection of Online Sexism. Given an input text, we perform three classification tasks to predict whether the text is sexist and classify the sexist text into subcategories in order to provide an additional explanation as to why the text is sexist. We explored many different types of models, including… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

  5. arXiv:2305.00944  [pdf, other

    cs.CL cs.CR cs.LG

    Poisoning Language Models During Instruction Tuning

    Authors: Alexander Wan, Eric Wallace, Sheng Shen, Dan Klein

    Abstract: Instruction-tuned LMs such as ChatGPT, FLAN, and InstructGPT are finetuned on datasets that contain user-submitted examples, e.g., FLAN aggregates numerous open-source datasets and OpenAI leverages examples submitted in the browser playground. In this work, we show that adversaries can contribute poison examples to these datasets, allowing them to manipulate model predictions whenever a desired tr… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  6. arXiv:2304.12406  [pdf, other

    cs.CV

    AutoFocusFormer: Image Segmentation off the Grid

    Authors: Chen Ziwen, Kaushik Patnaik, Shuangfei Zhai, Alvin Wan, Zhile Ren, Alex Schwing, Alex Colburn, Li Fuxin

    Abstract: Real world images often have highly imbalanced content density. Some areas are very uniform, e.g., large patches of blue sky, while other areas are scattered with many small objects. Yet, the commonly used successive grid downsampling strategy in convolutional deep networks treats all areas equally. Hence, small objects are represented in very few spatial locations, leading to worse results in tas… ▽ More

    Submitted 25 October, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

    ACM Class: I.4.6; I.4.8

  7. arXiv:2302.10914  [pdf, other

    cs.LG cs.AI cs.CL

    GLUECons: A Generic Benchmark for Learning Under Constraints

    Authors: Hossein Rajaby Faghihi, Aliakbar Nafar, Chen Zheng, Roshanak Mirzaee, Yue Zhang, Andrzej Uszok, Alexander Wan, Tanawan Premsri, Dan Roth, Parisa Kordjamshidi

    Abstract: Recent research has shown that integrating domain knowledge into deep learning architectures is effective -- it helps reduce the amount of required data, improves the accuracy of the models' decisions, and improves the interpretability of models. However, the research community is missing a convened benchmark for systematically evaluating knowledge integration methods. In this work, we create a be… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 8 pages, Accepted in AAAI 2023 proceedings

  8. arXiv:2106.07708  [pdf

    cs.LG cs.AI cs.CV eess.IV

    CathAI: Fully Automated Interpretation of Coronary Angiograms Using Neural Networks

    Authors: Robert Avram, Jeffrey E. Olgin, Alvin Wan, Zeeshan Ahmed, Louis Verreault-Julien, Sean Abreau, Derek Wan, Joseph E. Gonzalez, Derek Y. So, Krishan Soni, Geoffrey H. Tison

    Abstract: Coronary heart disease (CHD) is the leading cause of adult death in the United States and worldwide, and for which the coronary angiography procedure is the primary gateway for diagnosis and clinical management decisions. The standard-of-care for interpretation of coronary angiograms depends upon ad-hoc visual assessment by the physician operator. However, ad-hoc visual interpretation of angiogram… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: 62 pages, 3 main figures, 2 main tables

    ACM Class: I.4.9; I.2.10; J.3

  9. arXiv:2006.06868  [pdf, other

    cs.CV cs.LG

    SegNBDT: Visual Decision Rules for Segmentation

    Authors: Alvin Wan, Daniel Ho, Younjin Song, Henk Tillman, Sarah Adel Bargal, Joseph E. Gonzalez

    Abstract: The black-box nature of neural networks limits model decision interpretability, in particular for high-dimensional inputs in computer vision and for dense pixel prediction tasks like segmentation. To address this, prior work combines neural networks with decision trees. However, such models (1) perform poorly when compared to state-of-the-art segmentation models or (2) fail to produce decision rul… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: 8 pages, 8 figures

  10. arXiv:2006.03677  [pdf, other

    cs.CV cs.LG eess.IV

    Visual Transformers: Token-based Image Representation and Processing for Computer Vision

    Authors: Bichen Wu, Chenfeng Xu, Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Zhicheng Yan, Masayoshi Tomizuka, Joseph Gonzalez, Kurt Keutzer, Peter Vajda

    Abstract: Computer vision has achieved remarkable success by (a) representing images as uniformly-arranged pixel arrays and (b) convolving highly-localized features. However, convolutions treat all image pixels equally regardless of importance; explicitly model all concepts across all images, regardless of content; and struggle to relate spatially-distant concepts. In this work, we challenge this paradigm b… ▽ More

    Submitted 19 November, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

  11. arXiv:2006.02049  [pdf, other

    cs.CV cs.LG cs.NE

    FBNetV3: Joint Architecture-Recipe Search using Predictor Pretraining

    Authors: Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Bichen Wu, Zijian He, Zhen Wei, Kan Chen, Yuandong Tian, Matthew Yu, Peter Vajda, Joseph E. Gonzalez

    Abstract: Neural Architecture Search (NAS) yields state-of-the-art neural networks that outperform their best manually-designed counterparts. However, previous NAS methods search for architectures under one set of training hyper-parameters (i.e., a training recipe), overlooking superior architecture-recipe combinations. To address this, we present Neural Architecture-Recipe Search (NARS) to search both (a)… ▽ More

    Submitted 30 March, 2021; v1 submitted 3 June, 2020; originally announced June 2020.

  12. arXiv:2005.13164  [pdf, other

    cs.CR cs.CY

    CoVista: A Unified View on Privacy Sensitive Mobile Contact Tracing Effort

    Authors: David Culler, Prabal Dutta, Gabe Fierro, Joseph E. Gonzalez, Nathan Pemberton, Johann Schleier-Smith, K. Shankari, Alvin Wan, Thomas Zachariah

    Abstract: Governments around the world have become increasingly frustrated with tech giants dictating public health policy. The software created by Apple and Google enables individuals to track their own potential exposure through collated exposure notifications. However, the same software prohibits location tracking, denying key information needed by public health officials for robust contract tracing. Thi… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

  13. arXiv:2004.05565  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions

    Authors: Alvin Wan, Xiaoliang Dai, Peizhao Zhang, Zijian He, Yuandong Tian, Saining Xie, Bichen Wu, Matthew Yu, Tao Xu, Kan Chen, Peter Vajda, Joseph E. Gonzalez

    Abstract: Differentiable Neural Architecture Search (DNAS) has demonstrated great success in designing state-of-the-art, efficient neural networks. However, DARTS-based DNAS's search space is small when compared to other search methods', since all candidate network layers must be explicitly instantiated in memory. To address this bottleneck, we propose a memory and computationally efficient DNAS variant: DM… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

    Comments: 8 pages, 10 figures, accepted to CVPR 2020

  14. arXiv:2004.00221  [pdf, other

    cs.CV cs.LG cs.NE

    NBDT: Neural-Backed Decision Trees

    Authors: Alvin Wan, Lisa Dunlap, Daniel Ho, Jihan Yin, Scott Lee, Henry Jin, Suzanne Petryk, Sarah Adel Bargal, Joseph E. Gonzalez

    Abstract: Machine learning applications such as finance and medicine demand accurate and justifiable predictions, barring most deep learning methods from use. In response, previous work combines decision trees with deep learning, yielding models that (1) sacrifice interpretability for accuracy or (2) sacrifice accuracy for interpretability. We forgo this dilemma by jointly improving accuracy and interpretab… ▽ More

    Submitted 27 January, 2021; v1 submitted 1 April, 2020; originally announced April 2020.

    Comments: 8 pages, 7 figures, accepted to ICLR 2021

  15. arXiv:1803.00101  [pdf, other

    cs.LG cs.AI stat.ML

    Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning

    Authors: Vladimir Feinberg, Alvin Wan, Ion Stoica, Michael I. Jordan, Joseph E. Gonzalez, Sergey Levine

    Abstract: Recent model-free reinforcement learning algorithms have proposed incorporating learned dynamics models as a source of additional data with the intention of reducing sample complexity. Such methods hold the promise of incorporating imagined data coupled with a notion of model uncertainty to accelerate the learning of continuous control tasks. Unfortunately, they rely on heuristics that limit usage… ▽ More

    Submitted 28 February, 2018; originally announced March 2018.

  16. arXiv:1711.08141  [pdf, other

    cs.CV

    Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions

    Authors: Bichen Wu, Alvin Wan, Xiangyu Yue, Peter Jin, Sicheng Zhao, Noah Golmant, Amir Gholaminejad, Joseph Gonzalez, Kurt Keutzer

    Abstract: Neural networks rely on convolutions to aggregate spatial information. However, spatial convolutions are expensive in terms of model size and computation, both of which grow quadratically with respect to kernel size. In this paper, we present a parameter-free, FLOP-free "shift" operation as an alternative to spatial convolutions. We fuse shifts and point-wise convolutions to construct end-to-end t… ▽ More

    Submitted 3 December, 2017; v1 submitted 22 November, 2017; originally announced November 2017.

    Comments: Source code will be released afterwards

  17. arXiv:1710.07368  [pdf, other

    cs.CV

    SqueezeSeg: Convolutional Neural Nets with Recurrent CRF for Real-Time Road-Object Segmentation from 3D LiDAR Point Cloud

    Authors: Bichen Wu, Alvin Wan, Xiangyu Yue, Kurt Keutzer

    Abstract: In this paper, we address semantic segmentation of road-objects from 3D LiDAR point clouds. In particular, we wish to detect and categorize instances of interest, such as cars, pedestrians and cyclists. We formulate this problem as a point- wise classification problem, and propose an end-to-end pipeline called SqueezeSeg based on convolutional neural networks (CNN): the CNN takes a transformed LiD… ▽ More

    Submitted 19 October, 2017; originally announced October 2017.

  18. arXiv:1612.01051  [pdf, other

    cs.CV

    SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving

    Authors: Bichen Wu, Alvin Wan, Forrest Iandola, Peter H. Jin, Kurt Keutzer

    Abstract: Object detection is a crucial task for autonomous driving. In addition to requiring high accuracy to ensure safety, object detection for autonomous driving also requires real-time inference speed to guarantee prompt vehicle control, as well as small model size and energy efficiency to enable embedded system deployment. In this work, we propose SqueezeDet, a fully convolutional neural network for… ▽ More

    Submitted 11 June, 2019; v1 submitted 3 December, 2016; originally announced December 2016.

    Comments: The supplementary material of this paper, which discusses the energy efficiency of SqueezeDet, is attached after the main paper. The source code of this work is open-source released at https://github.com/BichenWuUCB/squeezeDet

  19. arXiv:1506.01055  [pdf, ps, other

    cs.DM

    An inequality for the Fourier spectrum of parity decision trees

    Authors: Eric Blais, Li-Yang Tan, Andrew Wan

    Abstract: We give a new bound on the sum of the linear Fourier coefficients of a Boolean function in terms of its parity decision tree complexity. This result generalizes an inequality of O'Donnell and Servedio for regular decision trees. We use this bound to obtain the first non-trivial lower bound on the parity decision tree complexity of the recursive majority function.

    Submitted 20 May, 2015; originally announced June 2015.

  20. arXiv:1505.01072  [pdf, ps, other

    cs.CL cs.IR

    Mining Measured Information from Text

    Authors: Arun S. Maiya, Dale Visser, Andrew Wan

    Abstract: We present an approach to extract measured information from text (e.g., a 1370 degrees C melting point, a BMI greater than 29.9 kg/m^2 ). Such extractions are critically important across a wide range of domains - especially those involving search and exploration of scientific and technical documents. We first propose a rule-based entity extractor to mine measured quantities (i.e., a numeric value… ▽ More

    Submitted 5 May, 2015; originally announced May 2015.

    Comments: 4 pages; 38th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '15)

    ACM Class: I.2.7; H.3.3

  21. arXiv:1405.7028  [pdf, ps, other

    cs.CC

    Pseudorandomness and Fourier Growth Bounds for Width 3 Branching Programs

    Authors: Thomas Steinke, Salil Vadhan, Andrew Wan

    Abstract: We present an explicit pseudorandom generator for oblivious, read-once, width-$3$ branching programs, which can read their input bits in any order. The generator has seed length $\tilde{O}( \log^3 n ).$ The previously best known seed length for this model is $n^{1/2+o(1)}$ due to Impagliazzo, Meka, and Zuckerman (FOCS '12). Our work generalizes a recent result of Reingold, Steinke, and Vadhan (RAN… ▽ More

    Submitted 27 May, 2014; originally announced May 2014.

    Comments: arXiv admin note: text overlap with arXiv:1306.3004

  22. arXiv:1405.5268  [pdf, ps, other

    cs.LG cs.CC cs.DM

    Approximate resilience, monotonicity, and the complexity of agnostic learning

    Authors: Dana Dachman-Soled, Vitaly Feldman, Li-Yang Tan, Andrew Wan, Karl Wimmer

    Abstract: A function $f$ is $d$-resilient if all its Fourier coefficients of degree at most $d$ are zero, i.e., $f$ is uncorrelated with all low-degree parities. We study the notion of $\mathit{approximate}$ $\mathit{resilience}$ of Boolean functions, where we say that $f$ is $α$-approximately $d$-resilient if $f$ is $α$-close to a $[-1,1]$-valued $d$-resilient function in $\ell_1$ distance. We show that ap… ▽ More

    Submitted 9 July, 2014; v1 submitted 20 May, 2014; originally announced May 2014.

  23. arXiv:1312.3003  [pdf, ps, other

    cs.CC

    Decision Trees, Protocols, and the Fourier Entropy-Influence Conjecture

    Authors: Andrew Wan, John Wright, Chenggang Wu

    Abstract: Given $f:\{-1, 1\}^n \rightarrow \{-1, 1\}$, define the \emph{spectral distribution} of $f$ to be the distribution on subsets of $[n]$ in which the set $S$ is sampled with probability $\widehat{f}(S)^2$. Then the Fourier Entropy-Influence (FEI) conjecture of Friedgut and Kalai (1996) states that there is some absolute constant $C$ such that… ▽ More

    Submitted 10 December, 2013; originally announced December 2013.

    ACM Class: F.1.3

  24. arXiv:1312.1983  [pdf, ps, other

    cs.CC q-bio.PE

    Satisfiability and Evolution

    Authors: Adi Livnat, Christos Papadimitriou, Aviad Rubinstein, Gregory Valiant, Andrew Wan

    Abstract: We show that, if truth assignments on $n$ variables reproduce through recombination so that satisfaction of a particular Boolean function confers a small evolutionary advantage, then a polynomially large population over polynomially many generations (polynomial in $n$ and the inverse of the initial satisfaction probability) will end up almost certainly consisting exclusively of satisfying truth as… ▽ More

    Submitted 11 August, 2014; v1 submitted 6 December, 2013; originally announced December 2013.

    MSC Class: 92D15 ACM Class: F.0

  25. arXiv:1304.3754  [pdf, ps, other

    cs.DS

    Faster Private Release of Marginals on Small Databases

    Authors: Karthekeyan Chandrasekaran, Justin Thaler, Jonathan Ullman, Andrew Wan

    Abstract: We study the problem of answering \emph{$k$-way marginal} queries on a database $D \in (\{0,1\}^d)^n$, while preserving differential privacy. The answer to a $k$-way marginal query is the fraction of the database's records $x \in \{0,1\}^d$ with a given value in each of a given set of up to $k$ columns. Marginal queries enable a rich class of statistical analyses on a dataset, and designing effici… ▽ More

    Submitted 2 September, 2013; v1 submitted 12 April, 2013; originally announced April 2013.

  26. arXiv:1202.6680  [pdf, other

    cs.CC cs.DM math.PR

    On the Distribution of the Fourier Spectrum of Halfspaces

    Authors: Ilias Diakonikolas, Ragesh Jaiswal, Rocco A. Servedio, Li-Yang Tan, Andrew Wan

    Abstract: Bourgain showed that any noise stable Boolean function $f$ can be well-approximated by a junta. In this note we give an exponential sharpening of the parameters of Bourgain's result under the additional assumption that $f$ is a halfspace.

    Submitted 29 February, 2012; originally announced February 2012.

  27. arXiv:0909.4727  [pdf, ps, other

    cs.CC cs.DM

    A regularity lemma, and low-weight approximators, for low-degree polynomial threshold functions

    Authors: Ilias Diakonikolas, Rocco A. Servedio, Li-Yang Tan, Andrew Wan

    Abstract: We give a "regularity lemma" for degree-d polynomial threshold functions (PTFs) over the Boolean cube {-1,1}^n. This result shows that every degree-d PTF can be decomposed into a constant number of subfunctions such that almost all of the subfunctions are close to being regular PTFs. Here a "regular PTF is a PTF sign(p(x)) where the influence of each variable on the polynomial p(x) is a small fr… ▽ More

    Submitted 5 May, 2010; v1 submitted 25 September, 2009; originally announced September 2009.

    Comments: 23 pages, 0 figures

    ACM Class: F.1.3

  28. arXiv:0805.1765  [pdf, ps, other

    cs.CC

    Efficiently Testing Sparse GF(2) Polynomials

    Authors: Ilias Diakonikolas, Homin K. Lee, Kevin Matulef, Rocco A. Servedio, Andrew Wan

    Abstract: We give the first algorithm that is both query-efficient and time-efficient for testing whether an unknown function $f: \{0,1\}^n \to \{0,1\}$ is an $s$-sparse GF(2) polynomial versus $\eps$-far from every such polynomial. Our algorithm makes $\poly(s,1/\eps)$ black-box queries to $f$ and runs in time $n \cdot \poly(s,1/\eps)$. The only previous algorithm for this testing problem \cite{DLM+:07}… ▽ More

    Submitted 12 May, 2008; originally announced May 2008.

    Comments: Full version of ICALP 2008 paper