Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–6 of 6 results for author: Ibrahim, M A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.20297  [pdf, other

    cs.AR cs.DC

    Balanced Data Placement for GEMV Acceleration with Processing-In-Memory

    Authors: Mohamed Assem Ibrahim, Mahzabeen Islam, Shaizeen Aga

    Abstract: With unprecedented demand for generative AI (GenAI) inference, acceleration of primitives that dominate GenAI such as general matrix-vector multiplication (GEMV) is receiving considerable attention. A challenge with GEMVs is the high memory bandwidth this primitive demands. Multiple memory vendors have proposed commercially viable processing-in-memory (PIM) prototypes that attain bandwidth boost o… ▽ More

    Submitted 1 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  2. arXiv:2311.05716  [pdf, other

    cs.AR

    ML-based Real-Time Control at the Edge: An Approach Using hls4ml

    Authors: R. Shi, S. Ogrenci, J. M. Arnold, J. R. Berlioz, P. Hanlet, K. J. Hazelwood, M. A. Ibrahim, H. Liu, V. P. Nagaslaev, A. Narayanan 1, D. J. Nicklaus, J. Mitrevski, G. Pradhan, A. L. Saewert, B. A. Schupbach, K. Seiya, M. Thieme, R. M. Thurman-Keup, N. V. Tran

    Abstract: This study focuses on implementing a real-time control system for a particle accelerator facility that performs high energy physics experiments. A critical operating parameter in this facility is beam loss, which is the fraction of particles deviating from the accelerated proton beam into a cascade of secondary particles. Accelerators employ a large number of sensors to monitor beam loss. The data… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  3. arXiv:2311.05034  [pdf, other

    cs.AR cs.DC

    Just-in-time Quantization with Processing-In-Memory for Efficient ML Training

    Authors: Mohamed Assem Ibrahim, Shaizeen Aga, Ada Li, Suchita Pati, Mahzabeen Islam

    Abstract: Data format innovations have been critical for machine learning (ML) scaling, which in turn fuels ground-breaking ML capabilities. However, even in the presence of low-precision formats, model weights are often stored in both high-precision and low-precision during training. Furthermore, with emerging directional data formats (e.g., MX9, MX6, etc.) multiple low-precision weight copies can be requi… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  4. arXiv:2308.03973  [pdf, other

    cs.AR cs.DC

    Collaborative Acceleration for FFT on Commercial Processing-In-Memory Architectures

    Authors: Mohamed Assem Ibrahim, Shaizeen Aga

    Abstract: This paper evaluates the efficacy of recent commercial processing-in-memory (PIM) solutions to accelerate fast Fourier transform (FFT), an important primitive across several domains. Specifically, we observe that efficient implementations of FFT on modern GPUs are memory bandwidth bound. As such, the memory bandwidth boost availed by commercial PIM solutions makes a case for PIM to accelerate FFT.… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  5. arXiv:2003.05443  [pdf

    cs.CL

    A Precisely Xtreme-Multi Channel Hybrid Approach For Roman Urdu Sentiment Analysis

    Authors: Faiza Memood, Muhammad Usman Ghani, Muhammad Ali Ibrahim, Rehab Shehzadi, Muhammad Nabeel Asim

    Abstract: In order to accelerate the performance of various Natural Language Processing tasks for Roman Urdu, this paper for the very first time provides 3 neural word embeddings prepared using most widely used approaches namely Word2vec, FastText, and Glove. The integrity of generated neural word embeddings is evaluated using intrinsic and extrinsic evaluation approaches. Considering the lack of publicly a… ▽ More

    Submitted 11 March, 2020; originally announced March 2020.

  6. arXiv:2003.01345  [pdf

    cs.CL cs.LG

    Benchmark Performance of Machine And Deep Learning Based Methodologies for Urdu Text Document Classification

    Authors: Muhammad Nabeel Asim, Muhammad Usman Ghani, Muhammad Ali Ibrahim, Sheraz Ahmad, Waqar Mahmood, Andreas Dengel

    Abstract: In order to provide benchmark performance for Urdu text document classification, the contribution of this paper is manifold. First, it pro-vides a publicly available benchmark dataset manually tagged against 6 classes. Second, it investigates the performance impact of traditional machine learning based Urdu text document classification methodologies by embedding 10 filter-based feature selection a… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.