Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–15 of 15 results for author: Ayed, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.15939  [pdf, other

    cs.IR eess.SP

    Telco-RAG: Navigating the Challenges of Retrieval-Augmented Language Models for Telecommunications

    Authors: Andrei-Laurentiu Bornea, Fadhel Ayed, Antonio De Domenico, Nicola Piovesan, Ali Maatouk

    Abstract: The application of Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) systems in the telecommunication domain presents unique challenges, primarily due to the complex nature of telecom standard documents and the rapid evolution of the field. The paper introduces Telco-RAG, an open-source RAG framework designed to handle the specific needs of telecommunications standards, particu… ▽ More

    Submitted 26 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 6 pages, 5 Figure, 4 Tables, submitted to IEEE Globecom 2024 (see https://github.com/netop-team/telco-rag)

  2. arXiv:2403.04666  [pdf, other

    cs.CL cs.LG

    Telecom Language Models: Must They Be Large?

    Authors: Nicola Piovesan, Antonio De Domenico, Fadhel Ayed

    Abstract: The increasing interest in Large Language Models (LLMs) within the telecommunications sector underscores their potential to revolutionize operational efficiency. However, the deployment of these sophisticated models is often hampered by their substantial size and computational demands, raising concerns about their viability in resource-constrained environments. Addressing this challenge, recent ad… ▽ More

    Submitted 25 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  3. arXiv:2310.20457  [pdf, other

    cs.LG

    FlexTrain: A Dynamic Training Framework for Heterogeneous Devices Environments

    Authors: Mert Unsal, Ali Maatouk, Antonio De Domenico, Nicola Piovesan, Fadhel Ayed

    Abstract: As deep learning models become increasingly large, they pose significant challenges in heterogeneous devices environments. The size of deep learning models makes it difficult to deploy them on low-power or resource-constrained devices, leading to long inference times and high energy consumption. To address these challenges, we propose FlexTrain, a framework that accommodates the diverse storage an… ▽ More

    Submitted 23 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: Workshop on Advancing Neural Network Training (WANT) at NeurIPS 2023

  4. arXiv:2310.15051  [pdf, other

    cs.IT cs.AI cs.LG

    TeleQnA: A Benchmark Dataset to Assess Large Language Models Telecommunications Knowledge

    Authors: Ali Maatouk, Fadhel Ayed, Nicola Piovesan, Antonio De Domenico, Merouane Debbah, Zhi-Quan Luo

    Abstract: We introduce TeleQnA, the first benchmark dataset designed to evaluate the knowledge of Large Language Models (LLMs) in telecommunications. Comprising 10,000 questions and answers, this dataset draws from diverse sources, including standards and research articles. This paper outlines the automated question generation framework responsible for creating this dataset, along with how human input was i… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  5. arXiv:2308.06013  [pdf, other

    cs.IT cs.AI cs.LG

    Large Language Models for Telecom: Forthcoming Impact on the Industry

    Authors: Ali Maatouk, Nicola Piovesan, Fadhel Ayed, Antonio De Domenico, Merouane Debbah

    Abstract: Large Language Models (LLMs), AI-driven models that can achieve general-purpose language understanding and generation, have emerged as a transformative force, revolutionizing fields well beyond Natural Language Processing (NLP) and garnering unprecedented attention. As LLM technology continues to progress, the telecom industry is facing the prospect of its impact on its landscape. To elucidate the… ▽ More

    Submitted 25 February, 2024; v1 submitted 11 August, 2023; originally announced August 2023.

  6. arXiv:2304.11039  [pdf, other

    cs.IT eess.SP

    An Optimization Framework For Anomaly Detection Scores Refinement With Side Information

    Authors: Ali Maatouk, Fadhel Ayed, Wenjie Li, Yu Wang, Hong Zhu, Jiantao Ye

    Abstract: This paper considers an anomaly detection problem in which a detection algorithm assigns anomaly scores to multi-dimensional data points, such as cellular networks' Key Performance Indicators (KPIs). We propose an optimization framework to refine these anomaly scores by leveraging side information in the form of a causality graph between the various features of the data points. The refinement bloc… ▽ More

    Submitted 30 August, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

  7. arXiv:2302.06960  [pdf, other

    stat.ML cs.LG

    Data pruning and neural scaling laws: fundamental limitations of score-based algorithms

    Authors: Fadhel Ayed, Soufiane Hayou

    Abstract: Data pruning algorithms are commonly used to reduce the memory and computational cost of the optimization process. Recent empirical results reveal that random data pruning remains a strong baseline and outperforms most existing data pruning methods in the high compression regime, i.e., where a fraction of $30\%$ or less of the data is kept. This regime has recently attracted a lot of interest as a… ▽ More

    Submitted 6 November, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

  8. arXiv:2302.01002  [pdf, other

    stat.ML cs.LG math.OC

    Over-parameterised Shallow Neural Networks with Asymmetrical Node Scaling: Global Convergence Guarantees and Feature Learning

    Authors: Francois Caron, Fadhel Ayed, Paul Jung, Hoil Lee, Juho Lee, Hongseok Yang

    Abstract: We consider the optimisation of large and shallow neural networks via gradient flow, where the output of each hidden node is scaled by some positive parameter. We focus on the case where the node scalings are non-identical, differing from the classical Neural Tangent Kernel (NTK) parameterisation. We prove that, for large neural networks, with high probability, gradient flow converges to a global… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

  9. arXiv:2302.00623  [pdf, other

    cs.NI cs.LG eess.SP

    Accordion: A Communication-Aware Machine Learning Framework for Next Generation Networks

    Authors: Fadhel Ayed, Antonio De Domenico, Adrian Garcia-Rodriguez, David Lopez-Perez

    Abstract: In this article, we advocate for the design of ad hoc artificial intelligence (AI)/machine learning (ML) models to facilitate their usage in future smart infrastructures based on communication networks. To motivate this, we first review key operations identified by the 3GPP for transferring AI/ML models through 5G networks and the main existing techniques to reduce their communication overheads. W… ▽ More

    Submitted 12 January, 2023; originally announced February 2023.

  10. arXiv:2301.05589  [pdf, other

    cs.IT

    A Framework for the Evaluation of Network Reliability Under Periodic Demand

    Authors: Ali Maatouk, Fadhel Ayed, Shi Biao, Wenjie Li, Harvey Bao, Enrico Zio

    Abstract: In this paper, we study network reliability in relation to a periodic time-dependent utility function that reflects the system's functional performance. When an anomaly occurs, the system incurs a loss of utility that depends on the anomaly's timing and duration. We analyze the long-term average utility loss by considering exponential anomalies' inter-arrival times and general distributions of mai… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

  11. arXiv:2205.08187  [pdf, other

    stat.ML cs.LG math.PR math.ST

    Deep neural networks with dependent weights: Gaussian Process mixture limit, heavy tails, sparsity and compressibility

    Authors: Hoil Lee, Fadhel Ayed, Paul Jung, Juho Lee, Hongseok Yang, François Caron

    Abstract: This article studies the infinite-width limit of deep feedforward neural networks whose weights are dependent, and modelled via a mixture of Gaussian distributions. Each hidden node of the network is assigned a nonnegative random variable that controls the variance of the outgoing weights of that node. We make minimal assumptions on these per-node random variables: they are iid and their sum, in e… ▽ More

    Submitted 11 September, 2023; v1 submitted 17 May, 2022; originally announced May 2022.

    Comments: 96 pages, 15 figures, 9 tables

    MSC Class: 68T07 (Primary); 62M45; 60F99 (Secondary)

  12. arXiv:2106.03091  [pdf, other

    stat.ML cs.LG

    Regularization in ResNet with Stochastic Depth

    Authors: Soufiane Hayou, Fadhel Ayed

    Abstract: Regularization plays a major role in modern deep learning. From classic techniques such as L1,L2 penalties to other noise-based methods such as Dropout, regularization often yields better generalization properties by avoiding overfitting. Recently, Stochastic Depth (SD) has emerged as an alternative regularization technique for residual neural networks (ResNets) and has proven to boost the perform… ▽ More

    Submitted 6 June, 2021; originally announced June 2021.

    Comments: 24 pages, 15 figures

  13. arXiv:2007.15541  [pdf, other

    cs.LG stat.ML

    Anomaly Detection at Scale: The Case for Deep Distributional Time Series Models

    Authors: Fadhel Ayed, Lorenzo Stella, Tim Januschowski, Jan Gasthaus

    Abstract: This paper introduces a new methodology for detecting anomalies in time series data, with a primary application to monitoring the health of (micro-) services and cloud resources. The main novelty in our approach is that instead of modeling time series consisting of real values or vectors of real values, we model time series of probability distributions over real values (or vectors). This extension… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

  14. arXiv:1902.10693  [pdf, other

    stat.ME cs.SI

    Nonnegative Bayesian nonparametric factor models with completely random measures for community detection

    Authors: Fadhel Ayed, François Caron

    Abstract: We present a Bayesian nonparametric Poisson factorization model for modeling network data with an unknown and potentially growing number of overlapping communities. The construction is based on completely random measures and allows the number of communities to either increase with the number of nodes at a specified logarithmic or polynomial rate, or be bounded. We develop asymptotics for the numbe… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

  15. arXiv:1902.04714  [pdf, other

    stat.ML cs.LG

    Beyond the Chinese Restaurant and Pitman-Yor processes: Statistical Models with Double Power-law Behavior

    Authors: Fadhel Ayed, Juho Lee, François Caron

    Abstract: Bayesian nonparametric approaches, in particular the Pitman-Yor process and the associated two-parameter Chinese Restaurant process, have been successfully used in applications where the data exhibit a power-law behavior. Examples include natural language processing, natural images or networks. There is also growing empirical evidence that some datasets exhibit a two-regime power-law behavior: one… ▽ More

    Submitted 9 July, 2019; v1 submitted 12 February, 2019; originally announced February 2019.