Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–9 of 9 results for author: Karimi, M R

.
  1. arXiv:2407.05189  [pdf

    cs.CL

    Enhancing Language Learning through Technology: Introducing a New English-Azerbaijani (Arabic Script) Parallel Corpus

    Authors: Jalil Nourmohammadi Khiarak, Ammar Ahmadi, Taher Ak-bari Saeed, Meysam Asgari-Chenaghlu, Toğrul Atabay, Mohammad Reza Baghban Karimi, Ismail Ceferli, Farzad Hasanvand, Seyed Mahboub Mousavi, Morteza Noshad

    Abstract: This paper introduces a pioneering English-Azerbaijani (Arabic Script) parallel corpus, designed to bridge the technological gap in language learning and machine translation (MT) for under-resourced languages. Consisting of 548,000 parallel sentences and approximately 9 million words per language, this dataset is derived from diverse sources such as news articles and holy texts, aiming to enhance… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: This paper is accepted and published at NeTTT 2024 Conf

  2. arXiv:2311.16706  [pdf, ps, other

    cs.LG math.PR stat.ML

    Sinkhorn Flow: A Continuous-Time Framework for Understanding and Generalizing the Sinkhorn Algorithm

    Authors: Mohammad Reza Karimi, Ya-Ping Hsieh, Andreas Krause

    Abstract: Many problems in machine learning can be formulated as solving entropy-regularized optimal transport on the space of probability measures. The canonical approach involves the Sinkhorn iterates, renowned for their rich mathematical properties. Recently, the Sinkhorn algorithm has been recast within the mirror descent framework, thus benefiting from classical optimization theory insights. Here, we b… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  3. arXiv:2311.02374  [pdf, other

    math.OC cs.LG

    Riemannian stochastic optimization methods avoid strict saddle points

    Authors: Ya-Ping Hsieh, Mohammad Reza Karimi, Andreas Krause, Panayotis Mertikopoulos

    Abstract: Many modern machine learning applications - from online principal component analysis to covariance matrix identification and dictionary learning - can be formulated as minimization problems on Riemannian manifolds, and are typically solved with a Riemannian stochastic gradient method (or some variant thereof). However, in many cases of interest, the resulting minimization problem is not geodesical… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 27 pages, 3 figures

    MSC Class: Primary 62L20; 37N40; secondary 90C15; 90C48

  4. arXiv:2211.01689  [pdf, other

    stat.ML cs.LG

    Isotropic Gaussian Processes on Finite Spaces of Graphs

    Authors: Viacheslav Borovitskiy, Mohammad Reza Karimi, Vignesh Ram Somnath, Andreas Krause

    Abstract: We propose a principled way to define Gaussian process priors on various sets of unweighted graphs: directed or undirected, with or without loops. We endow each of these sets with a geometric structure, inducing the notions of closeness and symmetries, by turning them into a vertex set of an appropriate metagraph. Building on this, we describe the class of priors that respect this structure and ar… ▽ More

    Submitted 25 February, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

  5. arXiv:2210.13867  [pdf, ps, other

    cs.LG math.PR math.ST

    A Dynamical System View of Langevin-Based Non-Convex Sampling

    Authors: Mohammad Reza Karimi, Ya-Ping Hsieh, Andreas Krause

    Abstract: Non-convex sampling is a key challenge in machine learning, central to non-convex optimization in deep learning as well as to approximate probabilistic inference. Despite its significance, theoretically there remain many important challenges: Existing guarantees (1) typically only hold for the averaged iterates rather than the more desirable last iterates, (2) lack convergence metrics that capture… ▽ More

    Submitted 13 March, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: typos corrected, references added

    MSC Class: 62D05

  6. arXiv:2206.06795  [pdf, other

    math.OC cs.LG math.DS

    Riemannian stochastic approximation algorithms

    Authors: Mohammad Reza Karimi, Ya-Ping Hsieh, Panayotis Mertikopoulos, Andreas Krause

    Abstract: We examine a wide class of stochastic approximation algorithms for solving (stochastic) nonlinear problems on Riemannian manifolds. Such algorithms arise naturally in the study of Riemannian optimization, game theory and optimal transport, but their behavior is much less understood compared to the Euclidean case because of the lack of a global linear structure on the manifold. We overcome this dif… ▽ More

    Submitted 27 December, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 33 pages, 2 figures; a one-page abstract of this paper was presented in COLT 2022

    MSC Class: Primary 62L20; 37N40; secondary 90C15; 90C47; 90C48

  7. arXiv:2010.09818  [pdf, other

    cs.LG stat.ML

    Online Active Model Selection for Pre-trained Classifiers

    Authors: Mohammad Reza Karimi, Nezihe Merve Gürel, Bojan Karlaš, Johannes Rausch, Ce Zhang, Andreas Krause

    Abstract: Given $k$ pre-trained classifiers and a stream of unlabeled data examples, how can we actively decide when to query a label so that we can distinguish the best model from the rest while making a small number of queries? Answering this question has a profound impact on a range of practical scenarios. In this work, we design an online selective sampling approach that actively selects informative exa… ▽ More

    Submitted 17 April, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

  8. arXiv:1711.01566  [pdf, other

    cs.LG cs.DM stat.ML

    Stochastic Submodular Maximization: The Case of Coverage Functions

    Authors: Mohammad Reza Karimi, Mario Lucic, Hamed Hassani, Andreas Krause

    Abstract: Stochastic optimization of continuous objectives is at the heart of modern machine learning. However, many important problems are of discrete nature and often involve submodular objectives. We seek to unleash the power of stochastic continuous optimization, namely stochastic gradient descent and its variants, to such discrete problems. We first introduce the problem of stochastic submodular optimi… ▽ More

    Submitted 5 November, 2017; originally announced November 2017.

    Comments: 31st Conference on Neural Information Processing Systems (NIPS 2017)

  9. arXiv:1605.06855  [pdf, other

    cs.SI cs.LG stat.ML

    Smart broadcasting: Do you want to be seen?

    Authors: Mohammad Reza Karimi, Erfan Tavakoli, Mehrdad Farajtabar, Le Song, Manuel Gomez-Rodriguez

    Abstract: Many users in online social networks are constantly trying to gain attention from their followers by broadcasting posts to them. These broadcasters are likely to gain greater attention if their posts can remain visible for a longer period of time among their followers' most recent feeds. Then when to post? In this paper, we study the problem of smart broadcasting using the framework of temporal po… ▽ More

    Submitted 22 May, 2016; originally announced May 2016.

    Comments: To appear in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), San Francisco (CA, USA), 2016