Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–10 of 10 results for author: Shtok, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.00459  [pdf, other

    cs.CL

    NumeroLogic: Number Encoding for Enhanced LLMs' Numerical Reasoning

    Authors: Eli Schwartz, Leshem Choshen, Joseph Shtok, Sivan Doveh, Leonid Karlinsky, Assaf Arbelle

    Abstract: Language models struggle with handling numerical data and performing arithmetic operations. We hypothesize that this limitation can be partially attributed to non-intuitive textual numbers representation. When a digit is read or generated by a causal language model it does not know its place value (e.g. thousands vs. hundreds) until the entire number is processed. To address this issue, we propose… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  2. arXiv:2111.14103  [pdf, other

    cs.CV

    CHARTER: heatmap-based multi-type chart data extraction

    Authors: Joseph Shtok, Sivan Harary, Ophir Azulai, Adi Raz Goldfarb, Assaf Arbelle, Leonid Karlinsky

    Abstract: The digital conversion of information stored in documents is a great source of knowledge. In contrast to the documents text, the conversion of the embedded documents graphics, such as charts and plots, has been much less explored. We present a method and a system for end-to-end conversion of document charts into machine readable tabular data format, which can be easily stored and analyzed in the d… ▽ More

    Submitted 28 November, 2021; originally announced November 2021.

    Comments: Joseph Shtok, Sivan Harary and Leonid Karlinsky had equal contribution

    Journal ref: Document Intelligence workshop at KDD 2021 conference

  3. arXiv:2104.09829  [pdf, other

    cs.CV

    Detector-Free Weakly Supervised Grounding by Separation

    Authors: Assaf Arbelle, Sivan Doveh, Amit Alfassy, Joseph Shtok, Guy Lev, Eli Schwartz, Hilde Kuehne, Hila Barak Levi, Prasanna Sattigeri, Rameswar Panda, Chun-Fu Chen, Alex Bronstein, Kate Saenko, Shimon Ullman, Raja Giryes, Rogerio Feris, Leonid Karlinsky

    Abstract: Nowadays, there is an abundance of data involving images and surrounding free-form text weakly corresponding to those images. Weakly Supervised phrase-Grounding (WSG) deals with the task of using this data to learn to localize (or to ground) arbitrary text phrases in images without any additional annotations. However, most recent SotA methods for WSG assume the existence of a pre-trained object de… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

  4. arXiv:2003.06798  [pdf, other

    cs.CV

    StarNet: towards Weakly Supervised Few-Shot Object Detection

    Authors: Leonid Karlinsky, Joseph Shtok, Amit Alfassy, Moshe Lichtenstein, Sivan Harary, Eli Schwartz, Sivan Doveh, Prasanna Sattigeri, Rogerio Feris, Alexander Bronstein, Raja Giryes

    Abstract: Few-shot detection and classification have advanced significantly in recent years. Yet, detection approaches require strong annotation (bounding boxes) both for pre-training and for adaptation to novel classes, and classification approaches rarely provide localization of objects in the scene. In this paper, we introduce StarNet - a few-shot model featuring an end-to-end differentiable non-parametr… ▽ More

    Submitted 17 September, 2020; v1 submitted 15 March, 2020; originally announced March 2020.

  5. arXiv:1902.09811  [pdf, other

    cs.CV

    LaSO: Label-Set Operations networks for multi-label few-shot learning

    Authors: Amit Alfassy, Leonid Karlinsky, Amit Aides, Joseph Shtok, Sivan Harary, Rogerio Feris, Raja Giryes, Alex M. Bronstein

    Abstract: Example synthesis is one of the leading methods to tackle the problem of few-shot learning, where only a small number of samples per class are available. However, current synthesis approaches only address the scenario of a single category label per image. In this work, we propose a novel technique for synthesizing samples with multiple labels for the (yet unhandled) multi-label few-shot classifica… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

  6. arXiv:1806.04734  [pdf, other

    cs.CV

    Delta-encoder: an effective sample synthesis method for few-shot object recognition

    Authors: Eli Schwartz, Leonid Karlinsky, Joseph Shtok, Sivan Harary, Mattias Marder, Rogerio Feris, Abhishek Kumar, Raja Giryes, Alex M. Bronstein

    Abstract: Learning to classify new categories based on just one or a few examples is a long-standing challenge in modern computer vision. In this work, we proposes a simple yet effective method for few-shot (and one-shot) object recognition. Our approach is based on a modified auto-encoder, denoted Delta-encoder, that learns to synthesize new samples for an unseen category just by seeing few examples from i… ▽ More

    Submitted 29 November, 2018; v1 submitted 12 June, 2018; originally announced June 2018.

  7. arXiv:1806.04728  [pdf, other

    cs.CV

    RepMet: Representative-based metric learning for classification and one-shot object detection

    Authors: Leonid Karlinsky, Joseph Shtok, Sivan Harary, Eli Schwartz, Amit Aides, Rogerio Feris, Raja Giryes, Alex M. Bronstein

    Abstract: Distance metric learning (DML) has been successfully applied to object classification, both in the standard regime of rich training data and in the few-shot scenario, where each category is represented by only a few examples. In this work, we propose a new method for DML that simultaneously learns the backbone network parameters, the embedding space, and the multi-modal distribution of each of the… ▽ More

    Submitted 18 November, 2018; v1 submitted 12 June, 2018; originally announced June 2018.

  8. arXiv:1311.7251  [pdf, other

    cs.CV cs.LG cs.NE

    Spatially-Adaptive Reconstruction in Computed Tomography using Neural Networks

    Authors: Joseph Shtok, Michael Zibulevsky, Michael Elad

    Abstract: We propose a supervised machine learning approach for boosting existing signal and image recovery methods and demonstrate its efficacy on example of image reconstruction in computed tomography. Our technique is based on a local nonlinear fusion of several image estimates, all obtained by applying a chosen reconstruction algorithm with different values of its control parameters. Usually such output… ▽ More

    Submitted 28 November, 2013; originally announced November 2013.

  9. arXiv:1004.4373  [pdf, ps, other

    cs.CV

    Spatially-Adaptive Reconstruction in Computed Tomography Based on Statistical Learning

    Authors: Joseph Shtok, Michael Zibulevsky, Michael Elad

    Abstract: We propose a direct reconstruction algorithm for Computed Tomography, based on a local fusion of a few preliminary image estimates by means of a non-linear fusion rule. One such rule is based on a signal denoising technique which is spatially adaptive to the unknown local smoothness. Another, more powerful fusion rule, is based on a neural network trained off-line with a high-quality training set… ▽ More

    Submitted 25 April, 2010; originally announced April 2010.

    Comments: Submitted to IEEE Transactions on Image Processing

  10. Analysis of Basis Pursuit Via Capacity Sets

    Authors: Joseph Shtok, Michael Elad

    Abstract: Finding the sparsest solution $α$ for an under-determined linear system of equations $Dα=s$ is of interest in many applications. This problem is known to be NP-hard. Recent work studied conditions on the support size of $α$ that allow its recovery using L1-minimization, via the Basis Pursuit algorithm. These conditions are often relying on a scalar property of $D$ called the mutual-coherence. In t… ▽ More

    Submitted 25 April, 2010; originally announced April 2010.

    Journal ref: Journal of Fourier Analysis and Applications, Volume 14, Numbers 5-6, December 2008, pp. 688-711