Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–4 of 4 results for author: Herdade, S

Searching in archive cs. Search in all archives.
.
  1. Fast Numerical Multivariate Multipoint Evaluation

    Authors: Sumanta Ghosh, Prahladh Harsha, Simão Herdade, Mrinal Kumar, Ramprasad Saptharishi

    Abstract: We design nearly-linear time numerical algorithms for the problem of multivariate multipoint evaluation over the fields of rational, real and complex numbers. We consider both \emph{exact} and \emph{approximate} versions of the algorithm. The input to the algorithms are (1) coefficients of an $m$-variate polynomial $f$ with degree $d$ in each variable, and (2) points $a_1,..., a_N$ each of whose c… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    MSC Class: 68W30 ACM Class: F.2.1; I.1.2

    Journal ref: In Proc. 64th FOCS pages 1426-1439, 2023

  2. arXiv:2007.00145  [pdf, other

    cs.CV cs.CL cs.LG

    Modality-Agnostic Attention Fusion for visual search with text feedback

    Authors: Eric Dodds, Jack Culpepper, Simao Herdade, Yang Zhang, Kofi Boakye

    Abstract: Image retrieval with natural language feedback offers the promise of catalog search based on fine-grained visual features that go beyond objects and binary attributes, facilitating real-world applications such as e-commerce. Our Modality-Agnostic Attention Fusion (MAAF) model combines image and text features and outperforms existing approaches on two visual search with modifying phrase datasets, F… ▽ More

    Submitted 30 June, 2020; originally announced July 2020.

    Comments: 14 pages, 8 figures

  3. arXiv:1906.05963  [pdf, other

    cs.CV cs.CL

    Image Captioning: Transforming Objects into Words

    Authors: Simao Herdade, Armin Kappeler, Kofi Boakye, Joao Soares

    Abstract: Image captioning models typically follow an encoder-decoder architecture which uses abstract image feature vectors as input to the encoder. One of the most successful algorithms uses feature vectors extracted from the region proposals obtained from an object detector. In this work we introduce the Object Relation Transformer, that builds upon this approach by explicitly incorporating information a… ▽ More

    Submitted 11 January, 2020; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: 10 pages

  4. arXiv:1810.04652  [pdf, other

    cs.CV cs.IR cs.LG

    Learning Embeddings for Product Visual Search with Triplet Loss and Online Sampling

    Authors: Eric Dodds, Huy Nguyen, Simao Herdade, Jack Culpepper, Andrew Kae, Pierre Garrigues

    Abstract: In this paper, we propose learning an embedding function for content-based image retrieval within the e-commerce domain using the triplet loss and an online sampling method that constructs triplets from within a minibatch. We compare our method to several strong baselines as well as recent works on the DeepFashion and Stanford Online Product datasets. Our approach significantly outperforms the sta… ▽ More

    Submitted 10 October, 2018; originally announced October 2018.