FarsNewsQA: a deep learning-based question answering system for the Persian news articles
- Arefeh Kazemi,
- Zahra Zojaji,
- Mahdi Malverdi,
- Jamshid Mozafari,
- Fatemeh Ebrahimi,
- Negin Abadani,
- Mohammad Reza Varasteh,
- Mohammad Ali Nematbakhsh
Nowadays, a considerable volume of news articles is produced daily by news agencies worldwide. Since there is an extensive volume of news on the web, finding exact answers to the users’ questions is not a straightforward task. Developing Question ...
Shop by image: characterizing visual search in e-commerce
Visual search has become more popular in recent years, allowing users to search by an image they are taking using their mobile device or uploading from their photo library. One domain in which visual search is especially valuable is electronic ...
An in-depth study on adversarial learning-to-rank
In light of recent advances in adversarial learning, there has been strong and continuing interest in exploring how to perform adversarial learning-to-rank. The previous adversarial ranking methods [e.g., IRGAN by Wang et al. (IRGAN: a minimax ...
Investigating better context representations for generative question answering
Generating natural language answers for question-answering (QA) tasks has recently surged in popularity with the rise of task-based personalized assistants. Most QA research is on extractive QA, methods that find answer spans in text passages. ...
MuMUR: Multilingual Multimodal Universal Retrieval
- Avinash Madasu,
- Estelle Aflalo,
- Gabriela Ben Melech Stan,
- Shachar Rosenman,
- Shao-Yen Tseng,
- Gedas Bertasius,
- Vasudev Lal
Multi-modal retrieval has seen tremendous progress with the development of vision-language models. However, further improving these models require additional labelled data which is a huge manual effort. In this paper, we propose a framework MuMUR, ...
Heterogeneous graph attention networks for passage retrieval
- Lucas Albarede,
- Philippe Mulhem,
- Lorraine Goeuriot,
- Sylvain Marié,
- Claude Le Pape-Gardeux,
- Trinidad Chardin-Segui
This paper presents an exploration of the usage of Heterogeneous Graph Attention Networks, or HGATs, for the task of Passage Retrieval. More precisely, we study how these models perform to alleviate the problem of passage contextualization, that ...
Multimodal video retrieval with CLIP: a user study
Recent machine learning advances demonstrate the effectiveness of zero-shot models trained on large amounts of data collected from the internet. Among these, CLIP (Contrastive Language-Image Pre-training) has been introduced as a multimodal model ...
Constructing and meta-evaluating state-aware evaluation metrics for interactive search systems
Evaluation metrics such as precision, recall and normalized discounted cumulative gain have been widely applied in ad hoc retrieval experiments. They have facilitated the assessment of system performance in various topics over the past decade. ...
DeepQFM: a deep learning based query facets mining method
Search results from the search engine may be not enough to satisfy users’ search intent when the issued query is broad or ambiguous. In such cases, presenting to the user query facets, which include common query reformulations, may help ...
Privacy-aware document retrieval with two-level inverted indexing
Previous work on privacy-aware ranking has addressed the minimization of information leakage when scoring top k documents, and has not studied on how to retrieve these top documents and their features for ranking. This paper proposes a privacy-...
Tashaphyne0.4: a new arabic light stemmer based on rhyzome modeling approach
Stemming algorithms are crucial tools for enhancing the information retrieval process in natural language processing. This paper presents a novel Arabic light stemming algorithm called Tashaphyne0.4, the idea behind this algorithm is to extract ...
An in-depth analysis of passage-level label transfer for contextual document ranking
Pre-trained contextual language models such as BERT, GPT, and XLnet work quite well for document retrieval tasks. Such models are fine-tuned based on the query-document/query-passage level relevance labels to capture the ranking signals. However, ...