Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–27 of 27 results for author: Askari, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12692  [pdf, other

    cs.CL cs.AI cs.DB cs.HC

    MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL

    Authors: Arian Askari, Christian Poelitz, Xinye Tang

    Abstract: Self-correction in text-to-SQL is the process of prompting large language model (LLM) to revise its previously incorrectly generated SQL, and commonly relies on manually crafted self-correction guidelines by human experts that are not only labor-intensive to produce but also limited by the human ability in identifying all potential error patterns in LLM responses. We introduce MAGIC, a novel multi… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 20 pages, 17 figures

  2. arXiv:2404.18185  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Ranked List Truncation for Large Language Model-based Re-Ranking

    Authors: Chuan Meng, Negar Arabzadeh, Arian Askari, Mohammad Aliannejadi, Maarten de Rijke

    Abstract: We study ranked list truncation (RLT) from a novel "retrieve-then-re-rank" perspective, where we optimize re-ranking by truncating the retrieved list (i.e., trim re-ranking candidates). RLT is crucial for re-ranking as it can improve re-ranking efficiency by sending variable-length candidate lists to a re-ranker on a per-query basis. It also has the potential to improve re-ranking effectiveness. D… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: Accepted for publication as a long paper at SIGIR 2024

    ACM Class: H.3.3

  3. arXiv:2404.01012  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Query Performance Prediction using Relevance Judgments Generated by Large Language Models

    Authors: Chuan Meng, Negar Arabzadeh, Arian Askari, Mohammad Aliannejadi, Maarten de Rijke

    Abstract: Query performance prediction (QPP) aims to estimate the retrieval quality of a search system for a query without human relevance judgments. Previous QPP methods typically return a single scalar value and do not require the predicted values to approximate a specific information retrieval (IR) evaluation measure, leading to certain drawbacks: (i) a single scalar is insufficient to accurately represe… ▽ More

    Submitted 17 June, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    ACM Class: H.3.3

  4. arXiv:2403.19056  [pdf, other

    cs.CL

    CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems

    Authors: Amin Abolghasemi, Zhaochun Ren, Arian Askari, Mohammad Aliannejadi, Maarten de Rijke, Suzan Verberne

    Abstract: An important unexplored aspect in previous work on user satisfaction estimation for Task-Oriented Dialogue (TOD) systems is their evaluation in terms of robustness for the identification of user dissatisfaction: current benchmarks for user satisfaction estimation in TOD systems are highly skewed towards dialogues for which the user is satisfied. The effect of having a more balanced set of satisfac… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  5. DungeonMaker: Embedding Tangible Creation and Destruction in Hybrid Board Games through Personal Fabrication Technology

    Authors: Evgeny Stemasov, Tobias Wagner, Ali Askari, Jessica Janek, Omid Rajabi, Anja Schikorr, Julian Frommel, Jan Gugenheimer, Enrico Rukzio

    Abstract: Hybrid board games (HBGs) augment their analog origins digitally (e.g., through apps) and are an increasingly popular pastime activity. Continuous world and character development and customization, known to facilitate engagement in video games, remain rare in HBGs. If present, they happen digitally or imaginarily, often leaving physical aspects generic. We developed DungeonMaker, a fabrication-aug… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 16 pages, 8 figures, 1 table. Accepted to ACM CHI 2024 (ACM CHI conference on Human Factors in Computing Systems)

    ACM Class: H.5.m

    Journal ref: Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11-16, 2024, Honolulu, HI, USA

  6. arXiv:2403.05975  [pdf, other

    cs.CL

    Measuring Bias in a Ranked List using Term-based Representations

    Authors: Amin Abolghasemi, Leif Azzopardi, Arian Askari, Maarten de Rijke, Suzan Verberne

    Abstract: In most recent studies, gender bias in document ranking is evaluated with the NFaiRR metric, which measures bias in a ranked list based on an aggregation over the unbiasedness scores of each ranked document. This perspective in measuring the bias of a ranked list has a key limitation: individual documents of a ranked list might be biased while the ranked list as a whole balances the groups' repres… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Accepted at the 46th European Conference on Information Retrieval (ECIR 2024)

  7. arXiv:2402.11633  [pdf, other

    cs.CL

    Self-seeding and Multi-intent Self-instructing LLMs for Generating Intent-aware Information-Seeking dialogs

    Authors: Arian Askari, Roxana Petcu, Chuan Meng, Mohammad Aliannejadi, Amin Abolghasemi, Evangelos Kanoulas, Suzan Verberne

    Abstract: Identifying user intents in information-seeking dialogs is crucial for a system to meet user's information needs. Intent prediction (IP) is challenging and demands sufficient dialogs with human-labeled intents for training. However, manually annotating intents is resource-intensive. While large language models (LLMs) have been shown to be effective in generating synthetic data, there is no study o… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  8. arXiv:2401.04852  [pdf, other

    cs.IR

    Answer Retrieval in Legal Community Question Answering

    Authors: Arian Askari, Zihui Yang, Zhaochun Ren, Suzan Verberne

    Abstract: The task of answer retrieval in the legal domain aims to help users to seek relevant legal advice from massive amounts of professional responses. Two main challenges hinder applying existing answer retrieval approaches in other domains to the legal domain: (1) a huge knowledge gap between lawyers and non-professionals; and (2) a mix of informal and formal content on legal QA websites. To tackle th… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: accepted at ECIR 2024

  9. arXiv:2306.10979  [pdf, other

    cs.IR cs.AI

    Enhancing Documents with Multidimensional Relevance Statements in Cross-encoder Re-ranking

    Authors: Rishabh Upadhyay, Arian Askari, Gabriella Pasi, Marco Viviani

    Abstract: In this paper, we propose a novel approach to consider multiple dimensions of relevance beyond topicality in cross-encoder re-ranking. On the one hand, current multidimensional retrieval models often use naïve solutions at the re-ranking stage to aggregate multiple relevance scores into an overall one. On the other hand, cross-encoder re-rankers are effective in considering topicality but are not… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  10. arXiv:2305.02320  [pdf, other

    cs.IR

    Generating Synthetic Documents for Cross-Encoder Re-Rankers: A Comparative Study of ChatGPT and Human Experts

    Authors: Arian Askari, Mohammad Aliannejadi, Evangelos Kanoulas, Suzan Verberne

    Abstract: We investigate the usefulness of generative Large Language Models (LLMs) in generating training data for cross-encoder re-rankers in a novel direction: generating synthetic documents instead of synthetic queries. We introduce a new dataset, ChatGPT-RetrievalQA, and compare the effectiveness of models fine-tuned on LLM-generated and human-generated data. Data generated with generative LLMs can be u… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  11. arXiv:2303.01200  [pdf, other

    cs.IR

    Retrieval for Extremely Long Queries and Documents with RPRS: a Highly Efficient and Effective Transformer-based Re-Ranker

    Authors: Arian Askari, Suzan Verberne, Amin Abolghasemi, Wessel Kraaij, Gabriella Pasi

    Abstract: Retrieval with extremely long queries and documents is a well-known and challenging task in information retrieval and is commonly known as Query-by-Document (QBD) retrieval. Specifically designed Transformer models that can handle long input sequences have not shown high effectiveness in QBD tasks in previous work. We propose a Re-Ranker based on the novel Proportional Relevance Score (RPRS) to co… ▽ More

    Submitted 1 November, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: Accepted at ACM Transactions on Information Systems (ACM TOIS journal)

  12. arXiv:2301.09728  [pdf, other

    cs.IR

    Injecting the BM25 Score as Text Improves BERT-Based Re-rankers

    Authors: Arian Askari, Amin Abolghasemi, Gabriella Pasi, Wessel Kraaij, Suzan Verberne

    Abstract: In this paper we propose a novel approach for combining first-stage lexical retrieval models and Transformer-based re-rankers: we inject the relevance score of the lexical model as a token in the middle of the input of the cross-encoder re-ranker. It was shown in prior work that interpolation between the relevance score of lexical and BERT-based re-rankers may not consistently result in higher eff… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: Accepted at ECIR 2023

  13. On the Interpolation of Contextualized Term-based Ranking with BM25 for Query-by-Example Retrieval

    Authors: Amin Abolghasemi, Arian Askari, Suzan Verberne

    Abstract: Term-based ranking with pre-trained transformer-based language models has recently gained attention as they bring the contextualization power of transformer models into the highly efficient term-based retrieval. In this work, we examine the generalizability of two of these deep contextualized term-based models in the context of query-by-example (QBE) retrieval in which a seed document acts as the… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval

  14. arXiv:2209.11799  [pdf, other

    cs.AI cs.CL cs.LG stat.ME

    Augmenting Interpretable Models with LLMs during Training

    Authors: Chandan Singh, Armin Askari, Rich Caruana, Jianfeng Gao

    Abstract: Recent large language models (LLMs) have demonstrated remarkable prediction performance for a growing array of tasks. However, their proliferation into high-stakes domains (e.g. medicine) and compute-limited settings has created a burgeoning need for interpretability and efficiency. We address this need by proposing Augmented Interpretable Models (Aug-imodels), a framework for leveraging the knowl… ▽ More

    Submitted 24 April, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Journal ref: Nature Communications, 2023

  15. arXiv:2205.13351  [pdf, other

    cs.IR

    LeiBi@COLIEE 2022: Aggregating Tuned Lexical Models with a Cluster-driven BERT-based Model for Case Law Retrieval

    Authors: Arian Askari, Georgios Peikos, Gabriella Pasi, Suzan Verberne

    Abstract: This paper summarizes our approaches submitted to the case law retrieval task in the Competition on Legal Information Extraction/Entailment (COLIEE) 2022. Our methodology consists of four steps; in detail, given a legal case as a query, we reformulate it by extracting various meaningful sentences or n-grams. Then, we utilize the pre-processed query case to retrieve an initial set of possible relev… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: Accepted at the COLIEE Workshop in Proceedings of JURISIN 2022. Please cite the published version

  16. arXiv:2201.07667  [pdf, ps, other

    cs.IR

    Expert Finding in Legal Community Question Answering

    Authors: Arian Askari, Suzan Verberne, Gabriella Pasi

    Abstract: Expert finding has been well-studied in community question answering (QA) systems in various domains. However, none of these studies addresses expert finding in the legal domain, where the goal is for citizens to find lawyers based on their expertise. In the legal domain, there is a large knowledge gap between the experts and the searchers, and the content on the legal QA websites consist of a com… ▽ More

    Submitted 19 April, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: Accepted at Proceedings of the 44th European Conference on Information Retrieval, ECIR 2022. Please cite the published version

  17. arXiv:2108.03937  [pdf, other

    cs.IR

    DoSSIER@COLIEE 2021: Leveraging dense retrieval and summarization-based re-ranking for case law retrieval

    Authors: Sophia Althammer, Arian Askari, Suzan Verberne, Allan Hanbury

    Abstract: In this paper, we present our approaches for the case law retrieval and the legal case entailment task in the Competition on Legal Information Extraction/Entailment (COLIEE) 2021. As first stage retrieval methods combined with neural re-ranking methods using contextualized language models like BERT achieved great performance improvements for information retrieval in the web and news domain, we eva… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: Published in COLIEE 2021

  18. Wise-SrNet: A Novel Architecture for Enhancing Image Classification by Learning Spatial Resolution of Feature Maps

    Authors: Mohammad Rahimzadeh, AmirAli Askari, Soroush Parvin, Elnaz Safi, Mohammad Reza Mohammadi

    Abstract: One of the main challenges since the advancement of convolutional neural networks is how to connect the extracted feature map to the final classification layer. VGG models used two sets of fully connected layers for the classification part of their architectures, which significantly increased the number of models' weights. ResNet and the next deep convolutional models used the Global Average Pooli… ▽ More

    Submitted 11 March, 2024; v1 submitted 25 April, 2021; originally announced April 2021.

    Comments: The code is shared at https://github.com/mr7495/image-classification-spatial

  19. arXiv:2006.08790  [pdf, other

    cs.LG stat.ME stat.ML

    FANOK: Knockoffs in Linear Time

    Authors: Armin Askari, Quentin Rebjock, Alexandre d'Aspremont, Laurent El Ghaoui

    Abstract: We describe a series of algorithms that efficiently implement Gaussian model-X knockoffs to control the false discovery rate on large scale feature selection problems. Identifying the knockoff distribution requires solving a large scale semidefinite program for which we derive several efficient methods. One handles generic covariance matrices, has a complexity scaling as $O(p^3)$ where $p$ is the… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: For code see https://github.com/qrebjock/fanok

  20. arXiv:1908.06315  [pdf, other

    cs.LG math.OC stat.ML

    Implicit Deep Learning

    Authors: Laurent El Ghaoui, Fangda Gu, Bertrand Travacca, Armin Askari, Alicia Y. Tsai

    Abstract: Implicit deep learning prediction rules generalize the recursive rules of feedforward neural networks. Such rules are based on the solution of a fixed-point equation involving a single vector of hidden features, which is thus only implicitly defined. The implicit framework greatly simplifies the notation of deep learning, and opens up many new possibilities, in terms of novel architectures and alg… ▽ More

    Submitted 6 August, 2020; v1 submitted 17 August, 2019; originally announced August 2019.

  21. arXiv:1905.09884  [pdf, other

    cs.LG stat.ML

    Naive Feature Selection: Sparsity in Naive Bayes

    Authors: Armin Askari, Alexandre d'Aspremont, Laurent El Ghaoui

    Abstract: Due to its linear complexity, naive Bayes classification remains an attractive supervised learning method, especially in very large-scale settings. We propose a sparse version of naive Bayes, which can be used for feature selection. This leads to a combinatorial maximum-likelihood problem, for which we provide an exact solution in the case of binary data, or a bound in the multinomial case. We pro… ▽ More

    Submitted 30 July, 2019; v1 submitted 23 May, 2019; originally announced May 2019.

  22. arXiv:1811.08039  [pdf, other

    cs.LG stat.ML

    Fenchel Lifted Networks: A Lagrange Relaxation of Neural Network Training

    Authors: Fangda Gu, Armin Askari, Laurent El Ghaoui

    Abstract: Despite the recent successes of deep neural networks, the corresponding training problem remains highly non-convex and difficult to optimize. Classes of models have been proposed that introduce greater structure to the objective function at the cost of lifting the dimension of the problem. However, these lifted methods sometimes perform poorly compared to traditional neural networks. In this paper… ▽ More

    Submitted 14 November, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

  23. arXiv:1811.02702  [pdf, other

    cs.LG stat.ML

    Greedy Frank-Wolfe Algorithm for Exemplar Selection

    Authors: Gary Cheng, Armin Askari, Kannan Ramchandran, Laurent El Ghaoui

    Abstract: In this paper, we consider the problem of selecting representatives from a data set for arbitrary supervised/unsupervised learning tasks. We identify a subset $S$ of a data set $A$ such that 1) the size of $S$ is much smaller than $A$ and 2) $S$ efficiently describes the entire data set, in a way formalized via convex optimization. In order to generate $|S| = k$ exemplars, our kernelizable algorit… ▽ More

    Submitted 22 February, 2020; v1 submitted 6 November, 2018; originally announced November 2018.

  24. Query Understanding via Entity Attribute Identification

    Authors: Arash Dargahi Nobari, Arian Askari, Faegheh Hasibi, Mahmood Neshati

    Abstract: Understanding searchers' queries is an essential component of semantic search systems. In many cases, search queries involve specific attributes of an entity in a knowledge base (KB), which can be further used to find query answers. In this study, we aim to move forward the understanding of queries by identifying their related entity attributes from a knowledge base. To this end, we introduce the… ▽ More

    Submitted 23 September, 2018; originally announced September 2018.

    Comments: Proceedings of the 27th International Conference on Information and Knowledge Management (CIKM '18), 2018

  25. arXiv:1806.06775  [pdf, other

    stat.ML cs.LG

    Kernel-based Outlier Detection using the Inverse Christoffel Function

    Authors: Armin Askari, Forest Yang, Laurent El Ghaoui

    Abstract: Outlier detection methods have become increasingly relevant in recent years due to increased security concerns and because of its vast application to different fields. Recently, Pauwels and Lasserre (2016) noticed that the sublevel sets of the inverse Christoffel function accurately depict the shape of a cloud of data using a sum-of-squares polynomial and can be used to perform outlier detection.… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.

  26. arXiv:1805.01532  [pdf, other

    cs.LG stat.ML

    Lifted Neural Networks

    Authors: Armin Askari, Geoffrey Negiar, Rajiv Sambharya, Laurent El Ghaoui

    Abstract: We describe a novel family of models of multi- layer feedforward neural networks in which the activation functions are encoded via penalties in the training problem. Our approach is based on representing a non-decreasing activation function as the argmin of an appropriate convex optimiza- tion problem. The new framework allows for algo- rithms such as block-coordinate descent methods to be applied… ▽ More

    Submitted 20 June, 2018; v1 submitted 3 May, 2018; originally announced May 2018.

  27. arXiv:cs/0508102  [pdf, ps, other

    cs.CE

    Investigations of Process Damping Forces in Metal Cutting

    Authors: Emily Stone, Suhail Ahmed, Abe Askari, Hong Tat

    Abstract: Using finite element software developed for metal cutting by Third Wave Systems we investigate the forces involved in chatter, a self-sustained oscillation of the cutting tool. The phenomena is decomposed into a vibrating tool cutting a flat surface work piece, and motionless tool cutting a work piece with a wavy surface. While cutting the wavy surface, the shearplane was seen to oscillate in ad… ▽ More

    Submitted 23 August, 2005; originally announced August 2005.

    Comments: 27 pages, 27 figures, submitted to Journal of Computational Methods in Science and Engineering, Feb. 2005

    ACM Class: I.6.3; I.6.4