Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 146 results for author: Yilmaz, E

.
  1. arXiv:2408.16312  [pdf, other

    cs.IR

    SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval

    Authors: Hossein A. Rahmani, Xi Wang, Emine Yilmaz, Nick Craswell, Bhaskar Mitra, Paul Thomas

    Abstract: Large-scale test collections play a crucial role in Information Retrieval (IR) research. However, according to the Cranfield paradigm and the research into publicly available datasets, the existing information retrieval research studies are commonly developed on small-scale datasets that rely on human assessors for relevance judgments - a time-intensive and expensive process. Recent studies have s… ▽ More

    Submitted 30 August, 2024; v1 submitted 29 August, 2024; originally announced August 2024.

    Comments: 9 pages, resource paper

  2. arXiv:2408.08896  [pdf, other

    cs.IR

    LLMJudge: LLMs for Relevance Judgments

    Authors: Hossein A. Rahmani, Emine Yilmaz, Nick Craswell, Bhaskar Mitra, Paul Thomas, Charles L. A. Clarke, Mohammad Aliannejadi, Clemencia Siro, Guglielmo Faggioli

    Abstract: The LLMJudge challenge is organized as part of the LLM4Eval workshop at SIGIR 2024. Test collections are essential for evaluating information retrieval (IR) systems. The evaluation and tuning of a search system is largely based on relevance labels, which indicate whether a document is useful for a specific search and user. However, collecting relevance judgments on a large scale is costly and reso… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: LLMJudge Challenge Overview, 3 pages

  3. arXiv:2408.05388  [pdf, other

    cs.IR

    Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024

    Authors: Hossein A. Rahmani, Clemencia Siro, Mohammad Aliannejadi, Nick Craswell, Charles L. A. Clarke, Guglielmo Faggioli, Bhaskar Mitra, Paul Thomas, Emine Yilmaz

    Abstract: The first edition of the workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) took place in July 2024, co-located with the ACM SIGIR Conference 2024 in the USA (SIGIR 2024). The aim was to bring information retrieval researchers together around the topic of LLMs for evaluation in information retrieval that gathered attention with the advancement of large languag… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: LLM4Eval Workshop Report

  4. arXiv:2407.21712  [pdf, other

    cs.CL cs.IR

    Adaptive Retrieval-Augmented Generation for Conversational Systems

    Authors: Xi Wang, Procheta Sen, Ruizhe Li, Emine Yilmaz

    Abstract: Despite the success of integrating large language models into the development of conversational systems, many studies have shown the effectiveness of retrieving and augmenting external knowledge for informative responses. Hence, many existing studies commonly assume the always need for Retrieval Augmented Generation (RAG) in a conversational system without explicit control. This raises a research… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: 12 pages, under review

  5. arXiv:2407.05894  [pdf, other

    math.NA math.AP

    On Nonlinear Closures for Moment Equations Based on Orthogonal Polynomials

    Authors: Eda Yilmaz, Georgii Oblapenko, Manuel Torrilhon

    Abstract: In the present work, an approach to the moment closure problem on the basis of orthogonal polynomials derived from Gram matrices is proposed. Its properties are studied in the context of the moment closure problem arising in gas kinetic theory, for which the proposed approach is proven to have multiple attractive mathematical properties. Numerical studies are carried out for model gas particle dis… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    MSC Class: 82C40; 35L60; 35Q70

  6. arXiv:2406.17803  [pdf, other

    cs.CL cs.AI cs.IR

    Understanding the Role of User Profile in the Personalization of Large Language Models

    Authors: Bin Wu, Zhengyan Shi, Hossein A. Rahmani, Varsha Ramineni, Emine Yilmaz

    Abstract: Utilizing user profiles to personalize Large Language Models (LLMs) has been shown to enhance the performance on a wide range of tasks. However, the precise role of user profiles and their effect mechanism on LLMs remains unclear. This study first confirms that the effectiveness of user profiles is primarily due to personalization information rather than semantic information. Furthermore, we inves… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  7. arXiv:2406.12177  [pdf, other

    cs.CV cs.LG

    Location-based Radiology Report-Guided Semi-supervised Learning for Prostate Cancer Detection

    Authors: Alex Chen, Nathan Lay, Stephanie Harmon, Kutsev Ozyoruk, Enis Yilmaz, Brad J. Wood, Peter A. Pinto, Peter L. Choyke, Baris Turkbey

    Abstract: Prostate cancer is one of the most prevalent malignancies in the world. While deep learning has potential to further improve computer-aided prostate cancer detection on MRI, its efficacy hinges on the exhaustive curation of manually annotated images. We propose a novel methodology of semisupervised learning (SSL) guided by automatically extracted clinical information, specifically the lesion locat… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 4 page paper accepted to IEEE International Symposium on Biomedical Imaging (ISBI 2024)

  8. arXiv:2406.01612  [pdf, ps, other

    physics.soc-ph

    Universal behavior of the Covid-19 tails: Inverse power-law distribution

    Authors: E. Aydiner, E. Yilmaz

    Abstract: Power-law distribution is one of the most important laws known in nature. Such a special universal behavior is known to occur in very few physical systems. In this work, we analyzed the mortality distribution of the Covid-19 pandemic tails for different countries and continents to discuss the possible universal behavior of the pandemic. Surprisingly, we found that the mortality distribution of Cov… ▽ More

    Submitted 30 May, 2024; originally announced June 2024.

    Comments: Submitted to Physica A

  9. arXiv:2405.14394  [pdf, other

    cs.CL cs.AI

    Instruction Tuning With Loss Over Instructions

    Authors: Zhengyan Shi, Adam X. Yang, Bin Wu, Laurence Aitchison, Emine Yilmaz, Aldo Lipani

    Abstract: Instruction tuning plays a crucial role in shaping the outputs of language models (LMs) to desired styles. In this work, we propose a simple yet effective method, Instruction Modelling (IM), which trains LMs by applying a loss function to the instruction and prompt part rather than solely to the output part. Through experiments across 21 diverse benchmarks, we show that, in many scenarios, IM can… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Code is available at https://github.com/ZhengxiangShi/InstructionModelling

  10. arXiv:2405.10119  [pdf, other

    quant-ph

    Applications of Quantum Machine Learning for Quantitative Finance

    Authors: Piotr Mironowicz, Akshata Shenoy H., Antonio Mandarino, A. Ege Yilmaz, Thomas Ankenbrand

    Abstract: Machine learning and quantum machine learning (QML) have gained significant importance, as they offer powerful tools for tackling complex computational problems across various domains. This work gives an extensive overview of QML uses in quantitative finance, an important discipline in the financial industry. We examine the connection between quantum computing and machine learning in financial app… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: comments are welcome

  11. arXiv:2405.07767  [pdf, other

    cs.IR cs.AI

    Synthetic Test Collections for Retrieval Evaluation

    Authors: Hossein A. Rahmani, Nick Craswell, Emine Yilmaz, Bhaskar Mitra, Daniel Campos

    Abstract: Test collections play a vital role in evaluation of information retrieval (IR) systems. Obtaining a diverse set of user queries for test collection construction can be challenging, and acquiring relevance judgments, which indicate the appropriateness of retrieved documents to a query, is often costly and resource-intensive. Generating synthetic datasets using Large Language Models (LLMs) has recen… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: SIGIR 2024

  12. arXiv:2404.01849  [pdf, other

    cs.SE cs.AI

    EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and Benchmarking

    Authors: Stavros Orfanoudakis, Cesar Diaz-Londono, Yunus E. Yılmaz, Peter Palensky, Pedro P. Vergara

    Abstract: As electric vehicle (EV) numbers rise, concerns about the capacity of current charging and power grid infrastructure grow, necessitating the development of smart charging solutions. While many smart charging simulators have been developed in recent years, only a few support the development of Reinforcement Learning (RL) algorithms in the form of a Gym environment, and those that do usually lack de… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 10 pages, 9 figures, and 6 tables

  13. arXiv:2403.05181  [pdf, other

    cs.LG cs.CR cs.CV

    Adversarial Sparse Teacher: Defense Against Distillation-Based Model Stealing Attacks Using Adversarial Examples

    Authors: Eda Yilmaz, Hacer Yalim Keles

    Abstract: We introduce Adversarial Sparse Teacher (AST), a robust defense method against distillation-based model stealing attacks. Our approach trains a teacher model using adversarial examples to produce sparse logit responses and increase the entropy of the output distribution. Typically, a model generates a peak in its output corresponding to its prediction. By leveraging adversarial examples, AST modif… ▽ More

    Submitted 20 July, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: 14 pages, 3 figures, 11 tables

  14. arXiv:2402.08205  [pdf, other

    cs.RO

    TurtleRabbit 2024 SSL Team Description Paper

    Authors: Linh Trinh, Alif Anzuman, Eric Batkhuu, Dychen Chan, Lisa Graf, Darpan Gurung, Tharunimm Jamal, Jigme Namgyal, Jason Ng, Wing Lam Tsang, X. Rosalind Wang, Eren Yilmaz, Oliver Obst

    Abstract: TurtleRabbit is a new RoboCup SSL team from Western Sydney University. This team description paper presents our approach in navigating some of the challenges in developing a new SSL team from scratch. SSL is dominated by teams with extensive experience and customised equipment that has been developed over many years. Here, we outline our approach in overcoming some of the complexities associated w… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Submitted paper as part of the qualification for RoboCup 2024

  15. arXiv:2402.01934  [pdf, other

    cs.IR

    Clarifying the Path to User Satisfaction: An Investigation into Clarification Usefulness

    Authors: Hossein A. Rahmani, Xi Wang, Mohammad Aliannejadi, Mohammadmehdi Naghiaei, Emine Yilmaz

    Abstract: Clarifying questions are an integral component of modern information retrieval systems, directly impacting user satisfaction and overall system performance. Poorly formulated questions can lead to user frustration and confusion, negatively affecting the system's performance. This research addresses the urgent need to identify and leverage key features that contribute to the classification of clari… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: EACL

  16. arXiv:2401.15214  [pdf, other

    gr-qc astro-ph.CO hep-ph hep-th

    A quantitative analysis of the effect of box size in N-body simulations of the matter power spectrum

    Authors: Maxim Eingorn, Ezgi Yilmaz, A. Emrah Yükselci, Alexander Zhuk

    Abstract: We study the effect of box size on the matter power spectrum obtained via cosmological N-body simulations. Within the framework of the cosmic screening approach, we show that the relative deviation between the spectra for our largest comoving box with L = 5632 Mpc/h and those for L = 280, 560, 1680, 4480, 5120 Mpc/h boxes consistently increases with decreasing box size in the latter set in the red… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 8 pages, 2 tables, 3 figures

  17. arXiv:2401.12794  [pdf, other

    cs.CL

    Benchmarking LLMs via Uncertainty Quantification

    Authors: Fanghua Ye, Mingming Yang, Jianhui Pang, Longyue Wang, Derek F. Wong, Emine Yilmaz, Shuming Shi, Zhaopeng Tu

    Abstract: The proliferation of open-source Large Language Models (LLMs) from various institutions has highlighted the urgent need for comprehensive evaluation methods. However, current evaluation platforms, such as the widely recognized HuggingFace open LLM leaderboard, neglect a crucial aspect -- uncertainty, which is vital for thoroughly assessing LLMs. To bridge this gap, we introduce a new benchmarking… ▽ More

    Submitted 25 April, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 25 pages, preprints

  18. arXiv:2401.05424  [pdf, other

    cs.CY cs.IR cs.LG stat.AP

    A Toolbox for Modelling Engagement with Educational Videos

    Authors: Yuxiang Qiu, Karim Djemili, Denis Elezi, Aaneel Shalman, María Pérez-Ortiz, Emine Yilmaz, John Shawe-Taylor, Sahan Bulathwela

    Abstract: With the advancement and utility of Artificial Intelligence (AI), personalising education to a global population could be a cornerstone of new educational systems in the future. This work presents the PEEKC dataset and the TrueLearn Python library, which contains a dataset and a series of online learner state models that are essential to facilitate research on learner engagement modelling.TrueLear… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: In Proceedings of AAAI Conference on Artificial Intelligence 2024. arXiv admin note: text overlap with arXiv:2309.11527

    ACM Class: H.3.3; J.1; I.2.0

  19. arXiv:2312.15682  [pdf, other

    cs.HC

    A Grating Based High-Frequency Motion Stimulus Paradigm for Steady-State Motion Visual Evoked Potentials

    Authors: Bartu Atabek, Efecan Yilmaz, Cengiz Acarturk, Murat Perit Cakir

    Abstract: Objective: This paper proposes a novel type of stimulus in the shape of sinusoidal gratings displayed with an imperceptibly high-frequency motion. The stimulus has been designed for use in BCI (Brain Computer Interface) applications that employ visually evoked potentials (VEPs) in an effort to mitigate discomfort associated with VEPs. The stimuli set included traditional VEP stimuli, already estab… ▽ More

    Submitted 17 May, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

  20. arXiv:2311.17864  [pdf

    cond-mat.mtrl-sci

    Direct Fabrication of Atomically Defined Pores in MXenes

    Authors: Matthew G. Boebinger, Dundar E. Yilmaz, Ayana Ghosh, Sudhajit Misra, Tyler S. Mathis, Sergei V. Kalinin, Stephen Jesse, Yury Gogotsi, Adri C. T. van Duin, Raymond R. Unocic

    Abstract: Controlled fabrication of nanopores in atomically thin two-dimensional material offers the means to create robust membranes needed for ion transport, nanofiltration, and DNA sensing. Techniques for creating nanopores have relied upon either plasma etching or direct irradiation using electrons or ions; however, aberration-corrected scanning transmission electron microscopy (STEM) offers the advanta… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Experimental and simulations on the electron beam interactions with MXene monolayers to form nanopores as a function of temperature

  21. arXiv:2310.16738  [pdf, other

    cs.CL cs.IR

    Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation

    Authors: Xi Wang, Hossein A. Rahmani, Jiqun Liu, Emine Yilmaz

    Abstract: Conversational Recommendation System (CRS) is a rapidly growing research area that has gained significant attention alongside advancements in language modelling techniques. However, the current state of conversational recommendation faces numerous challenges due to its relative novelty and limited existing contributions. In this study, we delve into benchmark datasets for developing CRS models and… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP 2023 (Findings)

  22. arXiv:2310.09716  [pdf, other

    cs.HC cs.AI cs.CL cs.IR

    Enhancing Conversational Search: Large Language Model-Aided Informative Query Rewriting

    Authors: Fanghua Ye, Meng Fang, Shenghui Li, Emine Yilmaz

    Abstract: Query rewriting plays a vital role in enhancing conversational search by transforming context-dependent user queries into standalone forms. Existing approaches primarily leverage human-rewritten queries as labels to train query rewriting models. However, human rewrites may lack sufficient information for optimal retrieval performance. To overcome this limitation, we propose utilizing large languag… ▽ More

    Submitted 18 October, 2023; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: 22 pages, accepted to EMNLP Findings 2023

  23. arXiv:2309.11219  [pdf

    physics.flu-dyn

    Magnetically Levitated Microrobotic Mixer

    Authors: Ecenur Can Yılmaz, Abdurrahim Yılmaz, Ali Anıl Demirçalı, Efehan Topçu, Lila Kaman, Hüseyin Üvet

    Abstract: Microfluidic systems, when combined with microrobots, offer enhanced precision in chemical synthesis by precisely controlling reaction conditions. These systems, when integrated with analytical tools, allow for real-time monitoring and are cost-efficient due to their minimal volume requirements, thereby reducing risks associated with hazardous chemicals. In our study, we have investigated the mixi… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: 5 pages, 2 figures, 1 table

    MSC Class: 76-05

  24. arXiv:2309.06878  [pdf, other

    physics.ins-det cond-mat.mtrl-sci

    Performance of a plastic scintillator developed using styrene monomer polymerization

    Authors: A. Sadigov, F. Ahmadov, G. Ahmadov, E. Aksu, D. Berikov, S. Nuruyev, R. Akbarov, M. Holik, J. Nagiyev, S. Gurbuz Guner, A. Mammadli, N. Suleymanova, C. Abbasova, S. Melikova, E. Yilmaz, O. Tagiyev, S. Lyubchyk, Z. Sadygov

    Abstract: This paper presents a newly developed plastic scintillator produced in collaboration with Turkiye Energy, Nuclear and Mineral Research Agency (TENMAK). The scintillator is manufactured using thermal polymerization of commercially available styrene monomer. The absorption spectrum of the scintillator exhibited two absorption bands at 225 nm and 340 nm, with an absorption edge observed at 410 nm. Th… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: 7 pages, 7 figures

  25. Mass density vs. energy density at cosmological scales

    Authors: Maxim Eingorn, Ezgi Yilmaz, A. Emrah Yükselci, Alexander Zhuk

    Abstract: In the presence of the gravitational field, the energy density of matter no longer coincides with its mass density. A discrepancy exists, of course, also between the associated power spectra. Within the $Λ$CDM model, we derive a formula that relates the power spectrum of the energy density to that of the mass density and test it with the help of N-body simulations run in comoving boxes of 2.816 Gp… ▽ More

    Submitted 5 April, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: 11 pages, 4 figures; matches the published version in Physics Letters B

    Journal ref: Phys. Lett. B 851, 138564 (2024)

  26. arXiv:2309.00536  [pdf

    econ.GN

    Preventing Others from Commercializing Your Innovation: Evidence from Creative Commons Licenses

    Authors: Erdem Dogukan Yilmaz, Tim Meyer, Milan Miric

    Abstract: Online innovation communities are an important source of innovation for many organizations. While contributions to such communities are typically made without financial compensation, these contributions are often governed by licenses such as Creative Commons that may prevent others from building upon and commercializing them. While this can diminish the usefulness of contributions, there is limite… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  27. arXiv:2308.13063  [pdf, other

    q-fin.CP quant-ph

    Grover Search for Portfolio Selection

    Authors: A. Ege Yilmaz, Stefan Stettler, Thomas Ankenbrand, Urs Rhyner

    Abstract: We present explicit oracles designed to be used in Grover's algorithm to match investor preferences. Specifically, the oracles select portfolios with returns and standard deviations exceeding and falling below certain thresholds, respectively. One potential use case for the oracles is selecting portfolios with the best Sharpe ratios. We have implemented these algorithms using quantum simulators.

    Submitted 24 August, 2023; originally announced August 2023.

  28. Multi-Modal Multi-Task (3MT) Road Segmentation

    Authors: Erkan Milli, Özgür Erkent, Asım Egemen Yılmaz

    Abstract: Multi-modal systems have the capacity of producing more reliable results than systems with a single modality in road detection due to perceiving different aspects of the scene. We focus on using raw sensor inputs instead of, as it is typically done in many SOTA works, leveraging architectures that require high pre-processing costs such as surface normals or dense depth predictions. By using raw se… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Journal ref: in IEEE Robotics and Automation Letters, vol. 8, no. 9, pp. 5408-5415, Sept. 2023

  29. Suppression of matter density growth at scales exceeding the cosmic screening length

    Authors: Maxim Eingorn, Ezgi Yilmaz, A. Emrah Yükselci, Alexander Zhuk

    Abstract: One of the main objectives of modern cosmology is to explain the origin and evolution of cosmic structures at different scales. The principal force responsible for the formation of such structures is gravity. In a general relativistic framework, we have shown that matter density contrasts do not grow over time at scales exceeding the cosmic screening length, which corresponds to a cosmological sca… ▽ More

    Submitted 28 May, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: 9 pages, 3 figures. This is the Accepted Manuscript version of an article accepted for publication in Journal of Cosmology and Astroparticle Physics. Neither SISSA Medialab Srl nor IOP Publishing Ltd is responsible for any errors or omissions in this version of the manuscript or any version derived from it. The Version of Record is available online at 10.1088/1475-7516/2024/05/083

    Journal ref: JCAP 05 (2024) 083

  30. arXiv:2305.16798  [pdf, other

    cs.CL cs.AI

    Schema-Guided User Satisfaction Modeling for Task-Oriented Dialogues

    Authors: Yue Feng, Yunlong Jiao, Animesh Prasad, Nikolaos Aletras, Emine Yilmaz, Gabriella Kazai

    Abstract: User Satisfaction Modeling (USM) is one of the popular choices for task-oriented dialogue systems evaluation, where user satisfaction typically depends on whether the user's task goals were fulfilled by the system. Task-oriented dialogue systems use task schema, which is a set of task attributes, to encode the user's task goals. Existing studies on USM neglect explicitly modeling the user's task g… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  31. arXiv:2305.15933  [pdf, other

    cs.IR

    A Survey on Asking Clarification Questions Datasets in Conversational Systems

    Authors: Hossein A. Rahmani, Xi Wang, Yue Feng, Qiang Zhang, Emine Yilmaz, Aldo Lipani

    Abstract: The ability to understand a user's underlying needs is critical for conversational systems, especially with limited input from users in a conversation. Thus, in such a domain, Asking Clarification Questions (ACQs) to reveal users' true intent from their queries or utterances arise as an essential task. However, it is noticeable that a key limitation of the existing ACQs studies is their incomparab… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: ACL 2023, 17 pages

  32. arXiv:2305.13690  [pdf, other

    cs.CL cs.IR

    Towards Asking Clarification Questions for Information Seeking on Task-Oriented Dialogues

    Authors: Yue Feng, Hossein A. Rahmani, Aldo Lipani, Emine Yilmaz

    Abstract: Task-oriented dialogue systems aim at providing users with task-specific services. Users of such systems often do not know all the information about the task they are trying to accomplish, requiring them to seek information about the task. To provide accurate and personalized task-oriented information seeking results, task-oriented dialogue systems need to address two potential issues: 1) users' i… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  33. arXiv:2305.13002  [pdf, other

    cs.CL cs.AI cs.LG

    Rethinking Semi-supervised Learning with Language Models

    Authors: Zhengxiang Shi, Francesco Tonolini, Nikolaos Aletras, Emine Yilmaz, Gabriella Kazai, Yunlong Jiao

    Abstract: Semi-supervised learning (SSL) is a popular setting aiming to effectively utilize unlabelled data to improve model performance in downstream natural language processing (NLP) tasks. Currently, there are two popular approaches to make use of unlabelled data: Self-training (ST) and Task-adaptive pre-training (TAPT). ST uses a teacher model to assign pseudo-labels to the unlabelled data, while TAPT c… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023. Code is available at https://github.com/amzn/pretraining-or-self-training

  34. arXiv:2305.12594  [pdf, other

    cs.CL

    Modeling User Satisfaction Dynamics in Dialogue via Hawkes Process

    Authors: Fanghua Ye, Zhiyuan Hu, Emine Yilmaz

    Abstract: Dialogue systems have received increasing attention while automatically evaluating their performance remains challenging. User satisfaction estimation (USE) has been proposed as an alternative. It assumes that the performance of a dialogue system can be measured by user satisfaction and uses an estimator to simulate users. The effectiveness of USE depends heavily on the estimator. Existing estimat… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: To appear at ACL 2023

  35. arXiv:2305.07871  [pdf, other

    cs.AI cs.CY cs.IR cs.LG

    Scalable Educational Question Generation with Pre-trained Language Models

    Authors: Sahan Bulathwela, Hamze Muse, Emine Yilmaz

    Abstract: The automatic generation of educational questions will play a key role in scaling online education, enabling self-assessment at scale when a global population is manoeuvring their personalised learning journeys. We develop \textit{EduQG}, a novel educational question generation model built by adapting a large language model. Our extensive experiments demonstrate that \textit{EduQG} can produce sup… ▽ More

    Submitted 13 May, 2023; originally announced May 2023.

    Comments: To be published at the Int. Conf. on Artificial Intelligence in Education (Tokyo, 2023)

    ACM Class: H.3.3; J.1; I.2.0

  36. arXiv:2304.11752  [pdf, ps, other

    cs.IR

    Query-specific Variable Depth Pooling via Query Performance Prediction towards Reducing Relevance Assessment Effort

    Authors: Debasis Ganguly, Emine Yilmaz

    Abstract: Due to the massive size of test collections, a standard practice in IR evaluation is to construct a 'pool' of candidate relevant documents comprised of the top-k documents retrieved by a wide range of different retrieval systems - a process called depth-k pooling. A standard practice is to set the depth (k) to a constant value for each query constituting the benchmark set. However, in this paper w… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: To appear in SIGIR 2023

  37. arXiv:2302.00654  [pdf, other

    cs.IR

    Task2KB: A Public Task-Oriented Knowledge Base

    Authors: Procheta Sen, Xi Wang, Ruiqing Xu, Emine Yilmaz

    Abstract: Search engines and conversational assistants are commonly used to help users complete their every day tasks such as booking travel, cooking, etc. While there are some existing datasets that can be used for this purpose, their coverage is limited to very few domains. In this paper, we propose a novel knowledge base, 'Task2KB', which is constructed using data crawled from WikiHow, an online knowledg… ▽ More

    Submitted 24 January, 2023; originally announced February 2023.

  38. arXiv:2212.11871  [pdf

    cond-mat.mtrl-sci

    Atomic-scale modeling of the thermal decomposition of titanium(IV)-isopropoxide

    Authors: Benazir Fazlioglu Yalcin, Dundar E. Yilmaz, Adri CT van Duin, Roman Engel-Herbert

    Abstract: The metal-organic (MO) compound titanium(IV)-isopropoxide (Ti(OiPr)4, TTIP) has tremendous technological relevance for thin film growth and coating technologies, offering a low-temperature deposition route for titania and titanium-oxide-based compounds. Thermal decomposition via the release of organic ligands, a key process in any TTIP-based synthesis approach, is commonly assumed to take place on… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: 26 pages, 7 figures

  39. arXiv:2212.03869  [pdf, other

    cs.CL cs.AI cs.CY cs.IR cs.LG stat.ML

    Pre-Training With Scientific Text Improves Educational Question Generation

    Authors: Hamze Muse, Sahan Bulathwela, Emine Yilmaz

    Abstract: With the boom of digital educational materials and scalable e-learning systems, the potential for realising AI-assisted personalised learning has skyrocketed. In this landscape, the automatic generation of educational questions will play a key role, enabling scalable self-assessment when a global population is manoeuvring their personalised learning journeys. We develop EduQG, a novel educational… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: In Proceedings of AAAI Conference on Artificial Intelligence 2023

    ACM Class: H.3.3; J.1; I.2.0

  40. arXiv:2210.12397  [pdf, other

    cs.CL

    MetaASSIST: Robust Dialogue State Tracking with Meta Learning

    Authors: Fanghua Ye, Xi Wang, Jie Huang, Shenghui Li, Samuel Stern, Emine Yilmaz

    Abstract: Existing dialogue datasets contain lots of noise in their state annotations. Such noise can hurt model training and ultimately lead to poor generalization performance. A general framework named ASSIST has recently been proposed to train robust dialogue state tracking (DST) models. It introduces an auxiliary model to generate pseudo labels for the noisy training set. These pseudo labels are combine… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: To appear at EMNLP 2022, 13 pages

  41. arXiv:2210.12195  [pdf, other

    cs.LG

    Just Mix Once: Worst-group Generalization by Group Interpolation

    Authors: Giorgio Giannone, Serhii Havrylov, Jordan Massiah, Emine Yilmaz, Yunlong Jiao

    Abstract: Advances in deep learning theory have revealed how average generalization relies on superficial patterns in data. The consequences are brittle models with poor performance with shift in group distribution at test time. When group annotation is available, we can use robust optimization tools to tackle the problem. However, identification and annotation are time-consuming, especially on large datase… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: preprint

  42. Evaluation Metrics for Measuring Bias in Search Engine Results

    Authors: Gizem Gezici, Aldo Lipani, Yucel Saygin, Emine Yilmaz

    Abstract: Search engines decide what we see for a given search query. Since many people are exposed to information through search engines, it is fair to expect that search engines are neutral. However, search engine results do not necessarily cover all the viewpoints of a search query topic, and they can be biased towards a specific view since search engine results are returned based on relevance, which is… ▽ More

    Submitted 3 February, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Journal ref: 24, 2021, 85-113

  43. arXiv:2207.02627  [pdf, other

    math.GR

    Composition laws on the Fricke surface and Markov triples

    Authors: A. Muhammed Uludağ, Esra Ünal Yılmaz

    Abstract: We determine some composition laws related to the Fricke surface and the "double" Fricke surface. This latter surface admits the squares of Markov triples as its solutions.

    Submitted 6 July, 2022; originally announced July 2022.

  44. arXiv:2207.02620  [pdf, ps, other

    math.NT math.QA

    Quantizations of Continued Fractions

    Authors: A. Muhammed Uludağ, Esra Ünal Yilmaz

    Abstract: We introduce a four-parameter deformation of continued fractions, which we call $ U $-deformation. We study some particular cases and compare them with the q-deformation of continued fractions introduce recently by Morier-Genoud and Ovsienko.

    Submitted 6 July, 2022; originally announced July 2022.

  45. arXiv:2207.01504  [pdf, other

    cs.CY cs.AI cs.DL stat.AP stat.ML

    Can Population-based Engagement Improve Personalisation? A Novel Dataset and Experiments

    Authors: Sahan Bulathwela, Meghana Verma, Maria Perez-Ortiz, Emine Yilmaz, John Shawe-Taylor

    Abstract: This work explores how population-based engagement prediction can address cold-start at scale in large learning resource collections. The paper introduces i) VLE, a novel dataset that consists of content and video based features extracted from publicly available scientific video lectures coupled with implicit and explicit signals related to learner engagement, ii) two standard tasks related to pre… ▽ More

    Submitted 22 June, 2022; originally announced July 2022.

    Comments: To be presented at International Conference for Educational Data Mining 2022

    ACM Class: H.3.3; J.1; I.2.0

  46. arXiv:2206.10298  [pdf, other

    cs.CL cs.SI

    ViralBERT: A User Focused BERT-Based Approach to Virality Prediction

    Authors: Rikaz Rameez, Hossein A. Rahmani, Emine Yilmaz

    Abstract: Recently, Twitter has become the social network of choice for sharing and spreading information to a multitude of users through posts called 'tweets'. Users can easily re-share these posts to other users through 'retweets', which allow information to cascade to many more users, increasing its outreach. Clearly, being able to know the extent to which a post can be retweeted has great value in adver… ▽ More

    Submitted 17 May, 2022; originally announced June 2022.

    Comments: UMAP 2022

  47. arXiv:2206.09496  [pdf, other

    cs.LG

    Integrated Weak Learning

    Authors: Peter Hayes, Mingtian Zhang, Raza Habib, Jordan Burgess, Emine Yilmaz, David Barber

    Abstract: We introduce Integrated Weak Learning, a principled framework that integrates weak supervision into the training process of machine learning models. Our approach jointly trains the end-model and a label model that aggregates multiple sources of weak supervision. We introduce a label model that can learn to aggregate weak supervision sources differently for different datapoints and takes into consi… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 14 pages, 4 figures

  48. Impact of Tokenization on Language Models: An Analysis for Turkish

    Authors: Cagri Toraman, Eyup Halit Yilmaz, Furkan Şahinuç, Oguzhan Ozcelik

    Abstract: Tokenization is an important text preprocessing step to prepare input tokens for deep language models. WordPiece and BPE are de facto methods employed by important models, such as BERT and GPT. However, the impact of tokenization can be different for morphologically rich languages, such as Turkic languages, where many words can be generated by adding prefixes and suffixes. We compare five tokenize… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: submitted to ACM TALLIP

    Journal ref: ACM Transactions on Asian and Low-Resource Language Information Processing (2023) Volume 22 Issue 4 pp 1-21

  49. arXiv:2204.06677  [pdf, other

    cs.CL cs.IR

    Dynamic Schema Graph Fusion Network for Multi-Domain Dialogue State Tracking

    Authors: Yue Feng, Aldo Lipani, Fanghua Ye, Qiang Zhang, Emine Yilmaz

    Abstract: Dialogue State Tracking (DST) aims to keep track of users' intentions during the course of a conversation. In DST, modelling the relations among domains and slots is still an under-studied problem. Existing approaches that have considered such relations generally fall short in: (1) fusing prior slot-domain membership relations and dialogue-aware dynamic slot relations explicitly, and (2) generaliz… ▽ More

    Submitted 15 April, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: Accepted by ACL 2022

  50. arXiv:2204.04792  [pdf, other

    cs.CR

    Robust Fingerprint of Location Trajectories Under Differential Privacy

    Authors: Yuzhou Jiang, Emre Yilmaz, Erman Ayday

    Abstract: Directly releasing those data raises privacy and liability (e.g., due to unauthorized distribution of such datasets) concerns since location data contain users' sensitive information, e.g., regular moving patterns and favorite spots. To address this, we propose a novel fingerprinting scheme that simultaneously identifies unauthorized redistribution of location datasets and provides differential pr… ▽ More

    Submitted 21 April, 2023; v1 submitted 10 April, 2022; originally announced April 2022.