Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–49 of 49 results for author: Develder, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.18821  [pdf, other

    eess.SY cs.AI cs.LG

    Control Policy Correction Framework for Reinforcement Learning-based Energy Arbitrage Strategies

    Authors: Seyed Soroush Karimi Madahi, Gargya Gokhale, Marie-Sophie Verwee, Bert Claessens, Chris Develder

    Abstract: A continuous rise in the penetration of renewable energy sources, along with the use of the single imbalance pricing, provides a new opportunity for balance responsible parties to reduce their cost through energy arbitrage in the imbalance settlement mechanism. Model-free reinforcement learning (RL) methods are an appropriate choice for solving the energy arbitrage problem due to their outstanding… ▽ More

    Submitted 30 April, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: ACM e-Energy 2024

  2. arXiv:2404.14836   

    eess.SY cs.LG

    Probabilistic forecasting of power system imbalance using neural network-based ensembles

    Authors: Jonas Van Gompel, Bert Claessens, Chris Develder

    Abstract: Keeping the balance between electricity generation and consumption is becoming increasingly challenging and costly, mainly due to the rising share of renewables, electric vehicles and heat pumps and electrification of industrial processes. Accurate imbalance forecasts, along with reliable uncertainty estimations, enable transmission system operators (TSOs) to dispatch appropriate reserve volumes,… ▽ More

    Submitted 24 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: One of the co-authors objected with having it on Arxiv already

  3. arXiv:2404.14110  [pdf, other

    eess.SY cs.AR

    HomeLabGym: A real-world testbed for home energy management systems

    Authors: Toon Van Puyvelde, Marie-Sophie Verwee, Gargya Gokhale, Mehran Zareh Eshghdoust, Chris Develder

    Abstract: Amid growing environmental concerns and resulting energy costs, there is a rising need for efficient Home Energy Management Systems (HEMS). Evaluating such innovative HEMS solutions typically relies on simulations that may not model the full complexity of a real-world scenario. On the other hand, real-world testing, while more accurate, is labor-intensive, particularly when dealing with diverse as… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 3 pages, 2 figures, conference

  4. arXiv:2403.11947  [pdf, other

    eess.SY cs.LG

    Explainable Reinforcement Learning-based Home Energy Management Systems using Differentiable Decision Trees

    Authors: Gargya Gokhale, Bert Claessens, Chris Develder

    Abstract: With the ongoing energy transition, demand-side flexibility has become an important aspect of the modern power grid for providing grid support and allowing further integration of sustainable energy sources. Besides traditional sources, the residential sector is another major and largely untapped source of flexibility, driven by the increased adoption of solar PV, home batteries, and EVs. However,… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 9 pages, 5 figures

  5. arXiv:2403.11907  [pdf, other

    eess.SY cs.LG

    Distill2Explain: Differentiable decision trees for explainable reinforcement learning in energy application controllers

    Authors: Gargya Gokhale, Seyed Soroush Karimi Madahi, Bert Claessens, Chris Develder

    Abstract: Demand-side flexibility is gaining importance as a crucial element in the energy transition process. Accounting for about 25% of final energy consumption globally, the residential sector is an important (potential) source of energy flexibility. However, unlocking this flexibility requires developing a control framework that (1) easily scales across different houses, (2) is easy to maintain, and (3… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 14 pages, 6 figures, to be published in e-Energy 2024,

  6. arXiv:2401.12178  [pdf, other

    cs.CL cs.AI

    In-Context Learning for Extreme Multi-Label Classification

    Authors: Karel D'Oosterlinck, Omar Khattab, François Remy, Thomas Demeester, Chris Develder, Christopher Potts

    Abstract: Multi-label classification problems with thousands of classes are hard to solve with in-context learning alone, as language models (LMs) might lack prior knowledge about the precise classes or how to assign them, and it is generally infeasible to demonstrate every class in a prompt. We propose a general program, $\texttt{Infer--Retrieve--Rank}$, that defines multi-step interactions between LMs and… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  7. arXiv:2401.00015  [pdf, other

    cs.LG cs.AI eess.SY

    Distributional Reinforcement Learning-based Energy Arbitrage Strategies in Imbalance Settlement Mechanism

    Authors: Seyed Soroush Karimi Madahi, Bert Claessens, Chris Develder

    Abstract: Growth in the penetration of renewable energy sources makes supply more uncertain and leads to an increase in the system imbalance. This trend, together with the single imbalance pricing, opens an opportunity for balance responsible parties (BRPs) to perform energy arbitrage in the imbalance settlement mechanism. To this end, we propose a battery control framework based on distributional reinforce… ▽ More

    Submitted 23 December, 2023; originally announced January 2024.

  8. arXiv:2312.03365  [pdf, other

    eess.SY cs.AI

    Demand response for residential building heating: Effective Monte Carlo Tree Search control based on physics-informed neural networks

    Authors: Fabio Pavirani, Gargya Gokhale, Bert Claessens, Chris Develder

    Abstract: To reduce global carbon emissions and limit climate change, controlling energy consumption in buildings is an important piece of the puzzle. Here, we specifically focus on using a demand response (DR) algorithm to limit the energy consumption of a residential building's heating system while respecting user's thermal comfort. In that domain, Reinforcement learning (RL) methods have been shown to be… ▽ More

    Submitted 21 May, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

  9. arXiv:2311.10905  [pdf, other

    cs.CL cs.AI

    Flexible Model Interpretability through Natural Language Model Editing

    Authors: Karel D'Oosterlinck, Thomas Demeester, Chris Develder, Christopher Potts

    Abstract: Model interpretability and model editing are crucial goals in the age of large language models. Interestingly, there exists a link between these two goals: if a method is able to systematically edit model behavior with regard to a human concept of interest, this editor method can help make internal representations more interpretable by pointing towards relevant representations and systematically m… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Extended Abstract -- work in progress. BlackboxNLP2023

  10. arXiv:2311.06549  [pdf, other

    cs.CL

    Zero-Shot Cross-Lingual Sentiment Classification under Distribution Shift: an Exploratory Study

    Authors: Maarten De Raedt, Semere Kiros Bitew, Fréderic Godin, Thomas Demeester, Chris Develder

    Abstract: The brittleness of finetuned language model performance on out-of-distribution (OOD) test samples in unseen domains has been well-studied for English, yet is unexplored for multi-lingual models. Therefore, we study generalization to OOD test data specifically in zero-shot cross-lingual transfer settings, analyzing performance impacts of both language and domain shifts between train and test data.… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: The 3rd Workshop on Multilingual Representation Learning (MRL@EMNLP2023)

  11. arXiv:2311.04088  [pdf, other

    cs.CL

    Personality Style Recognition via Machine Learning: Identifying Anaclitic and Introjective Personality Styles from Patients' Speech

    Authors: Semere Kiros Bitew, Vincent Schelstraete, Klim Zaporojets, Kimberly Van Nieuwenhove, Reitske Meganck, Chris Develder

    Abstract: In disentangling the heterogeneity observed in psychopathology, personality of the patients is considered crucial. While it has been demonstrated that personality traits are reflected in the language used by a patient, we hypothesize that this enables automatic inference of the personality type directly from speech utterances, potentially more accurately than through a traditional questionnaire-ba… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  12. Transfer Learning in Transformer-Based Demand Forecasting For Home Energy Management System

    Authors: Gargya Gokhale, Jonas Van Gompel, Bert Claessens, Chris Develder

    Abstract: Increasingly, homeowners opt for photovoltaic (PV) systems and/or battery storage to minimize their energy bills and maximize renewable energy usage. This has spurred the development of advanced control algorithms that maximally achieve those goals. However, a common challenge faced while developing such controllers is the unavailability of accurate forecasts of household power consumption, especi… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: 7 pages, 2 figures, workshop article at BALANCES, BuildSys'23

  13. Real-World Implementation of Reinforcement Learning Based Energy Coordination for a Cluster of Households

    Authors: Gargya Gokhale, Niels Tiben, Marie-Sophie Verwee, Manu Lahariya, Bert Claessens, Chris Develder

    Abstract: Given its substantial contribution of 40\% to global power consumption, the built environment has received increasing attention to serve as a source of flexibility to assist the modern power grid. In that respect, previous research mainly focused on energy management of individual buildings. In contrast, in this paper, we focus on aggregated control of a set of residential buildings, to provide gr… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: 8 pages, 2 figures, workshop article accepted at RLEM'23 (BuildSys'23)

  14. arXiv:2310.15636  [pdf, other

    cs.CL cs.AI

    Career Path Prediction using Resume Representation Learning and Skill-based Matching

    Authors: Jens-Joris Decorte, Jeroen Van Hautte, Johannes Deleu, Chris Develder, Thomas Demeester

    Abstract: The impact of person-job fit on job satisfaction and performance is widely acknowledged, which highlights the importance of providing workers with next steps at the right time in their career. This task of predicting the next step in a career is known as career path prediction, and has diverse applications such as turnover prevention and internal job mobility. Existing methods to career path predi… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted to the 3nd Workshop on Recommender Systems for Human Resources (RecSys in HR 2023) as part of RecSys 2023

  15. arXiv:2310.06165  [pdf, other

    cs.CL cs.AI

    CAW-coref: Conjunction-Aware Word-level Coreference Resolution

    Authors: Karel D'Oosterlinck, Semere Kiros Bitew, Brandon Papineau, Christopher Potts, Thomas Demeester, Chris Develder

    Abstract: State-of-the-art coreference resolutions systems depend on multiple LLM calls per document and are thus prohibitively expensive for many use cases (e.g., information extraction with large corpora). The leading word-level coreference system (WL-coref) attains 96.6% of these SOTA systems' performance while being much more efficient. In this work, we identify a routine yet important failure case of W… ▽ More

    Submitted 19 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted at CRAC 2023

  16. arXiv:2307.16338  [pdf, other

    cs.CL

    Distractor generation for multiple-choice questions with predictive prompting and large language models

    Authors: Semere Kiros Bitew, Johannes Deleu, Chris Develder, Thomas Demeester

    Abstract: Large Language Models (LLMs) such as ChatGPT have demonstrated remarkable performance across various tasks and have garnered significant attention from both researchers and practitioners. However, in an educational context, we still observe a performance gap in generating distractors -- i.e., plausible yet incorrect answers -- with LLMs for multiple-choice questions (MCQs). In this study, we propo… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: 16 pages, Accepted at the 1st International Tutorial and Workshop on Responsible Knowledge Discovery in Education

  17. arXiv:2307.10778  [pdf, other

    cs.CL

    Extreme Multi-Label Skill Extraction Training using Large Language Models

    Authors: Jens-Joris Decorte, Severine Verlinden, Jeroen Van Hautte, Johannes Deleu, Chris Develder, Thomas Demeester

    Abstract: Online job ads serve as a valuable source of information for skill requirements, playing a crucial role in labor market analysis and e-recruitment processes. Since such ads are typically formatted in free text, natural language processing (NLP) technologies are required to automatically process them. We specifically focus on the task of detecting skills (mentioned literally, or implicitly describe… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: Accepted to the International workshop on AI for Human Resources and Public Employment Services (AI4HR&PES) as part of ECML-PKDD 2023

  18. arXiv:2306.01584  [pdf, other

    cs.CL

    Learning from Partially Annotated Data: Example-aware Creation of Gap-filling Exercises for Language Learning

    Authors: Semere Kiros Bitew, Johannes Deleu, A. Seza Doğruöz, Chris Develder, Thomas Demeester

    Abstract: Since performing exercises (including, e.g., practice tests) forms a crucial component of learning, and creating such exercises requires non-trivial effort from the teacher, there is a great value in automatic exercise generation in digital tools in education. In this paper, we particularly focus on automatic creation of gapfilling exercises for language learning, specifically grammar exercises. S… ▽ More

    Submitted 15 June, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 12 pages, Accepted in the 18th Workshop on Innovative Use of NLP for Building Educational Applications

  19. arXiv:2305.19783  [pdf, other

    cs.CL

    IDAS: Intent Discovery with Abstractive Summarization

    Authors: Maarten De Raedt, Fréderic Godin, Thomas Demeester, Chris Develder

    Abstract: Intent discovery is the task of inferring latent intents from a set of unlabeled utterances, and is a useful step towards the efficient creation of new conversational agents. We show that recent competitive methods in intent discovery can be outperformed by clustering utterances based on abstractive summaries, i.e., "labels", that retain the core elements while removing non-essential information.… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: The 5th Workshop on NLP for Conversational AI (NLP4ConvAI@ACL)

  20. arXiv:2305.13395  [pdf, other

    cs.CL

    BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance

    Authors: Karel D'Oosterlinck, François Remy, Johannes Deleu, Thomas Demeester, Chris Develder, Klim Zaporojets, Aneiss Ghodsi, Simon Ellershaw, Jack Collins, Christopher Potts

    Abstract: Timely and accurate extraction of Adverse Drug Events (ADE) from biomedical literature is paramount for public safety, but involves slow and costly manual labor. We set out to improve drug safety monitoring (pharmacovigilance, PV) through the use of Natural Language Processing (NLP). We introduce BioDEX, a large-scale resource for Biomedical adverse Drug Event Extraction, rooted in the historical… ▽ More

    Submitted 20 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 28 pages. EMNLP Findings 2023

  21. arXiv:2302.02500  [pdf, other

    cs.CL

    TempEL: Linking Dynamically Evolving and Newly Emerging Entities

    Authors: Klim Zaporojets, Lucie-Aimee Kaffee, Johannes Deleu, Thomas Demeester, Chris Develder, Isabelle Augenstein

    Abstract: In our continuously evolving world, entities change over time and new, previously non-existing or unknown, entities appear. We study how this evolutionary scenario impacts the performance on a well established entity linking (EL) task. For that study, we introduce TempEL, an entity linking dataset that consists of time-stratified English Wikipedia snapshots from 2013 to 2022, from which we collect… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

  22. Learning to Reuse Distractors to support Multiple Choice Question Generation in Education

    Authors: Semere Kiros Bitew, Amir Hadifar, Lucas Sterckx, Johannes Deleu, Chris Develder, Thomas Demeester

    Abstract: Multiple choice questions (MCQs) are widely used in digital learning systems, as they allow for automating the assessment process. However, due to the increased digital literacy of students and the advent of social media platforms, MCQ tests are widely shared online, and teachers are continuously challenged to create new questions, which is an expensive and time-consuming task. A particularly sens… ▽ More

    Submitted 13 December, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: 24 pages and 4 figures Accepted for publication in IEEE Transactions on Learning technologies

  23. arXiv:2210.11805  [pdf, other

    cs.CL

    Robustifying Sentiment Classification by Maximally Exploiting Few Counterfactuals

    Authors: Maarten De Raedt, Fréderic Godin, Chris Develder, Thomas Demeester

    Abstract: For text classification tasks, finetuned language models perform remarkably well. Yet, they tend to rely on spurious patterns in training data, thus limiting their performance on out-of-distribution (OOD) test data. Among recent models aiming to avoid this spurious pattern problem, adding extra counterfactual samples to the training data has proven to be very effective. Yet, counterfactual data ge… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  24. arXiv:2210.06104  [pdf, other

    cs.CL

    EduQG: A Multi-format Multiple Choice Dataset for the Educational Domain

    Authors: Amir Hadifar, Semere Kiros Bitew, Johannes Deleu, Chris Develder, Thomas Demeester

    Abstract: We introduce a high-quality dataset that contains 3,397 samples comprising (i) multiple choice questions, (ii) answers (including distractors), and (iii) their source documents, from the educational domain. Each question is phrased in two forms, normal and close. Correct answers are linked to source documents with sentence-level annotations. Thus, our versatile dataset can be used for both questio… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  25. arXiv:2209.05987  [pdf, other

    cs.CL

    Design of Negative Sampling Strategies for Distantly Supervised Skill Extraction

    Authors: Jens-Joris Decorte, Jeroen Van Hautte, Johannes Deleu, Chris Develder, Thomas Demeester

    Abstract: Skills play a central role in the job market and many human resources (HR) processes. In the wake of other digital experiences, today's online job market has candidates expecting to see the right opportunities based on their skill set. Similarly, enterprises increasingly need to use data to guarantee that the skills within their workforce remain future-proof. However, structured information about… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: Accepted to the 2nd Workshop on Recommender Systems for Human Resources (RecSys in HR 2022) as part of RecSys 2022

  26. CookDial: A dataset for task-oriented dialogs grounded in procedural documents

    Authors: Yiwei Jiang, Klim Zaporojets, Johannes Deleu, Thomas Demeester, Chris Develder

    Abstract: This work presents a new dialog dataset, CookDial, that facilitates research on task-oriented dialog systems with procedural knowledge understanding. The corpus contains 260 human-to-human task-oriented dialogs in which an agent, given a recipe document, guides the user to cook a dish. Dialogs in CookDial exhibit two unique features: (i) procedural alignment between the dialog flow and supporting… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: The dataset and codes are available at https://github.com/YiweiJiang2015/CookDial

    Journal ref: Applied Intelligence, 1-19 (2022)

  27. arXiv:2203.14078  [pdf, other

    cs.AI cs.LG eess.SY

    Computationally efficient joint coordination of multiple electric vehicle charging points using reinforcement learning

    Authors: Manu Lahariya, Nasrin Sadeghianpourhamami, Chris Develder

    Abstract: A major challenge in todays power grid is to manage the increasing load from electric vehicle (EV) charging. Demand response (DR) solutions aim to exploit flexibility therein, i.e., the ability to shift EV charging in time and thus avoid excessive peaks or achieve better balancing. Whereas the majority of existing research works either focus on control strategies for a single EV charger, or use a… ▽ More

    Submitted 26 March, 2022; originally announced March 2022.

  28. Optimized cost function for demand response coordination of multiple EV charging stations using reinforcement learning

    Authors: Manu Lahariya, Nasrin Sadeghianpourhamami, Chris Develder

    Abstract: Electric vehicle (EV) charging stations represent a substantial load with significant flexibility. The exploitation of that flexibility in demand response (DR) algorithms becomes increasingly important to manage and balance demand and supply in power grids. Model-free DR based on reinforcement learning (RL) is an attractive approach to balance such EV charging load. We build on previous research o… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Journal ref: Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation (BuildSys 19), November 2019 Pages 344 345

  29. Defining a synthetic data generator for realistic electric vehicle charging sessions

    Authors: Manu Lahariya, Dries Benoit, Chris Develder

    Abstract: Electric vehicle (EV) charging stations have become prominent in electricity grids in the past years. Analysis of EV charging sessions is useful for flexibility analysis, load balancing, offering incentives to customers, etc. Yet, the limited availability of such EV sessions data hinders further development in these fields. Addressing this need for publicly available and realistic data, we develop… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

    Journal ref: e-Energy 20 Proceedings of the Eleventh ACM International Conference on Future Energy Systems June 2020 Pages 406 407

  30. arXiv:2202.12977  [pdf, other

    cs.RO cs.AI

    Learning physics-informed simulation models for soft robotic manipulation: A case study with dielectric elastomer actuators

    Authors: Manu Lahariya, Craig Innes, Chris Develder, Subramanian Ramamoorthy

    Abstract: Soft actuators offer a safe, adaptable approach to tasks like gentle grasping and dexterous manipulation. Creating accurate models to control such systems however is challenging due to the complex physics of deformable materials. Accurate Finite Element Method (FEM) models incur prohibitive computational complexity for closed-loop use. Using a differentiable simulator is an attractive alternative,… ▽ More

    Submitted 16 July, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

  31. arXiv:2111.12066  [pdf, other

    eess.SP cs.LG eess.SY

    Physics Informed Neural Networks for Control Oriented Thermal Modeling of Buildings

    Authors: Gargya Gokhale, Bert Claessens, Chris Develder

    Abstract: This paper presents a data-driven modeling approach for developing control-oriented thermal models of buildings. These models are developed with the objective of reducing energy consumption costs while controlling the indoor temperature of the building within required comfort limits. To combine the interpretability of white/gray box physics models and the expressive power of neural networks, we pr… ▽ More

    Submitted 21 March, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: 14 pages, 7 figures

  32. arXiv:2109.09605  [pdf, other

    cs.CL

    JobBERT: Understanding Job Titles through Skills

    Authors: Jens-Joris Decorte, Jeroen Van Hautte, Thomas Demeester, Chris Develder

    Abstract: Job titles form a cornerstone of today's human resources (HR) processes. Within online recruitment, they allow candidates to understand the contents of a vacancy at a glance, while internal HR departments use them to organize and structure many of their processes. As job titles are a compact, convenient, and readily available data source, modeling them with high accuracy can greatly benefit many H… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: Accepted to the International workshop on Fair, Effective And Sustainable Talent management using data science (FEAST) as part of ECML-PKDD 2021

  33. arXiv:2108.13530  [pdf, other

    cs.CL

    Towards Consistent Document-level Entity Linking: Joint Models for Entity Linking and Coreference Resolution

    Authors: Klim Zaporojets, Johannes Deleu, Yiwei Jiang, Thomas Demeester, Chris Develder

    Abstract: We consider the task of document-level entity linking (EL), where it is important to make consistent decisions for entity mentions over the full document jointly. We aim to leverage explicit "connections" among mentions within the document itself: we propose to join the EL task with that of coreference resolution (coref). This is complementary to related works that exploit either (i) implicit docu… ▽ More

    Submitted 1 July, 2022; v1 submitted 30 August, 2021; originally announced August 2021.

  34. arXiv:2107.02286  [pdf, other

    cs.CL

    Injecting Knowledge Base Information into End-to-End Joint Entity and Relation Extraction and Coreference Resolution

    Authors: Severine Verlinden, Klim Zaporojets, Johannes Deleu, Thomas Demeester, Chris Develder

    Abstract: We consider a joint information extraction (IE) model, solving named entity recognition, coreference resolution and relation extraction jointly over the whole document. In particular, we study how to inject information from a knowledge base (KB) in such IE model, based on unsupervised entity linking. The used KB entity representations are learned from either (i) hyperlinked text documents (Wikiped… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

  35. arXiv:2104.07944  [pdf, other

    cs.CL cs.AI

    A Million Tweets Are Worth a Few Points: Tuning Transformers for Customer Service Tasks

    Authors: Amir Hadifar, Sofie Labat, Véronique Hoste, Chris Develder, Thomas Demeester

    Abstract: In online domain-specific customer service applications, many companies struggle to deploy advanced NLP models successfully, due to the limited availability of and noise in their datasets. While prior research demonstrated the potential of migrating large open-domain pretrained models for domain-specific tasks, the appropriate (pre)training strategies have not yet been rigorously evaluated in such… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  36. arXiv:2104.03630  [pdf, other

    cs.CL cs.LG

    A Simple Geometric Method for Cross-Lingual Linguistic Transformations with Pre-trained Autoencoders

    Authors: Maarten De Raedt, Fréderic Godin, Pieter Buteneers, Chris Develder, Thomas Demeester

    Abstract: Powerful sentence encoders trained for multiple languages are on the rise. These systems are capable of embedding a wide range of linguistic properties into vector representations. While explicit probing tasks can be used to verify the presence of specific linguistic properties, it is unclear whether the vector representations can be manipulated to indirectly steer such properties. For efficient l… ▽ More

    Submitted 21 September, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021 - Short Paper Track

  37. arXiv:2009.12626  [pdf, other

    cs.CL

    DWIE: an entity-centric dataset for multi-task document-level information extraction

    Authors: Klim Zaporojets, Johannes Deleu, Chris Develder, Thomas Demeester

    Abstract: This paper presents DWIE, the 'Deutsche Welle corpus for Information Extraction', a newly created multi-task dataset that combines four main Information Extraction (IE) annotation subtasks: (i) Named Entity Recognition (NER), (ii) Coreference Resolution, (iii) Relation Extraction (RE), and (iv) Entity Linking. DWIE is conceived as an entity-centric dataset that describes interactions and propertie… ▽ More

    Submitted 9 March, 2021; v1 submitted 26 September, 2020; originally announced September 2020.

  38. Solving Arithmetic Word Problems by Scoring Equations with Recursive Neural Networks

    Authors: Klim Zaporojets, Giannis Bekoulis, Johannes Deleu, Thomas Demeester, Chris Develder

    Abstract: Solving arithmetic word problems is a cornerstone task in assessing language understanding and reasoning capabilities in NLP systems. Recent works use automatic extraction and ranking of candidate solution equations providing the answer to arithmetic word problems. In this work, we explore novel approaches to score such candidate solution equations using tree-structured recursive neural network (T… ▽ More

    Submitted 9 March, 2021; v1 submitted 11 September, 2020; originally announced September 2020.

    Journal ref: Expert Systems with Applications, 174 (2021) 114704

  39. Block-wise Dynamic Sparseness

    Authors: Amir Hadifar, Johannes Deleu, Chris Develder, Thomas Demeester

    Abstract: Neural networks have achieved state of the art performance across a wide variety of machine learning tasks, often with large and computation-heavy models. Inducing sparseness as a way to reduce the memory and computation footprint of these models has seen significant research attention in recent years. In this paper, we present a new method for \emph{dynamic sparseness}, whereby part of the comput… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

  40. arXiv:1903.05396  [pdf, other

    cs.CL

    Sub-event detection from Twitter streams as a sequence labeling problem

    Authors: Giannis Bekoulis, Johannes Deleu, Thomas Demeester, Chris Develder

    Abstract: This paper introduces improved methods for sub-event detection in social media streams, by applying neural sequence models not only on the level of individual posts, but also directly on the stream level. Current approaches to identify sub-events within a given event, such as a goal during a soccer match, essentially do not exploit the sequential nature of social media streams. We address this sho… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

    Comments: NAACL 2019

  41. arXiv:1809.10679  [pdf, other

    cs.LG cs.AI stat.ML

    Definition and evaluation of model-free coordination of electrical vehicle charging with reinforcement learning

    Authors: Nasrin Sadeghianpourhamami, Johannes Deleu, Chris Develder

    Abstract: Initial DR studies mainly adopt model predictive control and thus require accurate models of the control problem (e.g., a customer behavior model), which are to a large extent uncertain for the EV scenario. Hence, model-free approaches, especially based on reinforcement learning (RL) are an attractive alternative. In this paper, we propose a new Markov decision process (MDP) formulation in the RL… ▽ More

    Submitted 28 November, 2018; v1 submitted 27 September, 2018; originally announced September 2018.

  42. arXiv:1808.08720  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Predefined Sparseness in Recurrent Sequence Models

    Authors: Thomas Demeester, Johannes Deleu, Fréderic Godin, Chris Develder

    Abstract: Inducing sparseness while training neural networks has been shown to yield models with a lower memory footprint but similar effectiveness to dense models. However, sparseness is typically induced starting from a dense model, and thus this advantage does not hold during training. We propose techniques to enforce sparseness upfront in recurrent sequence models for NLP applications, to also benefit t… ▽ More

    Submitted 27 August, 2018; originally announced August 2018.

    Comments: the SIGNLL Conference on Computational Natural Language Learning (CoNLL, 2018)

  43. arXiv:1808.06876  [pdf, other

    cs.CL

    Adversarial training for multi-context joint entity and relation extraction

    Authors: Giannis Bekoulis, Johannes Deleu, Thomas Demeester, Chris Develder

    Abstract: Adversarial training (AT) is a regularization method that can be used to improve the robustness of neural network methods by adding small perturbations in the training data. We show how to use AT for the tasks of entity recognition and relation extraction. In particular, we demonstrate that applying AT to a general purpose baseline model for jointly extracting entities and relations, allows improv… ▽ More

    Submitted 14 January, 2019; v1 submitted 21 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018, code is available at https://github.com/bekou/multihead_joint_entity_relation_extraction

  44. arXiv:1806.09439  [pdf, other

    cs.CL

    Prior Attention for Style-aware Sequence-to-Sequence Models

    Authors: Lucas Sterckx, Johannes Deleu, Chris Develder, Thomas Demeester

    Abstract: We extend sequence-to-sequence models with the possibility to control the characteristics or style of the generated output, via attention that is generated a priori (before decoding) from a latent code vector. After training an initial attention-based sequence-to-sequence model, we use a variational auto-encoder conditioned on representations of input sequences and a latent code vector space to ge… ▽ More

    Submitted 25 June, 2018; originally announced June 2018.

    Comments: 6 pages, 6 figures

  45. Joint entity recognition and relation extraction as a multi-head selection problem

    Authors: Giannis Bekoulis, Johannes Deleu, Thomas Demeester, Chris Develder

    Abstract: State-of-the-art models for joint entity recognition and relation extraction strongly rely on external natural language processing (NLP) tools such as POS (part-of-speech) taggers and dependency parsers. Thus, the performance of such joint models depends on the quality of the features obtained from these NLP tools. However, these features are not always accurate for various languages and contexts.… ▽ More

    Submitted 17 December, 2018; v1 submitted 20 April, 2018; originally announced April 2018.

    Comments: Expert Systems with Applications, code is available at https://github.com/bekou/multihead_joint_entity_relation_extraction

    Journal ref: Expert Systems with Applications, Volume 114, 30 December 2018, Pages 34-45, ISSN 0957-4174

  46. An attentive neural architecture for joint segmentation and parsing and its application to real estate ads

    Authors: Giannis Bekoulis, Johannes Deleu, Thomas Demeester, Chris Develder

    Abstract: In processing human produced text using natural language processing (NLP) techniques, two fundamental subtasks that arise are (i) segmentation of the plain text into meaningful subunits (e.g., entities), and (ii) dependency parsing, to establish relations between subunits. In this paper, we develop a relatively simple and effective neural joint model that performs both segmentation and dependency… ▽ More

    Submitted 19 March, 2018; v1 submitted 27 September, 2017; originally announced September 2017.

    Comments: Preprint - Accepted for publication in Expert Systems with Applications

    Journal ref: Expert Systems with Applications, Volume 102, 15 July 2018, Pages 100-112, ISSN 0957-4174

  47. arXiv:1708.03492  [pdf, other

    cs.CL

    Break it Down for Me: A Study in Automated Lyric Annotation

    Authors: Lucas Sterckx, Jason Naradowsky, Bill Byrne, Thomas Demeester, Chris Develder

    Abstract: Comprehending lyrics, as found in songs and poems, can pose a challenge to human and machine readers alike. This motivates the need for systems that can understand the ambiguity and jargon found in such creative texts, and provide commentary to aid readers in reaching the correct interpretation. We introduce the task of automated lyric annotation (ALA). Like text simplification, a goal of ALA is t… ▽ More

    Submitted 11 August, 2017; originally announced August 2017.

    Comments: To appear in Proceedings of EMNLP 2017

  48. Predicting Relevance based on Assessor Disagreement: Analysis and Practical Applications for Search Evaluation

    Authors: Thomas Demeester, Robin Aly, Djoerd Hiemstra, Dong Nguyen, Chris Develder

    Abstract: Evaluation of search engines relies on assessments of search results for selected test queries, from which we would ideally like to draw conclusions in terms of relevance of the results for general (e.g., future, unknown) users. In practice however, most evaluation scenarios only allow us to conclusively determine the relevance towards the particular assessor that provided the judgments. A factor… ▽ More

    Submitted 23 November, 2015; originally announced November 2015.

    Comments: Accepted for publication in Springer Information Retrieval Journal, special issue on Information Retrieval Evaluation using Test Collections

  49. arXiv:1511.06219  [pdf, other

    cs.CL cs.LG

    Knowledge Base Population using Semantic Label Propagation

    Authors: Lucas Sterckx, Thomas Demeester, Johannes Deleu, Chris Develder

    Abstract: A crucial aspect of a knowledge base population system that extracts new facts from text corpora, is the generation of training data for its relation extractors. In this paper, we present a method that maximizes the effectiveness of newly trained relation extractors at a minimal annotation cost. Manual labeling can be significantly reduced by Distant Supervision, which is a method to construct tra… ▽ More

    Submitted 3 March, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: Submitted to Knowledge Based Systems, special issue on Knowledge Bases for Natural Language Processing