Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleAugust 2023
DAT@Z21: A Comprehensive Multimodal Dataset for Rumor Classification in Microblogs
AbstractMicroblogs have become popular media platforms for reporting and propagating news. However, they also enable the proliferation of misleading information that can cause serious damage. Thus, many efforts have been taken to defeat rumors ...
- articleAugust 2023
Diversity, Equity and Inclusion Activities in Database Conferences: A 2022 Report
- Sihem Amer-Yahia,
- Divyakant Agrawal,
- Yael Amsterdamer,
- Sourav S. Bhowmick,
- Angela Bonifati,
- Renata Borovica-Gajic,
- Jesús Camacho-Rodríguez,
- Barbara Catania,
- Panos K. Chrysanthis,
- Carlo Curino,
- Jérôme Darmont,
- Gillian Dobbie,
- Amr El Abbadi,
- Avrilia Floratou,
- Juliana Freire,
- Alekh Jindal,
- Vana Kalogeraki,
- Sujaya Maiyya,
- Alexandra Meliou,
- Madhulika Mohanty,
- Behrooz Omidvar-Tehrani,
- Fatma Özcan,
- Liat Peterfreund,
- Wenny Rahayu,
- Shazia Sadiq,
- Sana Sellami,
- Utku Sirin,
- Wang-Chiew Tan,
- Bhavani Thuraisingham,
- Yuanyuan Tian,
- Pinar Tözün,
- Genoveva Vargas-Solar,
- Neeraja Yadwadkar,
- Victor Zakhary,
- Meihui Zhang
The Diversity, Equity and Inclusion (DEI) initiative started as the Diversity/Inclusion initiative in 2020 [4]. The current report summarizes our activities in 2022. Our responsibility as a community is to ensure that attendees of DB conferences feel ...
- research-articleMay 2023
DLBench+: A benchmark for quantitative and qualitative data lake assessment
AbstractIn the last few years, the concept of data lake has become trendy for data storage and analysis. Thus, several approaches have been proposed to build data lake systems. However, such proposals are difficult to evaluate as there are no commonly ...
- research-articleApril 2023
A survey on implementations of homomorphic encryption schemes
The Journal of Supercomputing (JSCO), Volume 79, Issue 13Pages 15098–15139https://doi.org/10.1007/s11227-023-05233-zAbstractWith the increased need for data confidentiality in various applications of our daily life, homomorphic encryption (HE) has emerged as a promising cryptographic topic. HE enables to perform computations directly on encrypted data (ciphertexts) ...
- ArticleSeptember 2022
Dimensional Data KNN-Based Imputation
Advances in Databases and Information SystemsPages 315–329https://doi.org/10.1007/978-3-031-15740-0_23AbstractData Warehouses (DWs) are core components of Business Intelligence (BI). Missing data in DWs have a great impact on data analyses. Therefore, missing data need to be completed. Unlike other existing data imputation methods mainly adapted for facts,...
-
- ArticleAugust 2022
Automatic Machine Learning-Based OLAP Measure Detection for Tabular Data
AbstractNowadays, it is difficult for companies and organisations without Business Intelligence (BI) experts to carry out data analyses. Existing automatic data warehouse design methods cannot treat with tabular data commonly defined without schema. ...
- research-articleAugust 2022
Rumor Classification through a Multimodal Fusion Framework and Ensemble Learning
Information Systems Frontiers (KLU-ISFI), Volume 25, Issue 5Pages 1795–1810https://doi.org/10.1007/s10796-022-10315-zAbstractThe proliferation of rumors on social media has become a major concern due to its ability to create a devastating impact. Manually assessing the veracity of social media messages is a very time-consuming task that can be much helped by machine ...
- articleJuly 2022
Diversity and Inclusion Activities in Database Conferences: A 2021 Report
- Sihem Amer-Yahia,
- Yael Amsterdamer,
- Sourav S. Bhowmick,
- Angela Bonifati,
- Philippe Bonnet,
- Renata Borovica-Gajic,
- Barbara Catania,
- Tania Cerquitelli,
- Silvia Chiusano,
- Panos K. Chrysanthis,
- Carlo Curino,
- Jérôme Darmont,
- Amr El Abbadi,
- Avrilia Floratou,
- Juliana Freire,
- Alekh Jindal,
- Vana Kalogeraki,
- Georgia Koutrika,
- Arun Kumar,
- Sujaya Maiyya,
- Alexandra Meliou,
- Madhulika Mohanty,
- Felix Naumann,
- Nele Sina Noack,
- Fatma Özcan,
- Liat Peterfreund,
- Wenny Rahayu,
- Wang-Chiew Tan,
- Yuanyuan Tian,
- Pinar Tözün,
- Genoveva Vargas-Solar,
- Neeraja Yadwadkar,
- Meihui Zhang
Diversity and Inclusion (D&I) are core to fostering innovative thinking. Existing theories demonstrate that to facilitate inclusion, multiple types of exclusionary dynamics, such as self-segregation, communication apprehension, and stereotyping and ...
- ArticleSeptember 2021
Benchmarking Data Lakes Featuring Structured and Unstructured Data with DLBench
AbstractIn the last few years, the concept of data lake has become trendy for data storage and analysis. Thus, several approaches have been proposed to build data lake systems. However, these proposals are difficult to evaluate as there are no commonly ...
- ArticleSeptember 2021
Internal Data Imputation in Data Warehouse Dimensions
AbstractMissing data occur commonly in data warehouses and may generate data usefulness problems. Thus, it is essential to address missing data to carry out a better analysis. There exists data imputation methods for missing data in fact tables, but not ...
- ArticleSeptember 2021
Calling to CNN-LSTM for Rumor Detection: A Deep Multi-channel Model for Message Veracity Classification in Microblogs
Machine Learning and Knowledge Discovery in Databases. Applied Data Science TrackPages 497–513https://doi.org/10.1007/978-3-030-86517-7_31AbstractReputed by their low-cost, easy-access, real-time and valuable information, social media also wildly spread unverified or fake news. Rumors can notably cause severe damage on individuals and the society. Therefore, rumor detection on social media ...
- ArticleAugust 2021
Joint Management and Analysis of Textual Documents and Tabular Data Within the AUDAL Data Lake
Advances in Databases and Information SystemsPages 88–101https://doi.org/10.1007/978-3-030-82472-3_8AbstractIn 2010, the concept of data lake emerged as an alternative to data warehouses for big data management. Data lakes follow a schema-on-read approach to provide rich and flexible analyses. However, although trendy in both the industry and academia, ...
- ArticleAugust 2021
MONITOR: A Multimodal Fusion Framework to Assess Message Veracity in Social Networks
AbstractUsers of social networks tend to post and share content with little restraint. Hence, rumors and fake news can quickly spread on a huge scale. This may pose a threat to the credibility of social media and can cause serious consequences in real ...
- research-articleJuly 2021
The Forgotten Document-Oriented Database Management Systems: An Overview and Benchmark of Native XML DODBMSes in Comparison with JSON DODBMSes
AbstractIn the current context of Big Data, a multitude of new NoSQL solutions for storing, managing, and extracting information and patterns from semi-structured data have been proposed and implemented. These solutions were developed to ...
- research-articleSeptember 2021
An Automatic Schema-Instance Approach for Merging Multidimensional Data Warehouses
IDEAS '21: Proceedings of the 25th International Database Engineering & Applications SymposiumPages 232–241https://doi.org/10.1145/3472163.3472268Using data warehouses to analyse multidimensional data is a significant task in company decision-making. The need for analyzing data stored in different data warehouses generates the requirement of merging them into one integrated data warehouse. The ...
- research-articleSeptember 2021
ArchaeoDAL: A Data Lake for Archaeological Data Management and Analytics
IDEAS '21: Proceedings of the 25th International Database Engineering & Applications SymposiumPages 252–262https://doi.org/10.1145/3472163.3472266With new emerging technologies, such as satellites and drones, archaeologists collect data over large areas. However, it becomes difficult to process such data in time. Archaeological data also have many different formats (images, texts, sensor data) ...
- research-articleFebruary 2021
On data lake architectures and metadata management
Journal of Intelligent Information Systems (JIIS), Volume 56, Issue 1Pages 97–120https://doi.org/10.1007/s10844-020-00608-7AbstractOver the past two decades, we have witnessed an exponential increase of data production in the world. So-called big data generally come from transactional systems, and even more so from the Internet of Things and social media. They are mainly ...
- research-articleFebruary 2021
TextBenDS: a Generic Textual Data Benchmark for Distributed Systems
Information Systems Frontiers (KLU-ISFI), Volume 23, Issue 1Pages 81–100https://doi.org/10.1007/s10796-020-09999-yAbstractExtracting top-k keywords and documents using weighting schemes are popular techniques employed in text mining and machine learning for different analysis and retrieval tasks. The weights are usually computed in the data preprocessing step, as ...