Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJuly 2024
Data harmonization and federated learning for multi-cohort dementia research using the OMOP common data model: A Netherlands consortium of dementia cohorts case study
- Pedro Mateus,
- Justine Moonen,
- Magdalena Beran,
- Eva Jaarsma,
- Sophie M. van der Landen,
- Joost Heuvelink,
- Mahlet Birhanu,
- Alexander G.J. Harms,
- Esther Bron,
- Frank J. Wolters,
- Davy Cats,
- Hailiang Mei,
- Julie Oomens,
- Willemijn Jansen,
- Miranda T. Schram,
- Andre Dekker,
- Inigo Bermejo
Journal of Biomedical Informatics (JOBI), Volume 155, Issue CJul 2024https://doi.org/10.1016/j.jbi.2024.104661Graphical abstractDisplay Omitted
Abstract BackgroundEstablishing collaborations between cohort studies has been fundamental for progress in health research. However, such collaborations are hampered by heterogeneous data representations across cohorts and legal constraints to data ...
- surveyApril 2024
Semantic Data Integration and Querying: A Survey and Challenges
- Maroua Masmoudi,
- Sana Ben Abdallah Ben Lamine,
- Mohamed Hedi Karray,
- Bernard Archimede,
- Hajer Baazaoui Zghal
ACM Computing Surveys (CSUR), Volume 56, Issue 8Article No.: 209, Pages 1–35https://doi.org/10.1145/3653317Digital revolution produces massive, heterogeneous and isolated data. These latter remain underutilized, unsuitable for integrated querying and knowledge discovering. Hence the importance of this survey on data integration which identifies challenging ...
- research-articleApril 2024
AlterEgo: A Dedicated Blockchain Node For Analytics
EdgeSys '24: Proceedings of the 7th International Workshop on Edge Systems, Analytics and NetworkingApril 2024, Pages 7–12https://doi.org/10.1145/3642968.3654814Blockchains today amass terabytes of transaction data that demand efficient and insightful real-time analytics for applications such as smart contract hack detection, price arbitrage on decentralized exchanges, or trending token analysis. Conventional ...
- research-articleJanuary 2024
An efficient hybrid optimization of ETL process in data warehouse of cloud architecture
Journal of Cloud Computing: Advances, Systems and Applications (JOCCASA), Volume 13, Issue 1Jan 2024https://doi.org/10.1186/s13677-023-00571-yAbstractIn big data, analysis data is collected from different sources in various formats, transforming into the aspect of cleansing the data, customization, and loading it into a Data Warehouse. Extracting data in other formats and transforming it to the ...
- research-articleJune 2024
Stack Overflow Data Warehouse System
ICISE '23: Proceedings of the 2023 8th International Conference on Information Systems EngineeringDecember 2023, Pages 108–115https://doi.org/10.1145/3641032.3641057This research addresses the challenge of extracting and analyzing data from Stack Overflow to uncover insights into programming language trends, community contributions, and talent availability. The aim is to provide developers, organizations, and ...
-
- research-articleFebruary 2024
Evaluation of OMOP CDM, i2b2 and ICGC ARGO for supporting data harmonization in a breast cancer use case of a multicentric European AI project
- Santiago Frid,
- Guillem Bracons Cucó,
- Jessyca Gil Rojas,
- Antonio López-Rueda,
- Xavier Pastor Duran,
- Olga Martínez-Sáez,
- Raimundo Lozano-Rubí
Journal of Biomedical Informatics (JOBI), Volume 147, Issue CNov 2023https://doi.org/10.1016/j.jbi.2023.104505Graphical abstractDisplay Omitted
Highlights- Choosing a common data model for health research in cancer is challenging.
- ICGC ARGO’s rigid design of its data model hampers its implementation.
- i2b2 performed very similarly to OMOP CDM on all domains evaluated.
- i2b2′s lack ...
Observational research in cancer poses great challenges regarding adequate data sharing and consolidation based on a homogeneous data semantic base. Common Data Models (CDMs) can help consolidate health data repositories from different ...
- research-articleAugust 2023
Eos and OMOCL: Towards a seamless integration of openEHR records into the OMOP Common Data Model
- Severin Kohler,
- Diego Boscá,
- Florian Kärcher,
- Birger Haarbrandt,
- Manuel Prinz,
- Michael Marschollek,
- Roland Eils
Journal of Biomedical Informatics (JOBI), Volume 144, Issue CAug 2023https://doi.org/10.1016/j.jbi.2023.104437Abstract Background:The reuse of data from electronic health records (EHRs) for research purposes promises to improve the data foundation for clinical trials and may even support to enable them. Nevertheless, EHRs are characterized by both, heterogeneous ...
Graphical abstractDisplay Omitted
- ArticleApril 2023
ChainSync: A Real-Time Multi-chain ETL System for dApp Development
Database Systems for Advanced ApplicationsApr 2023, Pages 707–711https://doi.org/10.1007/978-3-031-30678-5_60AbstractThe increasing number of public blockchains has led to a need for infrastructures and tools for decentralized application (dApp) development. The blockchain Extract-Transform-Load (ETL) process, which involves extracting on-chain data, converting ...
- invited-talkApril 2023
Graph-Inceptor: Towards Extreme Data Ingestion, Massive Graph Creation and Storage
ICPE '23 Companion: Companion of the 2023 ACM/SPEC International Conference on Performance EngineeringApril 2023, Pages 253–254https://doi.org/10.1145/3578245.3585339Graph processing is increasingly popular given the wide range of phenomena represented as graphs (e.g., social media networks, pharmaceutical drug compounds, or fraud networks, among others). The increasing amount of data available requires new ...
- research-articleDecember 2022
Distributed real-time ETL architecture for unstructured big data
Knowledge and Information Systems (KAIS), Volume 64, Issue 12Dec 2022, Pages 3419–3445https://doi.org/10.1007/s10115-022-01757-7AbstractReal-time extract transform load (ETL) is the integral part of increasing demand of faster business decisions targeting large number of modern applications. Multi-source unstructured data stream extraction and transformation using disk data in ...
- ArticleJanuary 2023
Towards a Model-Driven Approach for Big Data Analytics in the Genomics Field
AbstractThe use of techniques such as Next Generation Sequencing has allowed a fast increase in data generation due to the reduction of processing costs. What at the beginning seemed to be an important step forward for the development of new approaches ...
- tutorialSeptember 2022
Training and Deploying Multi-Stage Recommender Systems
RecSys '22: Proceedings of the 16th ACM Conference on Recommender SystemsSeptember 2022, Pages 706–707https://doi.org/10.1145/3523227.3547372Industrial recommender systems are made up of complex pipelines requiring multiple steps including feature engineering and preprocessing, a retrieval model for candidate generation, filtering, a feature store query, a ranking model for scoring, and an ...
- research-articleSeptember 2022
BigData oriented to business decision making: a real case study in constructel
Computational & Mathematical Organization Theory (CMOT), Volume 28, Issue 3Sep 2022, Pages 271–291https://doi.org/10.1007/s10588-021-09330-3AbstractAnalyze and understand how to combine data warehouse with business intelligence tools, and other useful information or tools to visualize KPIs are critical factors in achieving the goal of raising competencies and business results of an ...
- research-articleAugust 2022
Development of a generalizable multi-site and multi-modality clinical data cloud infrastructure for pediatric patient care
- Andrew Hornback,
- Wenqi Shi,
- Felipe O. Giuste,
- Yuanda Zhu,
- Ashley M. Carpenter,
- Coleman Hilton,
- Vinieth N. Bijanki,
- Hiram Stahl,
- Gary S. Gottesman,
- Chad Purnell,
- Henry J. Iwinski,
- J. Michael Wattenbarger,
- May D. Wang
BCB '22: Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health InformaticsAugust 2022, Article No.: 23, Pages 1–10https://doi.org/10.1145/3535508.3545565World-renowned pediatric patient care in scoliosis, craniofacial, orthopedic, and other life-altering conditions is provided at the international Shriners Children's hospital system. The impact of scoliosis can be extreme with significant curvature of ...
- ArticleJune 2022
Enabling Knowledge Extraction on Bike Sharing Systems Throughout Open Data
HCI in Mobility, Transport, and Automotive SystemsJun 2022, Pages 570–585https://doi.org/10.1007/978-3-031-04987-3_39AbstractBike Sharing Systems (BSS) have changed urban mobility patterns. Their study as part of the overall transport system in cities is attracting growing attention in recent years. Nevertheless, some deficiencies such as the lack of convention in data ...
- research-articleMay 2022
An embedding driven approach to automatically detect identifiers and references in document stores
Data & Knowledge Engineering (DAKE), Volume 139, Issue CMay 2022https://doi.org/10.1016/j.datak.2022.102003AbstractNoSQL stores have become ubiquitous since they offer a new cost-effective and schema-free system. Although NoSQL systems are widely accepted today, Business Intelligence & Analytics (BI&A) wields relational data sources. Exploiting ...
- research-articleMay 2022
FLASc: a formal algebra for labeled property graph schema
Automated Software Engineering (KLU-AUSE), Volume 29, Issue 1May 2022https://doi.org/10.1007/s10515-022-00336-yAbstractContemporary labeled property graph databases are either schema-less or schema-optional to support frequent changes in the structure of data found in domains requiring high flexibility. However, the lack of structure impacts data transformation ...
- research-articleMay 2022
An ETL-like platform for the processing of mobility data
- Maxime Masson,
- Cécile Cayèré,
- Marie-Noëlle Bessagnet,
- Christian Sallaberry,
- Philippe Roose,
- Cyril Faucher
SAC '22: Proceedings of the 37th ACM/SIGAPP Symposium on Applied ComputingApril 2022, Pages 547–555https://doi.org/10.1145/3477314.3507057In this article, we introduce a novel platform dedicated to the extraction, transformation and visualization of mobility data. This platform was developed in the framework of a French regional project (DA3T project) aiming at improving the management ...
- articleMarch 2022
Healthcare Data Analytics Using Power BI
International Journal of Software Innovation (IJSI), Volume 10, Issue 1Sep 2022, Pages 1–10https://doi.org/10.4018/IJSI.293267Innovations in computer technologies have revolutionized attention in recent years. Data analytics has emerged as a promising tool for determination problems in various health care connected disciplines. The effective utilization of knowledge mining ...
- research-articleMarch 2022
Dynamic multi-variant relational scheme-based intelligent ETL framework for healthcare management
Soft Computing - A Fusion of Foundations, Methodologies and Applications (SOFC), Volume 27, Issue 1Jan 2023, Pages 605–614https://doi.org/10.1007/s00500-022-06938-8AbstractThe growth of information technology has opened the gate for the organizations to maintain their data in various forms and at various volumes. This increases the volume and dimension of data being maintained. However, they store their data in ...