No abstract available.
Front Matter
Front Matter
Reference Architecture for Running Large Scale Data Integration Experiments
This paper contributes a reference architecture of a reusable infrastructure for scientific experiments on data processing and data integration. The architecture is based on containerization and is integrated with an external machine learning ...
Subgroup Discovery with Consecutive Erosion on Discontinuous Intervals
The subgroup discovery problem aims to identify a subset of objects which exhibit interesting characteristics according to a quality measure defined on a target attribute. In this paper, we propose a new optimized approach, called SD-CEDI, which ...
Fast SQL/Row Pattern Recognition Query Processing Using Parallel Primitives on GPUs
SQL/Row Pattern Recognition (SQL/RPR), a row matching query processing for sequence data stored in a database, has been standardized in SQL:2016. So far, many studies have focused on developing technology to perform SQL/RPR for large-scale ...
Front Matter
Augmented Lineage: Traceability of Data Analysis Including Complex UDFs
Data lineage allows information to be traced to its origin in data analysis by showing how the results were derived. Although many methods have been proposed to identify the source data from which the analysis results are derived, analysis is ...
Neural Ordinary Differential Equations for the Regression of Macroeconomics Data Under the Green Solow Model
We are interested in the regression of data to a parameterised system of differential equations formalising a dynamical system. We study the case of the Green Solow model, a neoclassical economics model for sustainable growth. Faced with the ...
A Quantum-Inspired Neural Network Model for Predictive BPaaS Management
Nowadays, companies are more and more adopting cloud technologies in the management of their business processes rising, then, the Business Process as a Service (BPaaS) model. In order to guarantee the consistency of the provisioned BPaaS, cloud ...
Improving Billboard Advertising Revenue Using Transactional Modeling and Pattern Mining
Billboard advertisement is among the dominant modes of outdoor advertisements. The billboard operator has an opportunity to improve its revenue by satisfying the advertising demands of an increased number of clients by means of exploiting the user ...
Sarcasm Detection for Japanese Text Using BERT and Emoji
In this paper, we propose methods to detect sarcasm from Japanese text on Twitter by using the BERT language model and analyzing emoji as well as text. After constructing a Japanese Twitter dataset, we extract feature vector for both text and ...
Sigmalaw PBSA - A Deep Learning Model for Aspect-Based Sentiment Analysis for the Legal Domain
- Isanka Rajapaksha,
- Chanika Ruchini Mudalige,
- Dilini Karunarathna,
- Nisansa de Silva,
- Amal Shehan Perera,
- Gathika Ratnayaka
Legal information retrieval holds a significant importance to lawyers and legal professionals. Its significance has grown as a result of the vast and rapidly increasing amount of legal documents available via electronic means. Legal documents, ...
BERT-Based Sentiment Analysis: A Software Engineering Perspective
Sentiment analysis can provide a suitable lead for the tools used in software engineering along with the API recommendation systems and relevant libraries to be used. In this context, the existing tools like SentiCR, SentiStrength-SE, etc. ...
A Stochastic Block Model Based Approach to Detect Outliers in Networks
Finding outliers in networks is a central task in different application domains. Here, we exploit the stochastic block model framework to study the network from a generative point of view and design a score able to highlight those nodes whose ...
Medical-Based Text Classification Using FastText Features and CNN-LSTM Model
Text classification is a fundamental task that is often carried out upstream of natural language processing (NLP) techniques. Therefore, this task plays an essential role in information retrieval and extraction, and has a wide range of ...
Front Matter
EHUCM: An Efficient Algorithm for Mining High Utility Co-location Patterns from Spatial Datasets with Feature-specific Utilities
High utility co-location pattern mining is still computationally expensive in terms of both runtime and memory consumption. In this paper, an efficient high utility co-location pattern mining algorithm, named EHUCM, is proposed to address this ...
BERT-Based Multi-Task Learning for Aspect-Based Opinion Mining
Aspect-Based Opinion Mining (ABOM) mainly focuses on mining the aspect terms (product’s features) and related opinion polarities (e.g., Positive, Negative, and Neutral) from user’s reviews. The most prominent neural network-based methods to ...
GPU-Accelerated Vertex Orbit Counting for 5-Vertex Subgraphs
In this paper, we propose a parallel 5-vertex orbit counting method using GPUs. Given a graph and a set of subgraph patterns, the vertex orbit counting problem is to output, for each vertex in the graph, the number of subgraph patterns that ...
Front Matter
Efficient Discovery of Partial Periodic-Frequent Patterns in Temporal Databases
- So Nakamura,
- R. Uday Kiran,
- P. Likhitha,
- P. Ravikumar,
- Yutaka Watanobe,
- Minh Son Dao,
- Koji Zettsu,
- Masashi Toyoda
Partial periodic-frequent pattern mining is an important knowledge discovery technique in data mining. It involves identifying all frequent patterns that have exhibited partial periodic behavior in a temporal database. The following two ...
Database Framework for Supporting Retention Policies
Compliance with data retention laws and legislation is an important aspect of data management. As new laws governing personal data management are introduced (e.g., California Consumer Privacy Act enacted in 2020) and a greater emphasis is placed ...
Internal Data Imputation in Data Warehouse Dimensions
Missing data occur commonly in data warehouses and may generate data usefulness problems. Thus, it is essential to address missing data to carry out a better analysis. There exists data imputation methods for missing data in fact tables, but not ...
Purging Data from Backups by Encryption
Data retention laws establish rules intended to protect privacy. These define both retention durations (how long data must be kept) and purging deadlines (when the data must be destroyed in storage). To comply with the laws and to minimize ...