Large Dataset Analysis Research Papers

Bookmark
Download
- by Alfredo Ferro
- •
- 6
  Data Warehousing, Data Warehouse, Inverted Index, Bottom Up

Risk modelling along with multi-objective optimization problems have been at the epicenter of attention for supply chain managers. In this paper, we introduce a dataset for risk modelling in sophisticated supply chain networks based on... more

Bookmark
Download
- by Aiden Zhou
- •
- 7
  Computer Science, Empirical Study, Data Exchange, Parallel Computer

The Z-value is an attempt to estimate the statistical significance of a Smith-Waterman dynamic alignment score (SW-score) through the use of a Monte-Carlo process. It partly reduces the bias induced by the composition and length of the... more

Classification and regression trees are becoming increasingly popular for partitioning data and identifying local structure in small and large datasets. Classification trees include those models in which the dependent variable (the... more

Bookmark
Download
- by Bruno Versailles
- •
- 3
  Maize, Price Dispersion, Large Dataset Analysis

Outlier (or anomaly) detection is an important problem for many domains, including fraud detection, risk analysis, network intrusion and medical diagnosis, and the discovery of significant outliers is becoming an integral aspect of data... more

Bookmark
Download
- by David Powers
- •
- 14
  Computer Science, Data Mining, Anomaly Detection, Risk Analysis

Bookmark
Download
- by Annette Stumpf
- •
- 19
  Engineering, Data Mining, Data Analysis, Modeling

Bookmark
Download
- by Elizabeth Austin
- •
- 9
  Psychology, Cognitive Science, Personality, Intelligence

Risk modelling along with multi-objective optimization problems have been at the epicenter of attention for supply chain managers. In this paper, we introduce a dataset for risk modelling in sophisticated supply chain networks based on... more

Bookmark
Download
- by Gail Lalk
- •
- 8
  Engineering, Machine Learning, Data Warehouse, Mathematical Sciences

Bookmark
Download
- by Daniele Loiacono and +1
  Luca Galli
- •
- 15
  Machine Learning, User Interface, Neural Network, Competitive advantage

Bookmark
Download
- by Vamsi Madasu
- •
- 12
  Data Mining, Productivity, Writing, Information Extraction

In two studies, this thesis depicts the relationship between minority group status in the United States, perceived discrimination, and coping with stress. Past literature on coping and its types – problem-focused versus emotion-focused –... more

Clustering is a division of data into groups of similar objects. K-means has been used in many clustering work because of the ease of the algorithm. Our main effort is to parallelize the k-means clustering algorithm. The parallel version... more

Bookmark
Download
- by Dick Van Dijk
- •
- 10
  Economics, Monte Carlo Simulation, Time Series, Monte Carlo

The ability of Minkowski Functionals to characterize local structure in different biological tissue types has been demonstrated in a variety of medical image processing tasks. We introduce anisotropic Minkowski Functionals (AMFs) as a... more

Bookmark
Download
- by Titas De
- •
- 136
  Bioengineering, Artificial Intelligence, Computer Vision, Image Processing

Bookmark
Download
- by Telmo Matos
- •
- 23
  Marketing, Data Mining, Data Analysis, Statistical Analysis

Bookmark
Download
- by J. Troncy
- •
- 21
  Leukemia, Adolescent, Humans, Child

Bookmark
Download
- by Tiago Rodrigues
- •
- 8
  Machine Learning, Social behavior, Spam Detection, Real Time

Bookmark
Download
- by vipin Kumar
- •
- 10
  Time Series, Nearest Neighbor, Cluster Analysis, High Dimensional Data

In a world where massive amounts of data are recorded on a large scale we need data mining technologies to gain knowledge from the data in a reasonable time. The Top Down Induction of Decision Trees (TDIDT) algorithm is a very widely used... more

Bookmark
Download
- by Max Bramer
- •
- 10
  Data Mining, Scaling up, Decision Tree, Large Scale

The efficiency of frequent itemset mining algorithms is determined mainly by three factors: the way candidates are generated, the data structure that is used and the implementation details. Most papers focus on the first factor, some... more

Nowadays Web sites tend to be more and more social: users can upload any kind of information on collaborative platforms and can express their opinions about the content they enjoyed through textual feedbacks or reviews. These platforms... more

Bookmark
Download
- by Fedelucio Narducci
- •
- 7
  User needs, Recommender System, Social Tagging, MENTAL MODEL

When applying multivariate analysis techniques in information systems and social science disciplines, such as management information systems (MIS) and marketing, the assumption that the empirical data originate from a single homogeneous... more

Bookmark
Download
- by Raghu Ramakrishnan
- •
- 15
  Information Systems, Image Processing, Data Mining, Data Structure

Bookmark
Download
- by Domenico Calcaterra
- •
- 9
  Remote Sensing, Case Study, Field Survey, Thematic Maps

Bookmark
Download
- by Gordon Erlebacher
- •
- 24
  Economics, Data Mining, Data Analysis, Web Development

This paper presents a novel approach to the task of automatic music genre classification which is based on multiple feature vectors and ensemble of classifiers. Multiple feature vectors are extracted from a single music piece. First,... more

Bookmark
Download
- by Steve Uchytil
- •
- 9
  Stratification, Reservoir, Gulf of Mexico, Isotope

Many scientific applications can benejit from eficient clustering algorithm of massively large high dimensional datasets. However most of the developed ,algorithms are impractical to use when the amount of data is very large. Given N... more

This paper presents a novel approach to knowledge extraction from large-scale datasets using a neural network when applied to the real-world problem of payment card fraud detection. Fraud is a serious and long term threat to a peaceful... more

Bookmark
Download
- by Nick F Ryman-Tubb
- •
- 7
  Neural Network, Fraud Detection, Knowledge Extraction, Credit Cards

Bookmark
Download
- by Petar Milin
- •
- 4
  Psychological, Reaction Time, Statistical Model, Large Dataset Analysis

Significant payment flows now take place on-line, giving rise to a requirement for efficient and effective systems for the detection of credit card fraud. A particular aspect of this problem is that it is highly dynamic, as fraudsters... more

Bookmark
Download
- by Shankar Krishnan
- •
- 13
  Data Mining, Clustering, Time Complexity, Transportation Problem

Bookmark
Download
- by Csaba Szepesvari
- •
- 6
  Machine Learning, Model Selection, Scaling up, Upper Bound

Bookmark
Download
- by Cristian Capelli
- •
- 9
  Genetics, Population structure, North Africa, Genetic Structure

Large Dataset Analysis

Log In