Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2888451acmotherconferencesBook PagePublication PagesikddConference Proceedingsconference-collections
CODS '16: Proceedings of the 3rd IKDD Conference on Data Science, 2016
ACM2016 Proceeding
  • General Chairs:
  • Madhav Marathe,
  • Mukesh Mohania,
  • Program Chairs:
  • Mausam,
  • Prateek Jain
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
CODS '16: IKDD Conference on Data Science, 2016 Pune India March 13 - 16, 2016
ISBN:
978-1-4503-4217-9
Published:
13 March 2016
In-Cooperation:
ACM India
Recommend ACM DL
ALREADY A SUBSCRIBER?SIGN IN

Reflects downloads up to 25 Jan 2025Bibliometrics
Skip Abstract Section
Abstract

This volume contains the papers presented at CoDS 2016: Third IKDD Conference on Data Sciences held on March 13-16, 2016 in Pune.

Skip Table Of Content Section
SESSION: Full Papers
research-article
SocialStories: Segmenting Stories within Trending Twitter Topics
Article No.: 1, Pages 1–7https://doi.org/10.1145/2888451.2888453

This study present SocialStories - a system based on incremental clustering for streaming tweets, for identifying fine-grained stories within a broader trending topic on Twitter. The contributions include a novel tf-metric, called the inverse cluster ...

research-article
Learning to Collectively Link Entities
Article No.: 2, Pages 1–9https://doi.org/10.1145/2888451.2888454

Recently Kulkarni et al. [20] proposed an approach for collective disambiguation of entity mentions occurring in natural language text. Their model achieves disambiguation by efficiently computing exact MAP inference in a binary labeled Markov Random ...

research-article
Learning DTW-Shapelets for Time-Series Classification
Article No.: 3, Pages 1–8https://doi.org/10.1145/2888451.2888456

Shapelets are discriminative patterns in time series, that best predict the target variable when their distances to the respective time series are used as features for a classifier. Since the shapelet is simply any time series of some length less than ...

research-article
Modeling Spatio-temporal Change Pattern using Mathematical Morphology
Article No.: 4, Pages 1–10https://doi.org/10.1145/2888451.2888458

Detection and assessment of spatio-temporal change pattern is a challenging task, and may provide insights into various spatio-temporal changes, like urban sprawl monitoring, surveillance of epidemics due to infectious diseases etc. The existing spatio-...

research-article
Detecting Community Structures in Social Networks by Graph Sparsification
Article No.: 5, Pages 1–9https://doi.org/10.1145/2888451.2888479

Community structures are inherent in social networks and finding them is an interesting and well-studied problem. Finding community structures in social networks is similar to locating densely connected clusters of nodes in a graph. One of the popular ...

SESSION: Short Papers
short-paper
On the Dynamics of Username Changing Behavior on Twitter
Article No.: 6, Pages 1–6https://doi.org/10.1145/2888451.2888452

People extensively use username to lookup users, their profiles and tweets that mention them via Twitter search engine. Often, the searched username is outdated due to a recent username change and no longer refers to the user of interest. Search by the ...

short-paper
Audience Prism: Segmentation and Early Classification of Visitors Based on Reading Interests
Article No.: 7, Pages 1–6https://doi.org/10.1145/2888451.2888459

The largest Media and Entertainment (M&E) web portals today cater to more than 100 Million unique visitors every month. In Customer Relationship Management, customer segmentation plays an important role, with the goal of targeting different products for ...

short-paper
Investigating the Potential of Aggregated Tweets as Surrogate Data for Forecasting Civil Protests
Article No.: 8, Pages 1–6https://doi.org/10.1145/2888451.2888466

Online Micro-blogging Social Media websites like Twitter are being used as a real-time platform for information sharing and communication during planning and mobilization of civil unrest events. We conduct a study of more than 1.5 million English Tweets ...

short-paper
Learning transition models of biological regulatory and signaling networks from noisy data
Article No.: 9, Pages 1–6https://doi.org/10.1145/2888451.2888469

In this paper, we present an extended 2-step probabilistic LGTS (PLGTS) transition system which aims to identify the network structure and stochastic nature of biological processes using time series data. This work is a step towards system ...

short-paper
Some algorithms for correlated bandits with non-stationary rewards: Regret bounds and applications
Article No.: 10, Pages 1–6https://doi.org/10.1145/2888451.2888475

We first propose an online learning model wherein rewards for different actions/arms used by the user can be correlated and the reward stream can be non-stationary. Thus, this extends the standard multi-armed bandit learning model. We propose two ...

short-paper
Events Describe Places: Tagging Places with Event Based Social Network Data
Article No.: 11, Pages 1–6https://doi.org/10.1145/2888451.2888477

Location based services and Geospatial web applications have become popular in recent years due to wide adoption of mobile devices. Search and recommendation of places or Points of Interests (PoIs) are prominent services available on them. The ...

POSTER SESSION: Poster Papers
research-article
Smart filters for social retrieval
Article No.: 12, Pages 1–2https://doi.org/10.1145/2888451.2888457

Social media platform are increasingly becoming a rich source of information for capturing the views and opinions of online customers. Major brands listen to the social streams to understand the general pulse of their online community. The foremost task ...

research-article
Learning from Gurus: Analysis and Modeling of Reopened Questions on Stack Overflow
Article No.: 13, Pages 1–2https://doi.org/10.1145/2888451.2888460

Community-driven Question Answering (Q&A) platforms are gaining popularity now-a-days and the number of posts on such platforms are increasing tremendously. Thus, the challenge to keep these platforms noise-free is attracting the interest of research ...

research-article
Exploiting Local and Global Context In PPI networks For Efficient Protein Function Prediction
Article No.: 14, Pages 1–2https://doi.org/10.1145/2888451.2888461

Protein-protein interaction (PPI) networks are valuable biological data source which contain rich information useful for protein function prediction. The PPI network data obtained from high-throughput experiments is known to be noisy and incomplete. In ...

research-article
Feature Creation based Slicing for Privacy Preserving Data Mining
Article No.: 15, Pages 1–8https://doi.org/10.1145/2888451.2888462

In the digital era vast amount of data are collected and shared for purpose of research and analysis. These data contain sensitive information about the people and organizations which needs to be protected during the process of data mining. This work ...

research-article
CitizenPulse: A Text Analytics framework for Proactive e-Governance - A Case Study of Mygov.in
Article No.: 16, Pages 1–2https://doi.org/10.1145/2888451.2888463

Indian Citizens are beginning to express themselves via social media on a regular basis on various issues. Government of India have started an initiated called as Mygov.in as a collaborative portal where citizens can voice their opinions via free form ...

research-article
Trustworthiness of t-Distributed Stochastic Neighbour Embedding
Article No.: 17, Pages 1–2https://doi.org/10.1145/2888451.2888465

A well known technique for embedding high dimensional objects in two or three dimensional space is the t-distributed stochastic neighbour embedding (t-SNE). The t-SNE minimizes the Kullback-Liebler (KL) divergence between two probability distributions, ...

research-article
Weighted Linear Loss Twin Support Vector Clustering
Article No.: 18, Pages 1–2https://doi.org/10.1145/2888451.2888467

Traditional point based clustering methods such as k-means [1], k-median [2], etc. work by partitioning the data into clusters based on the cluster prototype points. These methods perform poorly in case when data is not distributed around several ...

research-article
Using Sort-Union to Enhance Economically-Efficient Sentiment Stream Analysis
Article No.: 19, Pages 1–2https://doi.org/10.1145/2888451.2888468

Sentiment drifts due to people changing their opinions instantly on microblogs e.g. Twitter, are a major challenge in sentiment analysis. In this paper, we have developed a method that selects most frequent messages from a relevant message set ...

research-article
Mining Multi-source Data to Study Workplace Activity Patterns
Article No.: 20, Pages 1–2https://doi.org/10.1145/2888451.2888470

Examining work activity patterns is a problem of enduring research in organizations. The fortuitous availability of a whole new set of data collection mechanisms such as mobiles, activity loggers, GPS based location detectors, provide us new ways of ...

research-article
Consensus Clustering Approach for Discovering Overlapping Nodes in Social Networks
Article No.: 21, Pages 1–2https://doi.org/10.1145/2888451.2888471

Community discovery is an important problem that has been addressed in social networks through multiple perspectives. Most of these algorithms discover disjoint communities and yield widely varying results with regard to number of communities as well as ...

research-article
An Approach to Allocate Advertisement Slots for Banner Advertising
Article No.: 22, Pages 1–2https://doi.org/10.1145/2888451.2888472

In the banner advertising scenario, an advertiser aims to reach the maximum number of potential visitors and a publisher tries to meet the requests of increased number of advertisers to maximize the revenue. In the literature, a model was introduced to ...

research-article
Competing Algorithm Detection from Research Papers
Article No.: 23, Pages 1–2https://doi.org/10.1145/2888451.2888473

We propose an unsupervised approach to extract all competing algorithms present in a given scholarly article. The algorithm names are treated as named entities and natural language processing techniques are used to extract them. All extracted entity ...

research-article
Query Classification using LDA Topic Model and Sparse Representation Based Classifier
Article No.: 24, Pages 1–2https://doi.org/10.1145/2888451.2888474

Users often seek for information by submitting query consisting of keywords may belong to multiple topics, representing overlapping concepts. Objective of the work is to classify the query into a topic class label by considering the query keywords ...

research-article
Scalable Quick Reduct Algorithm: Iterative MapReduce Approach
Article No.: 25, Pages 1–2https://doi.org/10.1145/2888451.2888476

Feature selection by reduct computation is the key technique for knowledge acquistion using rough set theory. Existing MapReduce based reduct algorithms use Hadoop Map Reduce framework, which is not suitable for iterative algorithms. Paper aims to ...

research-article
Improving Urban Transportation through Social Media Analytics
Article No.: 26, Pages 1–2https://doi.org/10.1145/2888451.2888478

Citizens tend to discuss issues in public forums, social media, and web blogs. Given that issues related to public transportation are most actively reported across web-based sources, we present a holistic framework for collection, categorization, ...

research-article
AMEO 2015: A dataset comprising AMCAT test scores, biodata details and employment outcomes of job seekers
Article No.: 27, Pages 1–2https://doi.org/10.1145/2888451.2892037

More than a million engineers enter the global workforce every year. A relevant question is what determines the jobs and salaries these engineers are offered right after graduation. Previous studies have shown the influence of various factors such as ...

Contributors
  • Indraprastha Institute of Information Technology, Delhi

Recommendations