Proceedings of the 3rd IKDD Conference on Data Science, 2016

CODS '16: Proceedings of the 3rd IKDD Conference on Data Science, 2016

March 2016

2016 Proceeding

General Chairs:
Madhav Marathe
Virginia Polytechnic Institute and State, University, Blacksburg (VA), USA
,
Mukesh Mohania
IBM India Research Lab, New Delhi, INDIA
,
Program Chairs:
Mausam
Indian Institute of Technology, Delhi, INDIA
,
Prateek Jain
Microsoft Research Lab, Bangalore, INDIA

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

CODS '16: IKDD Conference on Data Science, 2016 Pune India March 13 - 16, 2016

ISBN:

978-1-4503-4217-9

Published:

13 March 2016

In-Cooperation:

ACM India

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Reflects downloads up to 25 Jan 2025Bibliometrics

Citation Count

Downloads (6 weeks)

Downloads (12 months)

131

Downloads (cumulative)

4,613

Sections

CODS '16: Proceedings of the 3rd IKDD Conference on Data Science, 2016

2016

Previous Next

Skip Abstract Section

Abstract

This volume contains the papers presented at CoDS 2016: Third IKDD Conference on Data Sciences held on March 13-16, 2016 in Pune.

Proceeding Downloads

PDFFront matter (Title, Copyright, Preface, Table of Contents)

Skip Table Of Content Section

Select All

Export Citations Save to Binder

SESSION: Full Papers

research-article

SocialStories: Segmenting Stories within Trending Twitter Topics

Kokil Jaidka,
Kaushik Ramachandran,
Prakhar Gupta,
Sajal Rustagi

Article No.: 1, Pages 1–7https://doi.org/10.1145/2888451.2888453

This study present SocialStories - a system based on incremental clustering for streaming tweets, for identifying fine-grained stories within a broader trending topic on Twitter. The contributions include a novel tf-metric, called the inverse cluster ...

- 2
- 250
Metrics
Total Citations2
Total Downloads250
Last 12 Months3
Last 6 weeks1

Abstract
Get Access

research-article

Learning to Collectively Link Entities

Ashish Kulkarni,
Kanika Agarwal,
Pararth Shah,
Sunny Raj Rathod,
Ganesh Ramakrishnan

Article No.: 2, Pages 1–9https://doi.org/10.1145/2888451.2888454

Recently Kulkarni et al. [20] proposed an approach for collective disambiguation of entity mentions occurring in natural language text. Their model achieves disambiguation by efficiently computing exact MAP inference in a binary labeled Markov Random ...

- 2
- 79
Metrics
Total Citations2
Total Downloads79
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

research-article

Learning DTW-Shapelets for Time-Series Classification

Mit Shah,
Josif Grabocka,
Nicolas Schilling,
Martin Wistuba,
Lars Schmidt-Thieme

Article No.: 3, Pages 1–8https://doi.org/10.1145/2888451.2888456

Shapelets are discriminative patterns in time series, that best predict the target variable when their distances to the respective time series are used as features for a classifier. Since the shapelet is simply any time series of some length less than ...

- 32
- 705
Metrics
Total Citations32
Total Downloads705
Last 12 Months52
Last 6 weeks1

Abstract
Get Access

research-article

Modeling Spatio-temporal Change Pattern using Mathematical Morphology

Monidipa Das,
Soumya K. Ghosh

Article No.: 4, Pages 1–10https://doi.org/10.1145/2888451.2888458

Detection and assessment of spatio-temporal change pattern is a challenging task, and may provide insights into various spatio-temporal changes, like urban sprawl monitoring, surveillance of epidemics due to infectious diseases etc. The existing spatio-...

- 1
- 165
Metrics
Total Citations1
Total Downloads165
Last 12 Months5
Last 6 weeks1

Abstract
Get Access

research-article

Detecting Community Structures in Social Networks by Graph Sparsification

Partha Basuchowdhuri,
Satyaki Sikdar,
Sonu Shreshtha,
Subhashis Majumder

Article No.: 5, Pages 1–9https://doi.org/10.1145/2888451.2888479

Community structures are inherent in social networks and finding them is an interesting and well-studied problem. Finding community structures in social networks is similar to locating densely connected clusters of nodes in a graph. One of the popular ...

- 1
- 244
Metrics
Total Citations1
Total Downloads244
Last 12 Months7
Last 6 weeks2

Abstract
Get Access

SESSION: Short Papers

short-paper

On the Dynamics of Username Changing Behavior on Twitter

Paridhi Jain,
Ponnurangam Kumaraguru

Article No.: 6, Pages 1–6https://doi.org/10.1145/2888451.2888452

People extensively use username to lookup users, their profiles and tweets that mention them via Twitter search engine. Often, the searched username is outdated due to a recent username change and no longer refers to the user of interest. Search by the ...

- 7
- 275
Metrics
Total Citations7
Total Downloads275
Last 12 Months12
Last 6 weeks2

Abstract
Get Access

short-paper

Audience Prism: Segmentation and Early Classification of Visitors Based on Reading Interests

Lilly Kumari,
Sunny Dhamnani,
Akshat Bhatnagar,
Atanu R. Sinha,
Ritwik Sinha

Article No.: 7, Pages 1–6https://doi.org/10.1145/2888451.2888459

The largest Media and Entertainment (M&E) web portals today cater to more than 100 Million unique visitors every month. In Customer Relationship Management, customer segmentation plays an important role, with the goal of targeting different products for ...

- 0
- 174
Metrics
Total Citations0
Total Downloads174
Last 12 Months4
Last 6 weeks1

Abstract
Get Access

short-paper

Investigating the Potential of Aggregated Tweets as Surrogate Data for Forecasting Civil Protests

Swati Agarwal,
Ashish Sureka

Article No.: 8, Pages 1–6https://doi.org/10.1145/2888451.2888466

Online Micro-blogging Social Media websites like Twitter are being used as a real-time platform for information sharing and communication during planning and mobilization of civil unrest events. We conduct a study of more than 1.5 million English Tweets ...

- 6
- 203
Metrics
Total Citations6
Total Downloads203
Last 12 Months4
Last 6 weeks0

Abstract
Get Access

short-paper

Learning transition models of biological regulatory and signaling networks from noisy data

Deepika Vatsa,
Sumeet Agarwal,
Ashwin Srinivasan

Article No.: 9, Pages 1–6https://doi.org/10.1145/2888451.2888469

In this paper, we present an extended 2-step probabilistic LGTS (PLGTS) transition system which aims to identify the network structure and stochastic nature of biological processes using time series data. This work is a step towards system ...

- 1
- 70
Metrics
Total Citations1
Total Downloads70
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

short-paper

Some algorithms for correlated bandits with non-stationary rewards: Regret bounds and applications

Prathamesh Mayekar,
Nandyala Hemachandra

Article No.: 10, Pages 1–6https://doi.org/10.1145/2888451.2888475

We first propose an online learning model wherein rewards for different actions/arms used by the user can be correlated and the reward stream can be non-stationary. Thus, this extends the standard multi-armed bandit learning model. We propose two ...

- 0
- 150
Metrics
Total Citations0
Total Downloads150
Last 12 Months5
Last 6 weeks0

Abstract
Get Access

short-paper

Events Describe Places: Tagging Places with Event Based Social Network Data

Vinod Hegde,
Alessandra Mileo,
Alexei Pozdnoukhov

Article No.: 11, Pages 1–6https://doi.org/10.1145/2888451.2888477

Location based services and Geospatial web applications have become popular in recent years due to wide adoption of mobile devices. Search and recommendation of places or Points of Interests (PoIs) are prominent services available on them. The ...

- 0
- 144
Metrics
Total Citations0
Total Downloads144
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

POSTER SESSION: Poster Papers

research-article

Smart filters for social retrieval

Balaji Vasan Srinivasan,
Tanya Goyal,
Nikhil Mohan Nainani,
Kartik Sreenivasan

Article No.: 12, Pages 1–2https://doi.org/10.1145/2888451.2888457

Social media platform are increasingly becoming a rich source of information for capturing the views and opinions of online customers. Major brands listen to the social streams to understand the general pulse of their online community. The foremost task ...

- 1
- 121
Metrics
Total Citations1
Total Downloads121
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

research-article

Learning from Gurus: Analysis and Modeling of Reopened Questions on Stack Overflow

Rishabh Gupta,
P. Krishna Reddy

Article No.: 13, Pages 1–2https://doi.org/10.1145/2888451.2888460

Community-driven Question Answering (Q&A) platforms are gaining popularity now-a-days and the number of posts on such platforms are increasing tremendously. Thus, the challenge to keep these platforms noise-free is attracting the interest of research ...

- 4
- 176
Metrics
Total Citations4
Total Downloads176
Last 12 Months8
Last 6 weeks1

Abstract
Get Access

research-article

Exploiting Local and Global Context In PPI networks For Efficient Protein Function Prediction

D. Satheesh Kumar,
Siddharth Goyal,
V. Prashant Reddy,
Ramesh Loganathan

Article No.: 14, Pages 1–2https://doi.org/10.1145/2888451.2888461

Protein-protein interaction (PPI) networks are valuable biological data source which contain rich information useful for protein function prediction. The PPI network data obtained from high-throughput experiments is known to be noisy and incomplete. In ...

- 0
- 100
Metrics
Total Citations0
Total Downloads100
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

research-article

Feature Creation based Slicing for Privacy Preserving Data Mining

R. Praveena Priyadarsini,
M. L. Valarmathi,
S. Sivakumari

Article No.: 15, Pages 1–8https://doi.org/10.1145/2888451.2888462

In the digital era vast amount of data are collected and shared for purpose of research and analysis. These data contain sensitive information about the people and organizations which needs to be protected during the process of data mining. This work ...

- 2
- 118
Metrics
Total Citations2
Total Downloads118
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

research-article

CitizenPulse: A Text Analytics framework for Proactive e-Governance - A Case Study of Mygov.in

Ankit Lamba,
Deepak Yadav,
Abhijit Lele

Article No.: 16, Pages 1–2https://doi.org/10.1145/2888451.2888463

Indian Citizens are beginning to express themselves via social media on a regular basis on various issues. Government of India have started an initiated called as Mygov.in as a collaborative portal where citizens can voice their opinions via free form ...

- 6
- 165
Metrics
Total Citations6
Total Downloads165
Last 12 Months3
Last 6 weeks0

Abstract
Get Access

research-article

Trustworthiness of t-Distributed Stochastic Neighbour Embedding

Shishir Pandey,
Rahul Vaze

Article No.: 17, Pages 1–2https://doi.org/10.1145/2888451.2888465

A well known technique for embedding high dimensional objects in two or three dimensional space is the t-distributed stochastic neighbour embedding (t-SNE). The t-SNE minimizes the Kullback-Liebler (KL) divergence between two probability distributions, ...

- 3
- 133
Metrics
Total Citations3
Total Downloads133
Last 12 Months7
Last 6 weeks0

Abstract
Get Access

research-article

Weighted Linear Loss Twin Support Vector Clustering

Reshma Khemchandani,
Aman Pal

Article No.: 18, Pages 1–2https://doi.org/10.1145/2888451.2888467

Traditional point based clustering methods such as k-means [1], k-median [2], etc. work by partitioning the data into clusters based on the cluster prototype points. These methods perform poorly in case when data is not distributed around several ...

- 3
- 120
Metrics
Total Citations3
Total Downloads120
Last 12 Months5
Last 6 weeks0

Abstract
Get Access

research-article

Using Sort-Union to Enhance Economically-Efficient Sentiment Stream Analysis

Prateek Goel,
Manajit Chakraborty,
C. Ravindranath Chowdary

Article No.: 19, Pages 1–2https://doi.org/10.1145/2888451.2888468

Sentiment drifts due to people changing their opinions instantly on microblogs e.g. Twitter, are a major challenge in sentiment analysis. In this paper, we have developed a method that selects most frequent messages from a relevant message set ...

- 0
- 72
Metrics
Total Citations0
Total Downloads72
Last 12 Months3
Last 6 weeks1

Abstract
Get Access

research-article

Mining Multi-source Data to Study Workplace Activity Patterns

Sachin Patel,
Ravi Mahamuni,
Meghendra Singh,
David Clarance,
Mayuri Duggirala,
Shivani Sharma,
Vinay Katiyar,
Gauri Deshpande,
Amruta Deshmukh,
Vaibhav,
Vivek Balaraman

Article No.: 20, Pages 1–2https://doi.org/10.1145/2888451.2888470

Examining work activity patterns is a problem of enduring research in organizations. The fortuitous availability of a whole new set of data collection mechanisms such as mobiles, activity loggers, GPS based location detectors, provide us new ways of ...

- 1
- 91
Metrics
Total Citations1
Total Downloads91
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

research-article

Consensus Clustering Approach for Discovering Overlapping Nodes in Social Networks

D. Shiva Shankar,
S. Durga Bhavani

Article No.: 21, Pages 1–2https://doi.org/10.1145/2888451.2888471

Community discovery is an important problem that has been addressed in social networks through multiple perspectives. Most of these algorithms discover disjoint communities and yield widely varying results with regard to number of communities as well as ...

- 1
- 96
Metrics
Total Citations1
Total Downloads96
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

research-article

An Approach to Allocate Advertisement Slots for Banner Advertising

Vaddadi Naga Sai Kavya,
P. Krishna Reddy

Article No.: 22, Pages 1–2https://doi.org/10.1145/2888451.2888472

In the banner advertising scenario, an advertiser aims to reach the maximum number of potential visitors and a publisher tries to meet the requests of increased number of advertisers to maximize the revenue. In the literature, a model was introduced to ...

- 0
- 178
Metrics
Total Citations0
Total Downloads178
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

research-article

Competing Algorithm Detection from Research Papers

Soumyajit Ganguly,
Vikram Pudi

Article No.: 23, Pages 1–2https://doi.org/10.1145/2888451.2888473

We propose an unsupervised approach to extract all competing algorithms present in a given scholarly article. The algorithm names are treated as named entities and natural language processing techniques are used to extract them. All extracted entity ...

- 1
- 118
Metrics
Total Citations1
Total Downloads118
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

research-article

Query Classification using LDA Topic Model and Sparse Representation Based Classifier

Indrani Bhattacharya,
Jaya Sil

Article No.: 24, Pages 1–2https://doi.org/10.1145/2888451.2888474

Users often seek for information by submitting query consisting of keywords may belong to multiple topics, representing overlapping concepts. Objective of the work is to classify the query into a topic class label by considering the query keywords ...

- 3
- 147
Metrics
Total Citations3
Total Downloads147
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

research-article

Scalable Quick Reduct Algorithm: Iterative MapReduce Approach

Praveen Kumar Singh,
P. S. V. S. Sai Prasad

Article No.: 25, Pages 1–2https://doi.org/10.1145/2888451.2888476

Feature selection by reduct computation is the key technique for knowledge acquistion using rough set theory. Existing MapReduce based reduct algorithms use Hadoop Map Reduce framework, which is not suitable for iterative algorithms. Paper aims to ...

- 6
- 135
Metrics
Total Citations6
Total Downloads135
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

research-article

Improving Urban Transportation through Social Media Analytics

Manjira Sinha,
Preethy Varma,
Gayatri Sivakumar,
Mridula Singh,
Tridib Mukherjee,
Deepthi Chander,
Koustuv Dasgupta

Article No.: 26, Pages 1–2https://doi.org/10.1145/2888451.2888478

Citizens tend to discuss issues in public forums, social media, and web blogs. Given that issues related to public transportation are most actively reported across web-based sources, we present a holistic framework for collection, categorization, ...

- 1
- 221
Metrics
Total Citations1
Total Downloads221
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

research-article

AMEO 2015: A dataset comprising AMCAT test scores, biodata details and employment outcomes of job seekers

Varun Aggarwal,
Shashank Srikant,
Harsh Nisar

Article No.: 27, Pages 1–2https://doi.org/10.1145/2888451.2892037

More than a million engineers enter the global workforce every year. A relevant question is what determines the jobs and salaries these engineers are offered right after graduation. Previous studies have shown the influence of various factors such as ...

- 1
- 136
Metrics
Total Citations1
Total Downloads136
Last 12 Months2
Last 6 weeks1

Abstract
Get Access

Save to Binder

Create a New Binder

Name

Contributors

Madhav Marathe
- Publication Years
- Publication counts0
- Citation count0
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article0
View Full Profile
Mukesh Mohania
Indraprastha Institute of Information Technology, Delhi
- Publication Years1994 - 2024
- Publication counts111
- Citation count564
- Available for Download29
- Downloads (cumulative)14,334
- Downloads (12 months)874
- Downloads (6 weeks)122
- Average Downloads per Article494
- Average Citation per Article5
View Full Profile
Mausam
- Publication Years
- Publication counts0
- Citation count0
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article0
View Full Profile
Prateek Jain
- Publication Years
- Publication counts0
- Citation count0
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article0
View Full Profile

Comments

Recommendations

UbiMob '05: Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing
UbiMob '08: Proceedings of the 4th French-speaking conference on Mobility and ubiquity computing
CODS-IKDD '15: Proceedings of the 2nd IKDD Conference on Data Sciences

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Proceeding Downloads

Save to Binder

Recommendations

UbiMob '05: Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing

UbiMob '08: Proceedings of the 4th French-speaking conference on Mobility and ubiquity computing

CODS-IKDD '15: Proceedings of the 2nd IKDD Conference on Data Sciences