Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–44 of 44 results for author: Bhattacharya, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11704  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-4 340B Technical Report

    Authors: Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek , et al. (58 additional authors not shown)

    Abstract: We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be… ▽ More

    Submitted 6 August, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2405.07447  [pdf

    cs.HC

    From traces to measures: Large language models as a tool for psychological measurement from text

    Authors: Joseph J. P. Simons, Wong Liang Ze, Prasanta Bhattacharya, Brandon Siyuan Loh, Wei Gao

    Abstract: Digital trace data provide potentially valuable resources for understanding human behaviour, but their value has been limited by issues of unclear measurement. The growth of large language models provides an opportunity to address this limitation in the case of text data. Specifically, recognizing cases where their responses are a form of psychological measurement (the use of observable indicators… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 12 pages, 2 figures, 1 table

  3. arXiv:2404.12674  [pdf, other

    cs.DC cs.LG cs.PF

    Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms

    Authors: Zhongyi Lin, Ning Sun, Pallab Bhattacharya, Xizhou Feng, Louis Feng, John D. Owens

    Abstract: Characterizing and predicting the training performance of modern machine learning (ML) workloads on compute systems with compute and communication spread between CPUs, GPUs, and network devices is not only the key to optimization and planning but also a complex goal to achieve. The primary challenges include the complexity of synchronization and load balancing between CPUs and GPUs, the variance i… ▽ More

    Submitted 27 April, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: 12 pages, 11 figures, 4 tables

  4. arXiv:2310.16673  [pdf, other

    cs.SE cs.AI cs.IR

    Exploring Large Language Models for Code Explanation

    Authors: Paheli Bhattacharya, Manojit Chakraborty, Kartheek N S N Palepu, Vikas Pandey, Ishan Dindorkar, Rakesh Rajpurohit, Rishabh Gupta

    Abstract: Automating code documentation through explanatory text can prove highly beneficial in code understanding. Large Language Models (LLMs) have made remarkable strides in Natural Language Processing, especially within software engineering tasks such as code generation and code summarization. This study specifically delves into the task of generating natural-language summaries for code snippets, using… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted at the Forum for Information Retrieval Evaluation 2023 (IRSE Track)

    ACM Class: D.2.3; I.7

  5. arXiv:2310.09848  [pdf

    cs.CL

    Enhancing Stance Classification with Quantified Moral Foundations

    Authors: Hong Zhang, Prasanta Bhattacharya, Wei Gao, Liang Ze Wong, Brandon Siyuan Loh, Joseph J. P. Simons, Jisun An

    Abstract: This study enhances stance detection on social media by incorporating deeper psychological attributes, specifically individuals' moral foundations. These theoretically-derived dimensions aim to provide a comprehensive profile of an individual's moral concerns which, in recent work, has been linked to behaviour in a range of domains, including society, politics, health, and the environment. In this… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: 11 pages, 5 figures

  6. arXiv:2306.14142  [pdf, other

    cs.SI stat.ME

    Estimating Policy Effects in a Social Network with Independent Set Sampling

    Authors: Eugene Ang, Prasanta Bhattacharya, Andrew Lim

    Abstract: Evaluating the impact of policy interventions on respondents who are embedded in a social network is often challenging due to the presence of network interference within the treatment groups, as well as between treatment and non-treatment groups throughout the network. In this paper, we propose a modeling strategy that combines existing work on stochastic actor-oriented models (SAOM) with a novel… ▽ More

    Submitted 25 February, 2024; v1 submitted 25 June, 2023; originally announced June 2023.

  7. arXiv:2301.02959  [pdf, other

    cs.LG cs.DC cs.IR cs.PF

    FlexShard: Flexible Sharding for Industry-Scale Sequence Recommendation Models

    Authors: Geet Sethi, Pallab Bhattacharya, Dhruv Choudhary, Carole-Jean Wu, Christos Kozyrakis

    Abstract: Sequence-based deep learning recommendation models (DLRMs) are an emerging class of DLRMs showing great improvements over their prior sum-pooling based counterparts at capturing users' long term interests. These improvements come at immense system cost however, with sequence-based DLRMs requiring substantial amounts of data to be dynamically materialized and communicated by each accelerator during… ▽ More

    Submitted 7 January, 2023; originally announced January 2023.

  8. arXiv:2212.13897  [pdf, other

    cs.IR

    What You Like: Generating Explainable Topical Recommendations for Twitter Using Social Annotations

    Authors: Parantapa Bhattacharya, Saptarshi Ghosh, Muhammad Bilal Zafar, Soumya K. Ghosh, Niloy Ganguly

    Abstract: With over 500 million tweets posted per day, in Twitter, it is difficult for Twitter users to discover interesting content from the deluge of uninteresting posts. In this work, we present a novel, explainable, topical recommendation system, that utilizes social annotations, to help Twitter users discover tweets, on topics of their interest. A major challenge in using traditional rating dependent r… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

  9. arXiv:2212.12594  [pdf, other

    cs.SI

    Analyzing Regrettable Communications on Twitter: Characterizing Deleted Tweets and Their Authors

    Authors: Parantapa Bhattacharya, Saptarshi Ghosh, Niloy Ganguly

    Abstract: Over 500 million tweets are posted in Twitter each day, out of which about 11% tweets are deleted by the users posting them. This phenomenon of widespread deletion of tweets leads to a number of questions: what kind of content posted by users makes them want to delete them later? %Are all users equally active in deleting their tweets or Are users of certain predispositions more likely to post regr… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

  10. arXiv:2212.09045  [pdf, other

    cs.CL cs.CY cs.HC cs.SI

    Task Preferences across Languages on Community Question Answering Platforms

    Authors: Sebastin Santy, Prasanta Bhattacharya, Rishabh Mehrotra

    Abstract: With the steady emergence of community question answering (CQA) platforms like Quora, StackExchange, and WikiHow, users now have an unprecedented access to information on various kind of queries and tasks. Moreover, the rapid proliferation and localization of these platforms spanning geographic and linguistic boundaries offer a unique opportunity to study the task requirements and preferences of u… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: 7 pages, 4 figures

  11. arXiv:2211.01338  [pdf, other

    eess.AS cs.CL cs.MM cs.SD eess.IV

    Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages

    Authors: Anusha Prakash, Arun Kumar, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, K V Vikram, Mano Ranjith Kumar M, Metilda Sagaya Mary, Mohammad Wajahat, Mohana N, Mudit Batra, Navina K, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda Sukhadia, Dipti Sharma, Hema Murthy, Pushpak Bhattacharya , et al. (2 additional authors not shown)

    Abstract: Cross-lingual dubbing of lecture videos requires the transcription of the original audio, correction and removal of disfluencies, domain term discovery, text-to-text translation into the target language, chunking of text using target language rhythm, text-to-speech synthesis followed by isochronous lipsyncing to the original video. This task becomes challenging when the source and target languages… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  12. arXiv:2210.09421  [pdf, other

    cs.CR cs.CL cs.LG

    Deepfake Text Detection: Limitations and Opportunities

    Authors: Jiameng Pu, Zain Sarwar, Sifat Muhammad Abdullah, Abdullah Rehman, Yoonjin Kim, Parantapa Bhattacharya, Mobin Javed, Bimal Viswanath

    Abstract: Recent advances in generative models for language have enabled the creation of convincing synthetic text or deepfake text. Prior work has demonstrated the potential for misuse of deepfake text to mislead content consumers. Therefore, deepfake text detection, the task of discriminating between human and machine-generated text, is becoming increasingly critical. Several defenses have been proposed f… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted to IEEE S&P 2023; First two authors contributed equally to this work; 18 pages, 7 figures

  13. arXiv:2210.07544  [pdf, other

    cs.CL cs.IR

    Legal Case Document Summarization: Extractive and Abstractive Methods and their Evaluation

    Authors: Abhay Shukla, Paheli Bhattacharya, Soham Poddar, Rajdeep Mukherjee, Kripabandhu Ghosh, Pawan Goyal, Saptarshi Ghosh

    Abstract: Summarization of legal case judgement documents is a challenging problem in Legal NLP. However, not much analyses exist on how different families of summarization models (e.g., extractive vs. abstractive) perform when applied to legal case documents. This question is particularly important since many recent transformer-based abstractive summarization models have restrictions on the number of input… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted at The 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (AACL-IJCNLP), 2022

  14. arXiv:2209.12474  [pdf, other

    cs.IR

    Legal Case Document Similarity: You Need Both Network and Text

    Authors: Paheli Bhattacharya, Kripabandhu Ghosh, Arindam Pal, Saptarshi Ghosh

    Abstract: Estimating the similarity between two legal case documents is an important and challenging problem, having various downstream applications such as prior-case retrieval and citation recommendation. There are two broad approaches for the task -- citation network-based and text-based. Prior citation network-based approaches consider citations only to prior-cases (also called precedents) (PCNet). This… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: This work has been published in Information Processing and Management, Elsevier, vol. 59, issue 6, November 2022

  15. TPP: Transparent Page Placement for CXL-Enabled Tiered-Memory

    Authors: Hasan Al Maruf, Hao Wang, Abhishek Dhanotia, Johannes Weiner, Niket Agarwal, Pallab Bhattacharya, Chris Petersen, Mosharaf Chowdhury, Shobhit Kanaujia, Prakash Chauhan

    Abstract: The increasing demand for memory in hyperscale applications has led to memory becoming a large portion of the overall datacenter spend. The emergence of coherent interfaces like CXL enables main memory expansion and offers an efficient solution to this problem. In such systems, the main memory can constitute different memory technologies with varied characteristics. In this paper, we characterize… ▽ More

    Submitted 28 May, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

  16. arXiv:2204.08601  [pdf, other

    cs.CV cs.GR

    A Tour of Visualization Techniques for Computer Vision Datasets

    Authors: Bilal Alsallakh, Pamela Bhattacharya, Vanessa Feng, Narine Kokhlikyan, Orion Reblitz-Richardson, Rahul Rajan, David Yan

    Abstract: We survey a number of data visualization techniques for analyzing Computer Vision (CV) datasets. These techniques help us understand properties and latent patterns in such data, by applying dataset-level analysis. We present various examples of how such analysis helps predict the potential impact of the dataset properties on CV models and informs appropriate mitigation of their shortcomings. Final… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

  17. arXiv:2106.15876  [pdf, other

    cs.CL cs.IR

    Incorporating Domain Knowledge for Extractive Summarization of Legal Case Documents

    Authors: Paheli Bhattacharya, Soham Poddar, Koustav Rudra, Kripabandhu Ghosh, Saptarshi Ghosh

    Abstract: Automatic summarization of legal case documents is an important and practical challenge. Apart from many domain-independent text summarization algorithms that can be used for this purpose, several algorithms have been developed specifically for summarizing legal case documents. However, most of the existing algorithms do not systematically incorporate domain knowledge that specifies what informati… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: Accepted at the 18th International Conference on Artificial Intelligence and Law (ICAIL) 2021

  18. arXiv:2106.06292  [pdf, other

    cs.CL

    A Discussion on Building Practical NLP Leaderboards: The Case of Machine Translation

    Authors: Sebastin Santy, Prasanta Bhattacharya

    Abstract: Recent advances in AI and ML applications have benefited from rapid progress in NLP research. Leaderboards have emerged as a popular mechanism to track and accelerate progress in NLP through competitive model development. While this has increased interest and participation, the over-reliance on single, and accuracy-based metrics have shifted focus from other important metrics that might be equally… ▽ More

    Submitted 30 December, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: pre-print

  19. arXiv:2104.05158  [pdf, other

    cs.DC cs.AI cs.LG cs.PF

    Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation Models

    Authors: Dheevatsa Mudigere, Yuchen Hao, Jianyu Huang, Zhihao Jia, Andrew Tulloch, Srinivas Sridharan, Xing Liu, Mustafa Ozdal, Jade Nie, Jongsoo Park, Liang Luo, Jie Amy Yang, Leon Gao, Dmytro Ivchenko, Aarti Basant, Yuxi Hu, Jiyan Yang, Ehsan K. Ardestani, Xiaodong Wang, Rakesh Komuravelli, Ching-Hsiang Chu, Serhat Yilmaz, Huayu Li, Jiyuan Qian, Zhuobo Feng , et al. (28 additional authors not shown)

    Abstract: Deep learning recommendation models (DLRMs) are used across many business-critical services at Facebook and are the single largest AI application in terms of infrastructure demand in its data-centers. In this paper we discuss the SW/HW co-designed solution for high-performance distributed training of large-scale DLRMs. We introduce a high-performance scalable software stack based on PyTorch and pa… ▽ More

    Submitted 26 February, 2023; v1 submitted 11 April, 2021; originally announced April 2021.

  20. Jekyll: Attacking Medical Image Diagnostics using Deep Generative Models

    Authors: Neal Mangaokar, Jiameng Pu, Parantapa Bhattacharya, Chandan K. Reddy, Bimal Viswanath

    Abstract: Advances in deep neural networks (DNNs) have shown tremendous promise in the medical domain. However, the deep learning tools that are helping the domain, can also be used against it. Given the prevalence of fraud in the healthcare domain, it is important to consider the adversarial use of DNNs in manipulating sensitive data that is crucial to patient healthcare. In this work, we present the desig… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: Published in proceedings of the 5th European Symposium on Security and Privacy (EuroS&P '20)

  21. arXiv:2103.04263  [pdf, other

    cs.CR cs.CV

    Deepfake Videos in the Wild: Analysis and Detection

    Authors: Jiameng Pu, Neal Mangaokar, Lauren Kelly, Parantapa Bhattacharya, Kavya Sundaram, Mobin Javed, Bolun Wang, Bimal Viswanath

    Abstract: AI-manipulated videos, commonly known as deepfakes, are an emerging problem. Recently, researchers in academia and industry have contributed several (self-created) benchmark deepfake datasets, and deepfake detection algorithms. However, little effort has gone towards understanding deepfake videos in the wild, leading to a limited understanding of the real-world applicability of research contributi… ▽ More

    Submitted 10 March, 2021; v1 submitted 6 March, 2021; originally announced March 2021.

    Comments: Accepted to The Web Conference 2021; First two authors contributed equally to this work; 12 pages, 6 tables

  22. arXiv:2012.02594  [pdf, other

    cs.CL cs.IR cs.LG

    To Schedule or not to Schedule: Extracting Task Specific Temporal Entities and Associated Negation Constraints

    Authors: Barun Patra, Chala Fufa, Pamela Bhattacharya, Charles Lee

    Abstract: State of the art research for date-time entity extraction from text is task agnostic. Consequently, while the methods proposed in literature perform well for generic date-time extraction from texts, they don't fare as well on task specific date-time entity extraction where only a subset of the date-time entities present in the text are pertinent to solving the task. Furthermore, some tasks require… ▽ More

    Submitted 15 November, 2020; originally announced December 2020.

    Comments: Proceedings of EMNLP 2020

  23. arXiv:2007.03225  [pdf, other

    cs.IR cs.CY cs.SI

    Hier-SPCNet: A Legal Statute Hierarchy-based Heterogeneous Network for Computing Legal Case Document Similarity

    Authors: Paheli Bhattacharya, Kripabandhu Ghosh, Arindam Pal, Saptarshi Ghosh

    Abstract: Computing similarity between two legal case documents is an important and challenging task in Legal IR, for which text-based and network-based measures have been proposed in literature. All prior network-based similarity methods considered a precedent citation network among case documents only (PCNet). However, this approach misses an important source of legal knowledge -- the hierarchy of legal s… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: Accepted at the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2020 (Short Paper)

  24. arXiv:2004.13274  [pdf

    cs.MM cs.HC

    Exploring the contextual factors affecting multimodal emotion recognition in videos

    Authors: Prasanta Bhattacharya, Raj Kumar Gupta, Yinping Yang

    Abstract: Emotional expressions form a key part of user behavior on today's digital platforms. While multimodal emotion recognition techniques are gaining research attention, there is a lack of deeper understanding on how visual and non-visual features can be used to better recognize emotions in certain contexts, but not others. This study analyzes the interplay between the effects of multimodal emotion fea… ▽ More

    Submitted 30 June, 2021; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: Accepted version at IEEE Transactions on Affective Computing

  25. arXiv:2004.12307  [pdf, other

    cs.SI cs.CL cs.IR cs.LG

    Methods for Computing Legal Document Similarity: A Comparative Study

    Authors: Paheli Bhattacharya, Kripabandhu Ghosh, Arindam Pal, Saptarshi Ghosh

    Abstract: Computing similarity between two legal documents is an important and challenging task in the domain of Legal Information Retrieval. Finding similar legal documents has many applications in downstream tasks, including prior-case retrieval, recommendation of legal articles, and so on. Prior works have proposed two broad ways of measuring similarity between legal documents - analyzing the precedent c… ▽ More

    Submitted 26 April, 2020; originally announced April 2020.

    Comments: This paper was published at the LDA 2019 workshop in the JURIX 2019 conference

  26. arXiv:2003.04988  [pdf, other

    cs.CL cs.LG stat.ML

    ScopeIt: Scoping Task Relevant Sentences in Documents

    Authors: Vishwas Suryanarayanan, Barun Patra, Pamela Bhattacharya, Chala Fufa, Charles Lee

    Abstract: Intelligent assistants like Cortana, Siri, Alexa, and Google Assistant are trained to parse information when the conversation is synchronous and short; however, for email-based conversational agents, the communication is asynchronous, and often contains information irrelevant to the assistant. This makes it harder for the system to accurately detect intents, extract entities relevant to those inte… ▽ More

    Submitted 15 November, 2020; v1 submitted 22 February, 2020; originally announced March 2020.

    Comments: Accepted in COLING 2020

    ACM Class: I.2.7; I.7.5

  27. arXiv:1911.05405  [pdf, ps, other

    cs.IR

    Identification of Rhetorical Roles of Sentences in Indian Legal Judgments

    Authors: Paheli Bhattacharya, Shounak Paul, Kripabandhu Ghosh, Saptarshi Ghosh, Adam Wyner

    Abstract: Automatically understanding the rhetorical roles of sentences in a legal case judgement is an important problem to solve, since it can help in several downstream tasks like summarization of legal judgments, legal search, and so on. The task is challenging since legal case documents are usually not well-structured, and these rhetorical roles may be subjective (as evident from variation of opinions… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

    Comments: Accepted at the 32nd International Conference on Legal Knowledge and Information Systems (JURIX) 2019

  28. Removing Stripes, Scratches, and Curtaining with Non-Recoverable Compressed Sensing

    Authors: Jonathan Schwartz, Yi Jiang, Yongjie Wang, Anthony Aiello, Pallab Bhattacharya, Hui Yuan, Zetian Mi, Nabil Bassim, Robert Hovden

    Abstract: Highly-directional image artifacts such as ion mill curtaining, mechanical scratches, or image striping from beam instability degrade the interpretability of micrographs. These unwanted, aperiodic features extend the image along a primary direction and occupy a small wedge of information in Fourier space. Deleting this wedge of data replaces stripes, scratches, or curtaining, with more complex str… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

    Comments: 15 pages, 5 figures

  29. arXiv:1712.05492  [pdf, other

    cs.CG

    Constant Approximation Algorithms for Guarding Simple Polygons using Vertex Guards

    Authors: Pritam Bhattacharya, Subir Kumar Ghosh, Sudebkumar Pal

    Abstract: The art gallery problem enquires about the least number of guards sufficient to ensure that an art gallery, represented by a simple polygon $P$, is fully guarded. Most standard versions of this problem are known to be NP-hard. In 1987, Ghosh provided a deterministic $\mathcal{O}(\log n)$-approximation algorithm for the case of vertex guards and edge guards in simple polygons. In the same paper, Gh… ▽ More

    Submitted 11 April, 2018; v1 submitted 14 December, 2017; originally announced December 2017.

    Comments: 39 pages, 31 figures

  30. arXiv:1712.00988  [pdf, ps, other

    cs.AI

    End-to-End Relation Extraction using Markov Logic Networks

    Authors: Sachin Pawar, Pushpak Bhattacharya, Girish K. Palshikar

    Abstract: The task of end-to-end relation extraction consists of two sub-tasks: i) identifying entity mentions along with their types and ii) recognizing semantic relations among the entity mention pairs. %Identifying entity mentions along with their types and recognizing semantic relations among the entity mentions, are two very important problems in Information Extraction. It has been shown that for bette… ▽ More

    Submitted 4 December, 2017; originally announced December 2017.

  31. arXiv:1712.00640  [pdf, other

    cs.LG

    Learning Sparse Adversarial Dictionaries For Multi-Class Audio Classification

    Authors: Vaisakh Shaj, Puranjoy Bhattacharya

    Abstract: Audio events are quite often overlapping in nature, and more prone to noise than visual signals. There has been increasing evidence for the superior performance of representations learned using sparse dictionaries for applications like audio denoising and speech enhancement. This paper concentrates on modifying the traditional reconstructive dictionary learning algorithms, by incorporating a discr… ▽ More

    Submitted 2 December, 2017; originally announced December 2017.

    Comments: Accepted in Asian Conference of Pattern Recognition (ACPR-2017)

  32. arXiv:1706.03249  [pdf, other

    cs.HC cs.IR cs.MM

    Characterizing and Predicting Supply-side Engagement on Crowd-contributed Video Sharing Platforms

    Authors: Rishabh Mehrotra, Prasanta Bhattacharya

    Abstract: Video sharing and entertainment websites have rapidly grown in popularity and now constitute some of the most visited websites on the Internet. Despite the active user engagement on these online video-sharing platforms, most of recent research on online media platforms have restricted themselves to networking based social media sites, like Facebook or Twitter. We depart from previous studies in th… ▽ More

    Submitted 10 June, 2017; originally announced June 2017.

    Comments: 8 pages, ICTIR 2017

  33. arXiv:1608.01561  [pdf, ps, other

    cs.CL

    UsingWord Embeddings for Query Translation for Hindi to English Cross Language Information Retrieval

    Authors: Paheli Bhattacharya, Pawan Goyal, Sudeshna Sarkar

    Abstract: Cross-Language Information Retrieval (CLIR) has become an important problem to solve in the recent years due to the growth of content in multiple languages in the Web. One of the standard methods is to use query translation from source to target language. In this paper, we propose an approach based on word embeddings, a method that captures contextual clues for a particular word in the source lang… ▽ More

    Submitted 4 August, 2016; originally announced August 2016.

    Comments: 17th International Conference on Intelligent Text Processing and Computational Linguistics

  34. arXiv:1512.06469  [pdf

    cs.SI physics.soc-ph stat.AP

    A Co-evolution Model of Network Structure and User Behavior in Online Social Networks: The Case of Network-Driven Content Generation

    Authors: Prasanta Bhattacharya, Tuan Q. Phan, Xue Bai, Edoardo Airoldi

    Abstract: With the rapid growth of online social network sites (SNS), it has become imperative for platform owners and online marketers to investigate what drives content production on these platforms. However, previous research has found it difficult to statistically model these factors from observational data due to the inability to separately assess the effects of network formation and network influence.… ▽ More

    Submitted 26 November, 2018; v1 submitted 20 December, 2015; originally announced December 2015.

  35. arXiv:1409.4621  [pdf, other

    cs.CG

    Approximability of Guarding Weak Visibility Polygons

    Authors: Pritam Bhattacharya, Subir Kumar Ghosh, Bodhayan Roy

    Abstract: The art gallery problem enquires about the least number of guards that are sufficient to ensure that an art gallery, represented by a polygon $P$, is fully guarded. In 1998, the problems of finding the minimum number of point guards, vertex guards, and edge guards required to guard $P$ were shown to be APX-hard by Eidenbenz, Widmayer and Stamm. In 1987, Ghosh presented approximation algorithms for… ▽ More

    Submitted 30 April, 2016; v1 submitted 16 September, 2014; originally announced September 2014.

    Comments: 23 pages, 21 figures, 30 citations

  36. arXiv:1407.8476  [pdf

    cs.CE

    A comparative study between seasonal wind speed by Fourier and Wavelet analysis

    Authors: Sabyasachi Mukhopadhyay, Debadatta Dash, Asish Mitra, Paritosh Bhattacharya

    Abstract: Wind Energy is a useful resource for Renewable energy purpose. Wind speed plays a vital role for wind energy calculation of certain location. So, it is very much necessary to know the wind speed data characteristics. In this paper fourier and wavelet transform are applied to study the wind speed data. We have compared wind speed of winter with summer by taking their speed into account using variou… ▽ More

    Submitted 6 August, 2014; v1 submitted 31 July, 2014; originally announced July 2014.

  37. An ANN Based Call Handoff Management Scheme for Mobile Cellular Network

    Authors: P. P. Bhattacharya, Ananya Sarkar, IndranilSarkar, Subhajit Chatterjee

    Abstract: Handoff decisions are usually signal strength based because of simplicity and effectiveness. Apart from the conventional techniques, such as threshold and hysteresis based schemes, recently many artificial intelligent techniques such as Fuzzy Logic, Artificial Neural Network (ANN) etc. are also used for taking handoff decision. In this paper, an Artificial Neural Network based handoff algorithm is… ▽ More

    Submitted 10 January, 2014; originally announced January 2014.

    Comments: 11 pages. arXiv admin note: text overlap with arXiv:1004.1794 by other authors

  38. arXiv:1310.1590  [pdf, ps, other

    cs.CL

    Evolution of the Modern Phase of Written Bangla: A Statistical Study

    Authors: Paheli Bhattacharya, Arnab Bhattacharya

    Abstract: Active languages such as Bangla (or Bengali) evolve over time due to a variety of social, cultural, economic, and political issues. In this paper, we analyze the change in the written form of the modern phase of Bangla quantitatively in terms of character-level, syllable-level, morpheme-level and word-level features. We collect three different types of corpora---classical, newspapers and blogs---a… ▽ More

    Submitted 6 October, 2013; originally announced October 2013.

    Comments: LCC 2013

    ACM Class: I.2.7

  39. arXiv:1309.3513  [pdf

    cs.DM math.CO

    Application of Vertex coloring in a particular triangular closed path structure and in Krafts inequality

    Authors: Sabyasachi Mukhopadhyay, Paritosh Bhattacharya, B. B. Ghosh

    Abstract: A good deal of research has been done and published on coloring of the vertices of graphs for several years while studying of the excellent work of those maestros, we get inspire to work on the vertex coloring of graphs in case of a particular triangular closed path structure what we achieve from the front view of a pyramidal structure. From here we achieve a repetitive nature of vertex coloring i… ▽ More

    Submitted 22 August, 2013; originally announced September 2013.

  40. arXiv:1210.2940  [pdf

    cs.NI

    A review on routing protocols for application in wireless sensor networks

    Authors: Neha Rathi, Jyoti Saraswat, Partha Pratim Bhattacharya

    Abstract: Wireless sensor networks are harshly restricted by storage capacity, energy and computing power. So it is essential to design effective and energy aware protocol in order to enhance the network lifetime. In this paper, a review on routing protocol in WSNs is carried out which are classified as data-centric, hierarchical and location based depending on the network structure. Then some of the multip… ▽ More

    Submitted 10 October, 2012; originally announced October 2012.

    Comments: 20 pages, 16 figures, 2 tables

  41. arXiv:1205.2269  [pdf

    cs.NI

    Performance improvement in OFDM system by PAPR reduction

    Authors: Suverna Sengar, Partha Pratim Bhattacharya

    Abstract: Orthogonal Frequency Division Multiplexing (OFDM) is an efficient method of data transmission for high speed communication systems. However, the main drawback of OFDM system is the high Peak to Average Power Ratio (PAPR) of the transmitted signals. OFDM consist of large number of independent subcarriers, as a result of which the amplitude of such a signal can have high peak values. Coding, phase r… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: 13 pages, 8 figures, 1 Table, Signal & Image Processing : An International Journal (SIPIJ) Vol.3, No.2, April 2012

  42. arXiv:1201.1964  [pdf

    cs.NI cs.GT

    A Survey on Dynamic Spectrum Access Techniques for Cognitive Radio

    Authors: Anita Garhwal, Partha Pratim Bhattacharya

    Abstract: Cognitive radio (CR) is a new paradigm that utilizes the available spectrum band. The key characteristic of CR system is to sense the electromagnetic environment to adapt their operation and dynamically vary its radio operating parameters. The technique of dynamically accessing the unused spectrum band is known as Dynamic Spectrum Access (DSA). The dynamic spectrum access technology helps to minim… ▽ More

    Submitted 9 January, 2012; originally announced January 2012.

    Comments: arXiv admin note: text overlap with http://www.ijetch.org/papers/206-Z058.pdf by other authors

  43. arXiv:1112.2248  [pdf

    cs.NI

    A Survey on Cooperative Diversity and Its Applications in Various Wireless Networks

    Authors: Gurpreet Kaur, Partha Pratim Bhattacharya

    Abstract: Cooperative diversity is a technique in which various radio terminals relay signals for each other. Cooperative diversity results when cooperative communications is used primarily to leverage the spatial diversity available among distributed radios. In this paper different cooperative diversity schemes and their applications in various wireless networks are discussed. In this paper the impact of c… ▽ More

    Submitted 9 December, 2011; originally announced December 2011.

    Comments: 20 pages

    Journal ref: International Journal of Computer Science & Engineering Survey (IJCSES) Vol.2, No.4, November 2011

  44. arXiv:1109.0257  [pdf

    cs.NI

    Smart Radio Spectrum Management for Cognitive Radio

    Authors: Partha Pratim Bhattacharya, Ronak Khandelwal, Rishita Gera, Anjali Agarwal

    Abstract: Today's wireless networks are characterized by fixed spectrum assignment policy. The limited available spectrum and the inefficiency in the spectrum usage necessitate a new communication paradigm to exploit the existing wireless spectrum opportunistically. Cognitive radio is a paradigm for wireless communication in which either a network or a wireless node changes its transmission or reception par… ▽ More

    Submitted 5 August, 2011; originally announced September 2011.

    Comments: 13 pages, 11 figures

    Journal ref: International Journal of Parallel and Distributed Systems, Vol. 2, NO 4, July 2011