Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–4 of 4 results for author: Balabantaray, R C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.04829  [pdf, other

    cs.CL

    Fine-tuning Pre-trained Named Entity Recognition Models For Indian Languages

    Authors: Sankalp Bahad, Pruthwik Mishra, Karunesh Arora, Rakesh Chandra Balabantaray, Dipti Misra Sharma, Parameswari Krishnamurthy

    Abstract: Named Entity Recognition (NER) is a useful component in Natural Language Processing (NLP) applications. It is used in various tasks such as Machine Translation, Summarization, Information Retrieval, and Question-Answering systems. The research on NER is centered around English and some other major languages, whereas limited attention has been given to Indian languages. We analyze the challenges an… ▽ More

    Submitted 10 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 8 pages, accepted in NAACL-SRW, 2024

  2. arXiv:2012.15023  [pdf

    cs.CL

    Language Identification of Devanagari Poems

    Authors: Priyankit Acharya, Aditya Ku. Pathak, Rakesh Ch. Balabantaray, Anil Ku. Singh

    Abstract: Language Identification is a very important part of several text processing pipelines. Extensive research has been done in this field. This paper proposes a procedure for automatic language identification of poems for poem analysis task, consisting of 10 Devanagari based languages of India i.e. Angika, Awadhi, Braj, Bhojpuri, Chhattisgarhi, Garhwali, Haryanvi, Hindi, Magahi, and Maithili. We colla… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

  3. arXiv:1901.08625  [pdf

    cs.CL

    Automatic Parallel Corpus Creation for Hindi-English News Translation Task

    Authors: Aditya Kumar Pathak, Priyankit Acharya, Dilpreet Kaur, Rakesh Chandra Balabantaray

    Abstract: The parallel corpus for multilingual NLP tasks, deep learning applications like Statistical Machine Translation Systems is very important. The parallel corpus of Hindi-English language pair available for news translation task till date is of very limited size as per the requirement of the systems are concerned. In this work we have developed an automatic parallel corpus generation system prototype… ▽ More

    Submitted 24 January, 2019; originally announced January 2019.

  4. arXiv:1502.07938  [pdf

    cs.IR

    Document Clustering using K-Means and K-Medoids

    Authors: Rakesh Chandra Balabantaray, Chandrali Sarma, Monica Jha

    Abstract: With the huge upsurge of information in day-to-days life, it has become difficult to assemble relevant information in nick of time. But people, always are in dearth of time, they need everything quick. Hence clustering was introduced to gather the relevant information in a cluster. There are several algorithms for clustering information out of which in this paper, we accomplish K-means and K-Medoi… ▽ More

    Submitted 27 February, 2015; originally announced February 2015.

    Journal ref: International Journal of Knowledge Based Computer Systems, Volume 1 Issue 1 (2013)