Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–12 of 12 results for author: Sarveswaran, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.14657  [pdf

    cs.CL

    Building Tamil Treebanks

    Authors: Kengatharaiyer Sarveswaran

    Abstract: Treebanks are important linguistic resources, which are structured and annotated corpora with rich linguistic annotations. These resources are used in Natural Language Processing (NLP) applications, supporting linguistic analyses, and are essential for training and evaluating various computational models. This paper discusses the creation of Tamil treebanks using three distinct approaches: manual… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

    Comments: 10 pages

    Journal ref: Sarveswaran, K. (2024). Building Tamil Treebanks. In Proceedings of the International Conference on Tamil Computing and Information Technology (ICTCIT 2024)/23rd Tamil Internet Conference (pp. 22-32). INFITT. ISSN: 2313-4887

  2. arXiv:2409.11501  [pdf

    cs.CL cs.AI

    Egalitarian Language Representation in Language Models: It All Begins with Tokenizers

    Authors: Menan Velayuthan, Kengatharaiyer Sarveswaran

    Abstract: Tokenizers act as a bridge between human language and the latent space of language models, influencing how language is represented in these models. Due to the immense popularity of English-Centric Large Language Models (LLMs), efforts are being made to adapt them for other languages. However, we demonstrate that, from a tokenization standpoint, not all tokenizers offer fair representation for comp… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: Content - 8 pages, References - 3 pages

    ACM Class: I.2.7

  3. arXiv:2407.08618  [pdf

    cs.CL

    Tamil Language Computing: the Present and the Future

    Authors: Kengatharaiyer Sarveswaran

    Abstract: This paper delves into the text processing aspects of Language Computing, which enables computers to understand, interpret, and generate human language. Focusing on tasks such as speech recognition, machine translation, sentiment analysis, text summarization, and language modelling, language computing integrates disciplines including linguistics, computer science, and cognitive psychology to creat… ▽ More

    Submitted 12 August, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: 11 pages, This is the write-up of the address delivered at the 30th Annual Sessions of the Jaffna Science Association, held from March 29-31, 2023, at the University of Jaffna, Sri Lanka

    Journal ref: Sarveswaran, K. (2024). Tamil Language Computing: the Present and the Future. Proceedings of Jaffna Science Association, Vol30(2),27-37

  4. arXiv:2401.08367  [pdf

    cs.CL

    Morphology and Syntax of the Tamil Language

    Authors: Kengatharaiyer Sarveswaran

    Abstract: This paper provides an overview of the morphology and syntax of the Tamil language, focusing on its contemporary usage. The paper also highlights the complexity and richness of Tamil in terms of its morphological and syntactic features, which will be useful for linguists analysing the language and conducting comparative studies. In addition, the paper will be useful for those developing computatio… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 45 pages

  5. arXiv:2309.06085  [pdf

    cs.CL

    BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models

    Authors: Wei Qi Leong, Jian Gang Ngui, Yosephine Susanto, Hamsawardhini Rengarajan, Kengatharaiyer Sarveswaran, William Chandra Tjhi

    Abstract: The rapid development of Large Language Models (LLMs) and the emergence of novel abilities with scale have necessitated the construction of holistic, diverse and challenging benchmarks such as HELM and BIG-bench. However, at the moment, most of these benchmarks focus only on performance in English and evaluations that include Southeast Asian (SEA) languages are few in number. We therefore propose… ▽ More

    Submitted 18 September, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: 86 pages, 7 figures, added link to repository in abstract, minor formatting changes and typo corrections

  6. arXiv:2012.13436  [pdf, other

    cs.CL

    ThamizhiUDp: A Dependency Parser for Tamil

    Authors: Kengatharaiyer Sarveswaran, Gihan Dias

    Abstract: This paper describes how we developed a neural-based dependency parser, namely ThamizhiUDp, which provides a complete pipeline for the dependency parsing of the Tamil language text using Universal Dependency formalism. We have considered the phases of the dependency parsing pipeline and identified tools and resources in each of these phases to improve the accuracy and to tackle data scarcity. Tham… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

    Comments: 5 Pages, Published at ICON2020: 17th International Conference on Natural Language Processing (December 18-21, 2020)

  7. arXiv:2004.05319  [pdf, other

    eess.IV cs.CV cs.LG

    KD-MRI: A knowledge distillation framework for image reconstruction and image restoration in MRI workflow

    Authors: Balamurali Murugesan, Sricharan Vijayarangan, Kaushik Sarveswaran, Keerthi Ram, Mohanasankar Sivaprakasam

    Abstract: Deep learning networks are being developed in every stage of the MRI workflow and have provided state-of-the-art results. However, this has come at the cost of increased computation requirement and storage. Hence, replacing the networks with compact models at various stages in the MRI workflow can significantly reduce the required storage space and provide considerable speedup. In computer vision,… ▽ More

    Submitted 11 April, 2020; originally announced April 2020.

    Comments: Accepted in MIDL 2020. Code available

  8. arXiv:2001.02387  [pdf, other

    eess.IV cs.CV

    A context based deep learning approach for unbalanced medical image segmentation

    Authors: Balamurali Murugesan, Kaushik Sarveswaran, Vijaya Raghavan S, Sharath M Shankaranarayana, Keerthi Ram, Mohanasankar Sivaprakasam

    Abstract: Automated medical image segmentation is an important step in many medical procedures. Recently, deep learning networks have been widely used for various medical image segmentation tasks, with U-Net and generative adversarial nets (GANs) being some of the commonly used ones. Foreground-background class imbalance is a common occurrence in medical images, and U-Net has difficulty in handling class im… ▽ More

    Submitted 8 January, 2020; originally announced January 2020.

    Comments: Accepted in ISBI 2020

  9. arXiv:1908.09262  [pdf, other

    eess.IV cs.CV

    Recon-GLGAN: A Global-Local context based Generative Adversarial Network for MRI Reconstruction

    Authors: Balamurali Murugesan, Vijaya Raghavan S, Kaushik Sarveswaran, Keerthi Ram, Mohanasankar Sivaprakasam

    Abstract: Magnetic resonance imaging (MRI) is one of the best medical imaging modalities as it offers excellent spatial resolution and soft-tissue contrast. But, the usage of MRI is limited by its slow acquisition time, which makes it expensive and causes patient discomfort. In order to accelerate the acquisition, multiple deep learning networks have been proposed. Recently, Generative Adversarial Networks… ▽ More

    Submitted 25 August, 2019; originally announced August 2019.

    Comments: Accepted at MLMIR-MICCAIW 2019

  10. arXiv:1908.05311  [pdf, other

    cs.CV

    Conv-MCD: A Plug-and-Play Multi-task Module for Medical Image Segmentation

    Authors: Balamurali Murugesan, Kaushik Sarveswaran, Sharath M Shankaranarayana, Keerthi Ram, Jayaraj Joseph, Mohanasankar Sivaprakasam

    Abstract: For the task of medical image segmentation, fully convolutional network (FCN) based architectures have been extensively used with various modifications. A rising trend in these architectures is to employ joint-learning of the target region with an auxiliary task, a method commonly known as multi-task learning. These approaches help impose smoothness and shape priors, which vanilla FCN approaches d… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

    Comments: Accepted in MLMI 2019

  11. arXiv:1902.04099  [pdf, other

    cs.CV

    Psi-Net: Shape and boundary aware joint multi-task deep network for medical image segmentation

    Authors: Balamurali Murugesan, Kaushik Sarveswaran, Sharath M Shankaranarayana, Keerthi Ram, Mohanasankar Sivaprakasam

    Abstract: Image segmentation is a primary task in many medical applications. Recently, many deep networks derived from U-Net have been extensively used in various medical image segmentation tasks. However, in most of the cases, networks similar to U-net produce coarse and non-smooth segmentations with lots of discontinuities. To improve and refine the performance of U-Net like networks, we propose the use o… ▽ More

    Submitted 14 August, 2019; v1 submitted 11 February, 2019; originally announced February 2019.

    Comments: Accepted at EMBC 2019

  12. arXiv:1901.08824  [pdf, other

    cs.CV

    Joint shape learning and segmentation for medical images using a minimalistic deep network

    Authors: Balamurali Murugesan, Kaushik Sarveswaran, Sharath M Shankaranarayana, Keerthi Ram, Mohanasankar Sivaprakasam

    Abstract: Recently, state-of-the-art results have been achieved in semantic segmentation using fully convolutional networks (FCNs). Most of these networks employ encoder-decoder style architecture similar to U-Net and are trained with images and the corresponding segmentation maps as a pixel-wise classification task. Such frameworks only exploit class information by using the ground truth segmentation maps.… ▽ More

    Submitted 25 January, 2019; originally announced January 2019.

    Comments: Under review at MIDL 2019