Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Bipul Syam Purkayastha

Quantum computing has been an attractive method adapted for increasing the computational speed and it is governed by the laws of quantum mechanics. Quantum computers are not limited to two states as compared with classical systems, but... more
Quantum computing has been an attractive method adapted for increasing the computational speed and it is governed by the laws of quantum mechanics. Quantum computers are not limited to two states as compared with classical systems, but they encode information as quantum bits or qubits. A qubit represents atoms, ions, photons or electrons and their respective control devices that are working together to act as computer memory and processors. The power of quantum computing lies in its multi-state representation which makes it million times more powerful than classical computer systems. Quantum computers also use another aspect of quantum mechanics known as entanglement, which is a property where multiple objects existing in the states that can be linked together. There exists a standard/specification of message passing library, known as message passing interface (MPI) that can be used for achieving high-performance computing. The goal of MPI is to provide a standardized framework for ...
Term Frequency and inverse document frequency is reported to have a significant contribution for various text categorization, document clustering and many other text mining related tasks. A collection of the applications and the... more
Term Frequency and inverse document frequency is reported to have a significant contribution for various text categorization, document clustering and many other text mining related tasks. A collection of the applications and the enhancements of the Term Frequency and Inverse Document Frequency based document representation technique is examined in this work. The document representation algorithm is essential in the field of text - script mining. In this algorithm, unstructured data is converted into a vector space model where each related document is considered as a point in the vector space. Related documents come in proximity to the other related documents while the documents that are very far away from being coherent remain different from each other. In this paper, four feature selection techniques are implemented to discover the patterns from a repository of unstructured data by using correlation similarity measure. Analysis and comparison with other existing technique is also included. The validation of the patterns formed is performed by using silhouette values. Experiments are conducted to compare performance. Results indicate that TDMp1 performance is poor compared to others.
Machine Translation (MT), perhaps the earliest NLP applications, is the method of translating one human language sentence into another, using computer or any kind of machine. The aim of this research paper is to develop an MT system for... more
Machine Translation (MT), perhaps the earliest NLP applications, is the method of translating one human language sentence into another, using computer or any kind of machine. The aim of this research paper is to develop an MT system for Nepali language which can translate an English sentence to its most probable Nepali sentence using Statistical Machine Translation (SMT) approach. The system is implemented using three different tools like MOSES for decoding, GIZA++ for generating translation model and IRSTLM for estimating target model probability. Also for training the system, English-Nepali parallel corpus is used and for testing, English raw corpus is used. Both these two corpora are collected from TDIL (Technology Development for Indian Languages). The system has been manually evaluated using two parameters viz. fluency and adequacy and it gives an average accuracy of 2.7 out of 4 (level no), i.e., approximately 68%. Though the implemented system achieves an accuracy of 68% but for OoV (Out of Vocabulary) words the research still continuing. A small comparison has also been made with exiting English-Nepali MT system.
Language is the most important aspect in the life of all human beings. A language is one of the most important and effective modes of communication between the people belonging to different communities and cultures. The language acts as a... more
Language is the most important aspect in the life of all human beings. A language is one of the most important and effective modes of communication between the people belonging to different communities and cultures. The language acts as a bridge among us and helps in creating a bond among our cultures. Therefore, to learn mother language as well as other new languages is very important for us. The dictionary is one of the important tools that can be used for learning new languages. A word is basically an association of linguistic sound and meaning. The spelling does not always easily correlate with the sound of a word. A dictionary helps us both with the spelling and pronunciation of such words. Electronic dictionaries are very popular nowadays and many users can be accessed simultaneously on Online. This paper describes the Development of Multilingual Assamese Electronic Dictionary (MAED). The MAED contains four languages, namely Assamese, Bengali, English and Hindi. We have develo...
With fast growth in size of digital text documents over internet and digital repositories, the pools of digital document is piling up day by day. Due to this digital revolution and growth, an efficient and effective technique is required... more
With fast growth in size of digital text documents over internet and digital repositories, the pools of digital document is piling up day by day. Due to this digital revolution and growth, an efficient and effective technique is required to handle such an enormous amount of data. It is extremely important to understand the documents properly to mine them. To find coherence among documents text similarity measurement pays a humongous role.  The goal of similarity computation is to identify cohesion among text documents and to make the text ready for the required applications such as document organization, plagiarism detection, query matching etc. This task is one of the most fundamental task in the area of information retrieval, information extraction, document organization, plagiarism detection and text mining problems. But effectiveness of document clustering is highly dependent on this task.  In this paper four similarity measures are implemented and their descriptive statistics...
Assamese is the state language of Assam and English is the associate language of India. A Dictionary is one of the most important tools for language learning, in education and to know the relative information in everyday life. An... more
Assamese is the state language of Assam and English is the associate language of India. A Dictionary is one of the most important tools for language learning, in education and to know the relative information in everyday life. An Electronic Dictionary (E-Dictionary) is a powerful tool that enables students to improve their learning ability and achievements. An Assamese-English Bilingual Electronic Dictionary (AEBED) is an important part of Natural Language Processing tasks and linguistic work. This paper describes the implementation of an Assamese-English Bilingual Electronic (Online) Dictionary. The AEBED is user friendly and users can easily search or look up the meaning of Assamese words and relative information into English words and similarly English words to Assamese words. This dictionary will be beneficial for Assamese people as well as other people living in India.
Natural Language Processing (NLP) is mainly concerned with the development of computational models and tools of aspects of human (natural) language processing. Part of Speech Tagging (POS) is well studied topic and also one of the most... more
Natural Language Processing (NLP) is mainly concerned with the development of computational models and tools of aspects of human (natural) language processing. Part of Speech Tagging (POS) is well studied topic and also one of the most fundamental preprocessing steps for any language in NLP. Natural language processing of Nepali is still lack significant research efforts in the area of NLP in India. POS tagging of Nepali is a necessary component for most NLP applications in Nepali, which analyses the construction of the language, behavior of the language and can be used to develop automated tools for language processing. From the literature survey and related works, it has been found that, not much work has been done previously on POS tagging for Nepali language in India due to lack of comprehensive set of tagged corpus or correct hand written rules. In this paper, Hidden Markov Model (HMM) based Part of Speech (POS) tagging for Nepali language has been discussed. HMM is the most popular used statistical model for POS tagging that uses little amount of knowledge about the language, apart from contextual information of the language. The evaluation of the tagger has been done using the corpora, which are collected from TDIL (Technology Development for Indian Languages) and the BIS tagset of 42 tags. Tagset has been designed to meet the morph-syntactic requirements of the Nepali language. Apart from corpora and the tagset, python programming language and the NLTK's (Natural Language Toolkit) library has been used for implementation. The tagger achieves accuracy over 96% for known words but for unknown words, the research is still continuing.
A key resource that aids in several NLP tasks is WordNet. Wordnet is used as the sense inventory for sense tagging of corpus. Sense tagging is the task of tagging each word in the sentence with the correct sense of the word in the given... more
A key resource that aids in several NLP tasks is WordNet. Wordnet is used as the sense inventory for sense tagging of corpus. Sense tagging is the task of tagging each word in the sentence with the correct sense of the word in the given context. Sense tagging activity helps in validation of WordNet and improvement of Wordnet quality. Sense tagging is one of the toughest annotation works and this paper discusses about the Sense Tagging tool, procedures involved in sense tagging the Nepali corpus and the challenges involved in sense tagging. Nepali WordNet is used as the sense inventory for sense tagging of Nepali corpus. For accurately sense tagging voluminous data, a standard and definitive lexicon is required. In this work the corpus in Nepali language is taken from newspaper domain.
Page 1. Hybrid PSO/Self-Adaptive Improved EP for Economic Dispatch with Nonsmooth Cost Function Nidul Sinha1, Member IEEE, Bipul Syam Purkayastha2 and Biswajit Purkayastha3 Abstract—This paper investigates the ...
This paper investigates into hybridization between PSO and self-adaptive evolutionary programming techniques for solving economic dispatch (ED) problem with non-smooth cost curves where conventional gradient based methods are... more
This paper investigates into hybridization between PSO and self-adaptive evolutionary programming techniques for solving economic dispatch (ED) problem with non-smooth cost curves where conventional gradient based methods are in-applicable. The convergence capability of evolutionary programming technique is enhanced with hybridization of self-adaptive evolutionary programming technique with PSO intelligence. Three types of hybridization in between PSO and self-adaptive classical EP (CEP)) i.e. PSO-CEP, CEP-PSO and CEP-PSOare examined The performances of the hybrid algorithms are demonstrated on a moderately large power system with 40 units and comparison is drawn among floating point GA (GAF), CEP, PSO, PSO-CEP, CEP-PSO and CEP-PSOmethods in terms of the solution quality and computation efficiency. The simulation results show that CEP-PSOmethod is the most efficient in finding higher quality solutions in non-convex ED problems.
This paper proposes a hybrid method that integrates the main features of particle swarm optimization (PSO) and evolutionary programming (EP) for solution of nonconvex economic load dispatch (ELD) problems having nonlinearities like valve... more
This paper proposes a hybrid method that integrates the main features of particle swarm optimization (PSO) and evolutionary programming (EP) for solution of nonconvex economic load dispatch (ELD) problems having nonlinearities like valve point loadings. Algorithms based on PSO, Evolutionary programming (EP) and PSO embedded EP techniques have been developed and tested on a practical nonconvex ELD problem with valve
In this paper an efficient method for data clustering is proposed. The proposed algorithm is a modified psFCM, called the pshFCM clustering algorithm that finds better cluster centers for a given data sets as compared to the cluster... more
In this paper an efficient method for data clustering is proposed. The proposed algorithm is a modified psFCM, called the pshFCM clustering algorithm that finds better cluster centers for a given data sets as compared to the cluster centers obtained by he sFC. Tecmpuatinalperormnceof he ropsed pshFCM algorithm is comparable with thepsFCM and the FCM.
Part-of-speech tagging is the process of marking up the words in a text (corpus) as corresponding to a particular part of speech, based on both its definition, as well as its context —i.e. relationship with adjacent and related words in a... more
Part-of-speech tagging is the process of marking up the words in a text (corpus) as corresponding to a particular part of speech, based on both its definition, as well as its context —i.e. relationship with adjacent and related words in a phrase, sentence, or paragraph. Part-of-Speech (POS) tagging is the process of assigning the appropriate part of speech or lexical category to each word in a natural language sentence. Part-of-speech tagging is an important part of Natural Language Processing (NLP) and is useful for most NLP applications. It is often the first stage of natural language processing following which further processing like chunking, parsing, etc. are done. There are a number of approaches to implement part of speech tagger (1), i.e. Rule Based approach, Statistical approach and Hybrid approach. Rule-based tagger uses linguistic rules to assign the correct tags to the words in the sentence or file. Statistical Part of Speech tagger is based on the probabilities of occur...
Part-of-speech tagging (POS tagging or POST), also called grammatical tagging or word-category disambiguation, is the process of marking up words in a text corresponding to a particular part-of-speech. This marking is based on both... more
Part-of-speech tagging (POS tagging or POST), also called grammatical tagging or word-category disambiguation, is the process of marking up words in a text corresponding to a particular part-of-speech. This marking is based on both definition, as well as context i.e. relationship with adjacent and related words in a phrase, sentence, or paragraph.
Stemming is the process of removing the affixes from inflected words, without doing complete morphological analysis. A stemming Algorithm reduces all the Inflected words with the same stem to a common form. It is useful in many areas of... more
Stemming is the process of removing the affixes from inflected words, without doing complete morphological analysis. A stemming Algorithm reduces all the Inflected words with the same stem to a common form. It is useful in many areas of computational linguistics and information-retrieval work. This technique is used by the various search engines to find the best solution for a problem. The algorithm is a basic building block for the stemmer. Stemmer is basically used in information retrieval system to improve the performance. The paper present a stemmer for Manipuri, which uses a brute force algorithm. We also use a suffix stripping technique in our stemmer. This stemmer can be use as an important tool in information retrieval system for Manipuri language.
The web mining is a cutting edge technology, which includes information gathering and classification of information over web. This paper puts forth the concepts of document pre-processing, which is achieved by extraction of keywords from... more
The web mining is a cutting edge technology, which includes information gathering and classification of information over web. This paper puts forth the concepts of document pre-processing, which is achieved by extraction of keywords from the documents fetched from the web, processing it and generating a term-document matrix, TF-IDF and the different approaches of TF-IDF (term frequency Inverse document frequency) for each respective document. The last step is the clustering of these results through K Means algorithm, by comparing the performance of each approach used. The algorithm is realized on an X64 architecture and coded on Java and Matlab platform. The results are tabulated.
The Direct-To-Home (DTH) TV service is a smart option for all when Audio-Video clarity is concern. During the heavy rain and windy weather, the signal attenuates and becomes unavailable. This paper evaluates the rain attenuation in a DTH... more
The Direct-To-Home (DTH) TV service is a smart option for all when Audio-Video clarity is concern. During the heavy rain and windy weather, the signal attenuates and becomes unavailable. This paper evaluates the rain attenuation in a DTH system in the frequency range of 950-2150 MHz. The rain data collected at Silchar City during this year is used in the evaluation. It is very important part of any communication system to have its link budget analysis and attenuation approximation. The DAH model is used in the evaluation process. The model is now adopted by the ITU. The region is compared with the ITU recommended world regions. It is observed that the attenuation is very minimal even in heavy rain, although it gives immense effect on the service.
ABSTRACT
Research Interests:
3G mobile systems have been launched which drastically offered content rich services, wireless broadband access to Internet, and worldwide roaming, GPS and many more. However the broadcasting nature of the wireless communication and... more
3G mobile systems have been launched which drastically offered content rich services, wireless broadband access to Internet, and worldwide roaming, GPS and many more. However the broadcasting nature of the wireless communication and increased popularity of wireless devices introduce problems in many directions. Mobile users and providers must be assured of the error-free successful data communication. The present research work is to improve the quality of mobile services in heavy raining zones of sub-tropical Indian regions by identifying the communication problems in the existing network communication and to provide reliable and cost effective solutions to the problems. The present paper discusses the rainfall and rain attenuation of various capital cities of North East Indian Regions.
Research Interests:
SNR
Research Interests:
Web Service is relatively new and a relevant area. The security issues of Web Services in a distributed environment are a major concern of research. Web service security is one of the thrust areas of research both in industry as well as... more
Web Service is relatively new and a relevant area. The security issues of Web Services in a distributed environment are a major concern of research. Web service security is one of the thrust areas of research both in industry as well as in academia. The present work is mainly focusing on special security issues of Web Services. The study is done under the general framework of security issues. Special security issues are discussed with reference to present technology of Web services. Multi-Part Multi-Signature Document (MPMSD) based IT implementation is an emerging approach. Along with the emergence and development of this approach the XML signature security is also a matter of concern. Hence forth, the application of Multi-Part and Multi-Signature Document with respect to security issues needs further experimentation and exploration. Main aim of this proposed work is to describe the process and applications of Multi–Part Multi-Signature Document in the workflow environment and compares between XML signature security issues with the MPMSD. In addition, this work highlights the future issues for research and experimentation under the approach of MPMSD.
Research Interests:
In this paper, Parikh matrices over ternary alphabet are investigated. Algorithm is developed to display Parikh matrices of words over ternary alphabet. A set of equations for finding ternary words from the respective Parikh matrix is... more
In this paper, Parikh matrices over ternary alphabet are investigated. Algorithm is developed to display Parikh matrices of words over ternary alphabet. A set of equations for finding ternary words from the respective Parikh matrix is discussed. A theorem regarding the relations of the entries of the 4 × 4 Parikh matrices is proved. Some other results in this regard are also discussed. Significance of graphical representation of binary amiable words is given. Extension of this notion for ternary amiable words is introduced.
In this paper ratio property of words are investigated. Concept of ratio property and weak ratio property are extended for nth order alphabet. A relationship of ratio property with M-ambiguity is established. Various lemmas already proved... more
In this paper ratio property of words are investigated. Concept of ratio property and weak ratio property are extended for nth order alphabet. A relationship of ratio property with M-ambiguity is established. Various lemmas already proved about ratio property over ternary alphabet are investigated for tertiary alphabets. M-ambiguous words are formed by concatenating words satisfying ratio property.
The issue of synchronization of authorization flow with work object flow in a document production workflow environment is presented and discussed in this paper. We have shown how a work object flow is synchronized with the authorization... more
The issue of synchronization of authorization flow with work object flow in a document production workflow environment is presented and discussed in this paper. We have shown how a work object flow is synchronized with the authorization flow using a central arbiter in Web service paradigms. The co-ordination of Web services is done using WS-BPEL which supports orchestration and XACML
In this paper, a new technique based on telepathy is used to design a telepathy-based network system (TNS). The main objective of the system is to perform secured data communication in wireless networks. A point to point (P2P) Ad Hoc... more
In this paper, a new technique based on telepathy is used to design a telepathy-based network system (TNS). The main objective of the system is to perform secured data communication in wireless networks. A point to point (P2P) Ad Hoc network with two nodes- sender and receiver are used in the system. It is seen that if the receiver is
ABSTRACT In the paper, a system known as FTWN (File Transfer in Wireless Networks Environment) is developed. In wireless networks, sending a large file above 100 MB is a big issue. The problem is solved by the system developed using JAVA.... more
ABSTRACT In the paper, a system known as FTWN (File Transfer in Wireless Networks Environment) is developed. In wireless networks, sending a large file above 100 MB is a big issue. The problem is solved by the system developed using JAVA. In this system FTP is used to transfer file. The transfer time is recorded and found to be very useful in wireless system.
... Nidul Sinha, Bipul Syam Purkayastha and Biswajit Purkayastha Abstract ... Res., vol. 26, pp. 179-186, 1993. [9] CS Chang, KP Wong and B. Fan,“Security-constrained multiobjective generation dispatch using bicriterion global... more
... Nidul Sinha, Bipul Syam Purkayastha and Biswajit Purkayastha Abstract ... Res., vol. 26, pp. 179-186, 1993. [9] CS Chang, KP Wong and B. Fan,“Security-constrained multiobjective generation dispatch using bicriterion global optimization,” IEE Proc.- Gener. Transm. Distrib., ...
Abstract This paper focuses on morphological analysis of Kokborok words to incorporate them into Kokborok dictionary and Kokborok Machine translator. So far, no attempt has been made to integrate the works for a concrete computational... more
Abstract This paper focuses on morphological analysis of Kokborok words to incorporate them into Kokborok dictionary and Kokborok Machine translator. So far, no attempt has been made to integrate the works for a concrete computational output. In this paper we particularly emphasize on bringing works on morphological analysis in the frame, with the goal to produce a Kokborok-dictionary, as well as Machine Translator, which will provide a unified base to fit into already developed universal conversion systems of UNL. We explain the ...