Background: A question that lies at the very heart of language acquisition research is how childr... more Background: A question that lies at the very heart of language acquisition research is how children learn semi-regular systems with exceptions (e.g., the English plural rule that yields cats, dogs, etc, with exceptions feet and men). We investigated this question for Hindi ergative ne marking; another semi-regular but exception-filled system. Generally, in the past tense, the subject of two-participant transitive verbs (e.g., Ram broke the cup) is marked with ne, but there are exceptions. How, then, do children learn when ne marking is required, when it is optional, and when it is ungrammatical? Methods: We conducted two studies using (a) acceptability judgment and (b) elicited production methods with children (aged 4-5, 5-6 and 9-10 years) and adults. Results: All age groups showed effects of statistical preemption: the greater the frequency with which a particular verb appears with versus without ne marking on the subject – relative to other verbs – the greater the extent to which...
This paper describes our system submission for the GermEval 2018 shared task on the identificatio... more This paper describes our system submission for the GermEval 2018 shared task on the identification of German hate speech in Tweets at Konvens 2018. We trained and tested a Logistic Regression classifier with 10-fold cross validation using character ngrams as features. We achieved a macro F1 of 76.72 for the coarse-grained classification task and 47.17 for the fine-grained task when testing the classifiers on a small development set we created
The present work deals with image segmentation which results in the subdivision of an image into ... more The present work deals with image segmentation which results in the subdivision of an image into its constituent regions or objects. The result of image segmentation is a set of segments that collectively cover the entire image or a set of contours extracted from the image. Each of the pixels in a region are similar with respect to some characteristic or computed property, such as color, intensity or texture. Specifically this project deals with texture segmentation of an image to find out the different types of textures present in the image. In this project different type of procedures have been followed to carry out texture segmentation. Procedures starting from fundamental filter transforms till multi-resolution technique using wavelet transform have been considered. Many texture-segmentation schemes are based on a filter-bank model, where the filters called Gabor filters are derived from Gabor elementary functions. Both linear and circular Gabor filters are studied and analyzed ...
The IJCNLP 2017 shared task on Customer Feedback Analysis focuses on classifying customer feedbac... more The IJCNLP 2017 shared task on Customer Feedback Analysis focuses on classifying customer feedback into one of a predefined set of categories or classes. In this paper, we describe our approach to this problem and the results on four languages, i.e. English, French, Japanese and Spanish. Our system implemented a bidirectional LSTM (Graves and Schmidhuber, 2005) using pre-trained glove (Pennington et al., 2014) and fastText (Joulin et al., 2016) embeddings, and SVM (Cortes and Vapnik, 1995) with TF-IDF vectors for classifying the feedback data which is described in the later sections. We also tried different machine learning techniques and compared the results in this paper. Out of the 12 participating teams, our systems obtained 0.65, 0.86, 0.70 and 0.56 exact accuracy score in English, Spanish, French and Japanese respectively. We observed that our systems perform better than the baseline systems in three languages while we match the baseline accuracy for Japanese on our submitted ...
Background: A question that lies at the very heart of language acquisition research is how childr... more Background: A question that lies at the very heart of language acquisition research is how children learn semi-regular systems with exceptions (e.g., the English plural rule that yields cats, dogs, etc, with exceptions feet and men). We investigated this question for Hindi ergative ne marking; another semi-regular but exception-filled system. Generally, in the past tense, the subject of two-participant transitive verbs (e.g., Ram broke the cup) is marked with ne, but there are exceptions. How, then, do children learn when ne marking is required, when it is optional, and when it is ungrammatical? Methods: We conducted two studies using (a) acceptability judgment and (b) elicited production methods with children (aged 4-5, 5-6 and 9-10 years) and adults. Results: All age groups showed effects of statistical preemption: the greater the frequency with which a particular verb appears with versus without ne marking on the subject – relative to other verbs – the greater the extent to which...
This paper describes our system submission for the GermEval 2018 shared task on the identificatio... more This paper describes our system submission for the GermEval 2018 shared task on the identification of German hate speech in Tweets at Konvens 2018. We trained and tested a Logistic Regression classifier with 10-fold cross validation using character ngrams as features. We achieved a macro F1 of 76.72 for the coarse-grained classification task and 47.17 for the fine-grained task when testing the classifiers on a small development set we created
The present work deals with image segmentation which results in the subdivision of an image into ... more The present work deals with image segmentation which results in the subdivision of an image into its constituent regions or objects. The result of image segmentation is a set of segments that collectively cover the entire image or a set of contours extracted from the image. Each of the pixels in a region are similar with respect to some characteristic or computed property, such as color, intensity or texture. Specifically this project deals with texture segmentation of an image to find out the different types of textures present in the image. In this project different type of procedures have been followed to carry out texture segmentation. Procedures starting from fundamental filter transforms till multi-resolution technique using wavelet transform have been considered. Many texture-segmentation schemes are based on a filter-bank model, where the filters called Gabor filters are derived from Gabor elementary functions. Both linear and circular Gabor filters are studied and analyzed ...
The IJCNLP 2017 shared task on Customer Feedback Analysis focuses on classifying customer feedbac... more The IJCNLP 2017 shared task on Customer Feedback Analysis focuses on classifying customer feedback into one of a predefined set of categories or classes. In this paper, we describe our approach to this problem and the results on four languages, i.e. English, French, Japanese and Spanish. Our system implemented a bidirectional LSTM (Graves and Schmidhuber, 2005) using pre-trained glove (Pennington et al., 2014) and fastText (Joulin et al., 2016) embeddings, and SVM (Cortes and Vapnik, 1995) with TF-IDF vectors for classifying the feedback data which is described in the later sections. We also tried different machine learning techniques and compared the results in this paper. Out of the 12 participating teams, our systems obtained 0.65, 0.86, 0.70 and 0.56 exact accuracy score in English, Spanish, French and Japanese respectively. We observed that our systems perform better than the baseline systems in three languages while we match the baseline accuracy for Japanese on our submitted ...
Uploads
Papers by Pruthwik Mishra