Shahid Mushtaq
Northern Borders University, Languages and Translation, Faculty Member
- Buraydah Private College, Dentistry, Faculty Memberadd
- Applied Linguistics, Teaching English as a Second Language, Teaching English As A Foreign Language, Computational Linguistics, Kashmiri Resource Creation, Treebanking, and 24 moreDependency Grammars, Corpus Linguistics, Syntax, Semantics, Morphology, Psycholinguistics, Language Planning and Policy, Technology-enhanced Learning, Instructional Technology, Instructional Design, E-learning, Educational Technology, Hybrid Learning, Turkish Corpus Linguistics, Philosophy, Education, Communication, Gender Studies, Data Mining, Literature, Languages and Linguistics, Research Methodology, English, and English Literatureedit
Morphological analyzer is a fundamental tool in Natural Language Processing (NLP) that generates the morphological analyses of a given word-form. It can be used in enhancing the accuracy of POS-Tagging, Chunking, Syntactic Parsing, Word... more
Morphological analyzer is a fundamental tool in Natural Language Processing (NLP) that generates the morphological analyses of a given word-form. It can be used in enhancing the accuracy of POS-Tagging, Chunking, Syntactic Parsing, Word Sense Disambiguation (WSD), Information Retrieval (IR) & Machine Translation (MT) Systems. This paper describes an ongoing effort to develop Nepali morphological analyzer, using an open source platform-Apertium (LT-Toolbox). Since, it is the initial stage of this projectwe have confined our work to inflectional morphology. So far, we have covered all the possible categories, as per LDC-IL1 POS tag-set of Nepali. Currently, the coverage of Nepali Morph-Analyzer is 20,000 words, classified into 219 paradigms
Research Interests:
Research Interests:
The present paper explores passives in Kashmiri, a Northwestern Dardic language of the Indo-Aryan family. Though Kashmiri has some special features like V-2 phenomenon, pronominal clitics etc. it has an analytic passive construction like... more
The present paper explores passives in Kashmiri, a Northwestern Dardic language of the Indo-Aryan family. Though Kashmiri has some special features like V-2 phenomenon, pronominal clitics etc. it has an analytic passive construction like its Indo-Aryan counterparts. The internal argument surfaces as the subject of the passive, where the participial/infinitival verbal form - nI is added to the verb root followed by a periphrastic auxiliary yun ‘to come’ in perfective form. The agent of the action is in the form of athi or zaryi ( by / through ) and is preferably omitted. This optionality casts a doubt on its status – whether it is an adjunct or an argument. The promotion of the internal argument to the subject position is another key issue. The present paper investigates the above issues and claims that the Kashmiri passive construction is also a kind of ACTIVE-Passive and not really passive as in English. It is argued that in Kashmiri passives, the underlying subject remains an active subject and the underlying object does not become the surface subject. To prove this claim, some tests based on anaphora binding, pronominal co-reference, control, etc. are applied.
Research Interests:
Treebank is a basic language resource for training and testing syntactic parser which forms a key module in various NLP systems like machine translation system. This paper reports an ongoing research of building dependency treebank for... more
Treebank is a basic language resource for training and testing syntactic parser which forms a key module in various NLP systems like machine translation system. This paper reports an ongoing research of building dependency treebank for Kashmiri (KashTreeBank) and discusses some main annotation issues. The paper is based on the pilot annotation of 500 sentences.
Research Interests:
Research Interests:
The author, while working on a corpus linguistics project, has realized the importance of language resources, mainly the syntactically annotated text corpus, but has felt a severe vacuum in terms of such resources and the related research... more
The author, while working on a corpus linguistics project, has realized the importance of language resources, mainly the syntactically annotated text corpus, but has felt a severe vacuum in terms of such resources and the related research in his native language, Kashmiri. Considering this scenario, he has tried to address this problem in his PhD research. He started his work from scratch and developed a small-scale dependency treebank for Kashmiri (KashTreebank). This book is based on his PhD dissertation. It provides the necessary information about the theoretical and practical issues of developing a treebank, particularly in a resource-poor scenario. Since, these days, treebanks are in high demand, not only for training and testing syntactic parsers but also for promoting empirical research in linguistics, this book can serve as a basic source of information for developing large-scale treebanks. It will be more helpful for resource creation projects, aspiring computational linguists and language engineers, especially those interested in syntactic parsing, corpus linguistics and experimental syntax.