Environment is the predominant factor for deciding the appropriation and abundanceof Orthoptera s... more Environment is the predominant factor for deciding the appropriation and abundanceof Orthoptera species globally. During present study, the impact of major environmental factors on abrupt occurrence of Orthoptera species was examined. Orthopterans landed on vast agricultural fields of cotton, wheat, maize, sugarcane and other standing crops in fertile parts and sandy regions of district Rahim Yar Khan, Punjab, Pakistan. The survey on different localities of district Rahim Yar Khan, Punjab was conducted from January to December 2019, special attention was paid during rainy season. The total 4908 specimens of order Orthoptera, suborder Caelifera belonged to 8 species were collected from different localities of tehsils Sadiqabad, Rahim Yar Khan, Khanpur and Liaqatpur of district Rahim Yar Khan. The highest number of population, 2080 belonged to Schistocerca gregaria and lowest number of individuals belonged to Acrida exaltata were 135. However, other six species i.e., Peokilocerus pict...
This study describes a Natural Language Processing (NLP) toolkit, as the first contribution of a ... more This study describes a Natural Language Processing (NLP) toolkit, as the first contribution of a larger project, for an under-resourced language—Urdu. In previous studies, standard NLP toolkits have been developed for English and many other languages. There is also a dire need for standard text processing tools and methods for Urdu, despite it being widely spoken in different parts of the world with a large amount of digital text being readily available. This study presents the first version of the UNLT (Urdu Natural Language Toolkit) which contains three key text processing tools required for an Urdu NLP pipeline; word tokenizer, sentence tokenizer, and part-of-speech (POS) tagger. The UNLT word tokenizer employs a morpheme matching algorithm coupled with a state-of-the-art stochastic n-gram language model with back-off and smoothing characteristics for the space omission problem. The space insertion problem for compound words is tackled using a dictionary look-up technique. The UN...
Belt and Road Initiative (BRI) projects in Tajikistan are significant due to Tajikistan's geo... more Belt and Road Initiative (BRI) projects in Tajikistan are significant due to Tajikistan's geostrategic importance in the region. Tajikistan has channeled its geo-strategic values and established diplomatic and economic relations with regional and extra-regional states. China and Tajikistan's diplomatic relations were started in 1997, which marked another milestone when countries resolved their border disputes bilaterally in 2021. Since China introduced its BRI, its existing diplomatic relations with Tajikistan have changed to close economic ties. Hence, it uplifted the pace of existing economic projects between China and Tajikistan, with the introduction of new projects, some branded as BRI projects, and others are complementing it. In this scenario, China through the BRI provides an opportunity for Pakistan and Tajikistan to enhance and strengthen their economic and diplomatic ties. Both Pakistan and Tajikistan need a great deal of cooperation for further progress of both s...
The Cross-Language English-Urdu Corpus (CLEU) has source text in English while the derived text i... more The Cross-Language English-Urdu Corpus (CLEU) has source text in English while the derived text is in Urdu. It contains in total 3,235 sentence/passage pairs manually tagged into three categories i.e., near copy, paraphrased copy and independently written.
Language resources, such as corpora, are important for various natural language processing tasks.... more Language resources, such as corpora, are important for various natural language processing tasks. Urdu has millions of speakers around the world but it is under-resourced in terms of standard evaluation resources. This paper reports the construction of a benchmark corpus for Urdu summaries (abstracts) to facilitate the development and evaluation of single document summarization systems for Urdu language. In Urdu, space does not always mark word boundary. Therefore, we created two versions of the same corpus. In the first version, words are separated by space. In contrast, proper word boundaries are manually tagged in the second version. We further apply normalization, part-of-speech tagging, morphological analysis, lemmatization, and stemming for the articles and their summaries in both versions. In order to apply these annotations, we re-implemented some NLP tools for Urdu. We provide Urdu Summary Corpus, all these annotations and the needed software tools (as open-source) for rese...
2016 Sixth International Conference on Innovative Computing Technology (INTECH), 2016
Textual description of business process models can be generated either manually or automatically.... more Textual description of business process models can be generated either manually or automatically. Both ways of generating descriptions have their strengths and weaknesses. However, it is not clear, how similar or different are the two descriptions (manually generated and automatic generated). Once the answer to this question is known, it may lead to several conclusions, such as, the two descriptions can be used as an alternative to each other or not. To answer that question, in this study we have generated textual descriptions of 552 process models using two approaches (1) manual textual description approach and (2) automatic textual description approach. Subsequently, we apply three similarity estimation models including η-gram overlap, Longest Common Subsequence, and Vector Space Model to compute the degree of similarity between manual-automatic textual description pairs. Results show that the two types of textual descriptions are significantly different from each other and thus t...
This paper describes our participation for the Bots and Gender Profiling task at PAN 2019. The ai... more This paper describes our participation for the Bots and Gender Profiling task at PAN 2019. The aim of this task is to first discriminate a profile either as bot or human. If the profile is written by a human, it should be further classified as male or female. Our proposed approach is based on language independent stylometry features. A total of 27 language independent stylometry features were used to build the system for Bots and Gender Profiling (18 features are character based and remaining 9 are emotion based). On training dataset, for English language, Accuracy scores of 0.97 and 0.80 are obtained for bot and human classification task and male / female classification task respectively. For Spanish language, Accuracy of 0.93 and 0.75 is obtained for bot and human classification task and male / female classification task respectively. On test dataset 1, for English language, Accuracy scores of 0.92 and 0.76 are obtained for bot and human classification task and male / female class...
Baku archipelago is well-known for high oil and gas production even at the great depth due to hig... more Baku archipelago is well-known for high oil and gas production even at the great depth due to high sedimentation rate and low compaction during the formation of basin. This paper represent the Garnulomtric analysis of core samples, in which grain size fractions are determined along with porosity of the cores. Four types of rock groups (clayey-aleuritic sand, clayey-sandy aleurolites, sandy-clayey aleurolites and clayey sandy loams) were taken into account. In first group (clayey-aleuritic sand) the dominant grain size fraction is 0.175 mm which lead to increase in porosity, in second group the dominant fractions are 0.055 mm and 0.175 which tend to increase in porosity. In clayey-sandy aleurolites fractions like 0.055 mm and 0.25 mm have negative effect on porosity. Moreover, in clayey-sandy loam deposits, the coarser grains have positive impact on porosity compared to fine grains. On the basis of results obtained, the central part of Baku Archipelago is very promising for oil and g...
Author profiling task aims to identify different traits of an author by analyzing his/her written... more Author profiling task aims to identify different traits of an author by analyzing his/her written text. This study presents a Stylometry-based approach for detection of author traits (gender and age) for cross-genre author profiles. In our proposed approach, we used different types of stylistic features including 7 lexical features, 16 syntactic features, 26 character-based features and 6 vocabulary richness (total 56 stylistic features). On the training corpus, the proposed approach obtained promising results with an accuracy of 0.787 for gender, 0.983 for age and 0.780 for both (jointly detecting age and gender). On the test corpus, proposed system gave an accuracy of 0.576 for gender, 0.371 for age and 0.256 for both.
Nanofluid is well known as smart fluid which has high ability to recover oil. Therefore, it gains... more Nanofluid is well known as smart fluid which has high ability to recover oil. Therefore, it gains more significant effect in oil and gas industry. With the low concentration of nanofiller in nanofluid is used to enhance the numerous characteristics for oil recovery applications. Then, the main feature is the size of reinforcing agent and properties along matrix medium. Nano dimensional particles suspension in polymeric matrix have major advantages are stable sedimentation, optical, mechanical, electrical, and rheological properties that can be affected during the synthesis of nanofluids. Therefore nanoparticles/polymeric nanofluid have exceptional characteristics over the conventional fluid. Mixed nanoparticles/polymeric nanofluid in the presence of surfactant have effective interfacial tension and wettability which is evident for the development of nanofluids for oil recovery. In this context, the designed experimental study of silica/PVP nanofluids is synthesized via two step meth...
ABSTRACT: The post embryonic developmental stages of Marpissa bengalensis (Araneae; Salticidac), ... more ABSTRACT: The post embryonic developmental stages of Marpissa bengalensis (Araneae; Salticidac), the 2nd most abundant predatory species in citrus orchard were collected from the experimental fruit garden, department of Horticulture, located at the campus of University of Agriculture Faisalabad andstudied. Life cycle was observed in the laboratory, which started from egg sac collected from the field along with gravid female and released into the spider cages. The incubation period ranged from 5-15 days. The average eggs hatched were 23.8 eggs/cocoon and hatching % under laboratory conditions was recorded as 73.18%. The average duration of spiderlings span on their mothers back was 7 days. An overall, mean duration of 3rd spiderling stage was of 7.46 days. All spiderlings hatched from 8 cocoons. At the 4th spiderling stage, the average duration was of 10.23 days. In the 5th spiderling stage, the spiderling spent an average of 19.82 days. The 6th and 7th spiderlings stages lasted a to...
In the recent years, many benchmark author profiling corpora have been developed for various genr... more In the recent years, many benchmark author profiling corpora have been developed for various genres including Twitter, social media, blogs, hotel reviews and e-mail, etc. However, no such standard evaluation resource has been developed for Short Messaging Service (SMS), a popular medium of communication, which is very useful for author profiling. The primary aim of this study is to develop a large multilingual (English and Roman Urdu) benchmark SMS-based author profiling corpus. The proposed corpus contains 810 author profiles, wherein each profile consists of an aggregation of SMS messages as a single document of an author, along with seven demographic traits associated with each author profile: gender, age, native language, native city, qualification, occupation and personality type (introvert/extrovert). The secondary aims of this study include the following: (1) annotating the proposed corpus for code-switching annotations at the lexical level (approximately 0.69 million tokens ...
Environment is the predominant factor for deciding the appropriation and abundanceof Orthoptera s... more Environment is the predominant factor for deciding the appropriation and abundanceof Orthoptera species globally. During present study, the impact of major environmental factors on abrupt occurrence of Orthoptera species was examined. Orthopterans landed on vast agricultural fields of cotton, wheat, maize, sugarcane and other standing crops in fertile parts and sandy regions of district Rahim Yar Khan, Punjab, Pakistan. The survey on different localities of district Rahim Yar Khan, Punjab was conducted from January to December 2019, special attention was paid during rainy season. The total 4908 specimens of order Orthoptera, suborder Caelifera belonged to 8 species were collected from different localities of tehsils Sadiqabad, Rahim Yar Khan, Khanpur and Liaqatpur of district Rahim Yar Khan. The highest number of population, 2080 belonged to Schistocerca gregaria and lowest number of individuals belonged to Acrida exaltata were 135. However, other six species i.e., Peokilocerus pict...
This study describes a Natural Language Processing (NLP) toolkit, as the first contribution of a ... more This study describes a Natural Language Processing (NLP) toolkit, as the first contribution of a larger project, for an under-resourced language—Urdu. In previous studies, standard NLP toolkits have been developed for English and many other languages. There is also a dire need for standard text processing tools and methods for Urdu, despite it being widely spoken in different parts of the world with a large amount of digital text being readily available. This study presents the first version of the UNLT (Urdu Natural Language Toolkit) which contains three key text processing tools required for an Urdu NLP pipeline; word tokenizer, sentence tokenizer, and part-of-speech (POS) tagger. The UNLT word tokenizer employs a morpheme matching algorithm coupled with a state-of-the-art stochastic n-gram language model with back-off and smoothing characteristics for the space omission problem. The space insertion problem for compound words is tackled using a dictionary look-up technique. The UN...
Belt and Road Initiative (BRI) projects in Tajikistan are significant due to Tajikistan's geo... more Belt and Road Initiative (BRI) projects in Tajikistan are significant due to Tajikistan's geostrategic importance in the region. Tajikistan has channeled its geo-strategic values and established diplomatic and economic relations with regional and extra-regional states. China and Tajikistan's diplomatic relations were started in 1997, which marked another milestone when countries resolved their border disputes bilaterally in 2021. Since China introduced its BRI, its existing diplomatic relations with Tajikistan have changed to close economic ties. Hence, it uplifted the pace of existing economic projects between China and Tajikistan, with the introduction of new projects, some branded as BRI projects, and others are complementing it. In this scenario, China through the BRI provides an opportunity for Pakistan and Tajikistan to enhance and strengthen their economic and diplomatic ties. Both Pakistan and Tajikistan need a great deal of cooperation for further progress of both s...
The Cross-Language English-Urdu Corpus (CLEU) has source text in English while the derived text i... more The Cross-Language English-Urdu Corpus (CLEU) has source text in English while the derived text is in Urdu. It contains in total 3,235 sentence/passage pairs manually tagged into three categories i.e., near copy, paraphrased copy and independently written.
Language resources, such as corpora, are important for various natural language processing tasks.... more Language resources, such as corpora, are important for various natural language processing tasks. Urdu has millions of speakers around the world but it is under-resourced in terms of standard evaluation resources. This paper reports the construction of a benchmark corpus for Urdu summaries (abstracts) to facilitate the development and evaluation of single document summarization systems for Urdu language. In Urdu, space does not always mark word boundary. Therefore, we created two versions of the same corpus. In the first version, words are separated by space. In contrast, proper word boundaries are manually tagged in the second version. We further apply normalization, part-of-speech tagging, morphological analysis, lemmatization, and stemming for the articles and their summaries in both versions. In order to apply these annotations, we re-implemented some NLP tools for Urdu. We provide Urdu Summary Corpus, all these annotations and the needed software tools (as open-source) for rese...
2016 Sixth International Conference on Innovative Computing Technology (INTECH), 2016
Textual description of business process models can be generated either manually or automatically.... more Textual description of business process models can be generated either manually or automatically. Both ways of generating descriptions have their strengths and weaknesses. However, it is not clear, how similar or different are the two descriptions (manually generated and automatic generated). Once the answer to this question is known, it may lead to several conclusions, such as, the two descriptions can be used as an alternative to each other or not. To answer that question, in this study we have generated textual descriptions of 552 process models using two approaches (1) manual textual description approach and (2) automatic textual description approach. Subsequently, we apply three similarity estimation models including η-gram overlap, Longest Common Subsequence, and Vector Space Model to compute the degree of similarity between manual-automatic textual description pairs. Results show that the two types of textual descriptions are significantly different from each other and thus t...
This paper describes our participation for the Bots and Gender Profiling task at PAN 2019. The ai... more This paper describes our participation for the Bots and Gender Profiling task at PAN 2019. The aim of this task is to first discriminate a profile either as bot or human. If the profile is written by a human, it should be further classified as male or female. Our proposed approach is based on language independent stylometry features. A total of 27 language independent stylometry features were used to build the system for Bots and Gender Profiling (18 features are character based and remaining 9 are emotion based). On training dataset, for English language, Accuracy scores of 0.97 and 0.80 are obtained for bot and human classification task and male / female classification task respectively. For Spanish language, Accuracy of 0.93 and 0.75 is obtained for bot and human classification task and male / female classification task respectively. On test dataset 1, for English language, Accuracy scores of 0.92 and 0.76 are obtained for bot and human classification task and male / female class...
Baku archipelago is well-known for high oil and gas production even at the great depth due to hig... more Baku archipelago is well-known for high oil and gas production even at the great depth due to high sedimentation rate and low compaction during the formation of basin. This paper represent the Garnulomtric analysis of core samples, in which grain size fractions are determined along with porosity of the cores. Four types of rock groups (clayey-aleuritic sand, clayey-sandy aleurolites, sandy-clayey aleurolites and clayey sandy loams) were taken into account. In first group (clayey-aleuritic sand) the dominant grain size fraction is 0.175 mm which lead to increase in porosity, in second group the dominant fractions are 0.055 mm and 0.175 which tend to increase in porosity. In clayey-sandy aleurolites fractions like 0.055 mm and 0.25 mm have negative effect on porosity. Moreover, in clayey-sandy loam deposits, the coarser grains have positive impact on porosity compared to fine grains. On the basis of results obtained, the central part of Baku Archipelago is very promising for oil and g...
Author profiling task aims to identify different traits of an author by analyzing his/her written... more Author profiling task aims to identify different traits of an author by analyzing his/her written text. This study presents a Stylometry-based approach for detection of author traits (gender and age) for cross-genre author profiles. In our proposed approach, we used different types of stylistic features including 7 lexical features, 16 syntactic features, 26 character-based features and 6 vocabulary richness (total 56 stylistic features). On the training corpus, the proposed approach obtained promising results with an accuracy of 0.787 for gender, 0.983 for age and 0.780 for both (jointly detecting age and gender). On the test corpus, proposed system gave an accuracy of 0.576 for gender, 0.371 for age and 0.256 for both.
Nanofluid is well known as smart fluid which has high ability to recover oil. Therefore, it gains... more Nanofluid is well known as smart fluid which has high ability to recover oil. Therefore, it gains more significant effect in oil and gas industry. With the low concentration of nanofiller in nanofluid is used to enhance the numerous characteristics for oil recovery applications. Then, the main feature is the size of reinforcing agent and properties along matrix medium. Nano dimensional particles suspension in polymeric matrix have major advantages are stable sedimentation, optical, mechanical, electrical, and rheological properties that can be affected during the synthesis of nanofluids. Therefore nanoparticles/polymeric nanofluid have exceptional characteristics over the conventional fluid. Mixed nanoparticles/polymeric nanofluid in the presence of surfactant have effective interfacial tension and wettability which is evident for the development of nanofluids for oil recovery. In this context, the designed experimental study of silica/PVP nanofluids is synthesized via two step meth...
ABSTRACT: The post embryonic developmental stages of Marpissa bengalensis (Araneae; Salticidac), ... more ABSTRACT: The post embryonic developmental stages of Marpissa bengalensis (Araneae; Salticidac), the 2nd most abundant predatory species in citrus orchard were collected from the experimental fruit garden, department of Horticulture, located at the campus of University of Agriculture Faisalabad andstudied. Life cycle was observed in the laboratory, which started from egg sac collected from the field along with gravid female and released into the spider cages. The incubation period ranged from 5-15 days. The average eggs hatched were 23.8 eggs/cocoon and hatching % under laboratory conditions was recorded as 73.18%. The average duration of spiderlings span on their mothers back was 7 days. An overall, mean duration of 3rd spiderling stage was of 7.46 days. All spiderlings hatched from 8 cocoons. At the 4th spiderling stage, the average duration was of 10.23 days. In the 5th spiderling stage, the spiderling spent an average of 19.82 days. The 6th and 7th spiderlings stages lasted a to...
In the recent years, many benchmark author profiling corpora have been developed for various genr... more In the recent years, many benchmark author profiling corpora have been developed for various genres including Twitter, social media, blogs, hotel reviews and e-mail, etc. However, no such standard evaluation resource has been developed for Short Messaging Service (SMS), a popular medium of communication, which is very useful for author profiling. The primary aim of this study is to develop a large multilingual (English and Roman Urdu) benchmark SMS-based author profiling corpus. The proposed corpus contains 810 author profiles, wherein each profile consists of an aggregation of SMS messages as a single document of an author, along with seven demographic traits associated with each author profile: gender, age, native language, native city, qualification, occupation and personality type (introvert/extrovert). The secondary aims of this study include the following: (1) annotating the proposed corpus for code-switching annotations at the lexical level (approximately 0.69 million tokens ...
Uploads
Papers by MUHAMMAD NAWAB