An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.
-
Updated
Jan 4, 2024 - Python
An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.
UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)
Compilation of Manually Tagged Roman Urdu Dataset (Urdu written in Latin/Roman Script), along with other helpful Roman Urdu NLP resources
Repository dedicated to a collection of resources and helping material for Urdu language Processing related tasks
Pashto Natural Language Processing Toolkit
Fake news detection using Naïve Bayes in Python along with confusion matrix calculated using sklearn.
This is an Urdu Word Spell Checker using Noisy Channel Model implemented in Python3.
Generating Urdu poetry using SpaCy in Python. Poetry has been generated by using Uni-grams, Bi-grams, Tri-grams and through Bidirectional Bigram Model and Backward Bigram model.
A simple python based Urdu stemmer which tries to find a stem word from a list of affixes.
A Sequence to Sequence Model Implementation of Urdu Natural Language Processing
A list of most frequently used Roman Urdu words with different spellings and usages to help make Roman Urdu text processing easier.
High-quality synthetic text data generation for Urdu Text Recognition
The first Urdu search engine crawler for web.
UrduFeel: Deep Learning Sentiment Analysis for Emotional Insights
AI-based Train Reservation System that uses Urdu language to chat with it.
This project contains Urdu characters and some preprocessing functions
List of Most Used or Stop Words of Urdu. Approximately 300 Words
We have presented a new dataset for question and answering models. Our dataset contains 27 different Urdu paragraphs which are taken from different available resources i.e Urdu Wikipedia, youtube and news articles etc. All selected paragraphs have an average of 3 to 7 questions along with their possible answers that range from 1 to 3. The data c…
This project is a grapheme-to-phoneme (G2P) converter for Urdu language. It can generate lexicons for Urdu words using a deep learning model.
Add a description, image, and links to the urdu-nlp topic page so that developers can more easily learn about it.
To associate your repository with the urdu-nlp topic, visit your repo's landing page and select "manage topics."