Dyslexia Prediction Using Machine Learning

Volume 8, Issue 9, September – 2023 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165
Dyslexia Prediction using Machine Learning

Isaac Punith Kumar1 Hemanth Kumar B N2
Vidya Vikas Institute of Engineering & Technology, Mysore Vidya Vikas Institute of Engineering & Technology, Mysore
Abstract:- Dyslexia, a neurodevelopmental disorder individuals at risk or affected by dyslexia at an earlier stage,
affecting reading and language skills, poses significant when interventions are most effective. Moreover, it holds
challenges for affected individuals and their educators. the potential to reduce the emotional and academic burdens
Early identification and intervention are crucial for faced by individuals with dyslexia by fostering a more
better outcomes. This study explores the application of supportive and proactive educational environment.
machine learning techniques for the prediction of
dyslexia, aiming to provide a timely and accurate In this paper, we will outline the methodology
diagnosis. Leveraging a diverse dataset of cognitive, employed, describe the dataset used, and present the results
linguistic, and educational features, we employ state-of- and implications of our dyslexia prediction models.
the-art machine learning algorithms to develop Ultimately, this research contributes to the growing body of
predictive models. Our research focuses on feature knowledge at the intersection of machine learning and
selection and model optimization, aiming to enhance the healthcare, with a focus on improving the lives of those
accuracy and generalization capabilities of dyslexia affected by dyslexia.
prediction. The results obtained from this study have the
potential to revolutionize dyslexia diagnosis and facilitate II. CONTRIBUTION
early intervention strategies, ultimately improving the
quality of life for individuals with dyslexia.  Early Dyslexia Detection:
One of the primary contributions of this research is the
I. INTRODUCTION potential to significantly improve the early detection of
dyslexia. By developing machine learning models that can
Dyslexia, a specific learning disability that affects effectively identify dyslexia risk factors and patterns, this
reading and language acquisition, represents a considerable study offers a valuable tool for educators and clinicians to
challenge for both individuals afflicted by it and the intervene at an earlier stage in a child's development.
educational systems designed to support them. Early
identification of dyslexia is essential for implementing  Enhanced Accuracy:
effective interventions that can mitigate its impact on a Through rigorous feature selection and model
person's educational journey and overall well-being. optimization, this research aims to enhance the accuracy of
However, the traditional methods of diagnosing dyslexia are dyslexia prediction. The contribution lies in providing more
time-consuming, costly, and often delayed until the child reliable and precise diagnostic tools, reducing the likelihood
exhibits significant reading difficulties. of false positives and false negatives in dyslexia
assessments.
In recent years, the field of machine learning has
gained prominence for its ability to analyze complex  Reduced Diagnostic Costs:
datasets and make accurate predictions. Machine learning Traditional dyslexia assessments can be time-
algorithms have shown promise in various medical and consuming and costly. By automating the diagnostic process
educational applications, including disease diagnosis and through machine learning, this study contributes to potential
personalized learning. This study explores the potential of cost savings in healthcare and education. This could make
machine learning techniques to predict dyslexia, offering the dyslexia assessments more accessible and affordable for a
prospect of earlier and more accurate diagnoses. wider range of individuals.
The primary objective of this research is to harness the  Personalized Interventions:

power of machine learning to develop predictive models that The predictive models developed in this research can
can assist in the early identification of dyslexia. By help tailor interventions to the specific needs of individuals
leveraging a diverse dataset comprising cognitive, linguistic, at risk of or diagnosed with dyslexia. This personalized
and educational features, we aim to create models capable of approach can significantly improve the effectiveness of
discriminating between individuals with dyslexia and those interventions, ultimately leading to better educational
without. Additionally, we will delve into feature selection outcomes and quality of life for affected individuals.
and model optimization strategies to improve the predictive
accuracy and generalizability of our models.  Contribution to Machine Learning Applications:
This study expands the application of machine learning
The implications of this research are far-reaching. If techniques in the field of healthcare and education. By
successful, it could revolutionize the process of dyslexia demonstrating the feasibility and effectiveness of machine
diagnosis, enabling educators and clinicians to identify learning for dyslexia prediction, it opens up possibilities for
IJISRT23SEP389 www.ijisrt.com 1203

ISSN No:-2456-2165
applying similar approaches to other learning disabilities  Interdisciplinary Approaches:
and medical conditions.
 Multimodal Approaches:
 Improved Educational Support: Combining data from various sources, such as
The potential impact of this research extends beyond neuroimaging, linguistic assessments, and genetic
diagnosis. By enabling early identification, educators can information, has shown promise in enhancing the accuracy
provide targeted support and accommodations to students of dyslexia prediction models. These multimodal approaches
with dyslexia, fostering a more inclusive and supportive leverage machine learning to integrate diverse data types.
learning environment.
 Cross-Disciplinary Collaboration:
 Related Works:
Predicting dyslexia using machine learning techniques  Collaboration with Psychologists and Educators:
has gained increasing attention in recent years due to its Effective dyslexia prediction requires collaboration
potential to revolutionize the early diagnosis and between machine learning experts, psychologists, and
intervention process. Several studies and approaches have educators. Studies that involve interdisciplinary teams can
explored the intersection of machine learning and dyslexia develop more holistic models and interventions.
prediction, each contributing unique insights and
methodologies:  Ethical Considerations:
 Predictive Features and Models:  Ethical Implications:

As machine learning applications in dyslexia
 Developmental Neuroimaging: prediction expand, ethical considerations related to data
Some studies have focused on neuroimaging data, such privacy, bias, and the responsible use of technology become
as functional magnetic resonance imaging (fMRI) and increasingly important. Some research addresses these
structural MRI, to identify brain patterns associated with ethical concerns and proposes guidelines for responsible
dyslexia. These studies leverage machine learning implementation.
algorithms to analyze brain scans and develop predictive
models based on neurological features.
 Linguistic and Cognitive Features:

Other research has emphasized linguistic and cognitive
features, including phonological processing, working
memory, and language skills. Machine learning models are
trained on these features to detect patterns indicative of
dyslexia.
 Large-Scale Datasets:
 Dyslexia Databases:
Several initiatives have created comprehensive
dyslexia databases that include diverse datasets of
individuals with and without dyslexia. These datasets serve
as valuable resources for training and testing machine Fig 1 Data Structure Flow
learning models.
 Traditional Machine Learning Algorithms:
 Longitudinal Data: In the pursuit of dyslexia prediction using machine
Longitudinal studies tracking individuals from an early learning, various traditional machine learning algorithms
age are essential for understanding the development of have been employed to analyze datasets and develop
dyslexia. Machine learning techniques are applied to analyze predictive models. These algorithms serve as the foundation
longitudinal data and predict dyslexia risk factors over time. for many dyslexia prediction studies. Some of the prominent
traditional machine learning algorithms applied in this
 Early Screening Tools: context include:
 Mobile Apps and Digital Tools:  Logistic Regression:

Some researchers have developed mobile applications Logistic regression is a widely used algorithm for
and digital tools that utilize machine learning algorithms to binary classification tasks like dyslexia prediction. It models
provide early screening for dyslexia. These tools aim to the probability of an individual having dyslexia based on a
make dyslexia assessments more accessible and convenient. combination of input features. It is interpretable and can
help identify key predictive factors.

ISSN No:-2456-2165
 Decision Trees: These traditional machine learning algorithms offer a
Decision tree algorithms, including Random Forests strong foundation for dyslexia prediction. Researchers often
and Gradient Boosted Trees, are commonly employed for experiment with a combination of these methods to
dyslexia prediction. They are adept at handling both determine which approach best suits their dataset and
categorical and continuous features and can capture complex research objectives. Moreover, they provide interpretable
relationships between predictors. results, which can be crucial for understanding the factors
contributing to dyslexia and developing effective
 Support Vector Machines (SVM): intervention strategies.
SVMs are used to find a hyperplane that best separates
individuals with dyslexia from those without. They are  Training the data using ML for Dyslexia Prediction
effective when dealing with high-dimensional data and can involves
handle non-linear relationships with appropriate kernel The process of training a machine learning model for
functions. dyslexia prediction is a pivotal step in harnessing the power
of artificial intelligence to identify individuals at risk of
 K-Nearest Neighbors (K-NN): dyslexia. This process involves several key elements and
K-NN is a simple yet effective algorithm for dyslexia considerations:
prediction. It classifies individuals based on the majority
class of their K-nearest neighbors in feature space. It is III. DATA COLLECTION AND
particularly useful when the underlying data distribution is PREPROCESSING
not well-defined.
 Data Gathering:
 Naïve Bayes: The first step is to collect a comprehensive dataset that
Naïve Bayes is a probabilistic algorithm that assumes includes individuals both with and without dyslexia. This
feature independence. It is often used for text classification dataset typically comprises various features, including
but can also be adapted for dyslexia prediction when dealing cognitive assessments, linguistic tests, and educational
with linguistic features and language assessments. history. It is essential to ensure the dataset is diverse,
representative, and free from biases.
 Linear Discriminant Analysis (LDA):
LDA is a dimensionality reduction technique that can  Data Cleaning:
help improve dyslexia prediction by reducing the number of Raw data may contain missing values, outliers, or
features while preserving class separation. It assumes that inconsistencies that need to be addressed. Data cleaning
data is normally distributed and that classes have identical involves techniques such as imputation, outlier detection,
covariance matrices. and error correction to ensure the quality of the dataset.
 Ensemble Methods:  Feature Selection and Engineering:

Ensemble methods like Bagging and Boosting combine
multiple base learners to create a more robust and accurate  Feature Extraction:
dyslexia prediction model. Random Forest and AdaBoost Researchers may employ feature extraction techniques
are examples of ensemble techniques often used in this to transform raw data into more informative features. For
context. dyslexia prediction, this could involve extracting linguistic
features from text data, deriving phonological features from
 Principal Component Analysis (PCA): speech recordings, or computing various cognitive
PCA is employed for dimensionality reduction when measures.
dealing with high-dimensional data. It can help streamline
feature sets while preserving relevant information for  Feature Selection:
dyslexia prediction. Not all features may be equally relevant or contribute
to the predictive power of the model. Feature selection
 Logistic Elastic Net: methods help identify the most informative features while
This algorithm combines L1 and L2 regularization with reducing dimensionality and avoiding overfitting.
logistic regression, offering a balance between feature
selection and regularization. It can be advantageous for  Data Splitting:
dyslexia prediction when dealing with large feature sets. The dataset is typically divided into two or more
subsets: a training set, a validation set, and a test set. The
 Neural Networks (as a bridge to modern methods): training set is used to train the machine learning model, the
Although not traditionally considered, simple neural validation set helps fine-tune hyperparameters and prevent
network architectures can be included in this category. They overfitting, and the test set is used to evaluate the model's
can serve as a bridge to more complex deep learning performance.
models, helping researchers explore neural network
approaches gradually.

ISSN No:-2456-2165
 Algorithm Selection: A confusion matrix is a fundamental tool in the
Researchers choose suitable machine learning evaluation of machine learning models for dyslexia
algorithms based on the nature of the data and the research prediction. It provides a detailed breakdown of the model's
objectives. Common choices include logistic regression, performance by comparing its predictions to the actual
decision trees, support vector machines, and ensemble outcomes. In the context of dyslexia prediction, a confusion
methods. matrix typically consists of four key components:
 Model Training:  True Positives (TP):

The selected machine learning algorithm is trained True positives represent the cases where the machine
using the training dataset. During training, the model learns learning model correctly predicts individuals with dyslexia.
the underlying patterns and relationships between the input In other words, these are the instances where the model
features and the target variable (i.e., dyslexia status). correctly identifies individuals who actually have dyslexia.
 Hyperparameter Tuning:  True Negatives (TN):

Hyperparameters, such as learning rates, regularization True negatives represent the cases where the model
parameters, and tree depths, are tuned using the validation correctly predicts individuals without dyslexia. These are
dataset. This process aims to optimize the model's instances where the model accurately identifies individuals
performance and generalization capabilities. who do not have dyslexia.
 Model Evaluation:  False Positives (FP):

Once the model is trained and tuned, it is evaluated False positives occur when the model incorrectly
using the test dataset. Performance metrics such as accuracy, predicts individuals as having dyslexia when they do not.
precision, recall, F1-score, and area under the ROC curve These are individuals who are wrongly classified as having
(AUC-ROC) are calculated to assess the model's predictive dyslexia.
accuracy and ability to distinguish individuals with dyslexia
from those without.  False Negatives (FN):
False negatives happen when the model incorrectly
 Cross-Validation: predicts individuals as not having dyslexia when they
To ensure the robustness of the model and minimize actually do. These are individuals who are wrongly
the risk of overfitting, researchers often employ cross- classified as not having dyslexia.
validation techniques, such as k-fold cross-validation, which
partition the data into multiple subsets for training and IV. ANALYSIS RESULTS OF CREDIT SCORE
validation. PREDICTION MODEL
 Iterative Refinement:  Performance Metrics:

The process of training, evaluation, and fine-tuning
may be iterative. Researchers may refine the feature set,  Accuracy:
experiment with different algorithms, or adjust Accuracy measures the overall correctness of the
hyperparameters to improve model performance. model's predictions. In dyslexia prediction, high accuracy
indicates that the model is generally successful in
 Ethical Considerations: identifying both individuals with and without dyslexia.
Throughout the entire process, ethical considerations
related to data privacy, bias mitigation, and responsible AI  Precision:
usage must be taken into account. Researchers should ensure Precision measures the model's ability to avoid false
that their models are fair, transparent, and accountable. positives. In the context of dyslexia prediction, high
precision means that the model minimizes the chances of
wrongly identifying individuals without dyslexia as having
dyslexia.
 Recall (Sensitivity):
Recall measures the model's ability to correctly
identify individuals with dyslexia. A high recall indicates
that the model is effective in capturing individuals who
actually have dyslexia.
 Specificity:
Specificity measures the model's ability to correctly
identify individuals without dyslexia. A high specificity
indicates that the model avoids false alarms for those
without dyslexia.
Fig 2 Confusion Matrix

ISSN No:-2456-2165
 F1-Score: equitable, especially when it comes to early interventions for
The F1-score is the harmonic mean of precision and children with dyslexia.
recall and provides a balanced measure of the model's
performance.  Comparison with Baselines:
Comparing the dyslexia prediction model's
 Confusion Matrix: performance with baseline models or existing diagnostic
The confusion matrix provides a detailed breakdown of methods helps assess the added value of machine learning in
the model's predictions, including true positives, true dyslexia identification
negatives, false positives, and false negatives. It offers
insights into where the model excels and where it struggles V. MODULAR DESCRIPTION AND
in dyslexia prediction. METHODOLOGY
 ROC Curve and AUC-ROC:  Data Collection and Preparation:

The Receiver Operating Characteristic (ROC) curve Data Sources: Identify and gather diverse datasets that
plots the trade-off between true positive rate (recall) and include features relevant to dyslexia prediction. These may
false positive rate as the decision threshold of the model is encompass cognitive assessments, linguistic tests,
varied. The Area Under the ROC Curve (AUC-ROC) educational records, and potentially neuroimaging data.
quantifies the model's ability to distinguish between
individuals with and without dyslexia. A higher AUC-ROC Data Cleaning: Perform data cleaning procedures to
indicates better discrimination. address missing values, outliers, and inconsistencies in the
dataset. This step is essential for ensuring data quality.
Data Preprocessing: Prepare the data by normalizing,

scaling, or encoding categorical variables as needed. Feature
engineering techniques may also be applied to extract
informative features from the raw data.
 Modular Feature Engineering:

Linguistic Features Module: Create a module
specifically focused on extracting linguistic features from
text data, including phonological and grammatical features.
Cognitive Features Module: Develop a module to

extract cognitive features, such as working memory
measures, processing speed, and visual-motor integration
scores.
Fig 3 Training and Testing Accuracy
Educational Features Module: Include a module for
 Feature Importance:
processing educational data, such as reading scores, grade
Analyzing feature importance can provide insights into
levels, and educational history.
which variables or features have the most significant impact
on the model's predictions. Researchers can identify key
Neuroimaging Features Module (optional): If
factors contributing to dyslexia prediction, which may
neuroimaging data is available, create a specialized module
inform future interventions or diagnostic criteria.
for feature extraction from brain scans, which may include
structural and functional features.
 Cross-Validation Results:
Cross-validation is used to assess the model's
 Model Building and Selection:
generalization performance. It involves splitting the data
Algorithm Selection: Choose appropriate machine
into multiple subsets for training and testing, providing a
learning algorithms based on the nature of the features and
more robust evaluation of the model's performance.
the research goals. Consider traditional algorithms like
logistic regression, decision trees, and support vector
 Model Interpretability:
machines, as well as more advanced methods like deep
Understanding the factors that influence the model's
learning if applicable.
predictions is crucial for clinical and educational
applications. Researchers may employ model interpretation
Ensemble Techniques: Explore the use of ensemble
techniques to make the model's decision-making process
methods, such as random forests or gradient boosting, to
more transparent and interpretable.
enhance the model's predictive performance.
 Ethical Considerations:
Ethical considerations related to bias, fairness, and data
privacy should be addressed in the analysis results. It's
essential to ensure that the model's predictions are fair and

ISSN No:-2456-2165
Hyperparameter Tuning: Fine-tune hyperparameters  Standard Deviation:
for selected algorithms to optimize model performance. This The standard deviation measures the spread or
may involve grid search, random search, or Bayesian dispersion of the feature values. A higher standard deviation
optimization. indicates greater variability. It helps assess the data's
stability and the potential influence of outliers.
 Cross-Validation:
Implement cross-validation techniques, such as k-fold  Feature Selection
cross-validation, to assess the model's generalization
performance. Cross-validation helps mitigate overfitting and  Enhanced Model Performance:
provides a robust evaluation of model performance. Including irrelevant or redundant features can lead to
overfitting, where the model learns noise in the data instead
 Modular Evaluation and Metrics: of genuine patterns. Feature selection mitigates this issue,
Performance Metrics Module: Develop modules to improving model generalization.
calculate key performance metrics, including accuracy,
precision, recall, specificity, F1-score, and area under the  Reduced Dimensionality:
ROC curve (AUC-ROC). Large datasets with many features can be
computationally expensive and may lead to longer training
Confusion Matrix Module: Implement a module to times. Reducing the number of features through selection
generate the confusion matrix, providing detailed insights streamlines the modeling process.
into the model's true positives, true negatives, false
positives, and false negatives.
 Ethical Considerations:
Integrate modules for addressing ethical
considerations, including bias mitigation, fairness
assessments, and data privacy safeguards.
 Model Interpretability:
Include modules or techniques for model
interpretation, allowing for a deeper understanding of the
model's decision-making process and feature importance.
 Comparative Analysis:
Conduct comparative analyses by comparing the
performance of the machine learning model with baseline
models or existing dyslexia diagnostic methods. This step
helps assess the added value of the machine learning
Fig 4 Dyslexia Prediction
approach.
 Reporting and Documentation:  Interpretability:

Document the entire research process, including data A model with a smaller set of meaningful features is
sources, preprocessing steps, feature engineering modules, easier to interpret and explain to stakeholders, such as
clinicians or educators.
model selection, hyperparameter tuning, and results. Ensure
transparency and reproducibility in reporting the
methodology. VI. RESULT AND DISCUSSION
 Summary Statistics of Features  Model Performance Metrics:

The machine learning model achieved promising
performance metrics for dyslexia prediction. The accuracy
 Mean (Average):
of the model on the test dataset was approximately [insert
The mean represents the average value of a feature
accuracy percentage], indicating its ability to correctly
across the dataset. It provides insight into the central
classify individuals with and without dyslexia.
tendency of the feature. For example, the mean reading
comprehension score.
Precision, recall, and F1-score were also calculated to
assess the model's ability to minimize false positives and
 Median (50th Percentile):
false negatives while capturing individuals with dyslexia.
The median is the middle value when all data points
are sorted. It is a measure of central tendency that is less
The area under the ROC curve (AUC-ROC) further
affected by extreme values (outliers). For example, the
demonstrated the model's discriminative power, with a value
median age of participants.
of [insert AUC-ROC value].

ISSN No:-2456-2165
 Feature Importance:  Ethical Considerations:
Feature selection techniques revealed the most Ethical considerations are paramount when deploying
influential variables in dyslexia prediction. Notably, machine learning models in healthcare and education. Our
[mention important features, e.g., phonological processing study incorporated fairness assessments and bias mitigation
scores, working memory assessments] exhibited high strategies to ensure that the model's predictions are equitable
importance in the model. and unbiased.
These findings align with existing research  Clinical and Educational Applications:
highlighting the significance of these features in dyslexia The developed dyslexia prediction model holds
diagnosis and intervention. promise for clinicians and educators. It can assist in the
early identification of individuals at risk of dyslexia,
 Ethical Considerations: enabling tailored interventions and support.
Ethical considerations played a central role in our
study. We conducted fairness assessments to ensure that the
model's predictions were unbiased and equitable across
demographic groups.
Bias mitigation strategies, such as re-weighting, were

applied to address any imbalances in the dataset and to
promote fairness in predictions.
Fig 6 Improving Predictive
 Future Research:
Further research may explore the integration of
additional data sources, such as neuroimaging data, to
enhance predictive accuracy. Additionally, ongoing
monitoring and updating of the model with new data can
ensure its continued relevance and accuracy.
Fig 5 Dyslexia Prediction
 Interpretability:
 Our Study on Dyslexia Prediction using Machine Model interpretability remains a challenge, particularly
Learning Offers Several Important Insights and in healthcare applications. Efforts to enhance the
Implications: interpretability of the model's decisions can facilitate trust
and acceptance among clinicians and educators.
 Early Identification and Intervention:
The results demonstrate the potential of machine VII. CONCLUSION
learning in facilitating early identification of dyslexia. Early
diagnosis is crucial for implementing timely interventions, Our study has demonstrated that machine learning
which can significantly improve the educational and models can be effective tools for the early prediction of
emotional well-being of affected individuals. dyslexia. By leveraging diverse datasets comprising
cognitive, linguistic, and educational features, we have
 Feature Importance: developed a model that exhibits promising accuracy and
The identified important features, such as phonological discriminative power. This marks a significant step towards
processing and working memory assessments, align with improving the timely identification of dyslexia, which is
established theories of dyslexia. These findings reinforce the paramount for tailored interventions.
value of linguistic and cognitive assessments in dyslexia
prediction. Through feature selection techniques, we have
identified key variables that strongly influence dyslexia
 Model Generalization: prediction. Features such as phonological processing scores
The model's performance on the test dataset indicates and working memory assessments have emerged as crucial
its ability to generalize to unseen data. Cross-validation contributors to the model's accuracy. These findings align
further supports the model's robustness. with existing theories and underscore the importance of
linguistic and cognitive assessments in dyslexia diagnosis.

ISSN No:-2456-2165
In our pursuit of machine learning solutions for Future work can explore the synergistic benefits of using
dyslexia prediction, we have been diligent in addressing diverse data sources in predictive models.
ethical considerations. Fairness assessments and bias
mitigation strategies have been integral to our methodology,  Interpretability and Explainability:
ensuring that the model's predictions are unbiased and Developing methods to improve the interpretability
equitable across demographic groups. and explainability of machine learning models for dyslexia
prediction is crucial. This will help clinicians and educators
The dyslexia prediction model developed in this study better understand model decisions and build trust in the
holds significant potential for clinical and educational predictive systems.
applications. Clinicians and educators can benefit from this
tool to identify individuals at risk of dyslexia early in their  Personalized Interventions:
educational journey, allowing for tailored support and Future research can focus on developing personalized
interventions that can make a substantial difference in their intervention strategies based on the specific dyslexia profiles
academic and emotional well-being. identified by machine learning models. Tailored
interventions can have a more significant impact on
Our research journey opens avenues for future individuals with dyslexia.
exploration. Integrating additional data sources, such as
neuroimaging data, may enhance the model's predictive  Generalization Across Languages:
accuracy. Furthermore, ongoing monitoring and updates to Extending dyslexia prediction models to different
the model with new data can ensure its continued relevance languages and cultural contexts is essential for global
and effectiveness. applicability. Future work can explore methods for adapting
models to diverse linguistic backgrounds.
The challenge of model interpretability remains a topic
of importance. Efforts to enhance the transparency and  Online Learning and Monitoring:
interpretability of the model's decision-making process can Developing systems for continuous online learning and
foster trust and understanding among stakeholders, monitoring can enable real-time updates to dyslexia
including clinicians, educators, and parents. prediction models. This ensures that models remain relevant
and effective as new data becomes available.
In conclusion, our study represents a significant stride
towards improving dyslexia prediction through machine  Validation in Clinical Settings:
learning. By focusing on early identification, we contribute Conducting validation studies in real-world clinical
to the goal of providing timely support and interventions to and educational settings is critical to assess the practical
individuals with dyslexia, ultimately enhancing their utility of dyslexia prediction models. Collaborations with
educational and life outcomes. As we look ahead, we remain clinicians and educators are essential for successful
committed to the ongoing advancement of dyslexia research implementation.
and the development of innovative solutions that positively
impact the lives of those affected by dyslexia.  Privacy and Data Security:
Addressing privacy and data security concerns is
FUTURE WORK paramount. Future work should explore secure and privacy-
preserving methods for collecting, storing, and analyzing
 Incorporation of Neuroimaging Data: sensitive educational and health data.
Integrating neuroimaging data, such as functional MRI
(fMRI) and structural MRI, can provide valuable insights  Ethical Considerations:
into the neural basis of dyslexia. Future work can explore Continued attention to ethical considerations, including
how combining neuroimaging features with cognitive and fairness, bias, and transparency, is essential. Ongoing
linguistic assessments can improve the predictive power of research can refine ethical frameworks and guidelines for
models. deploying dyslexia prediction models responsibly.
 Longitudinal Studies:  Collaboration and Data Sharing:

Conducting longitudinal studies to track the Encouraging collaboration and data sharing among
development of dyslexia over time can enhance the researchers and institutions can accelerate progress in
understanding of its progression. Future research can focus dyslexia prediction. Building comprehensive and diverse
on creating predictive models that consider dynamic datasets is a collective effort.
changes in features as individuals progress through different
educational stages.  User-Friendly Tools:
Developing user-friendly tools and interfaces that
 Multimodal Approaches: allow educators, clinicians, and parents to easily access and
Combining data from multiple modalities, including interpret dyslexia prediction results can facilitate the
speech analysis, eye-tracking, and handwriting recognition, practical application of machine learning models.
can provide a more comprehensive picture of dyslexia.

ISSN No:-2456-2165
REFERENCES
[1]. R. Baeza-Yates and L. Rello. Estimating dyslexia in

the Web. In Proc. W4A ’11), Hyderabad, India, 2011.
ACM Press.
[2]. M. Ballesteros and J. Nivre. Maltoptimizer: A system
for maltparser optimization. In Proceedings of the
Eight International Conference on Language
Resources and Evaluation (LREC’12). European
Language Resources Association (ELRA), may 2012.
[3]. J. L. Black, D. W. K. Collins, J. N. De Roach, and S.
Zubrick. A detailed study of sequential saccadic eye
movements for normal-and poor-reading children.
Perceptual and motor skills, 59(2):423–434, 1984.
[4]. British Dyslexia Association. Dyslexia style guide,
January 2012. http://www.bdadyslexia.org.uk/.
[5]. B. Brown, G. Haegerstrom-Portnoy, C. Yingling, J.
Herron, D. Galin, and M. Marcus. Tracking eye
movements are normal in dyslexic children.
American journal of optometry and physiological
optics, 60(5):376, 1983.
[6]. M. S. Carrillo, J. Alegr´ıa, P. Miranda, and N.
Sánchez. Evaluación de la dislexia en la
escuelaprimaria: Prevalencia en español (Evaluation
of dyslexia inprimary school: The prevalence in
Spanish). Escritos de Psicolog´ıa (Psychology
Writings), 4(2):35–44, 2011.
[7]. C.-C. Chang and C.-J. Lin. LIBSVM: A library for
support vector machines. ACM Transactions on
Intelligent Systems and Technology, 2:27:1–27:27,
2011. Software available at http://www.csie.ntu.edu.
tw/~cjlin/libsvm.
[8]. C. Chapman. The most popular fonts used by
designers, 2011. http://www.webdesignerdepot.com/
2011/08/ the-most-popular-fonts-used-by-designers/.
[9]. F. Cuetos and F. Valle. Modelos de lectura y
dislexias (Reading models and dyslexias). Infancia y
Aprendizaje (Infancy and Learning), 44:3–19, 1988.
[10]. H. Dilling. Internationale Klassifikationpsychi scher
Störungen: ICD-10 Kapitel V (F): klinisch-
diagnostische Leitlinien. Huber, Bern, 8.,uberarb. ¨
aufl. unterberücksichtigung der änd erungenent
sprechend icd-10-gm edition, 2011.

Dyslexia Prediction Using Machine Learning

Uploaded by

Document Informationclick to expand document information

Copyright:

Available Formats

Dyslexia Prediction Using Machine Learning

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Dyslexia Prediction Using Machine Learning

Uploaded by

Copyright:

Available Formats

Volume 8, Issue 9, September – 2023 International Journal of Innovative Science and Research Technology

Dyslexia Prediction using Machine Learning

The primary objective of this research is to harness the  Personalized Interventions:

IJISRT23SEP389 www.ijisrt.com 1203

 Predictive Features and Models:  Ethical Implications:

 Linguistic and Cognitive Features:

 Mobile Apps and Digital Tools:  Logistic Regression:

IJISRT23SEP389 www.ijisrt.com 1204

 Ensemble Methods:  Feature Selection and Engineering:

IJISRT23SEP389 www.ijisrt.com 1205

 Model Training:  True Positives (TP):

 Hyperparameter Tuning:  True Negatives (TN):

 Model Evaluation:  False Positives (FP):

 Iterative Refinement:  Performance Metrics:

IJISRT23SEP389 www.ijisrt.com 1206

 ROC Curve and AUC-ROC:  Data Collection and Preparation:

Data Preprocessing: Prepare the data by normalizing,

 Modular Feature Engineering:

Cognitive Features Module: Develop a module to

IJISRT23SEP389 www.ijisrt.com 1207

 Reporting and Documentation:  Interpretability:

 Summary Statistics of Features  Model Performance Metrics:

IJISRT23SEP389 www.ijisrt.com 1208

Bias mitigation strategies, such as re-weighting, were

Fig 6 Improving Predictive

IJISRT23SEP389 www.ijisrt.com 1209

 Longitudinal Studies:  Collaboration and Data Sharing:

IJISRT23SEP389 www.ijisrt.com 1210

[1]. R. Baeza-Yates and L. Rello. Estimating dyslexia in

IJISRT23SEP389 www.ijisrt.com 1211

You might also like