PROJECT REPORT For Machine Learning
PROJECT REPORT For Machine Learning
PROJECT REPORT For Machine Learning
PROJECT REPORT
On
Language detection with machine learning
Submitted to Centurion University of Technology& Management
in partial fulfillment of the requirement for award of the degree of
B. TECH
in
COMPUTER SCIENCE & ENGINEERING
Submitted By
M.Syamla
210101120103
CUTM, Paralakhemundi-761211
2
CERTIFICATE
EVALUATION SHEET
1.Title of the Project: Language detection with
machine learning
2. Year of submission: 2023
3. Name of the degree: B. TECH (C.S.E)
4. Date of Examination / Viva:
5. Student Name: M.SYAMLA
6. Reg No: 210101120103
[APPROVED/REJECTED]
CANDIDATE’S DECLARATION
Signature of Student
INDEX
5
Abstract :-
Introduction
Use-Cases
Dataset
Text preparation
16
CountVectorizer
Both the input and the output features must
take the form of numbers. We will use the
CountVectorizer’s Bag of Words model to
convert text into numerical form.
from
sklearn.feature_extraction.text
import CountVectorizer
17
cv = CountVectorizer()
X =
cv.fit_transform(text_list).toarra
y()
X.shape
Model evaluation
After the successful completion of training,
test, and prediction, the next thing we always
want to do is model evaluation and
assessment.
from sklearn.metrics import
accuracy_score, confusion_matrix,
classification_reportaccuracy =
accuracy_score(y_test,
y_prediction)
confusion_m =
confusion_matrix(y_test,
19
y_prediction)print("The accuracy
is :",accuracy)
Visualization
visualization.
plt.figure(figsize=(15,10))
sns.heatmap(confusion_m, annot =
True)
plt.show()
21
Conclusion
We have come to the end of this article, I hope
you now have a better understanding of how
to predict language using machine learning.
The data has to be evaluated and then
preprocessed as necessary. The text data you
have becomes represented using a bag of
words model. In order to make accurate
predictions in NLP, text extraction and
vectorization are crucial tasks. In these text
classification issues, Naive Bayes consistently
proves to be a stronger model, leading to
more accurate results.