Internship
Internship
Internship
An Internship On Report
on
“Machine Learning With AI Using Python”
Along with the project
“Admission Prediction Analysis”
Submitted as partial fulfillment for the award of
BACHELOR OF TECHNOLOGY, DEGREE
in
Electrical and Electronics Engineering
By
PRABHA SINGH (0198CE211030)
Under the guidance of
Ms. Neela Santhosh Kumar
(HUMAN RESOURCES & ACADEMIC HEAD )
Session: 2024-25 (VII Semester)
CERTIFICATE
This is to certify that the work embodied in this internship report entitled
“Machine Learning With AI Using Python” has been satisfactorily
completed by PRABHA SINGH (0198CE211030), It is a bonafide piece
of work, carried out under our guidance in the Department of Electrical and
Electronics Engineering, Prestige Institute of Management and Research,
Bhopal for the partial fulfillment of the Bachelor in Technology during the
academic year 2023-24.
Approved By
Date :
Signature of student
(Name: PRABHA SINGH)
(Roll No: 0198CE211030)
Semester:-VII
Branch:- EX
Date:..........................
i
ABSTRACT
Many students nowadays are pursuing their education outside of their home nations. These
international students are primarily interested in the United States of America, Canada, Ireland, and
Germany. India and China account for the majority of international students in the United States. The
number of Indian students pursuing postgraduate education in the United States has surged
dramatically during the last decade. With the growing number of international students studying in
the United States, each candidate must compete fiercely for admission to their preferred university.
In educational institutions, the issue of student admittance is critical. This research focuses on using
machine learning models to predict a student's chances of being accepted into a master's degree.
Students will be able to see ahead of time if they have a probability of being admitted. This project
predicts a student's admittance based on a variety of factors such as the university's rating, the student's
undergraduate GPA, GRE score, research experience, and so on. This forecasts whether or not the
student will get admitted to the university of his choice. I employed a variety of methods in this study,
including linear regression, artificial neural networks (ANN), random forest regression, and decision
tree regression. Finally, I put this model on a Web-based GUI to check a student's acceptance
possibilities, and it worked perfectly.
ii
ACKNOWLEDGEMENT
Apart from above, we would also like to thanks DR. Mohammad Ilyas
Khan PRINCIPAL, Prestige Institute of Management and Research,
Bhopal, for his continuous support.
We are also thankful to all the staff members of the Department of Electrical
and Electronics Engineering, Prestige Institute of Management and
Research, Bhopal, for their cooperation in our work.
Signature of student
(Name: PRABHA SINGH)
(Roll No: 0198CE211030)
Semester:-VII
Branch:- EX
iii
CONTENTS
Declaration i
Abstract ii
Acknowledgement iii
Contents iv
List of Figures vi
3 TASK PERFORMED 9
4 REFLECTION NOTES 12
4.1 Experience 12
4.2 Technical Outcomes 12
iv
4.2.1 System Requirement Specification
4.3 System Analysis and Design 13
4.3.1 Existing System
4.3.2 Disadvantages of the Existing System
4.3.3 Proposed System
4.3.4 Advantages of the Proposed System
4.4 System Architecture 14
4.4.1 Data Flow Diagram
4.4.2 UML Diagram
4.4.3 USE CASE Diagram
4.4.4 Class Diagram
4.4.5 Sequence Diagram
4.4.6 Activity Diagram
4.5 Implementation 16
4.5.1 Modules
4.6 Screen Shots 18
5 CONCLUSION 20
BIBLIOGRAPHY 21
v
LIST OF FIGURES
Screenshots
vi
CHAPTER - 1
COMPANY PROFILE
Another company, Quantum Learnings, is a Microsoft Global Training Partner and their
daughter company, and MentorrBuddy is their own placement assistance tool that helps
students find the best job preparation and applications.
The subject matter experts at KSI are highly qualified. These experts have extensive
experience in their respective fields and are also certified. They are enthusiastic about the
subjects they teach, and their webinars and courses reflect this enthusiasm.
They have the best technical training delivered across the country, whether anyone
looking for Mobile application development, Progressive Web application development,
Angular Js, Python, IoT, ML, AI, Data Science, Adv Excel, or Digital Marketing. All of
their courses begin with no prerequisites, and their team makes every effort to ensure that
a candidate only completes the program after acquiring the necessary skills.
1
Admission Prediction Analysis Company Profile
1.1.1 Objectives
Their goal is to consistently deliver success to students by going the extra mile. To help
their students meet their technological skills and career opportunities, they offer the right
people, solutions, and services.
By leveraging leading technologies and industry best practices, they provide their
students with the most efficient and effective training.
The race for digital transformation is on. In this globally connected on-demand world
with rapid advancements in internet technologies, businesses worldwide are under
constant pressure to add innovative real-time capabilities to their applications to
respond to market opportunities.
Skills have become the global currency of the 21st century. In a world where competition
for jobs, pay increases, and academic success continues to increase, certifications offer hope
because they are a credible, third-party assessment of one’s skill and knowledge for a given
subject. Some of the key benefits achieved by the students by certification are Validation
of knowledge, Increased marketability, Increased earning power, Enhanced academic
performance, Improved reputation, Enhanced credibility, Increased confidence, Respect
from peers. By Knowledge Solution India’s certification, students has improved academic
performance having, higher grade point average for certified college students from 6.9 to
7.8, higher graduation rates for certified college students: 78.4% to 94.5% and the dropout
rates are reduced to 0.2% to 1.0%.
Our super energetic and massive team at KSI is our core strength, forming an excellent
blend of IT minds with a creative bent. Their goal is to keep improving and delivering the
skills that will help students have a successful career in the IT industry.
Taking advantage of our highly skilled and experienced trainers. We are primarily a
student-centered organization dedicated to exceeding students' expectations in terms of
meeting their needs. They successfully hosted a group of seasoned professionals.
Trainers who collaborate in order to provide their students with the knowledge they need
to advance in their careers. They take pride in being a sought-after Skill development after
delivering successful internships. They have successfully delivered value to our students
as well as colleges over the years. They truly believe that the success of their students is
their success, and they do not consider themselves to be a vendor for their program. We'd
like to hear some of their stories and learn how far they've gone to ensure the success of
our students, and they'll do everything they can to make that happen.
Training / Internships form a very important part of students over all development that's
why AICTE and Universities have made it mandatory for every engineer and MCA to
undergo the same, we help students in achieving this goal by helping them acquire latest
Learn Machine learning, an application of artificial intelligence (AI) that provides systems
the ability to automatically learn and improve from experience without being explicitly
Learn Data science and how to use scientific methods, processes, algorithms and systems
to extract knowledge and insights from structured and unstructured data as one of the
hottest professions in the market today, bundled with Microsoft MTA Certification
Learn Java one of the most popular programming languages used in the development of
Web and Mobile applications. It is designed for flexibility, allowing developers to write
code that would run on any machine, regardless of architecture or platform Bundled with
Learn the ethical way of how to do penetration testing and other testing methodologies
5. Internet of Things
Learn how to work with connected devices use sensors and raspberry PI3 and connect
these devices to cloud to identify patterns and extract meaning-full information out of it,
6. Business Analytics
Learn Business Analytics and how it enables companies to automate and optimize their
business processes in-fact Data-driven companies treat their data as a corporate asset and
leverage it for a competitive advantage as they are able to use the insights to find new
7. Digital Marketing
Learn Digital Marketing and how its used for promoting products or services online via
internet, companies are gaining higher profitability and return on investment by having
their Digital marketing strategies in place the program is bundled with Google
Certification
6
Admission Prediction Analysis About the Department
2.3 Testing
Testing was done according to the Corporate Standards. As each component was being built,
Unit testing was performed in order to check if the desired functionality is obtained. Each
component in turn is tested with multiple test cases to verify if it is properly working. These
unit tested components are integrated with the existing built components and then integration
testing is performed. Here again, multiple test cases are run to ensure the newly built
component runs in co-ordination with the existing components. Unit and Integration testing are
iteratively performed until the complete product is built. Once the complete product is built, it
is again tested against multiple test cases and all the functionalities.
The product could be working fine in the developer’s environment but might not necessarily
work well in all other environments that the users could be using. Hence, the product is also
tested under multiple environments (Various operating systems and devices). At every step, if
a flaw is observed, the component is rebuilt to fix the bugs. This way, testing is done
hierarchically and iteratively.
➢ Python Programming
➢ Artificial Intelligence
➢ Machine Learning Algorithms
DATA SET
This section describes, in brief, the data that has been used for the research. Data from
multiple sources was used in this project, the major amount of data was extracted from public
website Yocket (Yocket.com), data regarding the rankings, fees and enrolment in colleges was
obtained from a leading educational consultancy firm The Mentors Circle in India. Data from
both the sources was integrated together to form a staging data-set. For predicting the chance
of a student getting shortlisted in universities the final data-set was divided into multiple data-
sets each representing a particular university. For predicting the list of universities suitable for
students based on their profile data of all the students the staging data-set was updated only to
have records of students who had successfully secured admission in the universities. Below
table shows the different features of the data-sets.
FIELD DESCRIPTION
GRE Marks scored by the student in GRE
TOEFL Score Marks scored by the student in English Proficiency Test
Ranking The University Ranking
SOP Quality of Statement of Purpose or Statement of Intent
LOR Quality of Letter of Recommendations documents
CGPA Result of the student in their Undergraduate Course
Research Relevant experience in Research field.
9
Admission Prediction Analysis Task Performed
Data related to the college ranking was collected in .csv format, the data related to students’
profile was extracted using data extraction tool provided by (Mozenda (n.d.)) in .csv files. Data
being from public portal had multiple records with missing and irrelevant values; data cleaning
was performed in Microsoft Excel by deleting the records having unwanted and missing values.
Unwanted columns were removed from the data-set. Once the data-set was cleaned data was
transformed to be suitable for the model. The original data-set had TOEFL score as a
representation of language, to have a consistent metrics for the language score. Similarly, the
Undergraduate score of the students were represented in terms of percentage and CGPA; all the
records of percentage were converted to CGPA by multiplying percentage score by 9.5.
Algorithms
➢ Linear Regression
Linear Regression is a machine learning algorithm based on supervised learning. It performs a
regression task. Regression models a target prediction value based on independent variables. It is
mostly used for finding out the relationship between variables and forecasting. Different
regression models differ based on – the kind of relationship between dependent and independent
variables they are considering, and the number of independent variables getting used.
• Training Phase
As the final task, a main project was developed using machine learning models to predict the
chance of a student to be admitted to a master’s program. This will assist students to know in
advance if they have a chance to get accepted. This project predicts the admission of a student
based on different features including university rating, student’s undergraduate GPA, GRE
score, research experience and etc. This predicts that how much chances are there that the
student will get admission in his selected university or not. In this project I have used multiple
algorithms including linear regression, artificial neural network (ANN), random forest
regressor, decision tree regressor. In the end I have deployed this model on a Web Based GUI
to check student’s admission chances and these models are working fine.
4.1 Experience
According to our internship experience, Knowledge Solutions India offers a positive work
culture and courteous personnel at all levels, from staff to management. The instructors are
knowledgeable in their subjects and treat everyone fairly. There are no distinctions made
between new graduates and corporate executives, and everyone is treated equally. Every
activity, no matter how difficult or simple, requires a lot of teamwork, and the mood is always
peaceful and welcoming. Because of the excellent communication and support available, there
is a lot of room for self-improvement. Interns were well treated and educated, and all of our
questions and concerns about the training or the firms were addressed. All in all, Knowledge
Solutions India was a great place for a fresher to start career and also for a corporate to boost
his/her career. It has been a great experience to be an intern in such a reputed organization.
HARDWARE REQUIREMENTS:
SOFTWARE REQUIREMENTS:
12
Admission Prediction Analysis Reflection
(Bibodi et al. (n.d.)) used multiple machine learning models to create a system that would help
the students to shortlist the universities suitable for them also a second model was created to
help the colleges to decide on enrolment of the student. Nave Bayes algorithm was used to
predict the likelihood of success of an application, and multiple classification algorithms like
Decision Tree, Random Forest, Nave Bayes and SVM were compared and evaluated based on
their accuracy to select the best candidates for the college.
GRADE system was developed by (Waters and Miikkulainen (2013)) to support the admission
process for the graduate students in the University of Texas Austin Department of Computer
Science. The main objective of the project was to develop a system that can help the admission
committee of the university to take better and faster decisions. Logistic regression and SVM
were used to create the model, both models performed equally well and the final system was
developed using Logistic regression due to its simplicity. The time required by the admission
committee to review the applications was reduced by 74% but human intervention was
required to make the final decision on status if the application. (Nandeshwar et al. (2014))
created a similar model to predict the enrolment of the student in the university based on the
factors like SAT score, GPA score, residency race etc. The Model was created using the
Multiple Logistic regression algorithm, it was able to achieve accuracy rate of 67% only.
• Limitation of this system only relied on the GRE, TOEFL and Undergraduate Score
of the student and missed on taking into consideration other important factors like
SOP and LOR.
• The existing system lagged the factor of the research work in the related field.
The principal objective of the research is to help the students who are aspiring to pursue their
education in the USA. The Graduate Admissions Prediction system will help them to evaluate
the chances of success in any university without being dependent on any education consultancy
firm. It will help them in saving a huge amount of time and money spent in the application
process. Also, it will help them to limit the number of applications made by the students by
suggesting them the best universities where they have high chances of securing admission
thereby by saving the amount of money spent by the students by applying in universities where
they have less chance to secure admit based on their profile.
• Information about the prediction analysis is clear to enter all the required information
to predict the admission.
• The user interface code will interact with the Linear Regression, ANN, random forest
regressor, decision tree regressor to provide the users with the required result.
• The ANN algorithm and Linear Regression Algorithm will be used to determine the
chance of the student of securing admission in a particular university based on his/her
profile.
• Once the models have been executed the result will be provided to the student as the
output on the user interface.
The machine learning models are trained with the given dataset. The machine learning models
used in this project are linear regression, artificial neural network (ANN), random forest
regressor, decision tree regressor. Once the models are trained, the student’s profile details are
entered to predict the chances of getting the admit to the university.
4.2 Implementation
4.2.1 Modules
2. Data Visualization
MODULES DESCRIPTION
Data Visualization: Using data visualization, I summarized the data with graphs, pictures and
maps, so that the human mind has an easier time processing and understanding the given data. Data
visualization plays a significant role in the representation of both small and large data sets, but it is
especially useful when we have large data sets, in which it is impossible to see all of our data, let
alone process and understand it manually.
Training and Testing: In this project, datasets are split into two subsets. The first subset is known
as the training data - it's a portion of our actual dataset that is fed into the machine learning model
to discover and learn patterns. In this way, it trains our model. The other subset is known as the
testing data.
Train and Evaluate Linear Regression: Simple linear regression is an approach for predicting
a quantitative response using a single feature (or "predictor" or "input variable"). It takes the
following form: y=β0+β1x
4.4 Screenshots
CHAPTER – 4
CONCLUSION
The major goal of this study was to create a prototype of a system that students interested in studying in
the United States might use. For this study, several machine learning algorithms were created and used.
When compared to the Logistic regression model, Linear Regression demonstrated to be the greatest fit
for system development. The programme was designed with a basic user interface to make it interactive
and simple to use for non-technical people.
The ultimate goal of the study was met since the approach allows students to save time and money that
they would otherwise spend on education advisors and application fees for colleges where they have a
lower chance of being accepted. It will also assist students in making better and faster decisions on
university applications.
BIBLIOGRAPHY
• Bibodi, J., Vadodaria, A., Rawat, A. and Patel, J. (n.d.). Admission Prediction System
Using Machine Learning.