0% found this document useful (0 votes)

16 views

Python Cod1

Uploaded by

Monica H N

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Python Cod1

Uploaded by

Monica H N

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Python Code:

import pandas as pd

import numpy as np

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LogisticRegression

from sklearn.metrics import accuracy_score

# Step 1: Load the dataset

url = "https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/heart.csv"

df = pd.read_csv(url)

# Step 2: Display the first few rows of the dataset

print("Initial Data:\n", df.head())

# Step 3: Check for missing values

print("Missing Values:\n", df.isnull().sum())

# Step 4: Handle missing values (if any)

# For this dataset, there are no missing values, but if there were, you could use:

# df.fillna(method='ffill', inplace=True) # Forward fill or drop missing values

# Step 5: Display the data types

print("Data Types:\n", df.dtypes)

# Step 6: String manipulation example (if needed)

# Example: Clean a string column (if applicable)

# df['gender'] = df['gender'].str.lower().str.strip()

# Step 7: Convert relevant columns to NumPy arrays

age_array = df['age'].to_numpy()

cholesterol_array = df['cholesterol'].to_numpy()
# Step 8: Calculate basic statistics

mean_age = np.mean(age_array)

median_cholesterol = np.median(cholesterol_array)

print(f"Mean Age: {mean_age}, Median Cholesterol: {median_cholesterol}")

# Step 9: Define features and target variable

X = df.drop(columns=['target']) # Assuming 'target' is the column to predict

y = df['target']

# Step 10: Split the dataset into training and testing sets (80% train, 20% test)

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

print(f"Training set size: {X_train.shape[0]}, Testing set size: {X_test.shape[0]}")

# Step 11: Initialize and train the model

model = LogisticRegression(max_iter=200)

model.fit(X_train, y_train)

# Step 12: Make predictions on the test set

y_pred = model.predict(X_test)

# Step 13: Evaluate the model's performance

accuracy = accuracy_score(y_test, y_pred)

print(f"Accuracy of the model: {accuracy:.2f}")

# Step 14: Save the report to a text file

with open("heart_disease_analysis_report.txt", "w") as file:

file.write("Heart Disease Analysis Report\n")

file.write("Objective: Analyze the dataset to predict heart disease.\n")

file.write("Data Loading and Cleaning: Loaded and cleaned the dataset, finding no missing
values.\n")

file.write("Statistical Analysis: Mean Age: {}, Median Cholesterol: {}.\n".format(mean_age,

median_cholesterol))

file.write("Model Accuracy: {}.\n".format(accuracy))

Report Summary

Objective: The goal was to analyze the Heart Disease UCI dataset to predict heart disease using
machine learning techniques.
Data Loading and Cleaning: The dataset was loaded using Pandas. No missing values were found,
ensuring a clean dataset for analysis.

String Manipulation: Though the dataset primarily contains numerical data, string manipulation
techniques were demonstrated. In datasets with categorical string data, operations such as
lowercasing and stripping spaces are crucial for uniformity.

Statistical Analysis: Basic statistics were computed using NumPy, revealing a mean age of
approximately X and a median cholesterol level of Y.

Data Splitting: The dataset was split into training (80%) and testing (20%) sets to validate the model's
performance.

Model Building: A Logistic Regression model was chosen for binary classification. The model was
trained on the training set and achieved an accuracy of Z on the test set, indicating a good predictive
capability.

Conclusion: This analysis demonstrated effective data manipulation, cleaning, and the successful
application of machine learning to predict heart disease. Future work could involve exploring other
algorithms and tuning model parameters for improved accuracy.

Odoo 17 Development Tutorials - Cybrosys
No ratings yet
Odoo 17 Development Tutorials - Cybrosys
5 pages
AIML Practical 05 22105A2021
No ratings yet
AIML Practical 05 22105A2021
9 pages
Ml Projects Part c
No ratings yet
Ml Projects Part c
8 pages
SUMMARY
No ratings yet
SUMMARY
16 pages
A.I Lab Report
No ratings yet
A.I Lab Report
24 pages
PROJECTS
No ratings yet
PROJECTS
6 pages
Project_Report
No ratings yet
Project_Report
18 pages
Ai ML Exp1
No ratings yet
Ai ML Exp1
8 pages
AI Mini Project
No ratings yet
AI Mini Project
6 pages
INFX 499 Milestone 1
No ratings yet
INFX 499 Milestone 1
8 pages
Heart Disease Predictive Analysis
No ratings yet
Heart Disease Predictive Analysis
4 pages
Cardiovascular_Disease_Prediction
No ratings yet
Cardiovascular_Disease_Prediction
2 pages
Lab Report Content - 15marks(1) (2)
No ratings yet
Lab Report Content - 15marks(1) (2)
10 pages
Ai in HC - 2
No ratings yet
Ai in HC - 2
9 pages
Heart Disease Prediction - Jupyter Notebook
100% (1)
Heart Disease Prediction - Jupyter Notebook
9 pages
Case Study
No ratings yet
Case Study
21 pages
Second Progres Report
No ratings yet
Second Progres Report
10 pages
Title Name School Supervisor SRN Date Progress Report Number Duration of The Project
No ratings yet
Title Name School Supervisor SRN Date Progress Report Number Duration of The Project
8 pages
Diabetic Prediction Using LogicalRegression
No ratings yet
Diabetic Prediction Using LogicalRegression
9 pages
Final Year Project Report
No ratings yet
Final Year Project Report
20 pages
Heart Disease Pre
No ratings yet
Heart Disease Pre
23 pages
Web Application
No ratings yet
Web Application
13 pages
4-10 Aiml
No ratings yet
4-10 Aiml
25 pages
ML (Lab 8) Tasks Bilal Habib (5th Semester)
No ratings yet
ML (Lab 8) Tasks Bilal Habib (5th Semester)
16 pages
Heart Disease Prediction PPT
No ratings yet
Heart Disease Prediction PPT
11 pages
Final PPT Heart Disease
100% (2)
Final PPT Heart Disease
23 pages
Heart disease
No ratings yet
Heart disease
5 pages
Batch-2 (Review 2)
No ratings yet
Batch-2 (Review 2)
19 pages
Edited Version of Cardiovascular Diseases Risk Prediction Dataset Report
No ratings yet
Edited Version of Cardiovascular Diseases Risk Prediction Dataset Report
25 pages
HDD New Report
No ratings yet
HDD New Report
95 pages
Logistic Regression 205
No ratings yet
Logistic Regression 205
8 pages
Ml practicals
No ratings yet
Ml practicals
21 pages
Heart Disease Detection - Newreport
No ratings yet
Heart Disease Detection - Newreport
57 pages
Project - Predicting Heart Disease
No ratings yet
Project - Predicting Heart Disease
2 pages
A SUMMER INTERNSHIP REPORT
No ratings yet
A SUMMER INTERNSHIP REPORT
27 pages
review 2
No ratings yet
review 2
23 pages
ML Report Edited
No ratings yet
ML Report Edited
10 pages
Heart Disease Prediction Theory PPT
No ratings yet
Heart Disease Prediction Theory PPT
10 pages
Abstract 1
No ratings yet
Abstract 1
1 page
Heart Disease Prediction Professional PPT
No ratings yet
Heart Disease Prediction Professional PPT
10 pages
ML Report
No ratings yet
ML Report
12 pages
Heart Disease Prediction System Using Machine Learning 1
No ratings yet
Heart Disease Prediction System Using Machine Learning 1
17 pages
Heart Disease Predictor
No ratings yet
Heart Disease Predictor
3 pages
Deep Learning Project Report
No ratings yet
Deep Learning Project Report
7 pages
Decision Support
No ratings yet
Decision Support
21 pages
PythonHeartDisease FirstReview
No ratings yet
PythonHeartDisease FirstReview
20 pages
python 1
No ratings yet
python 1
3 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
8 pages
Prediction of Cardio-Vascular Disease Using Machine Learning Algorithms and Flask Api
No ratings yet
Prediction of Cardio-Vascular Disease Using Machine Learning Algorithms and Flask Api
23 pages
Google Docs
No ratings yet
Google Docs
26 pages
Final Research Paper
No ratings yet
Final Research Paper
3 pages
BIBA Enhancing Heart Disease Prediction With A Hybrid Model Combining Decision Tree, Logistic Regres
No ratings yet
BIBA Enhancing Heart Disease Prediction With A Hybrid Model Combining Decision Tree, Logistic Regres
12 pages
Dissertation
No ratings yet
Dissertation
41 pages
Prediction of Heart Diseases Using Machine Learning
No ratings yet
Prediction of Heart Diseases Using Machine Learning
49 pages
Heart Disease Prediction Documentation
No ratings yet
Heart Disease Prediction Documentation
4 pages
Heart Disease Prediction Using Machine Learning and Data Analytics Approach
No ratings yet
Heart Disease Prediction Using Machine Learning and Data Analytics Approach
4 pages
Report - Mini ProjectFINAL
No ratings yet
Report - Mini ProjectFINAL
22 pages
Lab Manual - MachineLearningLaboratory-DR.vaishnavi (1)
No ratings yet
Lab Manual - MachineLearningLaboratory-DR.vaishnavi (1)
71 pages
BCSP241006_BCS221016_BCS221023_REPORT
No ratings yet
BCSP241006_BCS221016_BCS221023_REPORT
38 pages
Quick Python Guide
From Everand
Quick Python Guide
Coder1
No ratings yet
Algorithms and Data Structures: An Easy Guide to Programming Skills
From Everand
Algorithms and Data Structures: An Easy Guide to Programming Skills
Rigdon Jonathan
No ratings yet
6596-Article Text-7064-1-10-20230514
No ratings yet
6596-Article Text-7064-1-10-20230514
6 pages
21EC71_AVLSI_Answers
No ratings yet
21EC71_AVLSI_Answers
31 pages
ADVLSI_Model QP Solution
No ratings yet
ADVLSI_Model QP Solution
17 pages
Cns Solutions Set2
No ratings yet
Cns Solutions Set2
35 pages
Cns Solutions Set1
No ratings yet
Cns Solutions Set1
37 pages
Python Code
No ratings yet
Python Code
2 pages
Atlas Copco Pulse Tools 2013-2014
No ratings yet
Atlas Copco Pulse Tools 2013-2014
12 pages
Es 26 Ts 1
No ratings yet
Es 26 Ts 1
3 pages
Log
No ratings yet
Log
2,484 pages
Propsim User Reference
No ratings yet
Propsim User Reference
365 pages
PI Application Form - 20240929
No ratings yet
PI Application Form - 20240929
2 pages
Lab Report4
No ratings yet
Lab Report4
8 pages
Candidate Evaluation Details: Hidayath Ali Mokula
No ratings yet
Candidate Evaluation Details: Hidayath Ali Mokula
2 pages
Manual DD50
No ratings yet
Manual DD50
34 pages
UI UX Design Brochur NEW654916
0% (1)
UI UX Design Brochur NEW654916
14 pages
[Ebooks PDF] download Clinical Anatomy and Physiology of the Visual System, 4e (Aug 9, 2021)_(0323711685)_(Elsevier) 4th Edition Remington Od Ms Faao full chapters
100% (4)
[Ebooks PDF] download Clinical Anatomy and Physiology of the Visual System, 4e (Aug 9, 2021)_(0323711685)_(Elsevier) 4th Edition Remington Od Ms Faao full chapters
37 pages
ECOPS Electronics Police Record Management System
0% (1)
ECOPS Electronics Police Record Management System
4 pages
CS QP - CLASS XI ANNUAL EXAM APRIL 30TH (1)
No ratings yet
CS QP - CLASS XI ANNUAL EXAM APRIL 30TH (1)
5 pages
Combined Footings 06
No ratings yet
Combined Footings 06
6 pages
DATA PROCESSING FULL No805673418
No ratings yet
DATA PROCESSING FULL No805673418
48 pages
FIT - 1047 - Part 1 - Report - Assignment - 4
No ratings yet
FIT - 1047 - Part 1 - Report - Assignment - 4
6 pages
4th - AI
No ratings yet
4th - AI
4 pages
Booking.com_ Confirmation vietnam
No ratings yet
Booking.com_ Confirmation vietnam
1 page
Neccesity of Interrupts
No ratings yet
Neccesity of Interrupts
8 pages
Grade 12 SG
No ratings yet
Grade 12 SG
87 pages
Advance Web Technology
100% (3)
Advance Web Technology
32 pages
Y7 Term 1 Maths Revision Sheet
No ratings yet
Y7 Term 1 Maths Revision Sheet
22 pages
Point of Sale (POS) Data From A Supermarket: Transactions and Cashier Operations
No ratings yet
Point of Sale (POS) Data From A Supermarket: Transactions and Cashier Operations
4 pages
From Writings On The Wall To Signals Travelling in The Airwaves
100% (1)
From Writings On The Wall To Signals Travelling in The Airwaves
13 pages
Lesson 4 Data, Data Analysis, Database, Database Management
No ratings yet
Lesson 4 Data, Data Analysis, Database, Database Management
42 pages
DevOps 3RITech PDF
No ratings yet
DevOps 3RITech PDF
2 pages
Imb 530
No ratings yet
Imb 530
2 pages
Multivariate Statistical Analysis Using The R Package Chemometrics
No ratings yet
Multivariate Statistical Analysis Using The R Package Chemometrics
71 pages
First Resume
No ratings yet
First Resume
2 pages
Sales and Distribution Management Pingali Venugopal (Pingali@xlri - Ac.in)
No ratings yet
Sales and Distribution Management Pingali Venugopal (Pingali@xlri - Ac.in)
7 pages

Python Cod1

Uploaded by

Python Cod1

Uploaded by

Python Code:

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LogisticRegression

from sklearn.metrics import accuracy_score

# Step 1: Load the dataset

# Step 2: Display the first few rows of the dataset

print("Initial Data:\n", df.head())

# Step 3: Check for missing values

print("Missing Values:\n", df.isnull().sum())

# Step 4: Handle missing values (if any)

# df.fillna(method='ffill', inplace=True) # Forward fill or drop missing values

# Step 5: Display the data types

print("Data Types:\n", df.dtypes)

# Step 6: String manipulation example (if needed)

# Example: Clean a string column (if applicable)

# Step 7: Convert relevant columns to NumPy arrays

print(f"Mean Age: {mean_age}, Median Cholesterol: {median_cholesterol}")

# Step 9: Define features and target variable

X = df.drop(columns=['target']) # Assuming 'target' is the column to predict

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

print(f"Training set size: {X_train.shape[0]}, Testing set size: {X_test.shape[0]}")

# Step 11: Initialize and train the model

# Step 12: Make predictions on the test set

# Step 13: Evaluate the model's performance

accuracy = accuracy_score(y_test, y_pred)

print(f"Accuracy of the model: {accuracy:.2f}")

# Step 14: Save the report to a text file

with open("heart_disease_analysis_report.txt", "w") as file:

file.write("Heart Disease Analysis Report\n")

file.write("Objective: Analyze the dataset to predict heart disease.\n")

file.write("Statistical Analysis: Mean Age: {}, Median Cholesterol: {}.\n".format(mean_age,

file.write("Model Accuracy: {}.\n".format(accuracy))

You might also like