Data Science Basics Cheatsheet

This document provides a summary of common functionality from Pandas, NumPy, and Scikit-Learn for data science basics. It covers topics such as importing and exploring data, cleaning data, filtering and grouping data, joining data, and writing data out. The full cheatsheet can be found online at elitedatascience.com.

Uploaded by

acutotu

Available Formats

Download as PDF, TXT or read online on Scribd

67% found this document useful (3 votes)

518 views

Data Science Basics Cheatsheet

Uploaded by

acutotu

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Python Cheatsheet:

Data Science Basics

In this cheat sheet, we summarize common and useful functionality from Pandas, NumPy, and Scikit-Learn. To see
the most up-to-date full version, visit the online cheatsheet at elitedatascience.com.

SETUP Data Cleaning

First, make sure you have the following installed on your computer: df.columns = ['a','b','c']
pd.isnull()
• Python 2.7+ or Python 3
• Pandas pd.notnull()
• Jupyter Notebook (optional, but recommended)
df.dropna()
*note: We strongly recommend installing the Anaconda Distribution, which
df.dropna(axis=1)
comes with all of those packages.
df.dropna(axis=1,thresh=n)
df.fillna(x)
Importing Data
s.fillna(s.mean())
pd.read_csv(filename)
s.astype(float)
pd.read_table(filename)
s.replace(1,'one')
pd.read_excel(filename)
s.replace([1,3],['one','three'])
pd.read_sql(query, connection_object)
df.rename(columns=lambda x: x + 1)
pd.read_json(json_string)
df.rename(columns={'old_name': 'new_ name'})
pd.read_html(url)
df.set_index('column_one')
pd.read_clipboard()
df.rename(index=lambda x: x + 1)
pd.DataFrame(dict)

Exploring Data Filter, Sort and Group By

df[df[col] > 0.5]
df.shape()
df[(df[col] > 0.5) & (df[col] < 0.7)]
df.head(n)
df.sort_values(col1)
df.tail(n)
df.sort_values(col2,ascending=False)
df.info()
df.sort_values([col1,col2], ascending=[True,False])
df.describe()
df.groupby(col)
s.value_counts(dropna=False)
df.groupby([col1,col2])
df.apply(pd.Series.value_counts)
df.groupby(col1)[col2].mean()
df.describe()
df.pivot_table(index=col1, values= col2,col3], aggfunc=mean)
df.mean()
df.groupby(col1).agg(np.mean)
df.corr()
df.apply(np.mean)
df.count()
df.apply(np.max, axis=1)
df.max()
df.min()
df.median()
Joining and Combining
df1.append(df2)
df.std()
pd.concat([df1, df2],axis=1)
df1.join(df2,on=col1,how='inner')
Selecting
df[col]
df[[col1, col2]]
Writing Data
df.to_csv(filename)
s.iloc[0]
df.to_excel(filename)
s.loc[0]
df.to_sql(table_name, connection_object)
df.iloc[0,:]
df.to_json(filename)
df.iloc[0,0]
df.to_html(filename)
df.to_clipboard()

ELITEDATASCIENCE.COM

Solid Starts - First 100 Days
94% (18)
Solid Starts - First 100 Days
287 pages
Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
89% (45)
12 Week Program: Summer Body Starts Now
70 pages
The Hold Me Tight Workbook - Dr. Sue Johnson
100% (16)
The Hold Me Tight Workbook - Dr. Sue Johnson
187 pages
Read People Like A Book by Patrick King-Edited
62% (66)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Cheat Code To The Universe
94% (77)
Cheat Code To The Universe
34 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
COSMIC CONSCIOUSNESS OF HUMANITY - PROBLEMS OF NEW COSMOGONY (V.P.Kaznacheev,. Л. V. Trofimov.)
94% (212)
COSMIC CONSCIOUSNESS OF HUMANITY - PROBLEMS OF NEW COSMOGONY (V.P.Kaznacheev,. Л. V. Trofimov.)
212 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (541)
How To Develop and Write A Grant Proposal
17 pages
Workbook For The Body Keeps The Score
88% (52)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (28)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
75% (12)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
36 Questions To Fall in Love 1
97% (31)
36 Questions To Fall in Love 1
2 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
80% (35)
100 Questions To Ask Your Partner
2 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
ALCHEMIST
64% (14)
ALCHEMIST
4 pages
1001 Songs
71% (69)
1001 Songs
1,798 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Principles of Data Science
From Everand
Principles of Data Science
Sinan Ozdemir
4/5 (3)
7 Data Science / Machine Learning Cheat Sheets in One
100% (1)
7 Data Science / Machine Learning Cheat Sheets in One
9 pages
65 Free Data Science Resources For Beginners PDF
No ratings yet
65 Free Data Science Resources For Beginners PDF
19 pages
Python For Data Analytics
67% (3)
Python For Data Analytics
69 pages
Introduction Data Science
100% (1)
Introduction Data Science
23 pages
Data Science Cheat Sheet
100% (1)
Data Science Cheat Sheet
2 pages
Machine Learning Cheat Sheet PDF
100% (1)
Machine Learning Cheat Sheet PDF
21 pages
Machine Learning Cheat Sheet
100% (1)
Machine Learning Cheat Sheet
211 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Build a Career in Data Science
From Everand
Build a Career in Data Science
Emily Robinson
5/5 (1)
Python Data Science Essentials - Second Edition
From Everand
Python Data Science Essentials - Second Edition
Alberto Boschetti
4.5/5 (3)
20 Are There Too Many Lawyers in The Philippines
No ratings yet
20 Are There Too Many Lawyers in The Philippines
5 pages
17 Free Data Science Projects To Boost Your Knowledge & Skills
No ratings yet
17 Free Data Science Projects To Boost Your Knowledge & Skills
18 pages
Cheat Sheets For AI, Neural Networks, Machine Learning, Deep Learning & Big Data PDF
100% (1)
Cheat Sheets For AI, Neural Networks, Machine Learning, Deep Learning & Big Data PDF
30 pages
Great Collection of Data Science Resources
100% (1)
Great Collection of Data Science Resources
2 pages
Data Visualization in Data Science
100% (6)
Data Visualization in Data Science
34 pages
Data Science Crash Course SharpSight
100% (6)
Data Science Crash Course SharpSight
107 pages
Python For Data Science
100% (1)
Python For Data Science
4 pages
DataScienceHandbook PDF
100% (3)
DataScienceHandbook PDF
322 pages
Introduction & Data Science Platforms
No ratings yet
Introduction & Data Science Platforms
31 pages
Data Science Solutions Sample
100% (6)
Data Science Solutions Sample
53 pages
Ai Cheat Sheet Machine Learning With Python Cheat Sheet
100% (3)
Ai Cheat Sheet Machine Learning With Python Cheat Sheet
2 pages
Data Science in Practice
No ratings yet
Data Science in Practice
34 pages
Data Science With Python
100% (3)
Data Science With Python
725 pages
Data Science Guide
No ratings yet
Data Science Guide
35 pages
Python Data Science Cookbook - Sample Chapter
100% (4)
Python Data Science Cookbook - Sample Chapter
48 pages
NumPy, SciPy, Pandas, Quandl Cheat Sheet
100% (3)
NumPy, SciPy, Pandas, Quandl Cheat Sheet
4 pages
Introduction To Data Science
75% (4)
Introduction To Data Science
74 pages
Data Science With Python - Lesson 01 - Data Science Overview
100% (5)
Data Science With Python - Lesson 01 - Data Science Overview
35 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
27 pages
Introduction To Data Science
94% (16)
Introduction To Data Science
530 pages
100 Data Science Interview Questions and Answers (General)
100% (1)
100 Data Science Interview Questions and Answers (General)
11 pages
What Is Data Science GDI
0% (1)
What Is Data Science GDI
24 pages
Jupyter Notebook Cheat Sheet
No ratings yet
Jupyter Notebook Cheat Sheet
1 page
Keras
100% (1)
Keras
2 pages
AI Deep Learning Cheat Sheets-From BecomingHuman - Ai PDF
100% (3)
AI Deep Learning Cheat Sheets-From BecomingHuman - Ai PDF
25 pages
Intelligent Techniques For Data Science
100% (12)
Intelligent Techniques For Data Science
282 pages
Machine Learning Projects in Python
100% (14)
Machine Learning Projects in Python
135 pages
Guide Python Data Science
100% (2)
Guide Python Data Science
13 pages
Scikit Learn Cheat Sheet
No ratings yet
Scikit Learn Cheat Sheet
9 pages
Full Course of Machine Learning
100% (12)
Full Course of Machine Learning
660 pages
KDnuggets The Complete Collection of Data Science Cheatsheets
No ratings yet
KDnuggets The Complete Collection of Data Science Cheatsheets
17 pages
Python Data Science
100% (1)
Python Data Science
173 pages
Python For Data Science PDF
100% (3)
Python For Data Science PDF
15 pages
Matplotlib Cheat Sheet
100% (6)
Matplotlib Cheat Sheet
8 pages
Lesson 5 Data Wrangling in Data Science.
100% (1)
Lesson 5 Data Wrangling in Data Science.
11 pages
Data Analysis With PANDAS: Cheat Sheet
80% (5)
Data Analysis With PANDAS: Cheat Sheet
4 pages
Python Quick Reference Card
94% (17)
Python Quick Reference Card
17 pages
Top 9 Feature Engineering Techniques With Python: Dataset & Prerequisites
No ratings yet
Top 9 Feature Engineering Techniques With Python: Dataset & Prerequisites
27 pages
Python For Data Science
From Everand
Python For Data Science
Kevin Clark
No ratings yet
Practical Data Cleaning: Bite-Size Stats, #5
From Everand
Practical Data Cleaning: Bite-Size Stats, #5
Lee Baker
No ratings yet
Practical Data Science with Jupyter: Explore Data Cleaning, Pre-processing, Data Wrangling, Feature Engineering and Machine Learning using Python and Jupyter (English Edition)
From Everand
Practical Data Science with Jupyter: Explore Data Cleaning, Pre-processing, Data Wrangling, Feature Engineering and Machine Learning using Python and Jupyter (English Edition)
Prateek Gupta
No ratings yet
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
From Everand
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Steven Cooper
3.5/5 (9)
NumPy Cookbook
From Everand
NumPy Cookbook
Ivan Idris
5/5 (2)
Hands-on Supervised Learning with Python
From Everand
Hands-on Supervised Learning with Python
Madeleine Shang
No ratings yet
Hands-On Data Analysis with Pandas: Efficiently perform data collection, wrangling, analysis, and visualization using Python
From Everand
Hands-On Data Analysis with Pandas: Efficiently perform data collection, wrangling, analysis, and visualization using Python
Stefanie Molin
No ratings yet
R Data Science Essentials
From Everand
R Data Science Essentials
Sharan Kumar Ravindran
2/5 (1)
Hands-on Data Analysis and Visualization with Pandas: Engineer, Analyse and Visualize Data, Using Powerful Python Libraries
From Everand
Hands-on Data Analysis and Visualization with Pandas: Engineer, Analyse and Visualize Data, Using Powerful Python Libraries
PURNA CHANDER RAO. KATHULA
5/5 (1)
Data Analytics with Python: Data Analytics in Python Using Pandas
From Everand
Data Analytics with Python: Data Analytics in Python Using Pandas
Frank Millstein
3/5 (1)
Mastering Python Regular Expressions
From Everand
Mastering Python Regular Expressions
Victor Romero
4.5/5 (2)
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
From Everand
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
Peter Bradley
No ratings yet
Product Recommendation Eaton Fuller Heavy-Duty Transmissions 13 - 18 Speed RT-6613
No ratings yet
Product Recommendation Eaton Fuller Heavy-Duty Transmissions 13 - 18 Speed RT-6613
2 pages
Isolation Precautions and Use of Personal Protective Equipments1
No ratings yet
Isolation Precautions and Use of Personal Protective Equipments1
65 pages
Time Value of Money
No ratings yet
Time Value of Money
73 pages
LTC2-A2001NV4 User Manual
No ratings yet
LTC2-A2001NV4 User Manual
52 pages
CCNA Descovery 2 Lacture 1&2
No ratings yet
CCNA Descovery 2 Lacture 1&2
5 pages
Chapter 5 Internal Enviroment Analysis PDF
No ratings yet
Chapter 5 Internal Enviroment Analysis PDF
31 pages
Untitled2.ipynb - Colab-Exp2
No ratings yet
Untitled2.ipynb - Colab-Exp2
2 pages
30 Day Challenge Meal-Plan - Week 1
No ratings yet
30 Day Challenge Meal-Plan - Week 1
39 pages
Brochure Sikaproof Bentonite v0514 NZ
No ratings yet
Brochure Sikaproof Bentonite v0514 NZ
4 pages
Service Ai-2301l 3010l
No ratings yet
Service Ai-2301l 3010l
478 pages
CIAC Arbitration EO 1008 Ceniza Lecture - 38 Pages
No ratings yet
CIAC Arbitration EO 1008 Ceniza Lecture - 38 Pages
38 pages
Catalogue Wonil - KOREA (English)
No ratings yet
Catalogue Wonil - KOREA (English)
68 pages
Mystery School Code Review
No ratings yet
Mystery School Code Review
4 pages
Advanced Fired Boilers: Oil and Gas
No ratings yet
Advanced Fired Boilers: Oil and Gas
12 pages
Compass Maritime Services, LLC: Valuing Ships Courseware 9-211-702
No ratings yet
Compass Maritime Services, LLC: Valuing Ships Courseware 9-211-702
5 pages
The Ceecec Handbook
No ratings yet
The Ceecec Handbook
533 pages
Kinetic Midterm Notes 1 PDF
No ratings yet
Kinetic Midterm Notes 1 PDF
16 pages
EVE_ESS_HVI-60.0_User manual_P_V1
No ratings yet
EVE_ESS_HVI-60.0_User manual_P_V1
79 pages
Instant download (Ebook) Structural Concrete: Strut-and-Tie Models for Unified Design by Chen, Wai-Fah; El-Metwally, Salah El-Din E ISBN 9781498783842, 1498783848 pdf all chapter
100% (10)
Instant download (Ebook) Structural Concrete: Strut-and-Tie Models for Unified Design by Chen, Wai-Fah; El-Metwally, Salah El-Din E ISBN 9781498783842, 1498783848 pdf all chapter
65 pages
Operating Manual: Electric Dust Catcher
No ratings yet
Operating Manual: Electric Dust Catcher
25 pages
Wins For The Week 5 February 2016
No ratings yet
Wins For The Week 5 February 2016
3 pages
5CO02 - 20s-Khalid Qasem - AR1 Report - V2
No ratings yet
5CO02 - 20s-Khalid Qasem - AR1 Report - V2
9 pages
Contacts No
No ratings yet
Contacts No
105 pages
Alternative Wall Technologies
No ratings yet
Alternative Wall Technologies
9 pages
Personality Types Predicting Social Media Behavior Spredfast Smart Social Report
No ratings yet
Personality Types Predicting Social Media Behavior Spredfast Smart Social Report
15 pages
Big Blue CuZn Fast Flow Cartridges
No ratings yet
Big Blue CuZn Fast Flow Cartridges
2 pages
R&D Update - Edge Fracture in Hole Extrusion and Flanging, Part II - The Fabricator
No ratings yet
R&D Update - Edge Fracture in Hole Extrusion and Flanging, Part II - The Fabricator
9 pages
A. LP-In-Matter (Properties of Matter)
No ratings yet
A. LP-In-Matter (Properties of Matter)
13 pages
Bplo PDF
No ratings yet
Bplo PDF
5 pages

Data Science Basics Cheatsheet

Uploaded by

Data Science Basics Cheatsheet

Uploaded by

Python Cheatsheet:

Data Science Basics

SETUP Data Cleaning

Exploring Data Filter, Sort and Group By

You might also like