M56. Dasar Data Analytics Menggunakan Python
M56. Dasar Data Analytics Menggunakan Python
Anaconda
Aplikasi untuk
pemrograman
Python dan R
Memfasilitasi
Data Analytic dan
Machine Learning
Open Source,
Free Distribution
Python Libraries for Data Analytics
No Library Functions
1 Pandas (Python • Indexing, manipulating renaming, sorting,
Data Analysis) merging data frame
• Update, add, delete columns from a data
frame
• Impute missing files, handle missing data or
NANs
• Plot data with histogram or box plot
2 NumPy • Basic array operations: add, multiply, slice,
flatten, reshape, index arrays
• Advanced array operations: stack arrays,
split into sections, broadcast arrays
• Work with DateTime or Linear Algebra
• Basic Slicing and Advanced Indexing in
NumPy Python
Python Library
untuk Data Analytics
No Library Fungsi
3 SciPy • to perform common scientific
programming tasks as linear algebra,
integration, calculus, ordinary differential
equations, and signal processing.
4 Matplotlib • Line plots
• Scatter plots
• Area plots
• Bar charts and Histograms
• Pie charts
• Stem plots
• Contour plots
• Quiver plots
• Spectrograms
Python Library
untuk Data Analytics
No Library Fungsi
5 Seaborn • Determine relationships between multiple
variables (correlation)
• Observe categorical variables for
aggregate statistics
• Analyze uni-variate or bi-variate
distributions and compare them between
different data subsets
• Plot linear regression models for dependent
variables
• Provide high-level abstractions, multi-plot
grids
Python Library
untuk Data Analytics
No Library Fungsi
6 Scikit Learn • Classification: Spam detection, image
recognition
• Clustering: Drug response, Stock price
• Regression: Customer segmentation,
Grouping experiment outcomes
• Dimensionality reduction: Visualization,
Increased efficiency
• Model selection: Improved accuracy via
parameter tuning
• Pre-processing: Preparing input data as a
text for processing with machine learning
algorithms.
Python Library
untuk Data Analytics
No Library Fungsi
7 Statsmodels • Linear Regression
• Correlation
• Ordinary Least Squares (OLS)
• Survival analysis
• Generalized linear models and Bayesian
model
• Uni-variate & bi-variate analysis, Hypothesis
Testing
Python Library
untuk Data Analytics
No Library Fungsi
8 Plotly • Basic Charts: Line, Pie, Scatter, Bubble, Dot,
Gantt, Sunburst, Treemap, Sankey, Filled
Area Charts
• Statistical and Seaborn Styles: Error, Box,
Histograms, Facet and Trellis Plots, Tree
plots, Violin Plots, Trend Lines
• Scientific charts: Contour, Ternary, Log,
Quiver, Carpet, Radar, Heat maps
Windrose and Polar Plots
• Financial Charts
• Maps
• Subplots
• Transforms
• Jupyter Widgets Interaction
Python Library
untuk Data Analytics
No Library Fungsi
9 TensorFlow • Voice/Sound Recognition — IoT,
Automotive, Security, UX/UI, Telecom
• Sentiment Analysis —CRM or CX
• Text-Based Apps — Threat Detection,
Google Translate, Gmail smart reply
• Face Recognition — Facebook’s Deep
Face, Photo tagging, Smart Unlock
• Time Series — Recommendation from
Amazon, Google, and Netflix
• Video Detection — Motion Detection, Real-
Time Threat Detection in Gaming, Security,
Airports
Python Library
untuk Data Analytics
No Library Fungsi
10 Keras • Determine percentage accuracy
• Compute loss function
• Create custom function layers
• Built-in data and image processing
• Write functions with repeating code blocks:
20, 50, 100 layers deep
Tutorial 1: Console