Time Series Using Python

Time series

Uploaded by

graduation

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views

Time Series Using Python

Time series

Uploaded by

graduation

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

Time Series using Python

What is a time series analysis and what are

the benefits?
• A time series analysis focuses on a series of data points ordered in
time. This is one of the most widely used data science analyses and is
applied in a variety of industries.
Time Series using Python
• Dataset contains two years of historical daily sales data for a global
retail widget company. The dataset contains data for the date range
from 2017 to 2019.
• Using the ‘pandas’ package, I took some preparation steps with our
dummy dataset so that it’s slightly cleaner than most real-life datasets.
• Aggregate the daily data into weeks before starting my analysis.
• Index your data with time so that your rows will be indicated by a date
rather than just a standard integer. Since our data is weekly, the values
in the first column will be in YYYY-MM-DD date format and show
the Monday of each week.
Data Import and Aggregation

import pandas as pd
# Import the data
df = pd.read_csv("Blog_Orders.csv")
df['Date'] = pd.to_datetime(df['Date’])
# Set the date as index
df = df.set_index('Date’)
# Select the proper time period for weekly aggregation
df = df['2017-01-02':'2019-12-29'].resample('W').sum()
df.head()
Examine and Prepare Your Dataset for Modeling
Check the Data for Common Time Series Patterns :
• It’s important to check any time series data for patterns that can affect the results, and can
inform which forecasting model to use. Some common time series data patterns are:

Level The average value in the series

Trend Increases, decreases, or stays the same over time
Seasonal or
Periodic Pattern repeats periodically over time
Pattern that increases and decreases but usually
Cyclical related to non-seasonal activity, like business cycles
Random or Increases and decreases that don’t have any
Irregular Variations apparent pattern

• Most time-series data will contain one or more, but probably not all of these patterns. It’s
still a good idea to check for them since they can affect the performance of the model and
may even require different modeling approaches.
• Two great methods for finding these data patterns are visualization and decomposition.
Visualize the Data
• The first step is simply to plot the dataset. In the example, matplotlib package is used. Since it’s
easier to see a general trend using the mean, I use both the original data (blue line) as well as the
monthly average resample data (orange line).
• By changing the 'M’ (or ‘Month’) within y.resample('M'), you can plot the mean for different
aggregate dates. For example, if you have a very long history of data, you might plot the yearly
average by changing ‘M’ to ‘Y’.

import warnings
import matplotlib.pyplot as plt
y = df['Orders’]
fig, ax = plt.subplots(figsize=(20, 6))
ax.plot(y,marker='.', linestyle='-', linewidth=0.5, label='Weekly’)
ax.plot(y.resample('M').mean(),marker='o', markersize=8, linestyle='-', label='Monthly Mean Resample’)
ax.set_ylabel('Orders’)
ax.legend();
Visualize the Data
Decompose the Data
• By looking at the graph of sales data above, we can see a general
increasing trend with no clear pattern of seasonal or cyclical changes.
• The next step is to decompose the data to view more of the complexity
behind the linear visualization.
• A useful Python function called seasonal_decompose within the '
statsmodels' package can help us to decompose the data into four
different components:
• Observed
• Trended
• Seasonal
• Residual
Decompose the Data

import statsmodels.api as sm
# graphs to show seasonal_decompose
def seasonal_decompose (y):
decomposition = sm.tsa.seasonal_decompose(y,
model='additive',extrapolate_trend='freq’)
fig = decomposition.plot(
fig.set_size_inches(14,7)
plt.show()
Decompose the Data
seasonal_decompose(y)

After looking at the four pieces of decomposed graphs, we can tell that our
sales dataset has an overall increasing trend as well as a yearly seasonality.
Depending on the components of your dataset like trend, seasonality, or
cycles, your choice of model will be different.
Check for Stationarity
• Next, we need to check whether the dataset is stationary or not. A dataset
is stationary if its statistical properties like mean, variance, and
autocorrelation do not change over time.
• Most time series datasets related to business activity are not stationary
since there are usually all sorts of non-stationary elements like trends and
economic cycles.
• But, since most time series forecasting models use stationarity—and
mathematical transformations related to it—to make predictions, we need
to ‘stationarize’ the time series as part of the process of fitting a model.
• Two common methods to check for stationarity are Visualization and the
Augmented Dickey-Fuller (ADF) Test. Python makes both approaches
easy:
Visualization - Check for Stationarity
This method graphs the rolling statistics (mean and variance) to show at a glance whether the standard
deviation changes substantially over time:

### plot for Rolling Statistic for testing Stationarity

def test_stationarity(timeseries, title):

#Determing rolling statistics

rolmean = pd.Series(timeseries).rolling(window=12).mean()
rolstd = pd.Series(timeseries).rolling(window=12).std()

fig, ax = plt.subplots(figsize=(16, 4))

ax.plot(timeseries, label= title)
ax.plot(rolmean, label='rolling mean');
ax.plot(rolstd, label='rolling std (x10)');
ax.legend()
pd.options.display.float_format = '{:.8f}'.format
test_stationarity(y,'raw data')
Visualization - Check for Stationarity

Both the mean and standard deviation for stationary data does not change much
over time. But in this case, since the y-axis has such a large scale, we can not
confidently conclude that our data is stationary by simply viewing the above
graph. Therefore, we should do another test of stationarity.
Augmented Dickey-Fuller Test
• The ADF approach is essentially a statistical significance test that compares the
p-value with the critical values and does hypothesis testing.
• Using this test, we can determine whether the processed data is stationary or not
with different levels of confidence.
# Augmented Dickey-Fuller Test
from statsmodels.tsa.stattools import adfuller

def ADF_test(timeseries, dataDesc):

print(' > Is the {} stationary ?'.format(dataDesc))
dftest = adfuller(timeseries.dropna(), autolag='AIC')
print('Test statistic = {:.3f}'.format(dftest[0]))
print('P-value = {:.3f}'.format(dftest[1]))
print('Critical values :')
for k, v in dftest[4].items():
print('\t{}: {} - The data is {} stationary with {}% confidence'.format(k, v,
'not' if v<dftest[0] else '', 100-int(k[:-1])))
Augmented Dickey-Fuller Test
ADF_test(y,'raw data')

Looking at both the visualization and ADF test, we can tell that our
sample sales data is non-stationary.
Make the Data Stationary - Detrending
• To proceed with our time series analysis, we need to stationarize the dataset. There
are many approaches to stationarize data, but we’ll use de-trending, differencing,
and then a combination of the two.

•This detrending method removes the underlying trend in the time series:
# Detrending
y_detrend = (y -
y.rolling(window=12).mean())/y.rolling(window=12).std()

test_stationarity(y_detrend,'de-trended data')
ADF_test(y_detrend,'de-trended data')
Make the Data Stationary - Detrending

The results show that the data is now stationary, indicated by the relative smoothness of
the rolling mean and rolling standard deviation after running the ADF test again
Differencing
This method removes the underlying seasonal or cyclical patterns in the time
series. Since the sample dataset has a 12-month seasonality, a 12-lag
difference is used:
# Differencing
y_12lag = y - y.shift(12)
test_stationarity(y_12lag,'12 lag differenced data')
ADF_test(y_12lag,'12 lag differenced data')

Mlfinlab Release Hudson & Thames
100% (1)
Mlfinlab Release Hudson & Thames
74 pages
Ca 3916
No ratings yet
Ca 3916
4 pages
Data Aggregation
No ratings yet
Data Aggregation
68 pages
32 BIT MICROPROCESSOR SYSTEM Module Z3/EV
No ratings yet
32 BIT MICROPROCESSOR SYSTEM Module Z3/EV
54 pages
AWB2528-1508-User Manual EASY500-700 PDF
No ratings yet
AWB2528-1508-User Manual EASY500-700 PDF
304 pages
Stochastic Modelling Notes Discrete
No ratings yet
Stochastic Modelling Notes Discrete
130 pages
Ch21 Time Series Econometrics - Basic Concept
No ratings yet
Ch21 Time Series Econometrics - Basic Concept
51 pages
William W.S.Wei (Pearson 2006 634s) - Time Series Analysis - Univariate and Multivariate Methods 2ed - PDF
No ratings yet
William W.S.Wei (Pearson 2006 634s) - Time Series Analysis - Univariate and Multivariate Methods 2ed - PDF
634 pages
2014 Book StochasticProcessesAndApplicat
100% (6)
2014 Book StochasticProcessesAndApplicat
345 pages
Deeplob: Deep Convolutional Neural Networks For Limit Order Books
No ratings yet
Deeplob: Deep Convolutional Neural Networks For Limit Order Books
12 pages
Catalogo KNX
No ratings yet
Catalogo KNX
183 pages
Pro E System - ABB
100% (1)
Pro E System - ABB
12 pages
BIFM Level 3 Qualification Specification
No ratings yet
BIFM Level 3 Qualification Specification
76 pages
Fundamentals Od Electric Circuits 2
No ratings yet
Fundamentals Od Electric Circuits 2
31 pages
Cmmo ST
No ratings yet
Cmmo ST
13 pages
1757 Um007 - en P PDF
No ratings yet
1757 Um007 - en P PDF
160 pages
Introduction To The Key Terms Associated With PID Temperature Control
100% (1)
Introduction To The Key Terms Associated With PID Temperature Control
8 pages
Jenkins Michael - 1 The Geometry of Stock Market Profits. A Guide To Professional Trading For A
No ratings yet
Jenkins Michael - 1 The Geometry of Stock Market Profits. A Guide To Professional Trading For A
77 pages
SSVR Single Phase en
No ratings yet
SSVR Single Phase en
4 pages
Bat Algorithm Literature Review and Appl PDF
No ratings yet
Bat Algorithm Literature Review and Appl PDF
10 pages
8082 ControlButtons EK00 III en
No ratings yet
8082 ControlButtons EK00 III en
11 pages
KNX Project Preparation
No ratings yet
KNX Project Preparation
32 pages
HMI and SCADA Systems
No ratings yet
HMI and SCADA Systems
9 pages
SimboluriGraficeScheme PDF
No ratings yet
SimboluriGraficeScheme PDF
14 pages
Basics of Control Components
No ratings yet
Basics of Control Components
21 pages
Manual Familia EASY
No ratings yet
Manual Familia EASY
24 pages
Inverted Pendulum
No ratings yet
Inverted Pendulum
6 pages
MBS - Kat 2008 Engl
No ratings yet
MBS - Kat 2008 Engl
352 pages
Stabilization of Linear Systems With Time-Varying Delay
No ratings yet
Stabilization of Linear Systems With Time-Varying Delay
2 pages
Parameters of The PT1 Element 1
No ratings yet
Parameters of The PT1 Element 1
7 pages
Direct Current Motor Electrical Evaluation With Motor Circuit Analysis
No ratings yet
Direct Current Motor Electrical Evaluation With Motor Circuit Analysis
6 pages
IEEE 1451 Manual
No ratings yet
IEEE 1451 Manual
40 pages
01 Fail Safe Planning
100% (1)
01 Fail Safe Planning
11 pages
Intelligent Lighting Standard Requirements
From Everand
Intelligent Lighting Standard Requirements
Gerardus Blokdyk
No ratings yet
Control Report
50% (2)
Control Report
15 pages
Horstmann RMU Catalogue - 2019 - Printed Sheets PDF
0% (1)
Horstmann RMU Catalogue - 2019 - Printed Sheets PDF
87 pages
Push Buttons
No ratings yet
Push Buttons
59 pages
Essential Guide To Power Supplies PDF
No ratings yet
Essential Guide To Power Supplies PDF
163 pages
Principles of Level - Vass
100% (3)
Principles of Level - Vass
14 pages
Wizmart: Installation Wiring Diagram
No ratings yet
Wizmart: Installation Wiring Diagram
2 pages
PLC Level 1
No ratings yet
PLC Level 1
46 pages
Inverted Pendulum
No ratings yet
Inverted Pendulum
10 pages
Variator Tensiune
100% (1)
Variator Tensiune
8 pages
Steel Price Forecast
No ratings yet
Steel Price Forecast
9 pages
Tennis Ball Trajectories
No ratings yet
Tennis Ball Trajectories
8 pages
Process Control Systems
100% (1)
Process Control Systems
3 pages
Photovoltaic System Types
No ratings yet
Photovoltaic System Types
3 pages
Modeling and Simulation of Complex Maintenance System Dynamics
No ratings yet
Modeling and Simulation of Complex Maintenance System Dynamics
5 pages
Root-Mean-Square Value: I. Complete Sinusoidal Waveform
No ratings yet
Root-Mean-Square Value: I. Complete Sinusoidal Waveform
3 pages
Ghid de Utilizare Hilti POS 15-18
No ratings yet
Ghid de Utilizare Hilti POS 15-18
103 pages
Easy 800 Manual
No ratings yet
Easy 800 Manual
351 pages
Simulink Time 1 Simple Oscillator: M y + C y + Ky 0
No ratings yet
Simulink Time 1 Simple Oscillator: M y + C y + Ky 0
16 pages
Protectii Diferentiale
No ratings yet
Protectii Diferentiale
8 pages
Nur Aqilah Binti Mohamad Amin
100% (1)
Nur Aqilah Binti Mohamad Amin
35 pages
Minor Project Report
No ratings yet
Minor Project Report
29 pages
Electrical Power in AC Circuits and Reactive Power
No ratings yet
Electrical Power in AC Circuits and Reactive Power
34 pages
AC Generator Simulation Using FEMM and Lua
No ratings yet
AC Generator Simulation Using FEMM and Lua
6 pages
Fuzzy Jerry M. Mendel
No ratings yet
Fuzzy Jerry M. Mendel
2 pages
FLT93 Installation, Operation and Troubleshooting Guide
No ratings yet
FLT93 Installation, Operation and Troubleshooting Guide
12 pages
Varistors Introduction: Resistive Products
No ratings yet
Varistors Introduction: Resistive Products
12 pages
Time Series Forecast - A Basic Introduction Using Python
No ratings yet
Time Series Forecast - A Basic Introduction Using Python
18 pages
Module 2.3 EDA Part 3 Time Series Data in Python and R
No ratings yet
Module 2.3 EDA Part 3 Time Series Data in Python and R
20 pages
tesla_time_series
No ratings yet
tesla_time_series
18 pages
ARIMA Model Python Example - Time Series Forecasting
No ratings yet
ARIMA Model Python Example - Time Series Forecasting
11 pages
Internshala Awareness Program-17112021
No ratings yet
Internshala Awareness Program-17112021
16 pages
Presentation - Awareness On Internship and Career Opportunities
No ratings yet
Presentation - Awareness On Internship and Career Opportunities
74 pages
aa6ce133-7c8a-11ee-a1bd-addfe44cc03c
No ratings yet
aa6ce133-7c8a-11ee-a1bd-addfe44cc03c
76 pages
1054dcc6-7c8b-11ee-b3fc-addfe44cc03c
No ratings yet
1054dcc6-7c8b-11ee-b3fc-addfe44cc03c
76 pages
Stationary Vs Non-Stationary Channels
No ratings yet
Stationary Vs Non-Stationary Channels
3 pages
Non-Stationary Extreme Value Analysis in A Changing Climate
No ratings yet
Non-Stationary Extreme Value Analysis in A Changing Climate
17 pages
Cat 2 Material Chandrasekar Ralph PDF
No ratings yet
Cat 2 Material Chandrasekar Ralph PDF
32 pages
Ensemble Average and Time Average
No ratings yet
Ensemble Average and Time Average
31 pages
Petroleum Reservoir Modeling and Simulation. Geology, Geostatistics, and Performance Reduction
No ratings yet
Petroleum Reservoir Modeling and Simulation. Geology, Geostatistics, and Performance Reduction
511 pages
Arch Models
No ratings yet
Arch Models
13 pages
RVSP
0% (1)
RVSP
8 pages
Time Series Econometrics For MSC 20212022
No ratings yet
Time Series Econometrics For MSC 20212022
268 pages
JNTUH B.Tech R16 2-1 Sem Syllabus For ELECTRONICSINSTRUMENTATION PDF
No ratings yet
JNTUH B.Tech R16 2-1 Sem Syllabus For ELECTRONICSINSTRUMENTATION PDF
33 pages
12-Bayesian Resolution of The The Exchange Paradox
No ratings yet
12-Bayesian Resolution of The The Exchange Paradox
4 pages
Lecture Notes 04 Ar 2
No ratings yet
Lecture Notes 04 Ar 2
18 pages
Toledo 2021 J. Phys. Conf. Ser. 1936 012002 PDF
No ratings yet
Toledo 2021 J. Phys. Conf. Ser. 1936 012002 PDF
8 pages
Markov Process and Markov Chains (Unit 3)
No ratings yet
Markov Process and Markov Chains (Unit 3)
5 pages
Chapter 7
No ratings yet
Chapter 7
33 pages
(Ebook) Managerial Decision Modeling by Nagraj Balakrishnan, Barry Render, Ralf M. Stair, Chuck L. Munson ISBN 9781501515101, 1501515101 instant download
100% (1)
(Ebook) Managerial Decision Modeling by Nagraj Balakrishnan, Barry Render, Ralf M. Stair, Chuck L. Munson ISBN 9781501515101, 1501515101 instant download
61 pages
Introduction to Applied Statistical Signal Analysis 3rd Edition Shiavi - Download the ebook in PDF with all chapters to read anytime
100% (1)
Introduction to Applied Statistical Signal Analysis 3rd Edition Shiavi - Download the ebook in PDF with all chapters to read anytime
55 pages
Blacklock (2004) Tesis-Characteristics of Variation in Production of Normal and Disordered Fricative - Multitaper
No ratings yet
Blacklock (2004) Tesis-Characteristics of Variation in Production of Normal and Disordered Fricative - Multitaper
288 pages
The Time Series Analysis (TSA) Toolbox For Octave and Matlab
No ratings yet
The Time Series Analysis (TSA) Toolbox For Octave and Matlab
1 page
Time Series Analysis Book
No ratings yet
Time Series Analysis Book
202 pages
(Advanced Texts in Physics) Philippe Réfrégier (Auth.) - Noise Theory and Application To Physics - From Fluctuations To Information-Springer-Verlag New York (2004)
No ratings yet
(Advanced Texts in Physics) Philippe Réfrégier (Auth.) - Noise Theory and Application To Physics - From Fluctuations To Information-Springer-Verlag New York (2004)
293 pages
Econometrics Chapter 1 UNAV
No ratings yet
Econometrics Chapter 1 UNAV
38 pages
The Use of NARX Neural Networks To Predict Chaotic
No ratings yet
The Use of NARX Neural Networks To Predict Chaotic
11 pages
Cointegration For The Applied Economist
100% (1)
Cointegration For The Applied Economist
6 pages
New Time Series Analysis
No ratings yet
New Time Series Analysis
16 pages
Study Material (Sessions 3,4)
No ratings yet
Study Material (Sessions 3,4)
10 pages
Itsm Windows A User's Guide To Time Series: Modelling and Forecasting
No ratings yet
Itsm Windows A User's Guide To Time Series: Modelling and Forecasting
126 pages