Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
3 views

AIC3 - Python for data analysis - Scheda

The 'Python for Data Analysis' course at Bocconi University, taught by Ivan Renesto, focuses on teaching participants how to manipulate, process, and clean data using Python's libraries. It is aimed at students interested in data analysis and requires basic knowledge of Python. The course consists of 16 hours of synchronous remote learning, culminating in a final exam that must be taken in person.

Uploaded by

felipe b
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

AIC3 - Python for data analysis - Scheda

The 'Python for Data Analysis' course at Bocconi University, taught by Ivan Renesto, focuses on teaching participants how to manipulate, process, and clean data using Python's libraries. It is aimed at students interested in data analysis and requires basic knowledge of Python. The course consists of 16 hours of synchronous remote learning, culminating in a final exam that must be taken in person.

Uploaded by

felipe b
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

unibocconi.

eu

Python for Data Analysis


Lecturer: Ivan Renesto

Course language
English

Course description and objectives


Python is a widely used high-level, general-purpose, interpreted, dynamic programming
language.
Through this course you will learn how to manipulate, process, and clean data with
Python, using its data-oriented library ecosystem and tools that will lay the foundations
to let you become an effective data analyst.

At the end of the course, participants will be able to:


• work with arrays and vectorized computation
• work with tabular or heterogeneous data
• plot and visualize data

Audience
The course is open to all students of Bocconi University. It is aimed at:
• those who want to approach the world of data analysis;
• students who want to acquire the basic knowledge to develop future expertise
in the area of Data Science;
• those who are interested in facing advanced topics in Python or are planning
to be part of projects where to extract information from a data set.

Prerequisites
Knowledge of Python basics, having attended the curricular course 30424 Computer
Science, or the extracurricular course: Python start, or having equivalent knowledge and
skills.
unibocconi.eu

Duration
16 hours

Teaching mode
Distance learning. Lessons will take place exclusively in synchronous remote mode.

The final test on the last day of class, however, can only be taken in physical presence.
Online mode will not be provided.

Calendar

Lecture Date Time Room


1 Thu 07/11/2024 18.15 - 19.45 Virtual room
2 Tue 12/11/2024 18.15 - 19.45 Virtual room
3 Thu 14/11/2024 18.15 - 19.45 Virtual room
4 Tue 19/11/2024 18.15 - 19.45 Virtual room
5 Thu 21/11/2024 18.15 - 19.45 Virtual room
6 Tue 26/11/2024 18.15 - 19.45 Virtual room
7 Thu 28/11/2024 18.15 - 19.45 Virtual room
8 Tue 03/12/2024 18.15 - 19.45 InfoAS04/05

Syllabus of the course

Lecture Topics Book reference

1 Introduction to Visual Studio Code Ch. 1, 2, and 3


- Preliminaries
- Install Visual Studio Code
- Walk through the development environment
- Built-in data structures and sequences.
Exercises
2 Arrays and vectorized computation Ch. 4
- NumPy basics
- Working with multidimensional array objects
- Indexing, slicing, and transposing arrays
- Array-Oriented Programming
- Mathematical and statistical methods.
unibocconi.eu

Lecture Topics Book reference

3 Plotting and visualization Ch. 9


- Data visualization using matplotlib
- Figures and Axes
- Saving figures to file
- Sub-plots
- Multiple line plots
- Colors, line styles, axes limits, labels plot title,
legend and other chart elements
- Histograms.
4 Data manipulation with pandas Ch. 5
- Pandas basics
- Introduction to Series, DataFrame, Index objects
- Essential functionalities of pandas library
- Summary statistics methods
- Data visualization using pandas.
Exercises
5 Problem requiring data analysis Ch. 6
- Data loading, storage and file formats
- Dataset analysis
- Reading and writing data in text format
- Interacting with Web APIs
- Interacting with Databases via pyodbc.
Exercises
6 Data Cleaning and Preparation Ch. 7
- Handling missing data
- Data formatting and string manipulation
- Data transformation (normalization and binning)
- Categorical values
Exercises
7 Exploratory Data Analysis Ch. 8, 10, 12
- Descriptive statistics
- GroupBy mechanics
- The analysis of variance
- Correlation between different variables
- Pearson correlation and correlation heatmaps.
Exercises
8 Final Exam

Software used
Python, version 3.9+. Current version is 3.12.6.
unibocconi.eu

Python interpreter can be downloaded for free from here:


https://www.python.org/downloads/.

Microsoft Visual Studio Code (VS Code). Current version is 1.93.


Visual Studio Code is a free coding editor that helps to start coding quickly. It supports
multiple programming languages, and the use of a Python web-based interactive
computing platform (Jupyter Notebook).
Supported in: Windows 10 and 11 (64-bit and Arm64), macOS 10.15+ versions with
Apple security update support (Intel chip, or Apple silicon), Linux Ubuntu, Debian, Red
Hat, Fedora, or SUSE.

VS Code can be downloaded from here: https://code.visualstudio.com/download.

Suggested bibliography
McKinney W., Python for Data Analysis, second edition. Data Wrangling with Pandas,
NumPy and IPython, O’Reilly Media, 2017

Available seats
This activity is limited to 110 participants and reserved to students of the Master of
Science Programs. Registrations cannot be carried out once this number has been
reached or after closing of the registration period.

You might also like