Python Advanced for Data Practitioners (2-day class)

April 23, 2018 - 12:00am to April 24, 2018 - 12:00am

https://uit.stanford.edu/service/techtraining/class/python-advanced-data-practitioners-2-day-class   DATE COST Mon Apr 23, 9:00 am to 4:00 pm Tue Apr 24, 9:00 am to 4:00 pm $650 Class Code ITS-2556 Class Description Already know Python? Now it is time to acquire the knowledge needed to start using Python in data analytics tasks to scale up and automate process. Prerequisites: Basic knowledge of Python, or other programming languages Taught by Arafat Mokhtar, a Business Intel Analyst at Stanford School of Medicine, this class will help you to leverage your Python skill and venture into the field of data-analytics. We will introduce concepts of data manipulation and analysis such as Dataframe and visualization techniques using Python data science libraries such as Pandas/Numpy/Matplotlib. We will also briefly discuss machine learning libraries such as Sklearn. By the end of this course, you will be able to accomplish common data analytics tasks such as: preparing, aggregating and summarizing data, finding patterns, and developing ways to automate manual processes -- all with Python coding. Planned topics Day 1 - Morning: 1. Review Python common functionalities and data structures used in data science 2. Learn the most important Python libraries in data science (Pandas, Numpy, Matplotlib) 3. Hands on: Python functionalities and dataframes Day 1 - Afternoon: 1. Read and write data from/to different formats (excel, csv, text, json, etc.) 2. Cleanse and select important records from dataframes 3. Deal with missing data: identify, replace, and eliminate records 4. Sort dataframes by multiple columns 5. Hands on: Data manipulations with Pandas Day 2 - Morning: 1. Leverage the functions apply, lambda, filter, and map 2. Merge/Join dataframes by foreign keys 3. Learn pivot tables in Pandas 4. Hands on: Data aggregation and summarization Day 2 - Afternoon: 1. Learn data visualizations with the libraries Matplotlib and Seaborn 2. Introduction to the Machine Learning library Sklearn 3. Apply linear and logistic regression with Sklearn 4. Hands on: Data predictions Instructor information: Arafat Mokhtar is a Business Intel Analyst at Stanford School of Medicine, who supports the Human Resources Group with data collections, validation, cleansing, and analytics to provide actionable data insights used by leadership management to make data-driven decisions on the organization workforce. He develops code in Python scientific stack (Pandas, Numpy, Matplotlib, Sklearn, etc) to automate data analytics processes. He also proposes data solutions and develop measurable business metrics. He has a Ph. D in Particle Physics from Tel Aviv University, along with several years of Python and R work experience.