Data Wrangling with Python - Intermediate - DAY 3
This course builds on the skills learned in Foundation and Foundation Plus to help attendees use core data science to deliver rapid statistics and analysis. This Intermediate course will teach attendees how to deliver scripted data wrangling with provable results. Attendees will be able to demonstrate how repeatable data wrangling pipelines increase speed, efficiency and accuracy and learn more advanced scripting techniques to expand their skill portfolio.
Description
This course builds on the skills learned in Foundation and Foundation Plus to help attendees use core data science to deliver rapid statistics and analysis. This Intermediate course will teach attendees how to deliver scripted data wrangling with provable results. Attendees will be able to demonstrate how repeatable data wrangling pipelines increase speed, efficiency and accuracy and learn more advanced scripting techniques to expand their skill portfolio.
2023 Online Platform and Software Requirements
Computer with video camera, speakers and microphone
Stable internet connection
Mains power
Zoom - This course can only be accessed via the desktop version of Zoom, so if necessary please submit a request with your IT team to download Zoom in advance. Alternatively, use a personal computer to attend this course.
Attendees will need access to:
- A PDF Reader
- MS Excel
- Modern web browser (latest Chrome or Firefox)
- Miniconda (minimal Python data science package)
- Thonny (minimal Python script editor and debugger).
2023 Prior knowledge
All delegates attending this course MUST have completed Python Data Wrangling 2: Foundation Plus. This course builds on the Python knowledge and skills obtained in the Foundation Plus course. Any delegate who is not practising their Python skills at least on a monthly basis, and who has a significant time gap between Foundation Plus and Intermediate courses is recommended to review the Introduction exercises to ensure they have familiarity before attending an Intermediate course.
2023 Topics
One day course, with four key parts:
Part I – Modules, Python’s free lunch
- How to add proven libraries for extra power
- Where is it? Scopes and namespaces
- Exercises: Explore module imports and library code reuse
Part II – Data arrays
- Using Numpy for fast n-dimensional arrays
- Simplifying operations with vectors
- Exercises: Using array objects to handle data
Part III – Manipulation and analysis
- Using Pandas for real-world data representation
- Leveraging data reshape and combine capabilities
- Exercises: Practical data wrangling
Part IV – Data analysis pipelines
- Reusability and reproducibility with pipelines
- Designing data analysis pipelines
- Exercises: Simple two-step data analysis pipeline
2023 Course goals and outcomes
Those that attend this course will learn how to extend Python with proven libraries. They will gain intermediate data manipulation and analysis techniques and learn how data analysis libraries seamlessly represent real world data. It will help delegates understand how to design pure-Python data pipelines and appreciate where to get more resources to continue to grow their skills.
2023 Audience
This course is suitable for OR practitioners, analysts, SAS, SPSS and R users who want to add advanced Python skills to their arsenal of data wrangling and analysis tools.
2023 Other related courses to continue your development...
Data Wrangling with Python - Introduction - DAY 1
Data Wrangling with Python - Foundation Plus - DAY 2