Data Wrangling with Python - Intermediate - DAY 3

This course builds on the skills learned in Foundation and Foundation Plus to help attendees use core data science to deliver rapid statistics and analysis. This Intermediate course will teach attendees how to deliver scripted data wrangling with provable results. Attendees will be able to demonstrate how repeatable data wrangling pipelines increase speed, efficiency and accuracy and learn more advanced scripting techniques to expand their skill portfolio.

Description

This course builds on the skills learned in Foundation and Foundation Plus to help attendees use core data science to deliver rapid statistics and analysis. This Intermediate course will teach attendees how to deliver scripted data wrangling with provable results. Attendees will be able to demonstrate how repeatable data wrangling pipelines increase speed, efficiency and accuracy and learn more advanced scripting techniques to expand their skill portfolio. 

2023 Online Platform and Software Requirements

Computer with video camera, speakers and microphone
Stable internet connection
Mains power

Zoom - This course can only be accessed via the desktop version of Zoom, so if necessary please submit a request with your IT team to download Zoom in advance. Alternatively, use a personal computer to attend this course.

Attendees will need access to:  

  • A PDF Reader 
  • MS Excel 
  • Modern web browser (latest Chrome or Firefox) 
  • Miniconda (minimal Python data science package) 
  • Thonny (minimal Python script editor and debugger).

2023 Prior knowledge

All delegates attending this course MUST have completed Python Data Wrangling 2: Foundation Plus. This course builds on the Python knowledge and skills obtained in the Foundation Plus course. Any delegate who is not practising their Python skills at least on a monthly basis, and who has a significant time gap between Foundation Plus and Intermediate courses is recommended to review the Introduction exercises to ensure they have familiarity before attending an Intermediate course.

2023 Topics

One day course, with four key parts: 

Part I – Modules, Python’s free lunch 

- How to add proven libraries for extra power 

- Where is it? Scopes and namespaces 

- Exercises: Explore module imports and library code reuse 

  

Part II – Data arrays 

- Using Numpy for fast n-dimensional arrays 

- Simplifying operations with vectors 

- Exercises: Using array objects to handle data 

  

Part III – Manipulation and analysis 

- Using Pandas for real-world data representation 

- Leveraging data reshape and combine capabilities 

- Exercises: Practical data wrangling 

  

Part IV – Data analysis pipelines 

- Reusability and reproducibility with pipelines 

- Designing data analysis pipelines 

- Exercises: Simple two-step data analysis pipeline 

2023 Course goals and outcomes

Those that attend this course will learn how to extend Python with proven libraries. They will gain intermediate data manipulation and analysis techniques and learn how data analysis libraries seamlessly represent real world data. It will help delegates understand how to design pure-Python data pipelines and appreciate where to get more resources to continue to grow their skills.  

 

2023 Audience

This course is suitable for OR practitioners, analysts, SAS, SPSS and R users who want to add advanced Python skills to their arsenal of data wrangling and analysis tools. 

 

2023 Other related courses to continue your development...

Data Wrangling with Python - Introduction - DAY 1
Data Wrangling with Python - Foundation Plus - DAY 2

Similar courses

While many analysts will use Microsoft Excel daily few use VBA (Visual Basic for Applications). This course will provide delegates with the skills to utilise VBA and achieve the efficiency in Excel modelling necessary for professional analytics practice. 

More Information

In this course you will learn how to utilise version control. This tool makes it possible to look back over all your previous configurations so removing the need to store a number of csv files. You will learn how to integrate version control into your project using RStudio.

More Information

This course will teach you how to update your reports at the click of a button using integrated R Markdown so your data reporting will efficient and reproducible.

More Information

As spatial data sets grow ever larger this course will teach you how to harness the capabilities of R for your analysis.

More Information

An introductory course to equip you with a tool bag of core quantitative forecasting techniques that are adaptable to almost any organisational setting. Appreciate the qualitative methods of forecasting by making use of expert opinion, judgement and scenarios.

More Information

This course is for those conducting modelling and analysis using Microsoft Excel and are now looking to expand their knowledge. In this course you will learn tips and shortcuts to aid you in achieving efficient and effective analysis while improving your proficiency in Excel. 

More Information

This course will provide you with a systematic introduction to the nature and advanced functionality of Microsoft Excel. It will arm you with the skills needed for planning, formatting and formulas that will take your proficiency to an improved level. 

More Information

An introductory course to equip you with a tool bag of core quantitative forecasting techniques that are adaptable to almost any organisational setting. Appreciate the qualitative methods of forecasting by making use of expert opinion, judgement and scenarios.

More Information

This course will teach you how to use advanced analytics techniques to solve complex optimisation problems. You will then be in a position to provide recommendations for the best solution or action from a number of possibilities.

More Information

This course offers a set of frameworks and methods based on behavioural science to support Operational Research (OR) Practice. The course can be useful for developing behavioural OR models, understanding behavioural issues with the implementation of OR models and the behavioural aspects of interventions.

More Information

Developed in response to the new age of data-centric activities and with data visualisation at its core method you’ll learn how to get your message across and leverage your data assets. OR practitioners, managers and policy makers will benefit from the data journalism approach leading to increased audience engagement.

More Information

Do you work in data analytics, BI/MI, data science, or machine learning? Do you want to make an immediate impact? This course is designed to equip delegates with a versatile skillset and deliver measurable results within your organisation.

More Information

Develop your communication skills to present your data in a clear and impactful way allowing your organisation to reap the full benefits of your work.

More Information

Letting the Facts Speak for Themselves. The purpose of this course is to teach delegates how to present their facts in the most compelling manner. Familiarity with Key Principles, borrowed from the Worlds of Art and Modern Design, will allow delegates to present results in more compelling ways thus influencing their audience more effectively.

More Information

Data wrangling is the name given to the process of extracting useful information from large quantities of data. It generally involves: Discovering, Structuring, Cleaning, Enriching, Validating, and Publishing data to make it available for analysis. This necessarily involves processing large amounts of data and this course will teach you how to do that with the Python programming language. 

More Information

In this course you will learn how to build a culture of creativity across the team. One in which it is ‘safe’ to suggest ideas, where good ideas can be recognised. This will enable your team to creatively solve difficult problems.

More Information

Gain an understanding of the wide spectrum of skills used in Operational Research, including: problem structuring; data collection; analytics; modelling; and simulation. Learn and understand a typical problem cycle

More Information

This Foundation Plus Python course will teach attendees how to deliver scripted data wrangling with provable results. Attendees will be able to demonstrate how repeatable data wrangling pipelines increase speed, efficiency and accuracy and learn more advanced scripting techniques to expand their skill portfolio.

More Information

This course will give you robust processes when you need to have innovative ideas ‘on demand’. You will have the chance to practice applying these tools and techniques that should be as much part of the OR practitioner’s tool kit as simulation and optimisation.

More Information

This course is designed to provide you with all the information you need to leverage data to design, build, train, deploy, and manage Artificial Intelligence (AI) models through Cloud Computing based on Machine Learning techniques.

More Information

Using case studies and Public Sector Scorecard (PSS), you will create an integrated strategy map, service improvement plan and performance measurement framework of your OR projects.

More Information

Improve your personal productivity for generating rapid and insightful results from large data sets using R.

More Information

This course will provide OR practioners with the competency to navigate the various legal frameworks regarding data protection while becoming fluent in the language of data for effective communications.

More Information

This course will aid your understanding of how visual analysis improves the speed and accuracy of decision-making. You will learn how data mining compliments statistical analysis and how to integrate geospatial data into your analysis.

More Information

Just how scientific is your pricing? Could you improve profits by increasing prices or even by reducing prices? On this course, you will learn how to use the latest scientific analyses to assess consumer’s willingness to pay for products and services and thus be able to better price your offerings.

More Information

This course is a natural follow on for delegates who have completed the Foundation course in forecasting and now wish to develop their toolbox further with Autoregressive Integrated Moving Average (ARIMA).

More Information

This two-day workshop will provide the state-of-the-art on the use of big data to improve service operations and will introduce several useful big data and machine learning techniques to help analysts, managers, and other stakeholders enhance operational performance.

More Information

Learn to use visualisations to summarise insights and communicate information effectively to stakeholders.

More Information

Learn six simple steps in the visualisation cycle and why some visuals are more pleasing to the eye than others and how this can be incorporated into your data results to transform your interactions with decision makers.

More Information

This course will provide OR practitioners with the competency to navigate the various legal frameworks regarding data protection while becoming fluent in the language of data for effective communications.

More Information

Learn how scripted data wrangling can deliver reproducible, provable results. This course will show you how repeatable data wrangling pipelines increase speed, efficiency and accuracy and how simple scripting techniques can expand an analyst’s skill portfolio.

More Information

This Foundation Plus Python course will teach attendees how to deliver scripted data wrangling with provable results. Attendees will be able to demonstrate how repeatable data wrangling pipelines increase speed, efficiency and accuracy and learn more advanced scripting techniques to expand their skill portfolio.

More Information

This course builds on the skills learned in Foundation and Foundation Plus to help attendees use core data science to deliver rapid statistics and analysis. This Intermediate course will teach attendees how to deliver scripted data wrangling with provable results.

More Information

Developed in response to the new age of data-centric activities and with data visualisation at its core method you’ll learn how to get your message across and leverage your data assets. OR practitioners, managers and policy makers will benefit from the data journalism. "

More Information

In this intermediate course you will learn how to design agent-based simulation models using a co-creation approach, for analysing systems where human behaviour plays a key role. We also take a quick look at how to implement such models using the simulation toolkit AnyLogic PLE.

More Information

This one day course will provide some approaches to help approach transformation in different ways, while taking into account both the hard and soft elements of transformation. It will incorporate the creation and evaluation of learning journeys and the pathways to transformation.

More Information

Gain the theoretical and practical understanding on how to process and model time series data in your analytical and forecasting workflows using R programming language. This course will provide you with essential knowledge to allow wrangling, processing, analysis and forecasting of time series data in the R programming language.

More Information