Python has emerged as a popular and effective language in the world of data science. The dynamic nature of the language, the relative simplicity of the syntax, and the abundance of fast and powerful libraries have all been important contributory factors in this growth.
Course description:
This course takes a detailed look at the most popular Python libraries for numeric processing, statistical analysis, machine learning, and visualization. We also show how to make use of common Python data types and algorithms to achieve real-world tasks.
What you will learn:
• Using NumPy and Pandas for efficient data manipulation
• Using Matplotlib and Seaborn for visualization
• Working with time series data
• Machine learning concepts
• Using Scikit-Learn for machine learning
Course outline:
Module 1 - Python Quick Start: Python Essentials; Language Fundamentals:
• Python Essentials; Language Fundamentals; Functions; Data Structures
Module 2 - Getting Started with NumPy:
• Setting the Scene; NumPy Arrays; Manipulating Array Elements; Manipulating Array Shape
Module 3 - NumPy Techniques:
• NumPy Universal Functions; Aggregations; Broadcasting; Manipulating Arrays using Boolean Logic; Additional Techniques
Module 4 - Getting Started with Pandas:
• Introduction to Pandas; Creating a Series; Using a Series; Creating a DataFrame; Using a DataFrame
Module 5 - Pandas Techniques:
• Universal Functions; Merging and Joining Datasets; A Closer Look at Joins
Module 6 - Working with Time Series Data:
• Introduction to Time Series Data; Indexing and Plotting Time Series Data; Testing Data for Stationarity; Making Data Stationary; Forecasting Time Series Data; Scaling Back the ARIMA Results.
Module 7 - Introduction to Machine Learning:
• Machine Learning Concepts; Classification; Clustering
Module 8 - Getting Started with Scikit-Learn:
• Scikit-Learn Essentials; A Closer Look at Datasets
Module 9 - Understanding the Scikit-Learn API:
• Introduction; Scikit-Learn API Essentials; Performing Linear Regression
Module 10 - Going Further with Scikit-Learn:
• Introduction; Understanding Naïve Bayes Classification; Naïve Bayes Example using Scikit-Learn
Module 11 - Case Study:
• Worked example of a real-world data science problem
Course instructor: Andy Olsen
Andy is a freelance consultant and instructor based in the UK, working mostly in the City of London and Oxford. Andy has been working with .NET since the Beta 1 days and has extensive experience in many facets of .NET development including WCF, WPF, WF, ASP.NET MVC Web development, and mobile applications. Andy has designed and written many Microsoft Official Curriculum courses in the last decade, and has been engaged as author and technical reviewer with Apress on some of their biggest selling books.
Target audience:
Anyone who wants course about Python Data Science
Prerequisites:
• Some familiarity with Python or another contemporary language would be beneficial
Language:
• The course is given in english