Big Data Track – Microsoft Professional Program

Big Data Track Microsoft Professional Program DataChangersMicrosoft Big Data Track

Designing systems that capture, process, and analyze data is critical for companies in order to have a competitive advantage. This Microsoft Big Data Track curriculum takes you from your first select statement to orchestrating big data workflows in the cloud. With these online courses it is possible to start a course wherever and whenever you want. All the courses are part of the Microsoft Professional Program.

Learn how to build big data solutions for batch and real-time stream processing using Azure managed services and open source systems like Hadoop and Spark. This curriculum will teach you the skills required to capture, process, and analyse data for today’s data-driven world.

This Microsoft Big Data Track exists of 10 steps. You don’t have to follow them in the specific order, but some courses are related. Some steps offer your various options. For every course you can obtain an official Microsoft Professional Program certificate, issued by Microsoft, for which you can buy a voucher from us (in collaboration with MD2C). All you need is your Windows LiveID to register with on the DataChangers Academy, and you will be ready to start your journey!

Steps of the Big Data Track:

  1. Introduction to Big Data
    To prepare yourself for step 2, you can follow Introduction to Data Analysis using Excel
  2. Analyze and Visualize Data with Excel or Analyze and Visualize Data with PowerBI
  3. Introduction to NoSQL Solutions
  4. Query Relation Data
  5. Delivering a Warehouse in the Cloud
  6. Processing Big Data with Azure Data Lake Analytics or Processing Big Data with Hadoop in Azure HDInsight
  7. Processing Real-Time Data Streams in Azure or Implementing Real-Time Analytics with Hadoop in Azure HDInsight
  8. Orchestrating Big Data with Azure Data Factory
  9. Developing Big Data Solutions with Azure Machine Learning or Analyze Big Data with Microsoft R or Implementing Predictive Analytics with Spark in Azure HDInsight
  10. Microsoft Professional Capstone – Big Data

You can also download these steps in a pdf: Big Data Track - Microsoft Professional Program (41 downloads)

Explore the Big Data Courses

Introduction to Big Data Microsoft Professional Program

Introduction to Big Data

Get started on your journey to building Big Data solutions.

About this course
Learn what it takes to build Big Data analytics solutions.
This is the first stop in the Big Data curriculum from Microsoft. It will help you get started with the curriculum, plan your learning schedule, and connect with fellow students and teaching assistants. Along the way, you’ll get an introduction to working with data and some fundamental concepts and technologies for Big Data scenarios.

Learn more…


Introduction to Data Analysis using Excel

Introduction to Data Analysis using Excel

Learn the basics of Excel, one of the most popular data analysis tools, to help visualize and gain insights from your data.

About This Course
The ability to analyze data is a powerful skill that helps you make better decisions. Microsoft Excel is one of the top tools for data analysis and the built-in pivot tables are arguably the most popular analytic tool.

In this course, you will learn how to perform data analysis using Excel’s most popular features. You will learn how to create pivot tables from a range with rows and columns in Excel. You will see the power of Excel pivots in action and their ability to summarize data in flexible ways, enabling quick exploration of data and producing valuable insights from the accumulated data.

Pivots are used in many different industries by millions of users who share the goal of reporting the performance of companies and organizations. In addition, Excel formulas can be used to aggregate data to create meaningful reports. To complement, pivot charts and slicers can be used together to visualize data and create easy to use dashboards.

You should have a basic understanding of creating formulas and how cells are referenced by rows and columns within Excel to take this course. If required, you can find many help topics on Excel at the Microsoft Office Support Site. You are welcome to use any supported version of Excel you have installed in your computer, however, the instructions are based on Excel 2016. You may not be able to complete all exercises as demonstrated in the lectures but workarounds are provided in the lab instructions or Discussion forum. Please note that Excel for Mac does not support many of the features demonstrated in this course.

After taking this course you’ll be ready to continue to our more advanced Excel course, Analyzing and Visualizing Data with Excel.

Learn more…


Analyzing and Visualizing Data with Excel Microsoft Professional Program

Analyzing and Visualizing Data with Excel

Develop your skills with Excel, one of the common tools that data scientists depend on to gather, transform, analyze, and visualize data.

About This Course
Excel is one of the most widely used solutions for analyzing and visualizing data. It now includes tools that enable the analysis of more data, with improved visualizations and more sophisticated business logics. In this data science course, you will get an introduction to the latest versions of these new tools in Excel 2016 from an expert on the Excel Product Team at Microsoft.

Learn how to import data from different sources, create mashups between data sources, and prepare data for analysis. After preparing the data, find out how business calculations can be expressed using the DAX calculation engine. See how the data can be visualized and shared to the Power BI cloud service, after which it can be used in dashboards, queried using plain English sentences, and even consumed on mobile devices.

Do you feel that the contents of this course is a bit too advanced for you and you need to fill some gaps in your Excel knowledge? Do you need a better understanding of how pivot tables, pivot charts and slicers work together, and help in creating dashboards? If so, check out Introduction to Data Analysis using Excel.

Learn more…


Analyzing and Visualizing Data with Power BI Microsoft Professional Program

Analyzing and Visualizing Data with Power BI

Learn Power BI, a powerful cloud-based service that helps data scientists visualize and share insights from their organizations’ data.

About This Course
Power BI is quickly gaining popularity among professionals in data science as a cloud-based service that helps them easily visualize and share insights from their organizations’ data.

In this data science course, you will learn from the Power BI product team at Microsoft with a series of short, lecture-based videos, complete with demos, quizzes, and hands-on labs. You’ll walk through Power BI, end to end, starting from how to connect to and import your data, author reports using Power BI Desktop, and publish those reports to the Power BI service. Plus, learn to create dashboards and share with business users—on the web and on mobile devices.

Learn more…


Big Data courses - build NoSQL solutions on AzureIntroduction to NoSQL Data Solutions

Learn the fundamentals of NoSQL and explore several non-relational data storage options in Microsoft Azure.

About This Course
As a data pro, you know that some scenarios—particularly those involving real-time analytics, site personalization, IoT, and mobile apps—are better addressed with NoSQL storage and compute solutions than they are with relational databases. Microsoft Azure has several NoSQL (or “Not Only SQL”) non-relational data storage options to choose from. NoSQL databases are generally built to be distributed and partitioned across many servers. And they’re built to scale out for high availability and to be flexible enough to handle semi-structured and unstructured data. If you have a data model that is constantly evolving and you want to move fast, that’s what these databases are about.

In this practical course, complete with labs, assessments, and a final exam, join the experts to learn how NoSQL has evolved over time. Explore non-relational data storage options in Azure, and see how to use them in your applications. Find out how to create, store, manage, and access data in these different storage options. Get an in-depth look at Azure Table Storage, DocumentDB, MongoDB, and more. Learn about the “three Vs”—variety (schemas or scenarios that evolve quickly), volume (scale in terms of data storage), and velocity (throughput needs to support a large user base). Take this opportunity to get hands-on with NoSQL options in Azure.

Learn more…


Querying Transact-SQL Microsoft Professional Program

Querying Data with Transact-SQL

From querying and modifying data in SQL Server or Azure SQL to programming with Transact-SQL, learn essential skills that employers need.

About This Course
Transact-SQL is an essential skill for data professionals and developers working with SQL databases. With this combination of expert instruction, demonstrations, and practical labs, step from your first SELECT statement through to implementing transactional programmatic logic.Work through multiple modules, each of which explore a key area of the Transact-SQL language, with a focus on querying and modifying data in Microsoft SQL Server or Azure SQL Database. The labs in this course use a sample database that can be deployed easily in Azure SQL Database, so you get hands-on experience with Transact-SQL without installing or configuring a database server.

Learn more…


Microsoft Professional Program - Delivering a Data Warehouse in the Cloud

Delivering a Warehouse in the Cloud

This Delivering a Warehouse in the Cloud course teaches you how to deploy, design, and load data using Microsoft’s Azure SQL Data Warehouse.

About this course

This course is part of the Microsoft Professional Program Certificate in Big Data .

When you need to scale your data warehouse’s storage and processing capabilities in minutes, not months, you need a cloud-based massively parallel processing solution.

In this computer science course, you will learn how to deploy, design, and load data using Microsoft’s Azure SQL Data Warehouse, or SQL DW. You’ll learn about data distribution, compressed in-memory indexes, PolyBase for Big Data, and elastic scale.

Note: To complete the hands-on elements in this course, you will require an Azure subscription. You can sign up for a free Azure trial subscription (a valid credit card is required for verification, but you will not be charged for Azure services). Note that the free trial is not available in all regions. It is possible to complete the course and earn a certificate without completing the hands-on practices.

Learn more…


Microsoft Professional Program - Processing Big Data with Azure Data Lake Analytics

Processing Big Data with Azure Data Lake Analytics

This Processing Big Data with Azure Data Lake Analytics course teaches you how to use Azure Data Lake technologies to store and process big data in the cloud.

About this course

This course is part of the Microsoft Professional Program Certificate in Big Data .

Want to store and process data at scale? This data analysis course teaches you how to apply the power of the Azure cloud to big data using Azure Data Lake technologies.

Learn how to manage data in Azure Data Lake Store and run U-SQL jobs in Azure Data Lake Analytics to generate insights from structured and unstructured data sources.

Note: To complete this course, you will need a Microsoft Azure subscription. You can sign up for a free trial subscription at https://azure.microsoft.com, or you can use your existing subscription. The labs have been designed to minimize the resource costs required to complete the hands-on activities.

Learn more…


Processing Big Data with Hadoop in Azure HDInsight Microsoft Professional Program

Processing Big Data with Hadoop in Azure HDInsight

Learn how to use Hadoop technologies in Microsoft Azure HDInsight to process big data in this five week, hands-on course.

About This Course
More and more organizations are taking on the challenge of analyzing big data. This course teaches you how to use the Hadoop technologies in Microsoft Azure HDInsight to build batch processing solutions that cleanse and reshape data for analysis. In this five-week course, you’ll learn how to use technologies like Hive, Pig, Oozie, and Sqoop with Hadoop in HDInsight; and how to work with HDInsight clusters from Windows, Linux, and Mac OSX client computers.

NOTE: To complete the hands-on elements in this course, you will require an Azure subscription and a Windows, Linux, or Mac OS X client computer. You can sign up for a free Azure trial subscription (a valid credit card is required for verification, but you will not be charged for Azure services). Note that the free trial is not available in all regions. It is possible to complete the course and earn a certificate without completing the hands-on practices.

Learn more…


Microsoft Professional Program - Processing Real-Time Data Streams in Azure

Processing Real-Time Data Streams in Azure

This Processing Real-Time Data Streams in Azure course teaches you how to use Microsoft Azure technologies to process real-time data in the cloud.

About this course

This course is part of the Microsoft Professional Program Certificate in Big Data.

Want to capture and process real-time data in the cloud?

This data analysis course teaches you how to use Microsoft Azure technologies like Event Hubs, IoT Hubs, and Stream Analytics to build real-time Internet-of-Things (IoT) solutions at scale.

Note: To complete this course, you will need a Microsoft Azure subscription. You can sign up for a free trial subscription at https://azure.microsoft.com, or you can use your existing subscription. The labs have been designed to minimize the resource costs required to complete the hands-on activities.

Learn more…


Implementing Real-Time Analytics with Hadoop in Azure HDInsight

Implementing Real-Time Analytics with Hadoop in Azure HDInsight

This Implementing Real-Time Analytics with Hadoop in Azure HDInsight course is about learning how to use Hadoop technologies to create real-time analytical solutions.

About This Course
In this four week course, you’ll learn how to implement low-latency and streaming Big Data solutions using Hadoop technologies like HBase, Storm, and Spark on Microsoft Azure HDInsight.Note: To complete the hands-on elements in this course, you will require an Azure subscription and a Windows, Linux, or Mac OS X client computer. You can sign up for a free Azure trial subscription (a valid credit card is required for verification, but you will not be charged for Azure services). Note that the free trial is not available in all regions. It is possible to complete the course and earn a certificate without completing the hands-on practices.This course is the second in a series that explores big data and advanced analytics techniques with HDInsight; and builds on the batch processing techniques learned in Processing Big Data with Hadoop in Azure HDInsight.
Learn more…

Implementing Predictive Analytics with Spark in Azure HDInsight Microsoft Professional Program

Implementing Predictive Analytics with Spark in Azure HDInsight

Learn how to use Spark in Microsoft Azure HDInsight to create predictive analytics and machine learning solutions.

About This Course
Are you ready for big data science? In this course, learn how to implement predictive analytics solutions for big data using Apache Spark in Microsoft Azure HDInsight. See how to work with Scala or Python to cleanse and transform data and build machine learning models with Spark ML (the machine learning library in Spark),

Note: To complete the hands-on elements in this course, you will require an Azure subscription and a Windows client computer. You can sign up for a free Azure trial subscription (a valid credit card is required for verification, but you will not be charged for Azure services). Note that the free trial is not available in all regions.

Learn more…


Microsoft Professional Program - Orchestrating Big Data with Azure Data Factory

Orchestrating Big Data with Azure Data Factory

This Orchestrating Big Data with Azure Data Factory course teaches you how to use Microsoft Azure Data Factory to orchestrate big data workflows in the cloud.

About this course

This course is part of the Microsoft Professional Program Certificate in Big Data.

Need to schedule and manage big data workflows?

This data analysis course teaches you how to use Azure Data Factory to coordinate data movement and transformation using technologies such as Hadoop, SQL, and Azure Data Lake Analytics. You will learn how to create data pipelines that will allow you to group activities to perform a certain task.

Note: To complete this course, you will need a Microsoft Azure subscription. You can sign up for a free trial subscription at https://azure.microsoft.com, or you can use your existing subscription. The labs have been designed to minimize the resource costs required to complete the hands-on activities.

Learn more…


Microsoft Professional Program - Developing Big Data Solutions with Azure Machine Learning

Developing Big Data Solutions with Azure Machine Learning

This Developing Big Data Solutions with Azure Machine Learning course teaches you how to build predictive solutions for big data using Microsoft Azure Machine Learning.

About this course

This course is part of the Microsoft Professional Program Certificate in Big Data.

The past can often be the key to predicting the future. Big data from historical sources is a valuable resource for identifying trends and building machine learning models that apply statistical patterns and predict future outcomes.

This course introduces Azure Machine Learning, and explores techniques and considerations for using it to build models from big data sources, and to integrate predictive insights into big data processing workflows.

Learn more…


Analyzing Big Data with Microsoft R

Analyzing Big Data with Microsoft R

Learn how to use Microsoft R Server to analyze large datasets using R, one of the most powerful programming languages.

About This Course
The open-source programming language R has for a long time been popular (particularly in academia) for data processing and statistical analysis. Among R’s strengths are that it’s a succinct programming language and has an extensive repository of third party libraries for performing all kinds of analyses. Together, these two features make it possible for a data scientist to very quickly go from raw data to summaries, charts, and even full-blown reports. However, one deficiency with R is that traditionally it uses a lot of memory, both because it needs to load a copy of the data in its entirety as a data.frame object, and also because processing the data often involves making further copies (sometimes referred to as copy-on-modify). This is one of the reasons R has been more reluctantly received by industry compared to academia.

The main component of Microsoft R Server (MRS) is the RevoScaleR package, which is an R library that offers a set of functionalities for processing large datasets without having to load them all at once in the memory. RevoScaleR offers a rich set of distributed statistical and machine learning algorithms, which get added to over time. Finally, RevoScaleR also offers a mechanism by which we can take code that we developed on our laptop and deploy it on a remote server such as SQL Server or Spark (where the infrastructure is very different under the hood), with minimal effort.

In this course, we will show you how to use MRS to run an analysis on a large dataset and provide some examples of how to deploy it on a Spark cluster or a SQL Server database. Upon completion, you will know how to use R for big-data problems.

Since RevoScaleR is an R package, we assume that the course participants are familiar with R. A solid understanding of R data structures (vectors, matrices, lists, data frames, environments) is required. Familiarity with 3rd party packages such as dplyr is also helpful.

Learn more…


Microsoft Professional Capstone - Big Data

Microsoft Professional Capstone – Big Data

Validate the skills you learned in the Microsoft Professional Program for Big Data with this Microsoft Professional Capstone – Big Data.

About this course

This course is part of the Microsoft Professional Program Certificate in Big Data.

The Microsoft Professional Program for Big Data is a comprehensive curriculum that teaches you how to build big data solutions.

In this capstone project, you will undertake challenges to design, implement, and document a big data solution based on what you have learned.

Learn more…


Explore also our other tracks, like Data Science, Entry Level Software Development, Artificial Intelligence, Cloud Administration, IT-Suport and DevOps!