

Our Courses

Big Data, Genes, and Medicine
This course distills for you expert knowledge and skills mastered by professionals in Health Big Data Science and Bioinformatics. You will learn exciting facts about the human body biology and chemistry, genetics, and medicine that will be intertwined with the science of Big Data and skills to harness the avalanche of data openly available at your fingertips and which we are just starting to make sense of.
-
Course by
-
Self Paced
-
40 hours
-
English

Probability Theory: Foundation for Data Science
Understand the foundations of probability and its relationship to statistics and data science. We’ll learn what it means to calculate a probability, independent and dependent outcomes, and conditional events. We’ll study discrete and continuous random variables and see how this fits with data collection. We’ll end the course with Gaussian (normal) random variables and the Central Limit Theorem and understand its fundamental importance for all of statistics and data science. This course can be taken for academic credit as part of CU Boulder’s Master of Science in Data Science
-
Course by
-
Self Paced
-
48 hours
-
English

Scalable Machine Learning on Big Data using Apache Spark
This course will empower you with the skills to scale data science and machine learning (ML) tasks on Big Data sets using Apache Spark. Most real world machine learning work involves very large data sets that go beyond the CPU, memory and storage limitations of a single computer. Apache Spark is an open source framework that leverages cluster computing and distributed storage to process extremely large data sets in an efficient and cost effective manner.
-
Course by
-
Self Paced
-
7 hours
-
English

The exposome: cracking the science about what makes us sick
What are the causes of disease? We know that most diseases result from a combination of genes and environment (nature and nurture). Our genes alone do not determine our fate. For most complex diseases, externalities - environmental factors in the broad sense - are more important. This includes our living and working environments, diet, social support and stress, pollution, and exposure to infectious agents.
-
Course by
-
Self Paced
-
15 hours
-
English

AI Workflow: Data Analysis and Hypothesis Testing
This is the second course in the IBM AI Enterprise Workflow Certification specialization. You are STRONGLY encouraged to complete these courses in order as they are not individual independent courses, but part of a workflow where each course builds on the previous ones. In this course you will begin your work for a hypothetical streaming media company by doing exploratory data analysis (EDA). Best practices for data visualization, handling missing data, and hypothesis testing will be introduced to you as part of your work. You will learn techniques of estimation
-
Course by
-
Self Paced
-
11 hours
-
English

Statistical Inference for Estimation in Data Science
This course introduces statistical inference, sampling distributions, and confidence intervals. Students will learn how to define and construct good estimators, method of moments estimation, maximum likelihood estimation, and methods of constructing confidence intervals that will extend to more general settings. This course can be taken for academic credit as part of CU Boulder’s Master of Science in Data Science (MS-DS) degree offered on the Coursera platform.
-
Course by
-
Self Paced
-
28 hours
-
English

Computer Vision in Microsoft Azure
In Microsoft Azure, the Computer Vision cognitive service uses pre-trained models to analyze images, enabling software developers to easily build applications"see" the world and make sense of it. This ability to process images is the key to creating software that can emulate human visual perception. In this course, you'll explore some of these capabilities as you learn how to use the Computer Vision service to analyze images. This course will help you prepare for Exam AI-900: Microsoft Azure AI Fundamentals.
-
Course by
-
Self Paced
-
8 hours
-
English

Calculus through Data & Modelling: Vector Calculus
This course continues your study of calculus by focusing on the applications of integration to vector valued functions, or vector fields. These are functions that assign vectors to points in space, allowing us to develop advanced theories to then apply to real-world problems. We define line integrals, which can be used to fund the work done by a vector field. We culminate this course with Green's Theorem, which describes the relationship between certain kinds of line integrals on closed paths and double integrals.
-
Course by
-
Self Paced
-
5 hours
-
English

Introduction to the Tidyverse
This course introduces a powerful set of data science tools known as the Tidyverse. The Tidyverse has revolutionized the way in which data scientists do almost every aspect of their job. We will cover the simple idea of "tidy data" and how this idea serves to organize data for analysis and modeling. We will also cover how non-tidy can be transformed to tidy data, the data science project life cycle, and the ecosystem of Tidyverse R packages that can be used to execute a data science project.
-
Course by
-
Self Paced
-
English

Google Advanced Data Analytics Capstone
You’re almost there! This is the seventh and final course of the Google Advanced Data Analytics Certificate. In this course, you have the opportunity to complete an optional capstone project that includes key concepts from each of the six preceding courses.
-
Course by
-
Self Paced
-
10 hours
-
English

Introduction to High-Performance and Parallel Computing
This course introduces the fundamentals of high-performance and parallel computing. It is targeted to scientists, engineers, scholars, really everyone seeking to develop the software skills necessary for work in parallel software environments. These skills include big-data analysis, machine learning, parallel programming, and optimization. We will cover the basics of Linux environments and bash scripting all the way to high throughput computing and parallelizing code. We recommend you are familiar with either Fortran 90, C++, or Python to complete some of the programming assignments.
-
Course by
-
Self Paced
-
14 hours
-
English

Data Scientist Career Guide and Interview Preparation
Data science professionals are in high demand around the world, and the trend shows no sign of slowing. There are lots of great jobs available, but lots of great candidates too. How can you get the edge in such a competitive field? This course will prepare you to enter the job market as a great candidate for a data scientist position. It provides practical techniques for creating essential job-seeking materials such as a resume and a portfolio, as well as auxiliary tools like a cover letter and an elevator pitch.
-
Course by
-
Self Paced
-
9 hours
-
English

What is Data Science?
Do you want to know why data science has been labeled the sexiest profession of the 21st century? After taking this course, you will be able to answer this question, understand what data science is and what data scientists do, and learn about career paths in the field. The art of uncovering insights and trends in data has been around since ancient times. The ancient Egyptians used census data to increase efficiency in tax collection and accurately predicted the Nile River's flooding every year. Since then, people have continued to use data to derive insights and predict outcomes.
-
Course by
-
Self Paced
-
19 hours
-
English

Julia for Beginners in Data Science
This guided project is for those who want to learn how to use Julia for data cleaning as well as exploratory analysis.
-
Course by
-
Self Paced
-
3 hours
-
English

Inventory Management
Inventory is a strategic asset for organizations. The effective management of inventory can minimize a company’s spending while dramatically increasing its profit. In this course, we will explore how to use data science to manage inventory in uncertain environments, how to set inventory levels based on customer service requirements, and how to calculate inventory for products that have short sales cycles.
-
Course by
-
Self Paced
-
6 hours
-
English

AI For Everyone
AI is not only for engineers. If you want your organization to become better at using AI, this is the course to tell everyone--especially your non-technical colleagues--to take.
-
Course by
-
Self Paced
-
11 hours
-
English

Bioconductor for Genomic Data Science
Learn to use tools from the Bioconductor project to perform analysis of genomic data. This is the fifth course in the Genomic Big Data Specialization from Johns Hopkins University.
-
Course by
-
Self Paced
-
9 hours
-
English

Applied Data Science for Data Analysts
In this course, you will develop your data science skills while solving real-world problems. You'll work through the data science process to and use unsupervised learning to explore data, engineer and select meaningful features, and solve complex supervised learning problems using tree-based models. You will also learn to apply hyperparameter tuning and cross-validation strategies to improve model performance.
-
Course by
-
Self Paced
-
17 hours
-
English

Data Science Companion
The Data Science Companion provides an introduction to data science. You will gain a quick background in data science and core machine learning concepts, such as regression and classification. You’ll be introduced to the practical knowledge of data processing and visualization using low-code solutions, as well as an overview of the ways to integrate multiple tools effectively to solve data science problems. You will then leverage cloud resources from Amazon Web Services to scale data processing and accelerate machine learning model training.
-
Course by
-
Self Paced
-
2 hours
-
English

Making Data Science Work for Clinical Reporting
This course is aimed to demonstate how principles and methods from data science can be applied in clinical reporting. By the end of the course, learners will understand what requirements there are in reporting clinical trials, and how they impact on how data science is used. The learner will see how they can work efficiently and effectively while still ensuring that they meet the needed standards.
-
Course by
-
Self Paced
-
11 hours
-
English

Machine Learning Rapid Prototyping with IBM Watson Studio
An emerging trend in AI is the availability of technologies in which automation is used to select a best-fit model, perform feature engineering and improve model performance via hyperparameter optimization. This automation will provide rapid-prototyping of models and allow the Data Scientist to focus their efforts on applying domain knowledge to fine-tune models. This course will take the learner through the creation of an end-to-end automated pipeline built by Watson Studio’s AutoAI experiment tool, explaining the underlying technology at work as developed by IBM Research.
-
Course by
-
Self Paced
-
9 hours
-
English

Python Essentials for MLOps
Python Essentials for MLOps (Machine Learning Operations) is a course designed to provide learners with the fundamental Python skills needed to succeed in an MLOps role. This course covers the basics of the Python programming language, including data types, functions, modules and testing techniques. It also covers how to work effectively with data sets and other data science tasks with Pandas and NumPy. Through a series of hands-on exercises, learners will gain practical experience working with Python in the context of an MLOps workflow.
-
Course by
-
Self Paced
-
23 hours
-
English

Integral Calculus and Numerical Analysis for Data Science
Are you interested in Data Science but lack the math background for it? Has math always been a tough subject that you tend to avoid? This course will provide an intuitive understanding of foundational integral calculus, including integration by parts, area under a curve, and integral computation. It will also cover root-finding methods, matrix decomposition, and partial derivatives. This course is designed to prepare learners to successfully complete Statistical Modeling for Data Science Application, which is part of CU Boulder's Master of Science in Data Science (MS-DS) program.
-
Course by
-
Self Paced
-
4 hours
-
English

Advanced Linear Models for Data Science 2: Statistical Linear Models
Welcome to the Advanced Linear Models for Data Science Class 2: Statistical Linear Models. This class is an introduction to least squares from a linear algebraic and mathematical perspective.
-
Course by
-
Self Paced
-
6 hours
-
English

Prepare for DP-100: Data Science on Microsoft Azure Exam
Microsoft certifications give you a professional advantage by providing globally recognized and industry-endorsed evidence of mastering skills in digital and cloud businesses. In this course, you will prepare to take the DP-100 Azure Data Scientist Associate certification exam. You will refresh your knowledge of how to plan and create a suitable working environment for data science workloads on Azure, run data experiments, and train predictive models.
-
Course by
-
Self Paced
-
9 hours
-
English