

Our Courses

Introduction to Designing Data Lakes on AWS
In this class, we will help you understand how to create and operate a data lake in a secure and scalable way, without previous knowledge of data science!
-
Course by
-
Self Paced
-
6
-
English

BI Foundations with SQL, ETL and Data Warehousing
The job market for business intelligence (BI) analysts is expected to grow by23 percent from 2021 to 2031 (US Bureau of Labor Statistics). This IBM specialization gives you sought-after skills employers look for when recruiting for a BI analyst. BI analysts gather, clean, and analyze key business data to find patterns and insights that aid business decision-making. During this specialization, you’ll learn the basics of SQL, focusing on how to query relational databases using this popular and powerful language. You’ll use essential Linux commands to create basic shell scripts.
-
Course by
-
Self Paced
-
English

Introduction to Designing Data Lakes on AWS
In this class, Introduction to Designing Data Lakes on AWS, we will help you understand how to create and operate a data lake in a secure and scalable way, without previous knowledge of data science! Starting with the "WHY" you may want a data lake, we will look at the Data-Lake value proposition, characteristics and components.
-
Course by
-
Self Paced
-
13 hours
-
English

Innovating with Data and Google Cloud
Cloud technology on its own only provides a fraction of the true value to a business; When combined with data–lots and lots of it–it has the power to truly unlock value and create new experiences for customers.
In this course, you'll learn what data is, historical ways companies have used it to make decisions, and why it is so critical for machine learning. This course also introduces learners to technical concepts such as structured and unstructured data. database, data warehouse, and data lakes. It then covers the most common and fastest growing Google Cloud products around data.
-
Course by
-
Self Paced
-
3 hours
-
English

Apache Spark (TM) SQL for Data Analysts
Apache Spark is one of the most widely used technologies in big data analytics. In this course, you will learn how to leverage your existing SQL skills to start working with Spark immediately. You will also learn how to work with Delta Lake, a highly performant, open-source storage layer that brings reliability to data lakes. By the end of this course, you will be able to use Spark SQL and Delta Lake to ingest, transform, and query data to extract valuable insights that can be shared with your team.
-
Course by
-
Self Paced
-
14 hours
-
English

ETL and Data Pipelines with Shell, Airflow and Kafka
Delve into the two different approaches to converting raw data into analytics-ready data. One approach is the Extract, Transform, Load (ETL) process. The other contrasting approach is the Extract, Load, and Transform (ELT) process. ETL processes apply to data warehouses and data marts. ELT processes apply to data lakes, where the data is transformed on demand by the requesting/calling application. In this course, you will learn about the different tools and techniques that are used with ETL and Data pipelines.
-
Course by
-
Self Paced
-
17 hours
-
English

Getting Started with Data Warehousing and BI Analytics
Kickstart your Data Warehousing and Business Intelligence (BI) Analytics journey with this self-paced course. You will learn how to design, deploy, load, manage, and query data warehouses and data marts. You will also work with BI tools to analyze data in these repositories.
-
Course by
-
Self Paced
-
17 hours
-
English

Modernizing Data Lakes and Data Warehouses with GCP em Português Brasileiro
Os dois principais componentes de um pipeline de dados são data lakes e warehouses. Neste curso, destacamos os casos de uso para cada tipo de armazenamento e as soluções de data lake e warehouse disponíveis no Google Cloud de forma detalhada e técnica. Além disso, também descrevemos o papel de um engenheiro de dados, os benefícios de um pipeline de dados funcional para operações comerciais e analisamos por que a engenharia de dados deve ser feita em um ambiente de nuvem. Este é o primeiro curso da série ""Data Engineering on Google Cloud"".
-
Course by
-
Self Paced
-
Portuguese

Modernizing Data Lakes and Data Warehouses with GCP en Español
Los dos componentes clave de cualquier canalización de datos son los data lakes y los almacenes de datos. En este curso, se destacan los casos de uso de cada tipo de almacenamiento y se analizan en profundidad las soluciones de data lakes y almacenes disponibles en Google Cloud con detalles técnicos.
-
Course by
-
Self Paced
-
Spanish

Introducción a los Data Lakes con Azure
En este proyecto de 1 hora, aprenderás a usar la tecnología de Azure Data Lake Gen2, entender la diferencia con los otros servicios, gestionar los sistemas de archivos y manejar el ciclo de vida de la información para facilitar la gestión automática y eliminación de los datos según regulaciones. Además, podrás entender cuáles aplicaciones se integran nativamente a Azure Data Lake Gen2.
-
Course by
-
Self Paced
-
3 hours
-
Spanish

Modernizing Data Lakes and Data Warehouses with GCP 日本語版
"すべてのデータ パイプラインには、データレイクとデータ ウェアハウスという 2 つの主要コンポーネントがあります。このコースでは、各ストレージ タイプのユースケースを紹介し、Google Cloud で利用可能なデータレイクとデータ ウェアハウスのソリューションを技術的に詳しく説明します。また、データ エンジニアの役割や、効果的なデータ パイプラインが事業運営にもたらすメリットについて確認し、クラウド環境でデータ エンジニアリングを行うべき理由を説明します。 これは「Data Engineering on Google Cloud」シリーズの最初のコースです。このコースを修了したら、「Building Batch Data Pipelines on Google Cloud」コースに登録してください。"
-
Course by
-
Self Paced
-
Japanese

Business Analytics Executive Overview
Businesses run on data, and data offers little value without analytics. The ability to process data to make predictions about the behavior of individuals or markets, to diagnose systems or situations, or to prescribe actions for people or processes drives business today. Increasingly many businesses are striving to become “data-driven”, proactively relying more on cold hard information and sophisticated algorithms than upon the gut instinct or slow reactions of humans. This course will focus on understanding key analytics concepts and the breadth of analytic possibilities.
-
Course by
-
Self Paced
-
17 hours
-
English

Distributed Computing with Spark SQL
This course is all about big data. It’s for students with SQL experience that want to take the next step on their data journey by learning distributed computing using Apache Spark. Students will gain a thorough understanding of this open-source standard for working with large datasets. Students will gain an understanding of the fundamentals of data analysis using SQL on Spark, setting the foundation for how to combine data with advanced analytics at scale and in production environments.
-
Course by
-
14 hours
-
English

Modernizing Data Lakes and Data Warehouses with Google Cloud
The two key components of any data pipeline are data lakes and warehouses. This course highlights use-cases for each type of storage and dives into the available data lake and warehouse solutions on Google Cloud in technical detail. Also, this course describes the role of a data engineer, the benefits of a successful data pipeline to business operations, and examines why data engineering should be done in a cloud environment. This is the first course of the Data Engineering on Google Cloud series.
-
Course by
-
Self Paced
-
7 hours
-
English

Introduction to Data Engineering
Start your journey in one of the fastest growing professions today with this beginner-friendly Data Engineering course! You will be introduced to the core concepts, processes, and tools you need to know in order to get a foundational knowledge of data engineering. as well as the roles that Data Engineers, Data Scientists, and Data Analysts play in the ecosystem. You will begin this course by understanding what is data engineering as well as the roles that Data Engineers, Data Scientists, and Data Analysts play in this exciting field.
-
Course by
-
Self Paced
-
13 hours
-
English