- Level Professional
- Duration 16 hours
- Course by University of California, Davis
-
Offered by
About
This course allows you to apply the SQL skills taught in "SQL for Data Science" to four increasingly complex and authentic data science inquiry case studies. We'll learn how to convert timestamps of all types to common formats and perform date/time calculations. We'll select and perform the optimal JOIN for a data science inquiry and clean data within an analysis dataset by deduping, running quality checks, backfilling, and handling nulls. We'll learn how to segment and analyze data per segment using windowing functions and use case statements to execute conditional logic to address a data science inquiry. We'll also describe how to convert a query into a scheduled job and how to insert data into a date partition. Finally, given a predictive analysis need, we'll engineer a feature from raw data using the tools and skills we've built over the course. The real-world application of these skills will give you the framework for performing the analysis of an AB test.Modules
Getting Started
1
Discussions
- Learning Goals
1
Videos
- Course Introduction
1
Readings
- A Note From UC Davis
Course Dataset
1
Assignment
- Are You Connected to Mode Analytics?
2
Videos
- Introduction of Data of Unknown Quality
- Introduction to the Course Dataset
1
Readings
- Connecting to Mode Analytics
Data of Unknown Quality
5
Discussions
- Error Codes (Practice Exercise)
- What Should a Message Table Look Like?
- Flexible Data Formats (Practice Exercise)
- Identifying Unreliable Data + Nulls (Practice Exercise)
- Counting Users (Practice Exercise)
9
Videos
- Error Codes (Solution)
- Flexible Data Formats
- Flexible Data Formats (Solution)
- Identifying Unreliable Data + Nulls
- Unreliable Data - Part 1 (Solution)
- Unreliable Data -Part 2 (Solution)
- Unreliable Data - Part 3 (Solution)
- Answering Ambiguous Questions
- Users Table (Solution)
Review
1
Assignment
- Module 1 Quiz
1
Readings
- Not Dumb Questions
Creating Clean Datasets
3
Assignment
- Data Types (Quiz)
- Dependencies (Quiz)
- Turn a Clean Query Into a Table (Quiz)
10
Videos
- Creating Clean Datasets Introduction
- Tools of the Trade: Coding Guide with Sublime Text (Optional)
- Data Types
- What is a Dependency?
- Create a View-Item Table
- Turn a Clean Query Into a Table (Activity/Solution)
- Hierarchy of Data
- Create User Info Table
- Create a User Snapshot Table (Activity/Solution)
- Partitions in Hive (Optional)
Review
1
Assignment
- Module 2 Quiz
SQL Problem Solving
1
Assignment
- Reorder and Connect Tables
4
Discussions
- Create a Rollup Table (Practice Exercise)
- Promo Email (Practice Exercise)
- Email Campaign
- Product Analysis (Practice Exercise)
11
Videos
- Introduction to SQL Problem Solving
- Map Out Your Joins
- Test Queries vs Final Queries
- Example: Create an Aggregate Table with a Rolling Date Period
- Rolling Orders (Solution)
- Find Each User's Most Recently Viewed Page for an Email Campaign
- Review Windowing Functions
- Promo Email (Solution)
- Product Analysis
- Product Analysis (Solution)
- Coding with Style
Review
1
Assignment
- Module 3 Quiz
Case Study: AB Testing
3
Discussions
- Test Assignments (Practice Exercise)
- Create a New Metric (Practice Exercise)
- Analyzing Results (Practice Exercise)
8
Videos
- How is AB Testing Used?
- Statistics Refresher (Optional)
- Test Assignments
- Test Assignments (Solution)
- Create a New Metric
- Creating a New Metric (Solution)
- Analyze Results
- Analyzing Results (Solution)
Final Project
1
Assignment
- Prepared for the Final Project?
1
Peer Review
- AB Testing
1
Videos
- Peer Review Overview
1
Readings
- Additional Thoughts for the Final Project
Course Summary
1
Discussions
- Self-Reflection
1
Videos
- Course Summary
1
Readings
- Additional Practice and Reading (Optional)
Auto Summary
Unlock the full potential of your SQL skills with "Data Wrangling, Analysis and AB Testing with SQL." Focused on Data Science and AI, this professional-level course, led by Coursera, offers practical case studies to enhance your SQL proficiency. Learn to convert timestamps, perform JOINs, clean data, analyze segments, and engineer features for predictive analysis. With a duration of 960 minutes, the course provides comprehensive training through Starter, Professional, and Paid subscription options. Ideal for data science enthusiasts aiming to master complex data analysis and AB testing.

Katrina Glaeser Poole