- Level Foundation
- Duration 10 hours
- Course by University of California San Diego
-
Offered by
About
How do we infer which genes orchestrate various processes in the cell? How did humans migrate out of Africa and spread around the world? In this class, we will see that these two seemingly different questions can be addressed using similar algorithmic and machine learning techniques arising from the general problem of dividing data points into distinct clusters. In the first half of the course, we will introduce algorithms for clustering a group of objects into a collection of clusters based on their similarity, a classic problem in data science, and see how these algorithms can be applied to gene expression data. In the second half of the course, we will introduce another classic tool in data science called principal components analysis that can be used to preprocess multidimensional data before clustering in an effort to greatly reduce the number dimensions without losing much of the "signal" in the data. Finally, you will learn how to apply popular bioinformatics software tools to solve a real problem in clustering.Modules
Welcome!
1
Videos
- (Check Out Our Wacky Course Intro Video!)
1
Readings
- Course Details
Week 1: How Did Yeast Become a Wine Maker? (Week 1/2)
1
Assignment
- Week 1 Quiz
1
External Tool
- Interactive Text for Week 1
4
Videos
- Which Yeast Genes are Responsible for Wine Making?
- Gene Expression Matrices
- Clustering as an Optimization Problem
- The Lloyd Algorithm for k-Means Clustering
1
Readings
- Week 1 FAQs (Optional)
Complete the Code Challenges!
1
External Tool
- Open in order to Sync Your Progress: Interactive Text for Week 1
Week 2: How Did Yeast Become a Wine Maker? (Week 2/2)
1
Assignment
- Week 2 Quiz
1
External Tool
- Interactive Text for Week 2
5
Videos
- From Hard to Soft Clustering
- From Coin Flipping to k-Means Clustering
- Expectation Maximization
- Soft k-Means Clustering
- Hierarchical Clustering
1
Readings
- Week 2 FAQs (Optional)
Complete the Code Challenges!
1
External Tool
- Open in order to Sync Your Progress: Interactive Text for Week 2
Week 3: How Have Humans Populated the Earth? (Week 1/1)
1
Assignment
- Week 3 Quiz
2
Readings
- Statement on This Week's Material
- How Have Humans Populated the Earth?
Auto Summary
Explore the Genomic Data Science and Clustering course, focusing on bioinformatics within the Health & Fitness domain. Led by Coursera, this foundational course covers clustering algorithms, gene expression data analysis, and principal components analysis. Over 600 minutes, learners will engage with practical bioinformatics tools and solve real clustering problems. Ideal for beginners, it offers a starter subscription option.

Pavel Pevzner

Phillip Compeau