- Level Foundation
- المدة 8 ساعات hours
- الطبع بواسطة University of Washington
-
Offered by
عن
Important note: The second assignment in this course covers the topic of Graph Analysis in the Cloud, in which you will use Elastic MapReduce and the Pig language to perform graph analysis over a moderately large dataset, about 600GB. In order to complete this assignment, you will need to make use of Amazon Web Services (AWS). Amazon has generously offered to provide up to $50 in free AWS credit to each learner in this course to allow you to complete the assignment. Further details regarding the process of receiving this credit are available in the welcome message for the course, as well as in the assignment itself. Please note that Amazon, University of Washington, and Coursera cannot reimburse you for any charges if you exhaust your credit. While we believe that this assignment contributes an excellent learning experience in this course, we understand that some learners may be unable or unwilling to use AWS. We are unable to issue Course Certificates for learners who do not complete the assignment that requires use of AWS. As such, you should not pay for a Course Certificate in Communicating Data Results if you are unable or unwilling to use AWS, as you will not be able to successfully complete the course without doing so. Making predictions is not enough! Effective data scientists know how to explain and interpret their results, and communicate findings accurately to stakeholders to inform business decisions. Visualization is the field of research in computer science that studies effective communication of quantitative results by linking perception, cognition, and algorithms to exploit the enormous bandwidth of the human visual cortex. In this course you will learn to recognize, design, and use effective visualizations. Just because you can make a prediction and convince others to act on it doesn’t mean you should. In this course you will explore the ethical considerations around big data and how these considerations are beginning to influence policy and practice. You will learn the foundational limitations of using technology to protect privacy and the codes of conduct emerging to guide the behavior of data scientists. You will also learn the importance of reproducibility in data science and how the commercial cloud can help support reproducible research even for experiments involving massive datasets, complex computational infrastructures, or both. Learning Goals: After completing this course, you will be able to: 1. Design and critique visualizations 2. Explain the state-of-the-art in privacy, ethics, governance around big data and data science 3. Use cloud computing to analyze large datasets in a reproducible way.الوحدات
Data Types and Visual Mappings
6
Videos
- 01 Introduction: What and Why
- 02 Introduction: Motivating Examples
- 03 Data Types: Definitions
- 04 Mapping Data Types to Visual Attributes
- 05 Data Types Exercise
- 06 Data Types and Visual Mappings Exercises
Visual Encoding
4
Videos
- 07 Data Dimensions
- 08 Effective Visual Encoding
- 09 Effective Visual Encoding Exercise
- 10 Design Criteria for Visual Encoding
Visual Perception
4
Videos
- 11 The Eye is not a Camera
- 12 Preattentive Processing
- 13 Estimating Magnitude
- 14 Evaluating Visualizations
Peer Assessment: Crime Analytics: Visualization of Incident Reports
1
Peer Review
- Crime Analytics: Visualization of Incident Reports
Ethics
6
Videos
- Motivation: Barrow Alcohol Study
- Barrow Study Problems
- Reifying Ethics: Codes of Conduct
- ASA Code of Conduct: Responsibilities to Stakeholders
- Other Codes of Conduct
- Examples of Codified Rules: HIPAA
Privacy
3
Videos
- Privacy Guarantees: First Attempts
- Examples of Privacy Leaks
- Formalizing the Privacy Problem
Differential Privacy
5
Videos
- Differential Privacy Defined
- Global Sensitivity
- Laplacian Noise
- Adding Laplacian Noise and Proving Differential Privacy
- Weaknesses of Differential Privacy
Introduction
2
Videos
- Reproducibility and Data Science
- Reproducibility Gold Standard
Cloud Computing
4
Videos
- Anecdote: The Ocean Appliance
- Code + Data + Environment
- Cloud Computing Introduction
- Cloud Computing History
Reproducibility in the Cloud
5
Videos
- Code + Data + Environment + Platform
- Cloud Computing for Reproducible Research
- Advantages of Virtualization for Reproducibility
- Complex Virtualization Scenarios
- Shared Laboratories
Costs and Cost Sharing
6
Videos
- Economies of Scale
- Provisioning for Peak Load
- Elasticity and Price Reductions
- Server Costs vs. Power Costs
- Reproducibility for Big Data
- Counter-Arguments and Summary
Large-Scale Computation in the Cloud
- Graph Analysis in the Cloud
1
Assignment
- AWS Credit Opt-in Consent Form
Auto Summary
"Communicating Data Science Results" is a foundational course in Big Data and Analytics, led by Coursera. It focuses on teaching effective visualization techniques for data communication, ethical considerations in big data, and using cloud computing for reproducible research. The course includes hands-on assignments with AWS, supported by $50 in free AWS credit. Over 480 minutes, learners will master designing visualizations, understanding data privacy and ethics, and analyzing large datasets. Subscription options include Starter and Professional, making it ideal for aspiring data scientists and professionals looking to enhance their data communication skills.

Bill Howe