- Level Foundation
- Duration 5 hours
- Course by Korea Advanced Institute of Science and Technology(KAIST)
-
Offered by
About
In this course, students will understand characteristics of language through big data. Students will learn how to collect and analyze big data, and find linguistic features from the data. A number of approaches to the linguistic analysis of written and spoken texts will be discussed.Modules
Week 1
1
Assignment
- Scientific approaches
5
Videos
- Lemmas
- Keyness of Word
- Collocations
- N-grams
- POS Parser
Week 2
1
Assignment
- BNC
5
Videos
- BNC - Information
- BNC - Function I
- BNC – Function II
- BNC - Task
- Student Presentation Samples: BNC
Week 3
1
Assignment
- COCA, TagAnt and AntConc
5
Videos
- COCA - Information
- COCA - Task
- AntConc
- TagAnt
- PM and Ratio
Week 4
1
Assignment
- Considerations of Big Data and Language
5
Videos
- Big Data for Different Languages
- Limitations of Big Data
- Future directions of Big Data
- Project Guideline and Peer Review
- Student Presentation Samples
Week 5
1
Peer Review
- Final project peer review
Auto Summary
Embark on an enlightening journey with the "Big Data and Language 2" course, designed to deepen your understanding of linguistic characteristics through the power of big data. Offered by Coursera, this foundational course empowers learners to collect, analyze, and uncover linguistic features from extensive datasets. Explore various methodologies for analyzing written and spoken texts, and gain valuable insights into language patterns. With a comprehensive duration of 300 hours, the course offers flexible subscription options tailored to your needs, including Starter and Professional plans. Ideal for language enthusiasts, data analysts, and anyone keen on merging the realms of language and data science, this course paves the way for mastering linguistic analysis in the digital age. Join now and unlock the potential of big data in language learning!

Seonmin Park