Web Science/Part2: Emerging Web Properties/Simple statistical descriptive Models for the Web/Counting Words And Documents
Counting Words And Documents
- Understand why we selected simple English Wikipedia as a toy example for modeling the web
- Understand that a task already as simple as counting words includes modeling choices
- Be familiar with the term “unique word token”
- Know some basic tools to count words and documents
Find the slide deck at File:Counting_Words_And_Documents.pdf
