Contents
1.2 Text Mining and Related Fields
1.3 Advice for Reading this Book
2.4 Decomposing Poe’s “The Tell-Tale Heart” into Words
2.6 First Attempt at Extracting Sentences
3.2 Scalars, Interpolation, and Context in Perl
3.3 Arrays and Context in Perl
3.4 Word Lengths in Poe’s “The Tell-Tale Heart”
4 Probability and Text Sampling
4.4 Mean and Variance of Random Variables
4.5 The Bag-of-Words Model for Poe’s “The Black Cat”
5 Applying Information Retrieval to Text Mining
5.2 Counting Letters and Words
5.4 The Term-Document Matrix Applied to Poe
6 Concordance Lines and Corpus Linguistics
6.5 Collocations and Concordance Lines
6.6 Applications with References
7 Multivariate Techniques with Text
7.4 Principal Components Analysis
7.6 Applications and References
9 A Sample of Additional Topics
9.3 Other Languages: Analyzing Goethe in German
Appendix A: Overview of Perl for Text Mining
A.5 Introduction to Regular Expressions