Reading

The material on the following sites is important and should be read either before or after studying the text in this section.

The LDA Buffet: A Topic Modeling Fable

Matthew L. Jockers

This web blog provides a succinct, clear, and humorous introduction to topic modeling, sans mathematical details.  Topic modeling is illustrated with a vignette describing an imaginary conversation between Jane Austen and Herman Melville.

 

Topic Modeling Made Just Simple Enough

Ted Underwood

April 7, 2012

Humanities scholar Ted Underwood presents the main concepts of topic modeling qualitatively in a clear manner, illustrated with simple examples.  There is some mathematical treatment, but it is kept on a basic level.

Read: Topic Modeling Made Just Simple Enough

 

The following material is optional.  However, interested readers are encouraged to peruse it.

Topic Modeling and Latent Dirichlet Allocation (LDA) in Python

Susan Lee

May 31, 2018

This short web article discusses topic modeling using LDA in Python.  Data preprocessing steps are reviewed.  An example using LDA and term frequency-inverse document frequency (tfidf) and corresponding Python code are presented.

 

Topic Modelling and Dynamic Topic Modelling: A Technical Review

By Statistics Canada

This web article provides a technical and mathematical overview of topic modeling and dynamic topic modeling

 

A Beginner’s Guide to Latent Dirichlet Allocation (LDA)

Ria Kulshrestha

Jul 19, 2019

This web article describes the basic concept of LDA in a straightforward manner, and illustrates the main points with intuitive examples.

 

R Code

 

This section uses the R code Topic_Modeling_Example.R and the data file subjectData.csv.

[NEXT]

License

Icon for the Creative Commons Attribution-ShareAlike 4.0 International License

Digital Humanities Tools and Techniques II Copyright © 2022 by Mark Wachowiak, Ph.D. is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License, except where otherwise noted.

Share This Book