Reading

The following website may be used for reference.

udpipe: Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the ‘UDPipe’ ‘NLP’ Toolkit

This cran-r website describes the UDPipe package in R and its functionality.

 

The following website contains data for experimenting with text analysis.

ABC News Corpora

News headlines from the Australian Broadcasting Corporation (ABC) from February 2003 – June 2017 can be found on this site.

 

The following material is optional.  However, interested readers are encouraged to peruse it.

Text Analysis in R made easy with Udpipe

R.S. Abdul Majed Raja

Feb 26, 2018

This article describes the UDPipe text analysis R package and illustrates some of its functionality with a large corpus of articles from the ABC News website.

Read:  Text Analysis in R made easy with Udpipe

 

Universal Parts of Speech (UPOS) Tagging

UPOS is described on this site.

 

RAKE (Rapid Automatic Keyword Extraction) Algorithm

This cran.r-project website provides a description of the R language RAKE package, which implements the Rapid Automatic Keyword Extraction algorithm (Rose et al., 2010).

View: RAKE

 

R Code

 

This section uses the R code Advanced_Text_Analysis_Example.R.

[NEXT]

 

License

Icon for the Creative Commons Attribution-ShareAlike 4.0 International License

Digital Humanities Tools and Techniques II Copyright © 2022 by Mark Wachowiak, Ph.D. is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License, except where otherwise noted.

Share This Book