embedplyr/coha.lemma_sgns

By-decade word2vec (SGNS) embeddings trained on Corpus of Historical American English (Genre-Balanced American English, word lemmas, 1830s-2000s).

rds files are R numeric matrices with tokens as rownames.

References:

William L. Hamilton, Jure Leskovec, and Dan Jurafsky. ACL 2016. Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change. https://nlp.stanford.edu/projects/histwords/