-
10,000 most frequent words in Greek and Latin canon
While working on the latest release for the CLTK, which now includes stopword builders, I discovered Python’s built-in
Tf-idf pairwise similarity in the Greek canon
I may write more about this later, but I have been sitting on this Greek tf-idf pairwise similarity comparison, run...
-
Tf-idf pairwise similarity in the Latin canon
Several weeks ago a journalist asked me some interesting questions about tf-idf, which got me thinking about related tasks...
-
Lexical diversity in the Latin canon
Since I had the code laying around from my previous post on Greek lexical diversity, I turned it to...
-
Lexical diversity in the Greek canon
This post is just a quick summary of a little calculation I did with the help of the CLTK’s improved...