Kyle P. Johnson PhD
  • about
  • blog
  • publications
  • 10,000 most frequent words in Greek and Latin canon

    While working on the latest release for the CLTK, which now includes stopword builders, I discovered Python’s built-in Tf-idf pairwise similarity in the Greek canon

    I may write more about this later, but I have been sitting on this Greek tf-idf pairwise similarity comparison, run...

  • Tf-idf pairwise similarity in the Latin canon

    Several weeks ago a journalist asked me some interesting questions about tf-idf, which got me thinking about related tasks...

  • Lexical diversity in the Latin canon

    Since I had the code laying around from my previous post on Greek lexical diversity, I turned it to...

  • Lexical diversity in the Greek canon

    This post is just a quick summary of a little calculation I did with the help of the CLTK’s improved...

  • Newer
  • 1
  • 2
  • 3
  • 4
  • 5
  • Older