Natural Language Processing
My primary project is the Classical Language Toolkit
(CLTK), a decentralized platform for natural language processing (NLP) for the languages of Ancient, Classical, and Medieval Eurasia. I am largely responsible for the project's organization of developers and the quality of its NLP. See here for materials
from my public talks on the CLTK.
Outside of work, most of my academic research has involved lecturing and writing about NLP and philology. In particular, I advocate for the vital importance of free and open, decentralized platforms for the research into an interconnected ancient world. While in academia, I published a few articles and wrote a dissertation. My dissertation
is a network-theoretical study of Julius Caesar's organization and leadership of the Roman army. Of my publications, two of the more interesting are one on what comic book art has to offer the study of literature
(from Oxford University Press) and a short piece on Etruscan medicine
, which to my utter surprise took on a life of its own as a foundation for contemporary pharmaceutical research (example
For the Pema Ts'al Orthographic System (for which I am the lead developer) I have customized fonts
and a keyboard
which introduce several new punctuation characters to Tibetan orthography, in order to aid beginners in reading the language.