Projects

Natural Language Processing

My primary project is the Classical Language Toolkit (CLTK), a decentralized platform for natural language processing (NLP) for the languages of Ancient, Classical, and Medieval Eurasia. I am largely responsible for the project's organization of developers and the quality of its NLP. See here for materials from my public talks on the CLTK.

Research

Outside of work, most of my academic research has involved lecturing and writing about NLP and philology. In particular, I advocate for the vital importance of free and open, decentralized platforms for the research into an interconnected ancient world. While in academia, I published a few articles and wrote a dissertation. My dissertation is a network-theoretical study of Julius Caesar's organization and leadership of the Roman army. Of my publications, two of the more interesting are one on what comic book art has to offer the study of literature (from Oxford University Press) and a short piece on Etruscan medicine, which to my utter surprise took on a life of its own as a foundation for contemporary pharmaceutical research (example).

For the Pema Ts'al Orthographic System (for which I am the lead developer) I have customized fonts and a keyboard which introduce several new punctuation characters to Tibetan orthography, in order to aid beginners in reading the language.

Recent posts

About

I born and raised in the State of Washington. I now reside in the Bay Area, where I work as a research scientist specializing in NLP and machine learning. My formal education was in Classics (BA, Reed College; PhD, NYU).