Research Interests

  • Computational Linguistics.
  • Data-driven natural language processing.
  • Automatic analysis of child language.
  • Human Language Technology applications (e.g. dialogue systems, information extraction and speech recognition).
  • Multimodal processing and human communication dynamics.

For more details, see my research page and my list of publications.

Brief Bio

University of California, Davis (2016-present)

I am director of the Linguistics Graduate Program and vice-chair of the Linguistics Department at UC Davis, with additional affiliations with Cognitive Science and Computer Science. I also run the Computational Linguistics Laboratory.

KITT.AI (2015-2016)

I was a co-founder of KITT.AI, a startup acquired by Baidu.

University of Southern California (2008-2015)

Until October of 2015 I was a Research Assistant Professor at the USC Computer Science Department, and a Research Scientist and Project Leader at the USC Institute for Creative Technologies. I taught CSCI 544 Applied Natural Language Processing, and supervised a research group on computational models of natural language structure consisting of a few students and a senior research associate.

University of Tokyo (2006-2008)

Before joining USC I was a member of Tsujii Laboratory at the University of Tokyo. At Tsujii Lab, I worked on combining discriminative dependency parsing with HPSG, and on applying syntactic parsing in bioinformatics.

Carnegie Mellon University (PhD, 2006)

I got my PhD at Carnegie Mellon University in 2006. My thesis advisors were Alon Lavie (LTI) and Brian MacWhinney (Psychology). The other members of my thesis committee were Lori Levin (LTI), Jaime Carbonell (LTI), and John Carroll (University of Sussex, Department of Informatics).

My research at CMU involved the identification of grammatical relations, or GRs, (such as subjects, objects and adjuncts) in corpora of transcribed dialogues between children and parents. Most of these transcripts came from the CHILDES Database, but I also worked with transcripts from other sources.