Julian Michael
Research Scientist
Bio: I work on a variety of things, mostly focused on AI alignment or formal semantics of natural language. In alignment, I focus on scalable oversight and agent alignment, from the lens of task formulation, data collection, and evaluation methodology. I’m especially interested in using debate as a training and evaluation paradigm. In language, I work on ways to design, annotate, and model semantics in a scalable, data-driven way while taking advantage of our understanding of linguistic structure.
Research Areas: AI alignment, truthfulness, ethics, natural language semantics