Julian Michael

Research Scientist

Bio: I work on a variety of things, mostly focused on AI alignment or formal semantics of natural language. In alignment, I focus on scalable oversight and agent alignment, from the lens of task formulation, data collection, and evaluation methodology. I’m especially interested in using debate as a training and evaluation paradigm. In language, I work on ways to design, annotate, and model semantics in a scalable, data-driven way while taking advantage of our understanding of linguistic structure.

Research Areas: AI alignment, truthfulness, ethics, natural language semantics

Scroll to Top