I am an AI safety researcher at Koç University, working on the gap between what we ask AI systems to do and what they actually optimize for. My research spans both empirical and theoretical directions — from cataloging specification failures in the wild to developing RL algorithms with formal pessimism guarantees.

Publications

For a full list of publications, see my Google Scholar profile.

Blog Posts

The Bestiary of Unintended Solutions: Specification Gaming in AI
Can You Share Your Network Configurations?