agentic-ai

PSA: Almost nobody is working on alignment

LessWrong · Jun 12, 2026, 5:17 AM

People often assume that a large fraction of the AI safety community works on alignment. As far as we're aware, this is not true. Most people are not working on making sure superintelligent AIs are aligned with human values or follow human instructions.Currently, the people who we know of that work on alignment are roughly:The Alignment Research Center who work on a research bet by Paul Christiano Probably Sequent who just got announced yesterday Parts of GDM (agent foundations work, some debate work)Some scattered people who work at universities or independently, some of whom hang around Berkeley??A lot of the remainder of the AI safety community does indirect work like capability evaluations, risk assessments, control, policy, AI science, understanding misalignment (which maybe should partially count as alignment work), demos and so on.Some production alignment work (i.e., making current models behave well) might help with more ambitious alignment, too (e.g., some COT-monitoring). Many people also work on aligning current/next-generation models so that these models help with aligning future models, and hope this scales to superintelligence.We are not necessarily saying this is bad and that people are making a big mistake (e.g., neither of us work on alignment) but it's a notable fact that seems good to make known to those who don't know about it.Discuss

Article preview — originally published by LessWrong. Full story at the source.

Read full story on LessWrong → More top stories

Aggregated and edited by the Scoop newsroom. We surface news from LessWrong alongside other reporting so you can compare coverage in one place. Editorial policy · Corrections · About Scoop

PSA: Almost nobody is working on alignment

More in agentic-ai