[Linkpost] Community polls on alignment controversies
Planning where we focus at Ca ML requires forming views on many controversial questions. In many cases, people we've talked to have wildly different perceptions of the balance of opinions, so we thought this would be a great way for us and others to know where we're out of step on these issues. Also feel free to tell us if you think the questions are ambiguous or embed false assumptions.The questions (voting in comments):Robust alignment requires alignment-relevant intervention during pretrainingAI alignment to humans will in practice avoid moral catastrophes to animalsAI alignment to humans will in practice avoid moral catastrophes to digital mindsResearch into digital mind suffering is sufficiently tractable to work onPartially aligned transformative AIs are likely to be stable under reflectionAlignment to specific values is underrated in research relative to controlMultipolar worlds will compete away >90% of net value that would otherwise be preservedDiscuss