Discussion about this post

User's avatar
AlgorithmicPeacebuilding's avatar

This is fascinating! I would be curious about your thoughts on the “dignity protocol” as a safety model…https://substack.com/@algorithmicpeacebuilding/note/p-185975208?r=ql6co&utm_medium=ios&utm_source=notes-share-action

Synthetic Civilization's avatar

The more models become evaluation-aware, the more static safety benchmarks risk selecting for “passing the test” rather than reducing risk.

Preemption + fixed dashboards could unintentionally accelerate that dynamic.

2 more comments...

No posts

Ready for more?