With all of the discussion about changes to Anthropic’s Responsible Scaling Policy, I figured actually reading through all of them in one go would be helpful. I wanted to easily compare sections side by side, so I made a quick website which you can find here.It took me a little over an hour to read through all of them and it seemed net worth it. It’s really clear how the tone has shifted, going from commitments (of which many have definitely been broken) to something more along the lines of reporting policy. To some extent this seems inevitable given that they would basically have to define RSP-4 as AGI[1], and cop up to considering how to stop nation state level security breaches as well the question of “should you even be releasing that?”Anyways, not trying to get too much into my personal commentary. Preview of what it looks like below, hope it’s helpful to some of you!^Yes, I am using this in an underdefined wayDiscuss Read More
Side by Side Comparison of RSP Versions
With all of the discussion about changes to Anthropic’s Responsible Scaling Policy, I figured actually reading through all of them in one go would be helpful. I wanted to easily compare sections side by side, so I made a quick website which you can find here.It took me a little over an hour to read through all of them and it seemed net worth it. It’s really clear how the tone has shifted, going from commitments (of which many have definitely been broken) to something more along the lines of reporting policy. To some extent this seems inevitable given that they would basically have to define RSP-4 as AGI[1], and cop up to considering how to stop nation state level security breaches as well the question of “should you even be releasing that?”Anyways, not trying to get too much into my personal commentary. Preview of what it looks like below, hope it’s helpful to some of you!^Yes, I am using this in an underdefined wayDiscuss Read More
