Anthropic Updates Responsible Scaling Policy with Claude Opus 4.6 Sabotage Risk Report
Anthropic publishes comprehensive sabotage risk assessment for Claude Opus 4.6, advancing AI safety standards and transparency in frontier model deployment.
Anthropic publishes comprehensive sabotage risk assessment for Claude Opus 4.6, advancing AI safety standards and transparency in frontier model deployment.
Meta is temporarily blocking teenagers from accessing its AI character chatbots globally as it works to build a safer experience and give parents more control, following reports of risky interactions.