Anthropic's latest interpretability research maps 171 emotion concepts inside Claude using sparse autoencoders. The findings reveal that emotion vecto
"Anthropic 开展了一项为期一周的实验,让 Claude 在 4 个平行市场中自主交易。Opus Agent 卖出商品的价格比 Haiku Agent 高出 70%。无论是 Agent 还是人类,都未察觉其中存在的问题。"
"Anthropic ran a week-long experiment where Claude autonomously traded items across 4 parallel markets. Opus agents sold items for 70% more than Haiku
2026 年 3 月 27 日,Anthropic 因 CMS 配置错误泄露了约 3,000 份未发布资产。其中一份草稿描述了代号 Mythos(内部称 Capybara)的下一代模型,声称在 coding、reasoning 和 cybersecurity 上有显著进展,且在网络安全能力上"far
"Anthropic 联合12家科技巨头,构建了一个因进攻能力太强而拒绝公开发布的网络安全AI模型。Project Glasswing 在关键基础设施中发现了数千个零日漏洞。这对软件行业意味着什么。"
"Anthropic assembled 12 tech giants and built a cybersecurity AI model too dangerous to release publicly. Project Glasswing found thousands of zero-da
"Anthropic 在159个国家、70种语言中对80,508人进行了深度访谈,这是史上最大规模的多语言AI用户定性研究。核心发现:人们想要的不是更强大的AI,而是更可靠的AI。数据背后的真相。"
"Anthropic interviewed 80,508 people across 159 countries in 70 languages — the largest qualitative AI study ever conducted. The top finding: people w
Claude Opus 4.7 全面技术解析:87.6% SWE-bench Verified、+14.6 MCP-Atlas、+44 XBOW、自验证行为、高分辨率视觉、xhigh effort level、迁移指南、多模型路由策略。
Claude Opus 4.7 analysis: 87.6% on SWE-bench Verified, +10.9 on SWE-bench Pro, +44 on XBOW Vision. The most comprehensive technical breakdown availabl