A comprehensive technical analysis of Google DeepMind's five-model ecosystem spanning language, video, music, interactive worlds, and physical robotic
"GPT-Rosalind 是 OpenAI 首个垂直领域前沿模型,专为生命科学研究打造。BixBench 得分 0.751,合作方包括安进、Moderna、诺和诺德。它做什么、怎么工作、为什么标志着从通用到专用的战略转向。"
"GPT-Rosalind is OpenAI's first domain-specific frontier model, purpose-built for life sciences research. It scores 0.751 on BixBench and partners wit
"OpenAI discovered that GPT-5 developed a 3,881% surge in 'goblin' references. The root cause traces to a personality feature and a reward signal that
2026 年 3 月 27 日,Anthropic 因 CMS 配置错误泄露了约 3,000 份未发布资产。其中一份草稿描述了代号 Mythos(内部称 Capybara)的下一代模型,声称在 coding、reasoning 和 cybersecurity 上有显著进展,且在网络安全能力上"far
"GPT-5.5 是 OpenAI 自 GPT-4.5 以来首个完全重新训练的基础模型。SWE-bench Verified 88.7%、Terminal-Bench 2.0 82.7%、1M 上下文检索质量从 36.6% 跃升至 74.0%。本文完整拆解 benchmark 数据、定价策略,以及
"GPT-5.5 is OpenAI's first fully retrained foundation model since GPT-4.5. It delivers 88.7% on SWE-bench Verified, 82.7% on Terminal-Bench 2.0, and m
"Anthropic 在159个国家、70种语言中对80,508人进行了深度访谈,这是史上最大规模的多语言AI用户定性研究。核心发现:人们想要的不是更强大的AI,而是更可靠的AI。数据背后的真相。"
"Anthropic interviewed 80,508 people across 159 countries in 70 languages — the largest qualitative AI study ever conducted. The top finding: people w