"DeepMind's 2026 Model Ecosystem: The Complete Technical Architecture of Gemini 3.1 Pro, Veo 3.1, Lyria 3, Genie 3, and Gemini Robotics-ER"

A comprehensive technical analysis of Google DeepMind's five-model ecosystem spanning language, video, music, interactive worlds, and physical robotic

Administrator Administrator Published on 2026-05-09

"GPT-Rosalind:OpenAI 生命科学前沿推理模型深度解析"

"GPT-Rosalind 是 OpenAI 首个垂直领域前沿模型,专为生命科学研究打造。BixBench 得分 0.751,合作方包括安进、Moderna、诺和诺德。它做什么、怎么工作、为什么标志着从通用到专用的战略转向。"

Administrator Administrator Published on 2026-05-05

"GPT-Rosalind: OpenAI's Frontier Reasoning Model for Life Sciences"

"GPT-Rosalind is OpenAI's first domain-specific frontier model, purpose-built for life sciences research. It scores 0.751 on BixBench and partners wit

Administrator Administrator Published on 2026-05-05

"GPT-5 的哥布林之谜:OpenAI 对模型行为漂移的深度调查与 RL 训练启示"

2025年11月,OpenAI的生产流量日志里出现了一个异常。一个特定的词在模型输出中出现的频率高得离谱。这个词是"哥布林"(goblin)。到2026年1月,GPT-5提及哥布林的频率比两个月前高出了**3881%**。没有任何提示词在引导这种行为。没有用户询问过奇幻生物。模型出于无人预料的原因,

Administrator Administrator Published on 2026-05-03

"Where the Goblins Came From: OpenAI's Deep Dive into GPT-5 Behavioral Quirks and What It Means for AI Training"

"OpenAI discovered that GPT-5 developed a 3,881% surge in 'goblin' references. The root cause traces to a personality feature and a reward signal that

Administrator Administrator Published on 2026-05-03

Anthropic Mythos 泄露与运行时治理的崛起——Agent 安全范式转移

2026 年 3 月 27 日,Anthropic 因 CMS 配置错误泄露了约 3,000 份未发布资产。其中一份草稿描述了代号 Mythos(内部称 Capybara)的下一代模型,声称在 coding、reasoning 和 cybersecurity 上有显著进展,且在网络安全能力上"far

Administrator Administrator Published on 2026-05-03

"GPT-5.5 技术深度解析:OpenAI 最新模型如何在编程与推理领域实现新突破"

"GPT-5.5 是 OpenAI 自 GPT-4.5 以来首个完全重新训练的基础模型。SWE-bench Verified 88.7%、Terminal-Bench 2.0 82.7%、1M 上下文检索质量从 36.6% 跃升至 74.0%。本文完整拆解 benchmark 数据、定价策略,以及

Administrator Administrator Published on 2026-04-26

"GPT-5.5 Technical Deep Dive: How OpenAI's Latest Model Achieves New Frontiers in Coding and Reasoning"

"GPT-5.5 is OpenAI's first fully retrained foundation model since GPT-4.5. It delivers 88.7% on SWE-bench Verified, 82.7% on Terminal-Bench 2.0, and m

Administrator Administrator Published on 2026-04-26

"8.1万人想要什么?史上最大规模AI用户期望与担忧研究"

"Anthropic 在159个国家、70种语言中对80,508人进行了深度访谈,这是史上最大规模的多语言AI用户定性研究。核心发现:人们想要的不是更强大的AI,而是更可靠的AI。数据背后的真相。"

Administrator Administrator Published on 2026-04-21

"What 81,000 People Actually Want from AI: Inside Anthropic's Largest Multilingual User Study"

"Anthropic interviewed 80,508 people across 159 countries in 70 languages — the largest qualitative AI study ever conducted. The top finding: people w

Administrator Administrator Published on 2026-04-21
Previous Next