"Gemini Robotics: When AI Finally Learns to Touch the Physical World"

"Google DeepMind's Gemini Robotics is the first AI model family that can see, reason about, and physically interact with the real world. Here is how i

Administrator Administrator Published on 2026-05-05

"智能时代的网络安全:OpenAI 五支柱行动计划深度解析与 AI 防御民主化"

"OpenAI 发布了五支柱网络安全行动计划,附带 1000 万美元 API 额度与专用 GPT-5.4-Cyber 模型。本文逐条解析支柱含义,评估落地成效与现存缺口。"

Administrator Administrator Published on 2026-05-03

"Cybersecurity in the Intelligence Age: Decoding OpenAI's Five-Pillar Action Plan for Democratizing AI-Powered Defense"

"OpenAI published a five-pillar cybersecurity action plan with $10M in API credits and a dedicated GPT-5.4-Cyber model. Here's what each pillar means

Administrator Administrator Published on 2026-05-03

"OpenAI 隐私过滤器:开源 PII 检测模型技术解析与企业 AI 合规实践"

OpenAI 发布了首个开源权重 PII 检测模型,采用 Apache 2.0 许可证。一家以专有 API 为核心业务的公司,发布可下载、微调且不按 token 收费的模型——这本身就是一个信号。该模型在标准测试集上达到 **96% F1**,可通过 WebGPU 在浏览器中运行,还附带 `opf`

Administrator Administrator Published on 2026-05-03

"OpenAI Privacy Filter: How Open-Weight PII Detection Works and Why It Matters for Enterprise AI"

"OpenAI released its first open-weight PII detection model under Apache 2.0. Here's how the 50M active-parameter model achieves 96% F1, runs in browse

Administrator Administrator Published on 2026-05-03

"Project Deal:Anthropic 让 Claude 代替 69 名员工自主交易的实验全记录"

"Anthropic 开展了一项为期一周的实验,让 Claude 在 4 个平行市场中自主交易。Opus Agent 卖出商品的价格比 Haiku Agent 高出 70%。无论是 Agent 还是人类,都未察觉其中存在的问题。"

Administrator Administrator Published on 2026-05-03

"Project Deal: How Anthropic Let Claude Buy, Sell, and Negotiate on Behalf of 69 Employees"

"Anthropic ran a week-long experiment where Claude autonomously traded items across 4 parallel markets. Opus agents sold items for 70% more than Haiku

Administrator Administrator Published on 2026-05-03

"GPT-5 的哥布林之谜:OpenAI 对模型行为漂移的深度调查与 RL 训练启示"

2025年11月,OpenAI的生产流量日志里出现了一个异常。一个特定的词在模型输出中出现的频率高得离谱。这个词是"哥布林"(goblin)。到2026年1月,GPT-5提及哥布林的频率比两个月前高出了**3881%**。没有任何提示词在引导这种行为。没有用户询问过奇幻生物。模型出于无人预料的原因,

Administrator Administrator Published on 2026-05-03

"Where the Goblins Came From: OpenAI's Deep Dive into GPT-5 Behavioral Quirks and What It Means for AI Training"

"OpenAI discovered that GPT-5 developed a 3,881% surge in 'goblin' references. The root cause traces to a personality feature and a reward signal that

Administrator Administrator Published on 2026-05-03

Anthropic Mythos 泄露与运行时治理的崛起——Agent 安全范式转移

2026 年 3 月 27 日,Anthropic 因 CMS 配置错误泄露了约 3,000 份未发布资产。其中一份草稿描述了代号 Mythos(内部称 Capybara)的下一代模型,声称在 coding、reasoning 和 cybersecurity 上有显著进展,且在网络安全能力上"far

Administrator Administrator Published on 2026-05-03
Previous Next