AI 日报 — 2026-06-13

2026年6月13日约 15 分钟阅读

今日要闻

今日最大事件是美国政府以国家安全为由，要求 Anthropic 立即全球禁用 Claude Fable 5 与 Mythos 5。Anthropic 已切断所有公开访问，包括付费企业用户和海外员工。这一动作与 Claude Fable 5 在 FrontierMath 最难题目上以 88% 准确率领先 GPT-5.5 13 个百分点形成刺眼对照：能力越强，监管来得越快。与此同时，Moonshot AI 开源 Kimi K2.7 Code，以 1/12 价格挑战闭源模型；LangGraph 曝出 RCE 漏洞链，则让 Agent 安全问题从模型层下沉到框架层。

评级	来源	标题	摘要
★★★	VentureBeat	Anthropic blocks all public access to Claude Fable 5, Mythos 5 following US government order	美国政府以出口管制指令要求 Anthropic 暂停所有外国公民访问 Fable 5 / Mythos 5，Anthropic 已全球下架两者
★★☆	The Decoder	US government forces Anthropic to disable Claude Fable 5 and Mythos 5 for all customers worldwide	Anthropic 公开反驳称漏洞轻微且竞品同样存在，警告此举可能开创 Frontier 模型全面停摆先例
★★☆	The Decoder	Microsoft CEO Satya Nadella admits he’s a token-maxer, too: “It’s addictive”	Nadella 警告不要把 Frontier 模型浪费在日常任务上，边际生产力收益必须匹配 token 成本
★★☆	The Decoder	Meta shifts from “tokenmaxxing” to token managing as internal AI costs reportedly hit billions	Meta 内部 AI 使用成本将达数十亿美元，2027 年起通过 AI Gateway 统一管控 token 消耗
★★☆	Reddit / r/artificial	OpenAI Faces Multi-State Probe as US Attorneys General Demand Records on Safety and User Impact	美国多州检察长对 OpenAI 发起调查，要求提供安全与用户影响记录

评级	来源	标题	摘要
★★★	The Decoder	Claude Fable 5 outpaces GPT-5.5 by 13 points on FrontierMath’s toughest problems	Fable 5 在 FrontierMath 最难层级达到 88% 准确率，而 Opus 4.5 年初还不到 10%，数学推理能力跃进明显
★★★	MarkTechPost	Moonshot AI Releases Kimi K2.7-Code: a Coding Model Reporting +21.8% on Kimi Code Bench v2 Over K2.6	Moonshot 开源 Kimi K2.7-Code，256K 上下文、推理 token 降低约 30%，对 GPT-5.5 / Claude 有最高 12 倍价格优势
★★☆	The Decoder	Google Research’s Gemini-SQL2 tops text-to-SQL benchmarks by a wide margin	基于 Gemini 3.1 Pro 的 Gemini-SQL2 在 BIRD 基准达到 80.04%，大幅领先 OpenAI 与 Anthropic

评级	来源	标题	摘要
★★☆	The Decoder	Microsoft’s SkillOpt boosts GPT-5.5 by using nothing but a trained Markdown file	微软与中科大等提出 SkillOpt，仅用一个训练过的 Markdown 指令文件就在程序化任务上提升 GPT-5.5 约 23 分，且可跨模型与 Agent 环境迁移
★★☆	Dev.to	LangGraph RCE Chain: How Malicious Tool Calls Escalate to Full Host Compromise	LangGraph 自托管部署曝出漏洞链，攻击者可通过恶意 tool call 实现完整主机 RCE，Agent 框架安全需引起重视
★★☆	Towards Data Science	Parse PDFs for RAG Locally with Docling: Rich Tables, No Cloud Upload	使用 Docling 在本地解析 PDF 表格、OCR、标题等结构，无需云端上传即可为 RAG 提供高质量文档理解

评级	来源	标题	摘要
★★☆	新智元	刚刚，GPT-5.5被中国纯血AI反超了！	讯飞星火医疗大模型 V3.5 发布，医生采纳率 91%、病历书写时间缩短 52%，在医疗场景落地指标上超越 GPT-5.5