🤖 AI 资讯日报 — 2026-06-16 周二

🕐 生成时间: 2026-06-16 08:32 UTC+8 📊 共收录 49 条资讯 | T1: 38 | T1.5: 7 | T2: 4

🔥 T1 官方一手

OpenAI

Built to benefit everyone: our plan

A vision for the future of AI, focusing on access, safety, and shared prosperity as OpenAI works to ensure AGI benefits everyone.
Codex for every role, tool, and workflow

Discover new Codex plugins, sites, and annotations that help analysts, marketers, designers, investors, and other teams get more done with
Introducing GPT-5.5

Introducing GPT-5.5, our smartest model yet—faster, more capable, and built for complex tasks like coding, research, and data analysis
An OpenAI model has disproved a central conjecture in discrete geometry

An OpenAI model solved the 80-year-old unit distance problem, disproving a major conjecture in discrete geometry and marking a milestone in
OpenAI News

Try ChatGPT (opens in a new window) Login. OpenAI News | OpenAI. All. Company · Research · Product · Safety · Engineering · Security · Global Affairs

Anthropic

Claude Fable 5 and Claude Mythos 5 - Anthropic

Claude Fable 5 and Claude Mythos 5. Today we're launching Claude Fable 5: a Mythos-class1 model that we've made safe for general use. It is state-of-the-art on nearly all tested benchmarks of...
Claude Fable 5 and Claude Mythos 5 - Anthropic

Claude Fable 5 and Claude Mythos 5. Today we're launching Claude Fable 5: a Mythos-class1 model that we've made safe for general use. It is state-of-the-art on nearly all tested benchmarks of...
Policy on the AI Exponential - Anthropic

Policy on the AI Exponential. We are sharing two policy proposals to prepare for AI progress. The first, our Advanced AI Framework, offers a roadmap for governing increasingly capable systems, from...

Hugging Face

The Open Source Community is backing OpenEnv for Agentic RL

OpenEnv is a tool for creating an agentic execution environment like terminals, browsers, or anything an agent can interact with.
Designing the hf CLI as an agent-optimized way to work with the Hub

hf is the official command-line entrypoint to the Hugging Face Hub. Anything you can do on the Hub from the Python SDK, you can do from your
google/diffusiongemma-26B-A4B-it

Hugging Face | GitHub | Launch Blog | Documentation License: Apache 2.0 | Authors: Google DeepMind. DiffusionGemma is a generative model
olmo-eval: An evaluation workbench for the model development loop

A Blog post by Ai2 on Hugging Face.
Daily Papers

Your daily dose of AI research from AK.

GitHub

GitHub Agentic Workflows is now in public preview

"With GitHub Agentic Workflows, we're able to expand how we apply agents to real engineering work at scale, including changes that span
Updates to GitHub Copilot billing and plans

As announced in our recent blog post, usage-based billing for GitHub Copilot is now live for all users and Copilot code review consumes
GitHub Universe is back: All together now, in the agentic era

GitHub Universe is back: returning to the historic Fort Mason Center in San Francisco on October 28–29, 2026.
How we made GitHub Copilot CLI more selective about delegation

In agentic systems, more delegation isn't always better. Imagine asking Copilot CLI to make a simple change. Instead of handling it directly
GitHub Copilot is moving to usage-based billing

Starting June 1, your Copilot usage will consume GitHub AI Credits.

Apple

Introducing the Third Generation of Apple's Foundation Models

Our next generation of Apple Intelligence is centered around our users, integrated deeply into our operating systems, and powered by a bold…
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Recent advancements in Large Language Models (LLMs) have sparked interest in their formal reasoning capabilities, particularly in…
Instruction-Following Pruning for Large Language Models

Our approach, termed "instruction-following pruning", introduces a sparse mask predictor that takes the user instruction as input and dynamically selects the
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026

Apple is presenting new research at the annual IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), which takes place in…
Announcing the 2022 Apple Scholars in AIML

The Apple Scholars in AIML PhD fellowship recognizes the contributions of emerging leaders in computer science and engineering at the graduate and postgraduate

NVIDIA

NVIDIA Blackwell Leads on First Agentic AI Infrastructure Benchmark

New AgentPerf results from Artificial Analysis show how accelerated computing systems handle real-world agentic workloads, with NVIDIA GB300
Save Big and Play Bigger: GeForce NOW Summer Sale Brings Major Membership Savings

Level up with the best value in gaming, 'Guild Wars' and rewards, plus eight new games this week.
Forecast: Fun Ahead — 18 Games Join in June to Stream on GeForce NOW

Kick off the summer with 10 games this week, including the highly requested surreal open-world adventure, 'Neverness to Everness.'
What Are AI Tokens? The Language and Currency Powering Modern AI

Tokens are units of data processed by AI models during training and inference, enabling prediction, generation and reasoning. March 17, 2025 by Dave
NVIDIA Accelerates Google DeepMind's DiffusionGemma for Local AI

The new DiffusionGemma open model generates text in parallel — not one token at a time — and is optimized to run on the NVIDIA RTX PRO

xAI

Use Grok in Warp

You can now use your Grok or X Premium subscription inside Warp, an agentic development environment built on the terminal and used by almost
Agent Dashboard in Grok Build

Manage many coding sessions at once. See what each is doing, reply to the ones that need you, and dispatch new work.
Grok Build Plugin Marketplace

Today we're launching the Grok Build Plugin Marketplace - a set of built-in plugins for Grok Build. A plugin bundles skills, slash commands,
Bringing real-time market sentiment to Tori, from eToro

Tori, eToro's AI agent, uses models from SpaceXAI to embed real-time market sentiment directly into Tori's investing workflow.
Powering Gopuff's Go agent

Gopuff and SpaceXAI launched Go, an AI-powered shopping assistant built into the Gopuff app and powered by Grok text, audio, and image

Simon Willison

Claude Fable is relentlessly proactive

Claude Fable is relentlessly proactive. 11th June 2026. After two days of experience with Claude Fable 5 I think the best way to describe it
Initial impressions of Claude Fable 5

Initial impressions of Claude Fable 5. 9th June 2026. I didn't have early access to today's Claude Fable 5 release, but I've spent the past
Setting a custom price for a model in AgentsView

Simon Willison's TILs My blog. Setting a custom price for a model in AgentsView. I'm a recent convent to AgentsView, Wes McKinney's
Statement on the US government directive to suspend access to Fable 5 and Mythos 5 via ) Well this is nuts

The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos
The lethal trifecta for AI agents: private data, untrusted content, and external communication

If your agent combines these three features, an attacker can easily trick it into accessing your private data and sending it to that attacker.

📰 T1.5 媒体报道

HackerNews

the-decoder.com

Landmark German ruling declares Google's AI Overviews are Google's own words and makes it liable for false answers

A German regional court has ruled that Google is directly liable for the content of its AI search overviews.
Google Research's Gemini-SQL2 tops text-to-SQL benchmarks by a wide margin

Google Research's Gemini-SQL2 turns natural language into executable SQL queries. Built on Gemini 3.1 Pro, it tops the BIRD benchmark.
Microsoft's SkillOpt boosts GPT-5.5 by using nothing but a trained Markdown file

Microsoft and three Chinese universities have developed SkillOpt, a method that optimizes instruction documents for AI agents.
Anthropic releases Claude Fable 5 and Mythos 5 with major gains in coding and science

Anthropic ships two new models, Claude Fable 5 and Mythos 5, that claim to blow past the current Opus generation.

🔬 T2 社区 & 学术

HuggingFace Daily Papers

No Hidden Prompts Needed! You Can Game AI Peer Review with Presentation-Only Revisions

As AI-generated reviews move from experimental tools into peer-review infrastructure, most robustness concerns have focused on explicit attacks such as hidden instructions and prompt injection. We stu
Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation

Modern Lean theorem provers achieve strong performance only with substantial training and inference compute, driven in part by scarce verified proof data and the long reasoning traces of formal proof
iMaC: Translating Actions into Motion and Contact Images for Embodied World Models

Embodied world models have emerged as a pivotal paradigm for visual robotic decision-making and interactive environment simulation. However, conventional embodied frameworks rely on low-dimensional st
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment

As AI systems built from multiple language-model agents become more common, they are increasingly used to make decisions together: discussing, negotiating, and acting on shared tasks. While individual

📝 说明

T1：官方一手来源（OpenAI, Anthropic, HuggingFace, GitHub, Apple, NVIDIA, xAI, Simon Willison），精选阈值 60 分
T1.5：主流科技媒体（THE DECODER, ITHOME, HackerNews），精选阈值 65 分
T2：社区 & 学术（KOL 推特, ArXiv, HF Daily Papers），精选阈值 70 分

由 Hermes Agent 自动生成 | 2026-06-16

🤖 AI 资讯日报 — 2026-06-16 周二

🔥 T1 官方一手

OpenAI

Anthropic

Claude Fable 5 and Claude Mythos 5. Today we're launching Claude Fable 5: a Mythos-class1 model that we've made safe for general use. It is state-of-the-art on nearly all tested benchmarks of...

Claude Fable 5 and Claude Mythos 5. Today we're launching Claude Fable 5: a Mythos-class1 model that we've made safe for general use. It is state-of-the-art on nearly all tested benchmarks of...

Policy on the AI Exponential. We are sharing two policy proposals to prepare for AI progress. The first, our Advanced AI Framework, offers a roadmap for governing increasingly capable systems, from...

Hugging Face

GitHub

Apple

NVIDIA

xAI

Simon Willison

📰 T1.5 媒体报道

HackerNews

the-decoder.com

🔬 T2 社区 & 学术

HuggingFace Daily Papers

📝 说明