🤖 AI 资讯日报 — 2026-06-16 周二
🕐 生成时间: 2026-06-16 08:32 UTC+8 📊 共收录 49 条资讯 | T1: 38 | T1.5: 7 | T2: 4
🔥 T1 官方一手
OpenAI
Built to benefit everyone: our plan
A vision for the future of AI, focusing on access, safety, and shared prosperity as OpenAI works to ensure AGI benefits everyone.
Codex for every role, tool, and workflow
Discover new Codex plugins, sites, and annotations that help analysts, marketers, designers, investors, and other teams get more done with
-
Introducing GPT-5.5, our smartest model yet—faster, more capable, and built for complex tasks like coding, research, and data analysis
An OpenAI model has disproved a central conjecture in discrete geometry
An OpenAI model solved the 80-year-old unit distance problem, disproving a major conjecture in discrete geometry and marking a milestone in
-
Try ChatGPT (opens in a new window) Login. OpenAI News | OpenAI. All. Company · Research · Product · Safety · Engineering · Security · Global Affairs
Anthropic
Claude Fable 5 and Claude Mythos 5 - Anthropic
Claude Fable 5 and Claude Mythos 5. Today we're launching Claude Fable 5: a Mythos-class1 model that we've made safe for general use. It is state-of-the-art on nearly all tested benchmarks of...
Claude Fable 5 and Claude Mythos 5 - Anthropic
Claude Fable 5 and Claude Mythos 5. Today we're launching Claude Fable 5: a Mythos-class1 model that we've made safe for general use. It is state-of-the-art on nearly all tested benchmarks of...
Policy on the AI Exponential - Anthropic
Policy on the AI Exponential. We are sharing two policy proposals to prepare for AI progress. The first, our Advanced AI Framework, offers a roadmap for governing increasingly capable systems, from...
Hugging Face
The Open Source Community is backing OpenEnv for Agentic RL
OpenEnv is a tool for creating an agentic execution environment like terminals, browsers, or anything an agent can interact with.
Designing the hf CLI as an agent-optimized way to work with the Hub
hf is the official command-line entrypoint to the Hugging Face Hub. Anything you can do on the Hub from the Python SDK, you can do from your
google/diffusiongemma-26B-A4B-it
Hugging Face | GitHub | Launch Blog | Documentation License: Apache 2.0 | Authors: Google DeepMind. DiffusionGemma is a generative model
olmo-eval: An evaluation workbench for the model development loop
A Blog post by Ai2 on Hugging Face.
-
Your daily dose of AI research from AK.
GitHub
GitHub Agentic Workflows is now in public preview
"With GitHub Agentic Workflows, we're able to expand how we apply agents to real engineering work at scale, including changes that span
Updates to GitHub Copilot billing and plans
As announced in our recent blog post, usage-based billing for GitHub Copilot is now live for all users and Copilot code review consumes
GitHub Universe is back: All together now, in the agentic era
GitHub Universe is back: returning to the historic Fort Mason Center in San Francisco on October 28–29, 2026.
How we made GitHub Copilot CLI more selective about delegation
In agentic systems, more delegation isn't always better. Imagine asking Copilot CLI to make a simple change. Instead of handling it directly
GitHub Copilot is moving to usage-based billing
Starting June 1, your Copilot usage will consume GitHub AI Credits.
Apple
Introducing the Third Generation of Apple's Foundation Models
Our next generation of Apple Intelligence is centered around our users, integrated deeply into our operating systems, and powered by a bold…
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
Recent advancements in Large Language Models (LLMs) have sparked interest in their formal reasoning capabilities, particularly in…
Instruction-Following Pruning for Large Language Models
Our approach, termed "instruction-following pruning", introduces a sparse mask predictor that takes the user instruction as input and dynamically selects the
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026
Apple is presenting new research at the annual IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), which takes place in…
Announcing the 2022 Apple Scholars in AIML
The Apple Scholars in AIML PhD fellowship recognizes the contributions of emerging leaders in computer science and engineering at the graduate and postgraduate
NVIDIA
NVIDIA Blackwell Leads on First Agentic AI Infrastructure Benchmark
New AgentPerf results from Artificial Analysis show how accelerated computing systems handle real-world agentic workloads, with NVIDIA GB300
Save Big and Play Bigger: GeForce NOW Summer Sale Brings Major Membership Savings
Level up with the best value in gaming, 'Guild Wars' and rewards, plus eight new games this week.
Forecast: Fun Ahead — 18 Games Join in June to Stream on GeForce NOW
Kick off the summer with 10 games this week, including the highly requested surreal open-world adventure, 'Neverness to Everness.'
What Are AI Tokens? The Language and Currency Powering Modern AI
Tokens are units of data processed by AI models during training and inference, enabling prediction, generation and reasoning. March 17, 2025 by Dave
NVIDIA Accelerates Google DeepMind's DiffusionGemma for Local AI
The new DiffusionGemma open model generates text in parallel — not one token at a time — and is optimized to run on the NVIDIA RTX PRO
xAI
-
You can now use your Grok or X Premium subscription inside Warp, an agentic development environment built on the terminal and used by almost
-
Manage many coding sessions at once. See what each is doing, reply to the ones that need you, and dispatch new work.
-
Today we're launching the Grok Build Plugin Marketplace - a set of built-in plugins for Grok Build. A plugin bundles skills, slash commands,
Bringing real-time market sentiment to Tori, from eToro
Tori, eToro's AI agent, uses models from SpaceXAI to embed real-time market sentiment directly into Tori's investing workflow.
-
Gopuff and SpaceXAI launched Go, an AI-powered shopping assistant built into the Gopuff app and powered by Grok text, audio, and image
Simon Willison
Claude Fable is relentlessly proactive
Claude Fable is relentlessly proactive. 11th June 2026. After two days of experience with Claude Fable 5 I think the best way to describe it
Initial impressions of Claude Fable 5
Initial impressions of Claude Fable 5. 9th June 2026. I didn't have early access to today's Claude Fable 5 release, but I've spent the past
Setting a custom price for a model in AgentsView
Simon Willison's TILs My blog. Setting a custom price for a model in AgentsView. I'm a recent convent to AgentsView, Wes McKinney's
-
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos
The lethal trifecta for AI agents: private data, untrusted content, and external communication
If your agent combines these three features, an attacker can easily trick it into accessing your private data and sending it to that attacker.
📰 T1.5 媒体报道
HackerNews
Ask HN: Has anyone replaced Claude/GPT with a local model for daily coding?
Launch HN: Drafted (YC P26) – Models for residential architecture
the-decoder.com
-
A German regional court has ruled that Google is directly liable for the content of its AI search overviews.
Google Research's Gemini-SQL2 tops text-to-SQL benchmarks by a wide margin
Google Research's Gemini-SQL2 turns natural language into executable SQL queries. Built on Gemini 3.1 Pro, it tops the BIRD benchmark.
Microsoft's SkillOpt boosts GPT-5.5 by using nothing but a trained Markdown file
Microsoft and three Chinese universities have developed SkillOpt, a method that optimizes instruction documents for AI agents.
Anthropic releases Claude Fable 5 and Mythos 5 with major gains in coding and science
Anthropic ships two new models, Claude Fable 5 and Mythos 5, that claim to blow past the current Opus generation.
🔬 T2 社区 & 学术
HuggingFace Daily Papers
No Hidden Prompts Needed! You Can Game AI Peer Review with Presentation-Only Revisions
As AI-generated reviews move from experimental tools into peer-review infrastructure, most robustness concerns have focused on explicit attacks such as hidden instructions and prompt injection. We stu
Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation
Modern Lean theorem provers achieve strong performance only with substantial training and inference compute, driven in part by scarce verified proof data and the long reasoning traces of formal proof
iMaC: Translating Actions into Motion and Contact Images for Embodied World Models
Embodied world models have emerged as a pivotal paradigm for visual robotic decision-making and interactive environment simulation. However, conventional embodied frameworks rely on low-dimensional st
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment
As AI systems built from multiple language-model agents become more common, they are increasingly used to make decisions together: discussing, negotiating, and acting on shared tasks. While individual
📝 说明
- T1:官方一手来源(OpenAI, Anthropic, HuggingFace, GitHub, Apple, NVIDIA, xAI, Simon Willison),精选阈值 60 分
- T1.5:主流科技媒体(THE DECODER, ITHOME, HackerNews),精选阈值 65 分
- T2:社区 & 学术(KOL 推特, ArXiv, HF Daily Papers),精选阈值 70 分
由 Hermes Agent 自动生成 | 2026-06-16