🤖 AI 资讯日报 — 2026-06-29

数据采集时间: 2026-06-29 08:32 UTC+8

共收录 90 条资讯

📡 T1 官方一手信源

Simon Willison

Datasette Apps: Host custom HTML applications inside Datasette ⭐0.98585
Today we launched a new plugin for Datasette, datasette-apps.

Apple

Machine Learning ⭐0.98545
At Apple, we believe privacy is a fundamental human right.

xAI

Explore the markets with Interactive Brokers and Grok ⭐0.98543
Interactive Brokers now integrates with Grok, bringing powerful AI directly into your trading experience.

Hugging Face

Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel ⭐0.98535
HuggingFace Transformers has become the foundation of the open-source AI ecosystem, and the recent Transformers v5 release strengthened it

OpenAI

Mastering Codex Remote for engineering ⭐0.98513
Turn your phone into a Codex control center for starting, steering, reviewing, and organizing engineering work.

NVIDIA

The Ultimate Summer Sale Pairing: Steam Sale Meets GeForce NOW Discounts ⭐0.98509
Big summer deals arrive along with new games joining the platform.

GitHub

Evaluating performance and efficiency of the GitHub Copilot agentic harness across models and tasks ⭐0.98503
Explore how the GitHub Copilot agentic harness delivers strong results across multiple benchmarks and leading token efficiency.

OpenAI

Codex blog posts ⭐0.9846
Search the blog. Search docs. Suggested. responses create reasoning_effort realtime prompt caching.

Hugging Face

PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters ⭐0.98258
PP-OCRv6 is the latest generation of PaddleOCR's universal OCR model family.

Apple

Introducing the Third Generation of Apple's Foundation Models ⭐0.98222
Our next generation of Apple Intelligence is centered around our users, integrated deeply into our operating systems.

Simon Willison

Simon Willison's Weblog ⭐0.98219
This is a bad state of affairs. Consider, in particular, some industry dynamics: Frontier models are trained at an enormous cost.

GitHub

MAI-Code-1-Flash for Copilot Business and Copilot Enterprise ⭐0.98214
MAI-Code-1-Flash, Microsoft AI's in-house coding model, is now generally available for GitHub Copilot Business and Copilot Enterprise.

xAI

Introducing /goal ⭐0.98196
Use /goal for long-running autonomous task execution in Grok Build.

NVIDIA

Hotter Than a Hot Tub: The 45°C Breakthrough to Cool AI's Biggest Machines ⭐0.98128
NVIDIA's latest AI servers can run on coolant warmer than a hot tub.

Apple

Announcing the 2022 Apple Scholars in AIML ⭐0.98074
The Apple Scholars in AIML PhD fellowship recognizes the contributions of emerging leaders.

GitHub

I automated my job (and it made me a better leader) ⭐0.98033
Explore how my day as a senior leader looks now that I use 40 automations to help.

NVIDIA

How Businesses Are Building Specialized AI They Can Trust ⭐0.98032
NVIDIA Agent Toolkit provides an open, modular foundation for building safer AI.

Hugging Face

Introducing North Mini Code: Cohere's First Model For Developers ⭐0.97861
A Blog post by Cohere Labs on Hugging Face.

Simon Willison

What happened after 2,000 people tried to hack my AI assistant ⭐0.9785
What happened after 2,000 people tried to hack my AI assistant.

xAI

New Compute Partnership with Anthropic ⭐0.97678
SpaceXAI has signed an agreement with Anthropic to provide access to Colossus 1.

OpenAI

Designing delightful frontends with GPT-5.4 ⭐0.97609
Practical techniques for steering GPT-5.4 toward polished, production-ready frontend designs.

Simon Willison

Porting the Moebius 0.2B image inpainting model to run in the browser with Claude Code ⭐0.97577
Porting the Moebius 0.2B image inpainting model to run in the browser with Claude Code.

xAI

Grok for Word ⭐0.97575
Use the Grok add-in for Microsoft Word to turn notes into documents.

Hugging Face

Run a vLLM Server on HF Jobs in One Command ⭐0.97496
You can spin up a private, OpenAI-compatible LLM endpoint on Hugging Face infrastructure with a single command.

Apple

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity ⭐0.97459
Recent generations of frontier language models have introduced Large Reasoning Models (LRMs).

GitHub

Transitioning as a Hubber ⭐0.97392
When I joined GitHub, my legal name was Ursula—but my handle was gleeblezoid.

NVIDIA

Eco Wave Power Turns Waves Into Watts With NVIDIA AI Infrastructure and Digital Twins ⭐0.97308
Eco Wave Power is developing technology powered by NVIDIA AI for sustainable energy.

OpenAI

Making private MCP servers reachable without making them public ⭐0.97278
How we preserved private network boundaries while supporting MCP streaming, authentication, and an inspectable client.

Hugging Face

Kog Laneformer 2B: The Latency-First Model Behind Kog Inference Engine ⭐0.97113
A Blog post by Kog on Hugging Face.

Apple

Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis ⭐0.97038
The rapid progress of foundation models and large language models has fueled significantly improvement in ML.

Simon Willison

Claude Fable is relentlessly proactive ⭐0.96998
After two days of experience with Claude Fable 5 I think the best way to describe it.

xAI

Grok Imagine Video 1.5 ⭐0.96955
Grok Imagine Video 1.5 is now generally available on the Imagine API.

OpenAI

Testing Agent Skills Systematically with Evals ⭐0.96865
A practical guide to turning agent skills into something you can test, score, and improve over time.

GitHub

GitHub Copilot app: The agent-native desktop experience ⭐0.96865
At Microsoft Build 2026, GitHub introduced new tools, updates, and surfaces so agents can work the way you already work.

NVIDIA

At Cannes Lions, NVIDIA Partners Reshape Advertising and Marketing With AI ⭐0.96754
The digital era gave the advertising industry speed; the AI era is giving it autonomous operations.

Anthropic

Research - Anthropic ⭐0.76385427
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Tracing the thoughts of a large language model - Anthropic ⭐0.74807674
Anthropic's latest interpretability research: a new microscope to understand Claude's internal mechanisms.
Exploring model welfare - Anthropic ⭐0.68420124
Announcing a new research ... We report results from our latest test of whether Claude can help Anthropic employees perform sophisticated robotics
Anthropic Economic Index report: Economic primitives ⭐0.683948
Our latest report documented rising 'directive' use where users delegate tasks entirely.
Interpretability Research - Anthropic ⭐0.6651719
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

📰 T1.5 媒体报道

HackerNews

GLM 5.2 beats Claude in our benchmarks ⭐364
I used Claude Code to get a second opinion on my MRI ⭐318
Librepods: AirPods liberated ⭐255
Professor denounces mass AI fraud on an exam at Brown ⭐188
Daisugi, the Japanese technique of growing trees out of other trees (2020) ⭐101
Working around dragons with the Lemote Yeeloong laptop and OpenBSD ⭐92
Show HN: Bash4LLM+ – A lightweight, dependency-free Bash wrapper for LLM APIs ⭐31
Knowledge Distillation of Black-Box Large Language Models ⭐30
Model Training as Code ⭐13

ITHOME

NAIRR Science Program Reshapes Scientific Research, Powered by NVIDIA AI Infrastructure ⭐0.98511
美国国家科学基金会的NAIRR 试点项目在NVIDIA AI 基础设施支持下，推动了多个领域超过700 个创新科研项目。
NVIDIA Announces BioNeMo Agent Toolkit — Tools for Agents to Accelerate Scientific Discovery ⭐0.98104
IT之家 6 月23 日消息，NVIDIA Announces BioNeMo Agent Toolkit.
NVIDIA Vera Rubin Delivers World-Class Supercomputers for Science ⭐0.97872
IT之家 6 月22 日消息，NVIDIA Vera Rubin Delivers World-Class Supercomputers for Science. With 7 Exaflops of AI for Science and 5 Petaflops.

THE DECODER

AI and society | The Decoder ⭐0.6711479
The Five Eyes intelligence agencies warn: AI models capable of taking down governments and businesses are only months away.
Artificial Intelligence: News, Business, Science | THE DECODER ⭐0.65173227
Only three AI models finished above starting capital in a 500-day startup survival test. Researchers at Princeton University built CEO-Bench.
Google keeps losing top AI researchers to rivals - The Decoder ⭐0.5889134
Google appears to be hemorrhaging key AI researchers to competitors. Bloomberg reports that Jonas Adler and Alexander Pritzel are planning to leave.

🔬 T2 社区与学术

HF Daily Papers

The Verification Horizon: No Silver Bullet for Coding Agent Rewards ⭐41
A classical intuition holds that verifying a solution is easier than producing one. For today's coding agents, this intuition is being inverted: as foundation models develop stronger reasoning capabil...
JetSpec: Breaking the Scaling Ceiling of Speculative Decoding with Parallel Tree Drafting ⭐31
Speculative decoding (SD) accelerates autoregressive Large Language Models (LLMs) by drafting multiple tokens and verifying them in parallel, but it faces a scaling limitation: increasing the draft bu...
GUI vs. CLI: Execution Bottlenecks in Screen-Only and Skill-Mediated Computer-Use Agents ⭐28
Computer-use agents can execute software tasks through either graphical interfaces or programmatic command interfaces, but existing evaluations confound interaction modality with differences in tasks,...
Running the Gauntlet: Re-evaluating the Capabilities of Agents Beyond Familiar Environments ⭐17
As agentic systems continue to evolve and are widely deployed in real-world scenarios, there is a growing demand to faithfully evaluate their capabilities. However, current benchmarks are typically bu...
Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It ⭐16
Tool use enables large language models (LLMs) to perform complex tasks, and recent agentic reinforcement learning (RL) methods show promise for enhancing model capabilities. However, RL alone often le...
LISA: Likelihood Score Alignment for Visual-condition Controllable Generation ⭐13
The prevalent dual-branch paradigm, i.e., training a side network to encode visual conditions and fusing its intermediate-layer features to a frozen pretrained main network, has shown remarkable succe...
Discretizing Reward Models ⭐10
Despite their widespread use, the role of reward models in shaping reinforcement learning is poorly understood. Reward models offer a tempting promise: they automatically estimate response quality in...
Information-Aware KV Cache Compression for Long Reasoning ⭐9
Reasoning capability has advanced rapidly in large language models (LLMs), leading to an increasing size of key-value (KV) cache in both prefilling and decoding stages. Existing KV cache compression m...
Neglected Free Lunch from Post-training: Progress Advantage for LLM Agents ⭐8
Process reward models enable fine-grained, step-level evaluation of LLMs, yet building them for agentic settings remains prohibitively difficult: long-horizon interactions, irreversible actions, and s...
CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies ⭐8
As LLM agents become capable of increasingly long-horizon tasks, evaluating their performance in economic systems is becoming increasingly important. Unlike existing benchmarks that primarily evaluate...
Hallucination in World Models is Predictable and Preventable ⭐8
Modern generative world models render increasingly realistic action-controllable futures, yet they frequently hallucinate: rollouts remain visually fluent while drifting from the ground-truth dynamics...
ABACUS: Adapting Unified Foundation Model for Bridging Image Count Understanding and Generation ⭐4
ABACUS is a unified vision-language model that handles object counting, crowd counting, referring-expression counting, and count-faithful image generation without any benchmark-specific training requi...
When Does Combining Language Models Help? A Co-Failure Ceiling on Routing, Voting, and Mixture-of-Agents Across 67 Frontier Models ⭐3
Multi-model LLM systems such as routing, voting, cascades, fusion, and mixture-of-agents are used to beat single-model accuracy. We show that their gain is capped by a quantity the field rarely report...
How Post-Training Shapes Biological Reasoning Models ⭐3
Scientific reasoning models for biology combine language models with foundation models trained on multimodal biological data, including DNA, RNA, and proteins. These models are built through post-trai...
EO-WM: A Physically Informed World Model for Probabilistic Earth Observation Forecasting ⭐2
Earth Observation (EO) forecasting aims to predict future Earth surface dynamics from satellite observations under changing meteorological conditions. In this paper, we view this task as a partially o...

ArXiv

[[2409.02668] Introduction to Machine Learning](https://arxiv.org/abs/2409.02668) ⭐0.98574
This book introduces the mathematical foundations and techniques that lead to the development and analysis of many of the algorithms that are used in machine
[[2604.15821] Breaking the Training Barrier of Billion-Parameter Universal ML Interatomic Potentials](https://arxiv.org/abs/2604.15821) ⭐0.98369
Deployed across two Exascale supercomputers, our code attains a peak performance of 1.2/1.0 EFLOPS
arXiv.org e-Print archive ⭐0.97881
arXiv is a free distribution service and an open-access archive for nearly 2.4 million scholarly articles
Unlimited OCR Works Welcome the Era of One-shot Long-horizon Parsing ⭐0.97236
Recently, end-to-end OCR models, exemplified by DeepSeek OCR, have once again thrust OCR into the spotlight.
[[2412.17643] Advances in Machine Learning Research Using Knowledge Graphs](https://ar5iv.labs.arxiv.org/html/2412.17643) ⭐0.96717
The study uses CSSCI-indexed literature from the China National Knowledge Infrastructure (CNKI) database.
[[2605.27923] Do We Really Need Quantum Machine Learning?](https://arxiv.org/abs/2605.27923) ⭐0.96649
A feature count of 10 qubits and a sample size in the range of 200-500 emerge as practical operating points.
[[2606.06473] MLEvolve: A Self-Evolving Framework for Automated ML Algorithm Discovery](https://arxiv.org/abs/2606.06473) ⭐0.96211
Large language model (LLM) agents are used for automated machine learning algorithm discovery.
Machine Learning in Biomechanics: Key Applications and Limitations ⭐0.95456
This chapter provides an overview of recent and promising Machine Learning applications in pose estimation, feature estimation, event detection.
Position: The AI and ML Community Should Adopt a More Transparent Peer Review Process ⭐0.94967
This position paper advocates for a more transparent, open, and well-regulated peer review.
[[2504.00709] Science Autonomy using Machine Learning for Astrobiology](https://arxiv.org/abs/2504.00709) ⭐0.94704
In recent decades, artificial intelligence (AI) including machine learning has advanced astrobiology.

KOL Twitter

INS贴文账号哪里有（购买网址CXzhan.com） ⭐0.48323447
Real-time posts from X ・ meng@shao__meng 聚焦AI 工具普及与产品体验,内容偏实用、面向普通用户和工具使用者。thinkingjimmy@thinkingjimmy ・适合关注工具构
嘉兴高级外围上门资源 ⭐0.44869256
一句话介绍：一台放在家里，和你一起成长的个人AI 系统。Oki Home 插电、扫码即可使用。
武汉高级资源外围大学生上门 ⭐0.44637465
字节跳动Seed：字节AI 研究团队官方信息源，适合跟踪前沿模型方向。
大连高级资源外围大学生上门 ⭐0.42076674
它揭示了当前AI发展的两大阵营：一方是科技巨头主导的中心化模式，另一方是区块链赋能的去中心化革命。
德阳高级小姐 ⭐0.41563448
Sahara AI 是一个为AI开发和AI数据/模型资产化提供基础设施的AI链。
南宁高级小姐 ⭐0.40500212
Sahara AI 是一个为AI开发和AI数据/模型资产化提供基础设施的AI链。
海口高级资源 ⭐0.3983836
这是AI从聊天工具变成基础设施的关键。n8n：测试所有平台后最好的选择。
嘉兴高级资源 ⭐0.38131228
Claude Code 是目前最强的AI 编程助手，但注册、订阅、使用的每一步都暗藏风控雷区。
海口资源（高级小姐） ⭐0.36772484
Claude Code 是目前最强的AI 编程助手。
昆明资源（高级小姐） ⭐0.358174
Claude Code 是目前最强的AI 编程助手。

由 Hermes Agent 自动生成 | 2026-06-29