🤖 AI 资讯日报 — 2026-06-28（周日）

数据采集时间：2026-06-28 08:33 UTC+8

来源：OpenAI · Anthropic · HuggingFace · GitHub · Apple · NVIDIA · xAI · Simon Willison · THE DECODER · ITHOME · HackerNews · ArXiv · HF Daily Papers

🏆 T1 官方一手（最高权重）

📌 OpenAI Blog

Mastering Codex Remote for engineering

Codex blog posts

Designing delightful frontends with GPT-5.4

Testing Agent Skills Systematically with Evals

Blog

📌 Anthropic Research

Anthropic Economic Index report: Cadences

Project Glasswing: An initial update

An update on our model deprecation commitments for Claude Opus 3

Anthropic Education Report: The AI Fluency Index

Alignment faking in large language models

📌 GitHub Blog

MAI-Code-1-Flash for Copilot Business and Copilot Enterprise

Transitioning as a Hubber

Evaluating performance and efficiency of the GitHub Copilot agentic harness across models and tasks

I automated my job (and it made me a better leader)

How we built an internal data analytics agent

📌 Apple ML

Introducing the Third Generation of Apple's Foundation Models

Announcing the 2022 Apple Scholars in AIML

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis

Evaluating Evaluation Metrics — The Mirage of Hallucination Detection

📌 NVIDIA Blog

The Ultimate Summer Sale Pairing: Steam Sale Meets GeForce NOW Discounts

Hotter Than a Hot Tub: The 45°C Breakthrough to Cool AI's Biggest Machines

How Businesses Are Building Specialized AI They Can Trust

Eco Wave Power Turns Waves Into Watts With NVIDIA AI Infrastructure and Digital Twins

Hands Free, AIs Forward: NVIDIA XR AI Brings Agents to AR Glasses

📌 xAI

Explore the markets with Interactive Brokers and Grok

Introducing /goal

Grok for Word

New Compute Partnership with Anthropic

Grok Imagine Video 1.5

📌 Simon Willison

What happened after 2,000 people tried to hack my AI assistant

A quote from Dean W. Ball

AI and Liability

Claude Fable is relentlessly proactive

A quote from Tom MacWright

📌 Hugging Face Blog

Run a vLLM Server on HF Jobs in One Command

Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel

PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters

Beyond LoRA: Can you beat the most popular fine-tuning technique?

Introducing North Mini Code: Cohere's First Model For Developers

📰 T1.5 媒体 + 社区

Landmark German ruling declares Google's AI Overviews are Google's own words and makes it liable for false answers [THE DECODER]

Microsoft researcher builds a working neural network out of goats in Age of Empires II to critique AI science [THE DECODER]

OpenAI's GPT 5.6 rollout now requires US government approval on a customer by customer basis [THE DECODER]

AI is inflating student grades, and the effect points to outsourced work, not better learning [THE DECODER]

NAIRR Science Program Reshapes Scientific Research, Powered by NVIDIA AI Infrastructure [ITHOME]

NVIDIA Announces BioNeMo Agent Toolkit — Tools for Agents to Accelerate Scientific Discovery [ITHOME]

Ask HN: Is it time to fork HN into AI/LLM and "Everything else/other?" [HackerNews ⭐553]

Coconut by Meta AI – Better LLM Reasoning with Chain of Continuous Thought? [HackerNews ⭐362]

Show HN: Countless.dev – A website to compare every AI model: LLMs, TTSs, STTs [HackerNews ⭐361]

Taste in the age of AI and LLMs [HackerNews ⭐265]

Wikimedia Enterprise – APIs for LLMs, AI Training, and More [HackerNews ⭐222]

Anti-AI Hype LLM Reading List [HackerNews ⭐208]

Jellyfin LLM/"AI" Development Policy [HackerNews ⭐207]

Ask HN: Go deep into AI/LLMs or just use them as tools? [HackerNews ⭐195]

Advent of Code 2023's new AI/LLM Policy [HackerNews ⭐174]

🔬 T2 学术 + 社交

📄 HF Daily Papers（热门论文）

Are We Ready For An Agent-Native Memory System? 👍104

Memory for large language model (LLM) agents has rapidly evolved from simple retrieval-augmented mechanisms into a data management system that support

DanceOPD: On-Policy Generative Field Distillation 👍64

Modern image generation demands a single model that unifies diverse capabilities, including text-to-image (T2I), local editing, and global editing. Ho

DomainShuttle: Freeform Open Domain Subject-driven Text-to-video Generation 👍62

Open domain subject-driven text-to-video (S2V) generation has drawn significant interest in academia and industry. Open domain S2V mainly involves two

ShutterMuse: Capture-Time Photography Guidance with MLLMs 👍44

Real-world photography requires capture-time guidance for both camera framing and subject pose. Yet existing aesthetic cropping benchmarks mainly eval

In-Context World Modeling for Robotic Control 👍42

Modern Vision-Language-Action (VLA) models often fail to generalize to novel setups, such as altered camera viewpoints or robot morphologies, because

OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning 👍40

Outcome-based reinforcement learning provides a stable optimization backbone for language agents, but its sparse trajectory-level rewards provide litt

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation 👍40

While text-to-image (T2I) models have achieved remarkable progress, they struggle with real-world requests that are often underspecified, implicit, or

The Verification Horizon: No Silver Bullet for Coding Agent Rewards 👍38

A classical intuition holds that verifying a solution is easier than producing one. For today's coding agents, this intuition is being inverted: as fo

ViQ: Text-Aligned Visual Quantized Representations at Any Resolution 👍37

A unified representation for text and vision is a natural pursuit, as it enables simpler multimodal modeling and more efficient training. However, rep

MVTrack4Gen: Multi-View Point Tracking as Geometric Supervision for 4D Video Generation 👍34

Synthesizing a novel-view video from a monocular reference video along a target camera trajectory requires both geometric consistency and motion fidel

📑 ArXiv 论文

Artificial Intelligence - arXiv recent submissions

Attention Is All You Need (Transformer)

Language Models are Few-Shot Learners (GPT-3)

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Geospatial Representation Learning: A Survey from Deep Learning to The LLM Era

Act Only When It Pays: Efficient Reinforcement Learning for LLM Reasoning via Selective Rollouts

AI and Supercomputing are Powering the Next Wave of Breakthrough Science – But at What Cost?

A Comprehensive Review of AI Agents: Transforming Possibilities in Technology and Beyond

🐦 KOL 观点

OpenAI, Anthropic, Google DeepMind jointly paper on LLM safety fragility

OpenAI, Anthropic, Google DeepMind killed product design interview rounds

Jack Clark built Anthropic to compete with OpenAI in centralized AI

Ethan Perez - Anthropic Inverse Scaling Prize $100k

Google AI Researchers Depart for Anthropic Amid Talent Shifts

Beyond the Agent Hype: Anthropic testing agentic security workflows

Harper Carroll: AI Reasoning & Reasoning Models Explained

Security Challenges in AI Agent Deployment

📊 本期统计：T1 40 条 · T1.5 21 条 · T2 70 条 · 合计 131 条

>

由 Hermes Agent 自动生成 · 2026-06-28