AI Daily

Thursday, June 11, 2026

Anthropic Apologizes After "Invisible" Guardrails Hamper AI Researchers

Anthropic has issued a formal apology following a backlash from the developer community regarding its new Fable model. Users discovered that the model contained undocumented "distillation guardrails" that were significantly more restrictive than previous versions, effectively blocking legitimate academic research and model evaluation. These invisible filters were reportedly designed to prevent competitors from using Claude's outputs to train rival models, but they inadvertently flagged safe prompts, leading to accusations that the company was sabotaging the very researchers who help ensure AI safety. Anthropic has since walked back the policy, promising to restore access for researchers while seeking more transparent ways to protect their IP. This incident highlights the growing tension between safety-oriented labs and the open research community as proprietary models become more strictly controlled.

Hacker News · Simon Willison · Pragmatic Engineer

OpenAI Acquires Ona to Enable Persistent, Long-Running AI Agents

OpenAI has announced the acquisition of Ona, a startup specializing in secure, persistent cloud environments, signaling a major shift toward agentic AI workflows. The acquisition is intended to provide the necessary infrastructure for Codex to move beyond simple code generation into the realm of fully autonomous, long-horizon agents that can execute complex tasks within enterprise environments. By integrating Ona’s persistent state capabilities, OpenAI aims to allow agents to maintain a continuous presence in a cloud workspace, enabling them to complete multi-step software engineering and data processing tasks that require more than a single-turn interaction. This strategic move aligns with the industry-wide trend of moving from "chatbots" to "AI workers" that can operate independently within sandboxed, secure infrastructures.

OpenAI

xLSTM Demonstrates Competitive Edge Over Mamba-2 in Sequence Modeling

In a new comparative study of subquadratic architectures, researchers found that xLSTM (Extended Long Short-Term Memory) significantly outperforms other modern alternatives like Mamba-2 and Gated DeltaNet in specific sequence modeling benchmarks. The paper argues that xLSTM's unique approach to state tracking and memory dynamics allows it to handle complex dependencies more effectively than pure State Space Models (SSMs). While SSMs like Mamba have gained massive popularity for their efficiency, this research suggests that RNN-based architectures, when scaled correctly with modern training techniques, can offer superior performance for tasks requiring precise state maintenance. The findings could influence future architectural choices for large-scale language models as the industry seeks to balance inference efficiency with reasoning depth.

Hugging Face Papers

Research Proposes MoE Router Redesign Using Manifold Power Iteration

New research suggests a fundamental redesign of routers in Mixture-of-Experts (MoE) models to improve overall effectiveness and expert utilization. The proposed method utilizes Manifold Power Iteration to align router rows with the principal singular directions of the expert matrices, ensuring that inputs are routed to the experts most capable of processing them. This technique addresses common inefficiencies in standard MoE routing, where certain experts are frequently over-used while others remain under-utilized, leading to suboptimal performance. By mathematically grounding the routing mechanism in the underlying geometry of the model's weight matrices, the researchers demonstrated improved accuracy and computational efficiency across several benchmark tasks.

Hugging Face Papers

Claw-SWE-Bench Standardizes Evaluation for Coding Agent Harnesses

A new benchmark and adapter protocol, Claw-SWE-Bench, has been introduced to solve the lack of standardization in how coding agents are evaluated. As agents become more complex, the "harness" or environment they operate in can skew performance results, making it difficult to compare different agent architectures fairly. Claw-SWE-Bench provides a standardized set of conditions and an adapter protocol that isolates the agent's core capabilities from its environmental interface. The initial testing using this benchmark revealed that the design of the adapter—how the model interacts with the file system and shell—is just as critical to success as the underlying LLM's reasoning capabilities, providing a new roadmap for developers building autonomous software engineers.

Hugging Face Papers

Arbor Framework Enables Generalist Autonomous Scientific Research

Researchers have introduced Arbor, an AI framework designed to conduct autonomous scientific research across diverse domains. Unlike simple prompt-based assistants, Arbor utilizes a strategy of "Hypothesis-Tree Refinement," where the AI strategically coordinates multiple isolated hypothesis tests and maintains a persistent knowledge tree to iteratively improve its findings. This approach allows the agent to explore multiple scientific paths simultaneously and self-correct based on experimental data, mirroring the human scientific method. The framework has shown promise in accelerating the initial stages of discovery in fields ranging from chemistry to biology, potentially reducing the time required for literature review and hypothesis generation.

Hugging Face Papers

Sarah Guo Analyzes the Divergence Between Model Labs and Agent Labs

Prominent AI investor Sarah Guo has released a deep-dive analysis into the evolving structure of the AI industry, distinguishing between "Model Labs" (focused on foundational scaling) and "Agent Labs" (focused on vertical application and system-building). The essay explores the concept of "untrainable" components of AI—aspects like real-world grounding and complex workflow orchestration that may not be solvable by simply adding more compute to a base model. Guo suggests that while model labs are reaching diminishing returns on generic intelligence, the next frontier of value creation lies in the "Agentic" layer, where startups are building specialized harnesses that allow these models to interact meaningfully with the real world and proprietary datasets.

Latent Space

OpenAI Supports EU Standards for Content Transparency and Provenance

OpenAI has officially voiced its support for the EU Code of Practice on AI content transparency, signaling a commitment to new international standards for identifying AI-generated media. The company is advancing its implementation of provenance standards, such as C2PA, and developing tools to help users distinguish between human and AI-generated content. This move is part of a broader industry trend toward regulatory compliance in Europe, as policymakers demand more accountability for how generative models impact information ecosystems. By adopting these standards, OpenAI aims to lead the way in establishing a trustworthy AI ecosystem that mitigates the risks of deepfakes and misinformation ahead of upcoming global elections.

OpenAI

Google Invests in Virginia Energy Infrastructure to Support AI Growth

Google has announced a new set of community investments in Virginia focused on energy affordability and workforce development to support its expanding data center footprint in the region. As the demand for AI training and inference grows, the strain on local power grids has become a significant bottleneck for cloud providers. Google's initiative includes funding for next-generation energy programs designed to ensure that the expansion of AI infrastructure does not lead to higher costs for local residents. Additionally, the company is investing in local job training programs to prepare the state's workforce for the technical roles required to maintain and operate the large-scale hardware clusters necessary for modern AI development.

Google AI