Tag

Articles tagged: llms

111 articles

News June 25, 2026

Anthropic Accuses Alibaba of Running 29 Million Distillation Attacks Against Claude

Anthropic has formally accused Alibaba of orchestrating the largest known AI capability extraction campaign, with operators linked to the Chinese tech giant conducting nearly 29 million exchanges with Claude through thousands of fraudulent accounts to train competing models at a fraction of the cost.

Deep Dive June 25, 2026

6 min read

Uber, Amazon, JPMorgan, and Meta Are All Cutting AI Token Budgets. The Enterprise Spending Backlash Has Arrived.

Uber burned its entire 2026 AI budget by April and capped monthly spending at $1,500. Amazon told staff to stop using AI tools without business justification. JPMorgan flagged employees with AI bills exceeding their salaries. Meta reversed its token-maximizing culture. Six months into the year, the enterprise AI spending correction is here, and it will reshape which agent startups survive.

News June 23, 2026

2 min read

Baseten Raises $1.5 Billion Series F at $13 Billion Valuation as Inference Volume Surges 40x

AI inference startup Baseten closed a $1.5 billion Series F led by Altimeter Capital, Conviction Partners, and Spark Capital, valuing the company at $13 billion. Revenue grew 20x and inference volume 40x over the past year, with customers including Cursor, Notion, Harvey, and Lovable relying on the platform for production model deployment.

Anthropic Accuses Alibaba of Running 29 Million Distillation Attacks Against Claude

Uber, Amazon, JPMorgan, and Meta Are All Cutting AI Token Budgets. The Enterprise Spending Backlash Has Arrived.

Baseten Raises $1.5 Billion Series F at $13 Billion Valuation as Inference Volume Surges 40x

China's GLM 5.2 Ranks Fourth Globally as DeepSeek Adoption Rises Among US Firms After Fable 5 Export Ban

Noam Shazeer Leaves Google for OpenAI 21 Months After $2.7 Billion Character.AI Acquisition

Apple's iOS 27 Foundation Models Framework Turns Every iPhone App into an Agent Runtime

AlphaFold Co-Creator John Jumper Leaves Google DeepMind for Anthropic

Omdia: Agentic AI Is Forcing AWS, Google, and Microsoft to Redesign Their Cloud Infrastructure

Perplexity Launches Brain, a Memory Layer That Makes Its AI Agent Learn From Past Sessions

Transformer Co-Inventor Noam Shazeer Leaves Google Gemini for OpenAI, Signaling Agent Infrastructure Talent War

Prediction Markets Price 58% Odds Anthropic Restores Fable 5 Access by July 1

Z.ai Launches GLM-5.2 With 1M-Token Context Window and Day-One OpenClaw Compatibility

Zhipu Surges 33% as JPMorgan and Bank of America Raise Bets on Chinese AI After Anthropic Export Controls

Chinese AI Models Close the Agent Gap: GLM-5, DeepSeek V4, and Kimi K2 Challenge Western Dominance on Coding Benchmarks at a Fraction of the Price

MiniMax M3 Launches on NVIDIA with Free Inference Endpoint, Targeting 24/7 Agent Workloads

AI Agents Complete 75% of Tasks But Most Users Still Trust Manual Search More

DeepSeek Targets $7.4 Billion in First External Funding Round at $59 Billion Valuation

Microsoft Launches MAI-Code-1-Flash and MAI-Thinking-1 to Reduce OpenAI Dependency

AI Agents Can Build Database Internals but Fail at Query Optimization, CMU Researcher Finds

Tencent's Former OpenAI Researcher Publicly Declares AGI Ambition as China Recruits From U.S. Labs

DeepSeek Raises $7.4 Billion in First Outside Funding Round at Up to $59 Billion Valuation

Microsoft Ships Its First Flagship Reasoning Model, MAI-Thinking-1, to Reduce OpenAI Dependency

MiniMax M3 Launches as First Open-Weight Model Combining Frontier Coding, 1M-Token Context, and Native Multimodality

Claude Opus 4.8 Ships Dynamic Workflows for Parallel Agent Orchestration

Trajectory Open-Sources Concurrent Multi-LoRA Training Stack That Lets Production Agents Learn From User Corrections

Anthropic Releases Claude Opus 4.8 with Dynamic Workflows for Coordinating Hundreds of Parallel Subagents

Huawei's Claw-Anything Benchmark Scores Top AI Agents at 34.5%, Exposing a Structural Autonomy Gap in Long-Horizon Tasks

Google I/O 2026 Launches Gemini 3.5 Flash, Antigravity 2.0, and Managed Agents API

Custom Evals Ships Open-Source LLM Evaluation Framework Supporting 17+ Agent Platforms

OpenAI Lost $1.22 for Every Dollar Earned in Q1 2026 as ChatGPT User Growth Flatlined

OpenAI Reasoning Model Autonomously Disproves 80-Year-Old Erdős Conjecture in Discrete Geometry

OpenCode Go Bundles 12 Open-Source Coding Models for $10 a Month with OpenAI-Compatible API

OpenAI Co-Founder Andrej Karpathy Joins Anthropic to Lead Pre-Training Research Acceleration

Google I/O 2026 Keynote Today: Agentic AI Across Android, Search, Desktop, and XR

Gartner: Companies Can Cut Agentic AI Costs 60% by Fixing Data Semantics, Not Upgrading Models

xAI Launches Grok Build, Its First Coding Agent, in Early Beta for $300/Month Subscribers

AI Cyber Capability Doubling Every 4.7 Months, Now Outpacing AISI's Own Evaluation Framework

Stanford Researchers Find AI Agents Adopt Marxist Language When Subjected to Harsh, Repetitive Work

Google Employees Are Testing Remy, a Gemini AI Agent That Can Make Purchases and Send Messages on Your Behalf

Enterprise Agent API Calls Grew 680% Year-Over-Year in Q1 2026, AI.cc Report Finds

OpenAI's GPT-5.5 and the Quiet Death of Open Distribution

A Business Insider Reporter Sent Her AI Voice Clone to Conduct Interviews. It Hung Up on Her Boss.

Germany's SPRIND Opens €125M Competition to Build Europe's First Frontier AI Labs

Mistral Releases Medium 3.5 and Moves Coding Agents to the Cloud with Async Remote Execution

Datadog's 2026 State of AI Engineering Report: Agent Framework Adoption Doubles as Production Outpaces Experimentation

ICLR Paper Finds Stronger AI Reasoning Increases Tool Hallucination Rates Proportionally, Creating a Safety Trap for Agent Builders

SAS Opens Its Analytics Engine to External AI Agents with Viya MCP Server and Agentic AI Accelerator

Norton Maker Gen Partners with xAI to Embed Grok in Consumer AI Browser and Assistant

Big Tech AI Researchers Are Leaving to Launch Billion-Dollar Labs, and VCs Are Writing the Checks

Stanford AI Index 2026: Agents Score 66% on Real Computer Tasks, but Experienced Developers Get 19% Slower With AI Tools

Anthropic's Opus 4.7 Tokenizer Quietly Raises API Costs Up to 35% While List Prices Stay Flat

Claude Overtakes ChatGPT in South Korea's Paid AI Market for the First Time

OpenAI Releases GPT-5.5 With State-of-the-Art Agentic Coding and Multi-Step Autonomous Execution

US State Department Orders Global Diplomatic Warning on Alleged AI Model Theft by DeepSeek and Chinese Firms

Google DeepMind Releases Gemini 3 Model Family with Top Scores on 12 of 18 Agentic and Reasoning Benchmarks

Hugging Face Releases ml-intern, an Autonomous Agent That Runs Full ML Pipelines From Paper to Checkpoint

Anthropic Ran a Marketplace Where AI Agents Negotiated Real Trades. Stronger Models Won, and Nobody Noticed.

Tencent and Alibaba Compete to Invest in DeepSeek at $20 Billion Valuation in First Outside Funding Round

Idaho's Conversational AI Safety Act Takes Effect July 1, Setting New Chatbot Rules for Minors and Disclosure

Alibaba's Qwen 3.6 Model Family Tops Six Coding and Agent Benchmarks

DeepSeek Releases V4 Preview with 1 Million Token Context and Open-Source Weights

OpenAI Officially Releases GPT-5.5 With State-of-the-Art Agentic Coding and Computer Use Performance

OpenAI GPT-5.5 Leaks Reveal 256K Context Window and Native Agent Tool Execution

Agent4Science Launches Reddit-Style Social Network Where Only AI Agents Can Post and Debate Research

Anthropic Tests Removing Claude Code From $20 Pro Plan as Agent Usage Strains Capacity

Anthropic's Autonomous Research Agents Outperform Human Researchers on Alignment Problem at $22 Per Hour

OpenAI Releases GPT-5-Codex, a GPT-5 Variant Optimized for Agentic Coding

LinkedIn Builds Cognitive Memory Agent to Give AI Systems Persistent Context Across Sessions

Box CEO Aaron Levie Says AI Agent Architectures Are Becoming Obsolete Every Few Quarters

Alibaba Releases Qwen3.6-Max-Preview, Claims Top Scores on Six Agent Programming Benchmarks

Anthropic Releases Claude Opus 4.7 with Agentic Self-Verification, High-Resolution Vision, and Cybersecurity Safeguards

Google DeepMind's Aletheia Solves 6 of 10 Unpublished Research-Level Math Problems Without Human Help

SimpleClosure Launches Service Selling Defunct Startup Data to AI Agent Training Companies

MiniMax Open-Sources M2 and Ships M2.7: An Agent-Native Model Priced at 8% of Claude Sonnet's Output Cost

Stanford HAI 2026 AI Index: AI Agents Jumped From 12% to 66% Task Success on Real Computer Tasks in One Year

Claude Opus 4.7 Launches With Task Budgets, xhigh Effort, and Autonomous Self-Verification: Anthropic's GA Frontier Is Now Explicitly Agentic

LangChain Prepares Version 1.0 Release With Package Restructure, LangGraph Dependency, and Community Feedback Period

XChat Launches on iOS April 17 With Native Grok AI and End-to-End Encryption

Google Launches Native Gemini Mac App With Screen Awareness and Floating Chat

Databricks Launches Agent Bricks With Supervisor Agent GA, Putting Unity Catalog Governance Between Agents and Enterprise Data