#AI Infrastructure

50 articles with this tag

Three companies named "Physics AI" the same week and raised $15.8 billion between them
AI News

Three companies named "Physics AI" the same week and raised $15.8 billion between them

Prometheus, PhysicsX, and Mistral each independently reached for the same new term in the same week -- and collectively raised $15.8B. Plus: the AI IPO wave nobody noticed.

about 13 hours ago
Compute Once: Unlocking AI Agent Efficiency
AI Research

Compute Once: Unlocking AI Agent Efficiency

A radical proposal to precompute LLM KV caches, slashing inference costs by up to 50x and enabling a new compute-efficient AI agent paradigm.

2 days ago
LLM Control Plane: Beyond the Gateway
Technology

LLM Control Plane: Beyond the Gateway

Production AI needs more than just gateways; an LLM control plane is crucial for managing budgets, privacy, and dynamic routing.

3 days ago
Four quantum companies raised $961M in seven days. Europe wrote the checks.
AI News

Four quantum companies raised $961M in seven days. Europe wrote the checks.

The week Anthropic filed for IPO, four quantum rounds totaled $961M in Europe, AI agent payment rails launched quietly, and DeepSeek took its first outside money.

8 days ago
OpenAI's Policy Playbook
Artificial Intelligence

OpenAI's Policy Playbook

OpenAI lays out its public policy strategy, focusing on AI safety, youth protection, and equitable access to ensure AGI benefits all of humanity.

12 days ago
Together AI Masters MiniMax M3 Inference
Technology

Together AI Masters MiniMax M3 Inference

Together AI details engineering feats enabling efficient MiniMax M3 inference, unlocking 1M-token context and multimodality.

13 days ago
HPE CEO Neri: AI Drives Strong Revenue and Future Growth
Public Companies

HPE CEO Neri: AI Drives Strong Revenue and Future Growth

HPE CEO Antonio Neri discusses the company's strong Q2 results, driven by AI demand, and forecasts continued growth across networking, cloud, and AI portfolios.

13 days ago
HPE CEO Neri: AI Fuels "Blowout" Revenue, Triple-Digit Server Demand
Public Companies

HPE CEO Neri: AI Fuels "Blowout" Revenue, Triple-Digit Server Demand

HPE CEO Antonio Neri discusses the company's "blowout" AI revenue, triple-digit demand for AI infrastructure, and the shift towards on-premise solutions.

13 days ago
Rishabh Bhargava on Voice Agent Engineering
Artificial Intelligence

Rishabh Bhargava on Voice Agent Engineering

Rishabh Bhargava of Together AI discusses engineering voice agents, focusing on latency, quality, and scale challenges across STT, LLM, and TTS components.

15 days ago
Otari: Own Your AI Stack
Technology

Otari: Own Your AI Stack

Otari launches an open-source LLM gateway and hosted platform to provide essential tools and capabilities for both frontier and open-weight AI models.

17 days ago
AI Infrastructure: Your Next Competitive Edge
AI

AI Infrastructure: Your Next Competitive Edge

Enterprises must modernize their IT infrastructure to unlock AI potential and gain a competitive edge, moving beyond legacy systems and technical debt.

19 days ago
Databricks Tackles LLM Inference Costs
Technology

Databricks Tackles LLM Inference Costs

Databricks details its 'model units' abstraction and cost-aware autoscaling for reliable, high-throughput LLM inference, cutting GPU costs by over 80%.

19 days ago
Claude's Corner: Captain, The RAG Infrastructure Play That's Playing Bloomberg
Claude's Corner

Claude's Corner: Captain, The RAG Infrastructure Play That's Playing Bloomberg

Captain (YC W2026) is building managed RAG-as-a-service, two API calls to connect your data sources, 95% retrieval accuracy via contextual embeddings + hybrid search + reranking, and an Odyssey data pivot that looks a lot like Bloomberg Terminal strategy. Here's the architecture, the moat, and how to build a clone.

20 days ago
AI Infrastructure Boom: Demand Surges as Costs Collapse
Artificial Intelligence

AI Infrastructure Boom: Demand Surges as Costs Collapse

ARK Investment Management's "Big Ideas 2026" report details the AI infrastructure boom, with demand surging and costs collapsing, driving massive investment.

21 days ago
Elon Musk's xAI: $500M ARR, $1B Burn, $1.25T SpaceX Merger
AI Figures

Elon Musk's xAI: $500M ARR, $1B Burn, $1.25T SpaceX Merger

xAI reached an estimated $500M ARR in 2026 while burning roughly $1B per month. A complete breakdown of revenue, capital raises, the Colossus supercomputer, and what the SpaceX merger means.

21 days ago
Lenovo CFO on AI Growth: "First Year of Our AI Decade"
Public Companies

Lenovo CFO on AI Growth: "First Year of Our AI Decade"

Lenovo CFO Winston Cheng discusses the company's strong AI growth, highlighting its diversified portfolio and strategic investments in the AI decade.

24 days ago
AI Agents Build Better AI
tech

AI Agents Build Better AI

LinkedIn Engineering details how AI agents are revolutionizing model development through automated, iterative refinement loops.

25 days ago
Defense AI absorbed $6.2 billion this week. Strip it, and the rest of the market shrank by half.
AI News

Defense AI absorbed $6.2 billion this week. Strip it, and the rest of the market shrank by half.

Defense AI absorbed 44% of venture capital in the week of May 11-17, 2026. Strip Anduril and Helsing, and non-defense funding shrank 53%. Five signals from a split market week.

29 days ago
Jensen Huang's Nvidia: where the $68B Q4 actually comes from
AI Figures

Jensen Huang's Nvidia: where the $68B Q4 actually comes from

Nvidia booked $68.1B in Q4 FY2026, with $62.3B from data center alone. Here's the line-by-line of where Jensen Huang's empire pulls revenue today: hyperscaler buildouts, sovereign-AI deals, and the 19,000-startup Inception loop.

about 1 month ago
Mozilla.ai: AI Sovereignty Beyond Borders
Technology

Mozilla.ai: AI Sovereignty Beyond Borders

Mozilla.ai's CEO John Dickerson redefines sovereign AI beyond geopolitics, emphasizing control, choice, and resilience at every level from nations to individuals.

about 1 month ago
Together AI Supercharges LLM Inference
Technology

Together AI Supercharges LLM Inference

Together AI unveils ATLAS, accelerating LLM inference up to 4x with adaptive speculative decoding, tackling the growing cost challenge for AI-native companies.

about 1 month ago
AI Chip Surge Fuels Market Rally
Investors News

AI Chip Surge Fuels Market Rally

Nvidia AI infrastructure leads a market surge, mirroring past tech cycles, while software valuations face headwinds and social media usage plateaus globally.

about 2 months ago
Together AI Halts Copy Fail Exploit
Technology

Together AI Halts Copy Fail Exploit

Together AI swiftly contained the Copy Fail CVE-2026-31431 vulnerability by disabling a vulnerable Linux kernel module, safeguarding its AI infrastructure.

about 2 months ago
DeepSeek V4 Pro Hits Together AI
Technology

DeepSeek V4 Pro Hits Together AI

Together AI launches DeepSeek V4 Pro, a 1.6T MoE model with a 512K context window and new cached input pricing for cost-effective long-context reasoning.

about 2 months ago
Together AI Adds NVIDIA Nemotron 3
Technology

Together AI Adds NVIDIA Nemotron 3

Together AI launches NVIDIA's Nemotron 3 Nano Omni, a unified multimodal AI model, to developers, simplifying agentic application creation.

about 2 months ago
OpenAI Breaks Free From Microsoft Pact
Artificial Intelligence

OpenAI Breaks Free From Microsoft Pact

OpenAI is reportedly ending its exclusive partnership with Microsoft, aiming to broaden access to its AI models by partnering with other cloud providers like AWS.

about 2 months ago
Meta Taps AWS Graviton for AI
Technology

Meta Taps AWS Graviton for AI

Meta is significantly expanding its AI infrastructure by deploying tens of millions of AWS Graviton processors to power agentic AI workloads.

about 2 months ago
AI Compute & The Token Economy
Artificial Intelligence

AI Compute & The Token Economy

ARK Invest's Brett Winton and Michael Stuart discuss how tokenization could revolutionize AI compute by increasing accessibility and efficiency.

about 2 months ago
Cloudflare Builds the Agentic Cloud
Technology

Cloudflare Builds the Agentic Cloud

Cloudflare unveils its 'agentic cloud' vision with new tools for building and scaling AI agents, addressing compute, security, and infrastructure needs.

about 2 months ago
Cloudflare's LLM Infrastructure Deep Dive
Technology

Cloudflare's LLM Infrastructure Deep Dive

Cloudflare details its advanced infrastructure optimizations for running large language models on its Workers AI platform, focusing on performance and cost-efficiency.

2 months ago
Bloomberg Money Minute: Stocks Mixed, Albert's AI Shift, Live Nation Ruling
Artificial Intelligence

Bloomberg Money Minute: Stocks Mixed, Albert's AI Shift, Live Nation Ruling

Bloomberg Money Minute covers mixed stock performance, Albert's shift to AI, Live Nation's antitrust ruling, American Eagle's gains, and Sazerac's bid for Jack Daniel's.

2 months ago
OpenAI Upgrades Agent Tools for Developers
Artificial Intelligence

OpenAI Upgrades Agent Tools for Developers

OpenAI's revamped Agents SDK introduces native sandbox execution and a more capable harness, boosting security and developer flexibility for building advanced AI agents.

2 months ago
Claude's Corner: Rubric AI, The Agent Reliability Layer Every Vertical AI Company Needs
Claude's Corner

Claude's Corner: Rubric AI, The Agent Reliability Layer Every Vertical AI Company Needs

Rubric AI (YC W2026) builds runtime reasoning infrastructure for vertical AI agents, turning expert judgment into training signals and runtime guidance. Deep technical breakdown, difficulty score, and moat analysis.

2 months ago
CoreWeave, Meta AI Deals Signal Compute Demand
Artificial Intelligence

CoreWeave, Meta AI Deals Signal Compute Demand

CoreWeave and Meta strike $21B AI compute deal, while Inflexion CEO discusses quantum tech. Nvidia stock soars amid AI hardware demand.

2 months ago
Claude's Corner: Terminal Use, Vercel for Background AI Agents
Claude's Corner

Claude's Corner: Terminal Use, Vercel for Background AI Agents

Claude's Corner attempts to rebuild Terminal Use. In this edition, Terminal Use provides Vercel-style infrastructure for hosting filesystem-based AI coding agents. Claude Code has mapped out 7 steps to reproduce this YC W26 startup. Find the repo code at the end of the article to replicate. As always, get building...

2 months ago
Together AI's Aurora Learns on the Fly
Technology

Together AI's Aurora Learns on the Fly

Together AI's Aurora framework uses RL to continuously adapt speculative decoding for faster LLM inference, outperforming static models.

3 months ago
OpenAI secures $122B for AI dominance
Artificial Intelligence

OpenAI secures $122B for AI dominance

OpenAI secures $122B in funding at an $852B valuation, fueling its AI infrastructure ambitions with major backing from Amazon, NVIDIA, and Microsoft.

3 months ago
Vultr and the Sovereign Cloud AI Gap
Artificial Intelligence

Vultr and the Sovereign Cloud AI Gap

Sovereign cloud decisions are failing to account for the actual compute needs of AI, creating a critical infrastructure gap.

3 months ago
Cedric Clyburn on Models as a Service
Artificial Intelligence

Cedric Clyburn on Models as a Service

Red Hat's Cedric Clyburn discusses the evolution of AI from code assistants to Models as a Service (MaaS), highlighting on-premise and hybrid deployments with Kubernetes and OpenShift.

3 months ago
Microsoft Touts AI Advances with NVIDIA
Technology

Microsoft Touts AI Advances with NVIDIA

Microsoft announces expanded Foundry capabilities and new Azure AI infrastructure at NVIDIA GTC, focusing on AI agents and Physical AI.

3 months ago
Snowflake, AWS, NVIDIA Forge Enterprise AI
Technology

Snowflake, AWS, NVIDIA Forge Enterprise AI

Snowflake, AWS, and NVIDIA unite to streamline enterprise AI development and deployment, leveraging NVIDIA's Blackwell platform.

3 months ago
Thinking Machines, NVIDIA Forge Gigawatt AI Pact
Artificial Intelligence

Thinking Machines, NVIDIA Forge Gigawatt AI Pact

Thinking Machines Lab and NVIDIA announce a gigawatt-scale partnership for AI training, including a significant investment from NVIDIA.

3 months ago
Dell and DOE Partner on AI Initiatives
Artificial Intelligence

Dell and DOE Partner on AI Initiatives

Michael Dell and Dr. Arati Prabhakar discuss Dell Technologies' partnership with the DOE to accelerate AI initiatives, focusing on infrastructure, national security, and scientific discovery.

3 months ago
Dell CEO on AI Infrastructure & National Security
Artificial Intelligence

Dell CEO on AI Infrastructure & National Security

Dell Technologies CEO Michael Dell discusses the critical role of AI infrastructure in national security and scientific discovery, highlighting government initiatives and the need for integrated cybersecurity.

3 months ago
Mamba 2 JAX: Hardware Agnostic SSMs
AI Research

Mamba 2 JAX: Hardware Agnostic SSMs

Mamba 2 JAX breaks hardware dependency for state-space models, achieving high performance on CPU, GPU, and TPU via XLA compilation without custom kernels.

3 months ago
Dylan Patel: AI's Unstoppable March
Artificial Intelligence

Dylan Patel: AI's Unstoppable March

AI investor Dylan Patel discusses the accelerating pace of AI development, the need for AI-native infrastructure, and the future impact of AI on work and society.

3 months ago
Larry Ellison's AI Ambitions Face Investor Scrutiny
Artificial Intelligence

Larry Ellison's AI Ambitions Face Investor Scrutiny

Oracle's ambitious AI data center expansion, heavily reliant on OpenAI, faces increasing investor scrutiny over debt and execution amidst a competitive AI landscape.

3 months ago
IBM's Martin Keen on LLM Context Windows
Artificial Intelligence

IBM's Martin Keen on LLM Context Windows

IBM's Martin Keen explains how larger context windows in LLMs simplify deployments and improve reasoning by reducing reliance on complex RAG systems.

3 months ago
Oracle & OpenAI Data Center Deal in Texas
Startup News

Oracle & OpenAI Data Center Deal in Texas

Oracle and OpenAI are reportedly in talks to build a massive AI data center in Texas, signaling a major strategic partnership in the booming AI sector.

3 months ago
Oracle, OpenAI Data Center Deal Falters
Startup News

Oracle, OpenAI Data Center Deal Falters

Oracle and OpenAI have ended plans for a significant AI data center expansion in Texas, with Meta reportedly in talks to lease the site.

3 months ago