Inference
Made Easy

Join the engineers running inference at scale in production.

Infer Summit Logo
Pinterest Logo
Nvidia Logo
baseten
LinkedIn Logo
databricks logo

Why Attend?

Learn from the engineers building the inference stack

Understand what actually works in production

Explore emerging inference architectures

Learn how teams are controlling costs at scale

Hear From Experts

HOST

Hien Luu

Hien Luu

PLATFORM TEAMS

Salina Wu Headshot

Salina Wu

Sundara Ramachandran

INFERENCE PROVIDERS

Meryem Arik

Meryem Arik

Philip Keily

Emilio Andere

Ying Chen Headshot

Ying Chen

ENGINE CONTRIBUTORS

Chenyang Zhao headshot

Chenyang Zhao

Harry Kim

The Three Pillars of Modern Inference

Engines

Inside today’s inference engines: schedulers, KV caches, serving systems, and the code powering them.

Mission Critical

Operating inference at scale: reliability, observability, and performance.

Economics

Building faster, cheaper, and more efficient inference platforms.

One Day.
Three Pillars.
Real Engineering.

Let’s build the future of inference together.

Presented by

Momento