Episode 4: Scaling AI Infrastructure: From Backend Engineer to Platform & Infra
00:00
00:00
1x
- 0.5
- 1
- 1.25
- 1.5
- 1.75
- 2
This is a podcast episode titled, Episode 4: Scaling AI Infrastructure: From Backend Engineer to Platform & Infra. The summary for this episode is: <p>In this episode of <em>Making Software</em>, Carla sits down with <strong>Matteo Ferrando </strong>who does Platform and Infrastructure at <strong>Fal.ai</strong>, to explore the complex world of infrastructure and platform engineering.</p><p>Matteo shares his journey from backend engineering to architecting a leading generative media platform. The conversation dives deep into:</p><ul><li><strong>The "Two-Way Door" Framework:</strong> How to decide between building a quick MVP and a long-term scalable solution.</li><li><strong>Latency-Sensitive AI:</strong> The infrastructure challenges of serving audio and video models with millisecond precision.</li><li><strong>Backend vs. Infra:</strong> Matteo’s "controversial" take on what backend engineering really means in a startup environment.</li><li><strong>The Future of Coding:</strong> Why computer science fundamentals are more important than ever in the age of AI-assisted development.</li></ul><p>Whether you're an engineer looking to transition into platform roles or curious about the "how" behind generative AI platforms, this episode is packed with practical architectural insights.</p>
DESCRIPTION
In this episode of Making Software, Carla sits down with Matteo Ferrando who does Platform and Infrastructure at Fal.ai, to explore the complex world of infrastructure and platform engineering.
Matteo shares his journey from backend engineering to architecting a leading generative media platform. The conversation dives deep into:
- The "Two-Way Door" Framework: How to decide between building a quick MVP and a long-term scalable solution.
- Latency-Sensitive AI: The infrastructure challenges of serving audio and video models with millisecond precision.
- Backend vs. Infra: Matteo’s "controversial" take on what backend engineering really means in a startup environment.
- The Future of Coding: Why computer science fundamentals are more important than ever in the age of AI-assisted development.
Whether you're an engineer looking to transition into platform roles or curious about the "how" behind generative AI platforms, this episode is packed with practical architectural insights.
Today's Host

Carla Urrea Stabile
|Staff Developer Advocate
Today's Guests

Matteo Ferrando
|Platform and Infra at fal.ai
Matteo is an Infrastructure and Platform Engineer at fal.ai, where he was one of the company's first hires. As the team has grown and pivoted through multiple directions, he has been at the core of the infrastructure — managing thousands of GPUs, the routing and scheduling layers, and the platform that powers fal.ai's scale. He also works closely with customers to understand their needs and architects the infrastructure solutions that bring new features to life.
@chamini2 
