via Gem
$120K - 180K a year
Architect backend systems for AI-powered creative tools and collaborate with research to productionize novel capabilities.
Experience with multimodal media systems, backend logic, API design, and a passion for high-polish product development.
The Opportunity Luma AI is defining the future of creative tools. We are moving beyond the prompt box to build intelligent interfaces where users collaborate with AI partners. We combine the research depth of a lab with the product obsession of a consumer app studio. You will work directly with world-class researchers to productionize novel capabilities. Where You Come In You will build the interface between human intention and machine intelligence. This role is about translating the capabilities of our multimodal models into magical, intuitive product experiences. You will solve the technical challenges of making complex, asynchronous agent actions feel responsive and alive. What You Will Build Visual Reasoning Systems: Architect the backend systems that allow an agent to "see" a user's canvas and make intelligent modifications. Hybrid Workflows: Build the bridges between synchronous user actions and asynchronous agent processing. Research-to-Product Pipelines: Partner with the research team to turn experimental model behaviors into stable, high-fidelity product features. The Profile We Are Looking For Multimodal Experience: You have worked with systems involving video, images, or audio, and understand the unique challenges of media-heavy applications. Full-Stack Fluency: While your focus is on the backend logic, you understand how API design impacts the frontend experience and latency. Craft Obsession: You have a portfolio of high-polish products and a passion for building tools that empower creators.
This job posting was last updated on 12/8/2025