Lambda

Lambda

17 open positions available

2 locations
1 employment type
Actively hiring
Full-time

Latest Positions

Showing 17 most recent jobs
Lambda

Engineering Program Manager - Fleet Engineering

LambdaSeattle, WAFull-time
View Job
Compensation$226K - 377K a year

Coordinate and lead cross-functional teams to deliver complex infrastructure projects, ensuring alignment, risk management, and process improvement. | Over 10 years of infrastructure experience, with at least 5 years managing major projects and leading engineering teams, along with a technical background in infrastructure technologies. | Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mission is to make compute as ubiquitous as electricity and give everyone the power of superintelligence. One person, one GPU. If you'd like to build the world's best AI cloud, join us. • Note: This position requires presence in our San Francisco/San Jose or Bellevue office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. About the Team The Fleet Engineering team is responsible for the logical deployment of cutting edge NVIDIA GPU clusters, the reliability of the production fleet, and the tools and processes to support these outcomes. About the Role Reporting to the Director of Fleet Engineering, your role as an Engineering Program Manager is to coordinate collaboratively across a set of cross functional teams to ensure we deliver new GPU capacity on time and at 100% quality. You will be responsible for managing and coordinating the efforts of multiple teams, communicating progress and actively managing risks and prioritization. You will work collaboratively with Product and Infrastructure engineering teams to improve transparency, metrics, automation and overall efficiency for the team. We value diverse backgrounds, experiences, and skills, and we are excited to hear from candidates who can bring unique perspectives to our team. If you do not exactly meet this description but believe you may be a good fit, please still apply and help us understand your readiness for this Manager role. Your application is not a waste of our time. What You’ll Do • Partner with Fleet Engineering Managers to ensure the teams are aligned on expectations, track progress towards deliverables, providing repeatable & scalable programs. • Identify opportunities for improvement: ensuring we are capturing the appropriate signals throughout the program and facilitating continuous improvement. • Work with Fleet Engineering Deployments on executing against tight deadlines while improving process, tooling, automation. • Collaborate closely with a broad set of stakeholders, including Platform & Infrastructure engineering, Program Management, Product Management, DC Operations, and finance • Lead cross-functional engineering teams to deliver complex infrastructure projects from concept to deployment. Define scope, goals, and deliverables; plan resources, timelines, risks and ensure execution aligns with organizational objectives. • Demonstrate technical expertise in infrastructure technologies, including NVIDIA GPUs, hardware troubleshooting, lab methodologies, and automation tools. • Drive risk management and stakeholder communication by proactively identifying issues, driving realtime and inflight tight timeline projects, and providing transparent updates on progress and milestones. • Continuously refine project management processes to improve efficiency, collaboration, and cross-functional alignment with product, operations, and security teams. Maintain a customer-focused approach in defining and meeting technical requirements. You • 10+ years of infrastructure experience with 5+ years performing program management for major projects including capital projects or hyperscaler infrastructure deployment • Demonstrated experience leading a team of engineers on complex, cross-functional projects in a fast-paced environment. • Comfortable managing cross functional teams and driving decisions and communications • Experience successfully designing and implementing simple, scalable processes that solve complex problems. • Thrive in ambiguous, fast-paced environments, You bring clarity and order to the rest of the team. • Bachelor's degree in Computer Science, Engineering, or a related technical field. • Proven track record of successfully leading and delivering complex technical projects. • Exceptional leadership, communication, and interpersonal skills. • Ability to thrive in a fast-paced, high-pressure environment and manage multiple projects simultaneously. Nice to Have • Experience managing hybrid hardware deployment and software engineering projects. • Experience in a hyperscaler (CSP), neocloud provider (NCP), or high-performance computing (HPC) production environments. • Worked closely with product managers to deliver products to specification. • Deep understanding of infrastructure technologies and software development best practices. Salary Range Information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda • Founded in 2012, with 500+ employees, and growing fast • Our investors notably include TWG Global, US Innovative Technology Fund (USIT), Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, Gradient Ventures, Mercato Partners, SVB, 1517, and Crescent Cove • We have research papers accepted at top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG • Our values are publicly available: https://lambda.ai/careers • We offer generous cash & equity compensation • Health, dental, and vision coverage for you and your dependents • Wellness and commuter stipends for select roles • 401k Plan with 2% company match (USA employees) • Flexible paid time off plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law. Compensation Range: $226K - $377K

Program Management
Cross-Functional Leadership
Strategic Planning
Verified Source
Posted 20 days ago
Lambda

Assistant Controller

LambdaAnywhereFull-time
View Job
Compensation$Not specified

Oversee and manage the monthly, quarterly, and annual close process to ensure accuracy and timeliness. Lead, mentor, and develop accounting team members while participating in special projects and assisting with audits. | Bachelor’s degree in Accounting or Finance is required, along with a CPA and 8+ years of progressive accounting experience. Strong knowledge of US GAAP, SEC Reporting, and SOX compliance is essential. | Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference. Lambda’s mission is to make compute as ubiquitous as electricity and give every person access to artificial intelligence. One person, one GPU. If you'd like to build the world's best deep learning cloud, join us. *Note: This position requires presence in our San Jose office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. What You’ll Do Oversee and manage the monthly, quarterly and annual close process to ensure accuracy and timeliness Review journal entries, reconciliations, and account analyses prepared by the accounting team Support preparation of SEC filings (10-K, 10-Q, 8-K, proxy statements) in compliance with US GAAP and SEC regulations Partner with Sales Operations, FP&A, Tax, HR and other functions to ensure proper accounting and reporting of transactions Assist with internal and external audits, providing documentation and analyses Assist in driving process improvements and automation initiatives to shorten the close cycle and enhance reporting accuracy Lead, mentor, and develop accounting team members, fostering a culture of collaboration, accountability, and continuous improvement Participate in special projects such as system implementations, accounting policy development and other projects as needed Assist in budgeting and forecasting activities, providing valuable insights and recommendations Collaborate with cross-functional teams to ensure proper recording and allocation of expenses You Bachelor’s degree in Accounting or Finance, or a related field; CPA required 8+ years of progressive accounting experience, including Big 4 public accounting and corporate accounting leadership roles Strong knowledge of US GAAP, SEC Reporting and SOX compliance Proven track record of managing monthly, quarterly and annual close processes in a fast-paced environment Experience with NetSuite Excellent analytical, organizational, and problem-solving skills with a keen eye for detail Strong leadership and communication skills, with the ability to collaborate cross-functionally, influence at all levels of the organization and articulate accounting concepts to non-accounting stakeholders Maintain a high level of integrity and professionalism, with the ability to handle sensitive and confidential information Ability to thrive in a dynamic, high-growth, and deadline-driven environment Nice to Have Experience working in a fast paced, high-growth technology environment Experience with AI, SaaS, or consumption-based business models Demonstrated skill in process automation and system implementations Comfort handling ambiguity and working with minimal supervision Initiative to apply knowledge and recommend well-considered improvements Salary Range Information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda Founded in 2012, ~400 employees (2025) and growing fast We offer generous cash & equity compensation Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG Health, dental, and vision coverage for you and your dependents Wellness and Commuter stipends for select roles 401k Plan with 2% company match (USA employees) Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Accounting
Finance
US GAAP
SEC Reporting
SOX Compliance
Leadership
Analytical Skills
Organizational Skills
Problem-Solving
Communication Skills
Collaboration
Process Improvement
Budgeting
Forecasting
NetSuite
Automation
Direct Apply
Posted 4 months ago
Lambda

Senior Investor Relations Analyst

LambdaAnywhereFull-time
View Job
Compensation$Not specified

Support the quarterly earnings process and collaborate with Finance and cross-functional teams to analyze financial performance for investor communications. Assist with the creation and maintenance of investor relations materials and plan investor relations events. | Candidates should have 2-5 years of experience in equity research, investment banking, or strategic finance, along with strong financial and analytical skills. A BS or BA in Business, Economics, or a related field is required, along with a passion for capital markets and technology. | Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference. Lambda’s mission is to make compute as ubiquitous as electricity and give every person access to artificial intelligence. One person, one GPU. If you'd like to build the world's best deep learning cloud, join us. *Note: This position requires presence in our San Francisco location 4 days per week; Lambda’s designated work from home day is currently Tuesday. What You’ll Do: Support the quarterly earnings process, including logistics, financial analyses, and preparation of earnings materials Collaborate with Finance and cross-functional teams to analyze financial performance and key business developments for use in investor communications Perform financial, strategic, and competitive analyses, including peer benchmarking and ad hoc projects that support strategic and financial initiatives Review and analyze equity research, consensus estimates, and investor/analyst sentiment to assess valuation drivers and market perception Assist with the creation and ongoing maintenance of investor relations materials, including updates to the Investor Relations website Assist with the planning and execution of investor relations events, including non-deal roadshows and investor conferences Monitor peers and competitors’ earnings disclosures and equity research publications to inform internal messaging and market positioning Summarize and distribute key takeaways from investor interactions, equity research, and market developments to senior management You Have 2–5 years of experience in equity research, investment banking, buy-side investing, or strategic finance Demonstrate mature financial and analytical skills with the ability to interpret complex information and financial concepts Possess strong business acumen with an understanding of technology and software business models, and a capacity to quickly learn about our industry, products, competitors, and key audiences Proficient in Excel and Google Suite; experience with financial research tools is a plus Bring a detail-oriented mindset coupled with the ability to think strategically and see the broader picture Thrive in fast-paced, high-growth environments with a proactive, flexible, and collaborative mindset Bring excellent verbal and written communication skills, with the ability to work effectively across teams and functions Hold a BS or BA in Business, Economics, or a related field, or equivalent relevant work experience Bring a passion for capital markets, paired with a strong interest in understanding emerging trends and developments in the technology industry Nice to Have: Experience with AI, SaaS, or consumption-based business models Comfort handling ambiguity and working with minimal supervision Experience working with pre-IPO companies Familiarity with capital structure dynamics and the ability to evaluate debt instruments such as bonds and bank facilities Salary Range Information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda Founded in 2012, ~400 employees (2025) and growing fast We offer generous cash & equity compensation Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG Health, dental, and vision coverage for you and your dependents Wellness and Commuter stipends for select roles 401k Plan with 2% company match (USA employees) Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Financial Analysis
Analytical Skills
Business Acumen
Technology Understanding
Excel Proficiency
Google Suite Proficiency
Detail-Oriented
Strategic Thinking
Communication Skills
Investor Relations
Equity Research
Market Analysis
Competitive Analysis
Event Planning
Peer Benchmarking
Capital Markets
Direct Apply
Posted 4 months ago
Lambda

Engineering Manager - Software Defined Networking

LambdaAnywhereFull-time
View Job
Compensation$Not specified

Lead the Software Defined Networking team and manage both internal and customer-facing projects to enhance the capabilities of the network. Ensure optimal customer experience while overseeing operational and development workloads. | Candidates should have over 10 years of industry experience in software engineering with a focus on networking and distributed systems. Proven leadership in building high-performance networking infrastructure and managing operational excellence is essential. | Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference. Lambda’s mission is to make compute as ubiquitous as electricity and give every person access to artificial intelligence. One person, one GPU. If you'd like to build the world's best deep learning cloud, join us. *Note: This position requires presence in our San Francisco/San Jose or Seattle office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. What You’ll Do Lead our world-class Software Defined Networking team. Lead both internal and customer facing projects in order to expand the capabilities of our multi-tenant, high-performance software-defined network. Work closely with peer Managers in the Networking, Control Plane, and HPC Architecture teams to set our future vision for software defined networking. Ensure our customers have the best possible experience that meets their performance, feature, and reliability requirements. Manage both operational and development workloads, ensuring rigorous SLAs while also investing in automation and platform improvements to accelerate future growth. Own and assist with product-focused projects and strategies that keep Lambda at the cutting edge of GPU hosting, making us the best place to run any GPU, ML or AI workloads. Hire, grow and retain top-tier engineers, focusing on both systems reliability engineering and software engineering. Shape a culture of sustainable, empathetic, and high-velocity engineering, with a deep focus on cross-team collaboration, documentation, and data-driven decision-making. You 6+ years in full-time engineering management roles at a hyperscalar/cloud, networking solutions provider, technology company dependent on on-prem data-center networking, or a networking software company where you led networking or networking-adjacent teams. 10+ years of industry experience in software engineering, with a focus on deploying networking, distributed systems engineering, and/or software-defined networking. Design and implementation of networking control planes and data planes, Development and tuning of traffic engineering, routing protocols (e.g., BGP, OSPF), VPNs, load balancers, and distributed firewalls, Proficient in low-level Linux networking, network namespaces, iptables, eBPF, and DPDK Proven record of leading and building engineering teams that work on mission-critical, high performance networking infrastructure and distributed-systems orchestration. Demonstrated operational excellence in running production-grade networking infrastructure with 99.99%+ availability SLAs, Defining SLIs and SLOs, Incident management under high-pressure scenarios, Postmortem and root cause analysis, Implementation of observability pipelines (Prometheus, Grafana, ELK, etc.) Experience deploying and operating next-generation networking technologies in High-performance computing (HPC) and AI datacenters, Edge environments with strict latency and jitter constraints, Private cloud stacks such as OpenStack Neutron, Open vSwitch (OvS), Open Virtual Network (OVN) for Open vSwitch, or other Software Defined Networking software stacks like Nutanix, or VMware NSX Exceptional leadership skills that encompass leading by trust, building empathy with your reports and other teams, and maintaining a sustainable but rapid velocity. Strong customer-facing skills, including pre-sales, general support, and incident management. Expertise with Kubernetes and container networking stacks like CNI plugins (Calico, Cilium, Flannel), Service mesh implementations (Istio, Linkerd), Ingress controllers, multi-tenant network policies, and network security enforcement. Demonstrated expertise in managing long-term projects alongside urgent, short-term priorities and incident resolution. Extensive experience collaborating with product, sales, and other engineering teams to build cohesive products with a focus on user experience and reliability. Nice to Have Experience with cutting edge networking technologies, specifically programmable superNICs, such as the Nvidia BlueField DPUs. Experience with Nvidia Spectrum-X networking platform. Experience with deploying networking solutions on Kubernetes. Experience with implementing robust networking observability solutions. Experience managing a remote, globally-distributed team. Significant experience providing input into sales, customer service/success, and customer support functions. Salary Range Information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda Founded in 2012, ~400 employees (2025) and growing fast We offer generous cash & equity compensation Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG Health, dental, and vision coverage for you and your dependents Wellness and Commuter stipends for select roles 401k Plan with 2% company match (USA employees) Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Software Defined Networking
Engineering Management
Distributed Systems
Networking Infrastructure
Traffic Engineering
Routing Protocols
Linux Networking
Incident Management
Kubernetes
Container Networking
Observability Solutions
Customer Support
Team Leadership
Automation
Cloud Technologies
AI Datacenters
Direct Apply
Posted 5 months ago
Lambda

Senior Financial Analyst

LambdaAnywhereFull-time
View Job
Compensation$Not specified

Develop and implement complex financial models for forecasting and budgeting. Collaborate with cross-functional teams to support strategic initiatives and drive profitable growth. | Candidates must have a bachelor's degree in Finance, Accounting, Economics, or a related field, along with 2-4 years of relevant experience. Strong analytical skills and the ability to thrive in a fast-paced environment are essential. | We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be. If you'd like to build the world's best deep learning cloud, join us. *Note: This position requires presence in our San Francisco office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. What You’ll Do Develop and implement complex financial models for forecasting, budgeting, and long-term strategic planning Conduct rigorous analysis of key performance indicators, operational metrics, and business trends, translating the data into actionable insights and recommendations Lead the monthly and quarterly financial reporting processes, ensuring accuracy and providing variance analysis to highlight key business drivers and financial risks Collaborate with cross-functional teams, including Sales, Marketing, Product, and Operations, to align on and support strategic initiatives, improve operational efficiency, provide deal support, and drive profitable growth Assist in preparing management presentations and reports for the Executive team, Board of Directors, and external stakeholders Participate in the financial due diligence for fundraising opportunities Monitor industry trends and the competitive landscape to identify potential business risks and opportunities Undertake interesting and impactful ad hoc analyses and special projects as directed by Leadership You Have a bachelor's degree in Finance, Accounting, Economics, or a related field Have a minimum of 2-4 years of experience in financial analysis including: FP&A, strategic finance, investment banking, private equity, or venture capital, ideally with a focus on software, AI, cloud infrastructure, and/or enterprise technology Possess extensive experience in financial modeling and analysis, with a deep expertise in constructing complex financial models and interpreting financial statements to drive strategic decision-making Have excellent analytical, strategic thinking, and decision-making skills Possess strong Excel skills and experience with financial software systems Have the ability to thrive in a fast-paced, high-growth environment, balancing multiple complex projects Have excellent written and verbal communication skills with the ability to present complex data clearly and concisely Are a team player with a positive attitude, strong work ethic, and a commitment to continuous improvement Are able to work in an ambiguous environment with very little direction Nice to Have Prior experience in a startup or high-growth organization, demonstrating adaptability and flexibility to thrive in such environments Experience in the machine learning, computer hardware industry, or cloud computing Salary Range Information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda Founded in 2012, ~400 employees (2025) and growing fast We offer generous cash & equity compensation Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG Health, dental, and vision coverage for you and your dependents Wellness and Commuter stipends for select roles 401k Plan with 2% company match (USA employees) Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Financial Modeling
Analytical Skills
Strategic Thinking
Decision-Making
Excel Skills
Financial Software
Communication Skills
Team Player
Adaptability
Problem Solving
Direct Apply
Posted 5 months ago
Lambda

Senior Networking Engineer

LambdaAnywhereFull-time
View Job
Compensation$Not specified

Help to build Lambda’s cloud networking infrastructure and contribute to automation of network configuration. Work with internal and external customers to resolve network-related issues and maintain network monitoring tools. | Candidates should have 3+ years of experience in IT and 1+ years in managing networks. Familiarity with virtualization technologies, firewall policies, and understanding of complex networking topologies is essential. | We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be. If you'd like to build the world's best deep learning cloud, join us. *Note: This position requires presence in our San Francisco/San Jose/Seattle office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. What You’ll Do Help to build Lambda’s cloud networking infrastructure Contribute to automation of network configuration Will be part of operations and on-call for networking Work with internal and external customer to resolve network related issues Work on deploying and configuring networking HW, Switches, FWs, for new clusters Help with deploying and maintaining network monitoring and management tools You Have 3+ years of experience in IT space, and 1+ in managing networks Have experience with virtualization technology, like ESXi, KVM, and VMs management Have experience with FW policies configurations Have experience with multi-data center networks and hybrid cloud networks Have understanding of BGP EVPN VXLAN networks, Spine and Leaf (Clos) network topology Are comfortable on the Linux command line, and have an understanding of the Linux networking stack and internals Have python and/or bash programming experience and worked with git or similar source control systems Nice to Have Experience with Monitoring/Observability tools like Datadog, Splunk, Grafana, Prometheus Have experience building and maintaining Software Defined Networks (SDN) Experience with HPC networking, such as Infiniband or RoCE Experience automating network configuration within public clouds, with tools like Terraform/Ansible/Salt Experience with Next-Generation Firewalls (NGFW) Salary Range Information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda Founded in 2012, ~400 employees (2025) and growing fast We offer generous cash & equity compensation Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG Health, dental, and vision coverage for you and your dependents Wellness and Commuter stipends for select roles 401k Plan with 2% company match (USA employees) Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Networking Infrastructure
Automation
Linux
Python
Bash
Virtualization
Firewalls
BGP
EVPN
VXLAN
Multi-Data Center Networks
Hybrid Cloud Networks
Monitoring Tools
SDN
HPC Networking
Terraform
Ansible
Direct Apply
Posted 5 months ago
Lambda

Site Reliability Engineer - Managed Kubernetes (Senior)

LambdaAnywhereFull-time
View Job
Compensation$Not specified

Operate and maintain bare-metal Kubernetes clusters while handling cluster degradation and incident response. Collaborate with teams for low-level issues and develop automation for cluster lifecycle management. | Candidates must have 6+ years of experience in SRE or operations roles with strong programming skills in Go and Python. Proven experience operating Kubernetes clusters in production environments is essential. | We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be. If you'd like to build the world's best deep learning cloud, join us. What You’ll Do Operate and maintain bare-metal Kubernetes clusters, scaling up to thousands of nodes Handle cluster degradation, recovery, resizing, and incident response using fleet management tools Participate in a well-managed on-call rotation for critical incidents Assist customers with Kubernetes questions, workload integration, storage, and authentication Work closely with our HPC Ops and Datacenter Ops teams for low-level or cross-functional issues Use Python and Golang to create tooling and automate the validation of platform quality. Design, build, and maintain scalable control plane services, operators, and custom controllers for Kubernetes Develop automation for cluster lifecycle management: provisioning, upgrades, patching, and deletion. Define and implement SLOs and SLIs for Kubernetes services, workloads, and platform reliability. About You Must-Have 6+ years of experience in a SRE, operations engineer, or similar role, with a deep knowledge of running Linux clusters and systems Strong programming skills in Go and Python; experience with GitOps (e.g., ArgoCD), Helm, and Kubernetes operators Proven experience operating Kubernetes clusters in production environments (on-prem, EKS, GKE, or similar) Can work either independently with limited direction or as part of a team Can work with customers during incidents either via tickets, live messaging, or as part of a larger call. Familiarity with observability tools like Prometheus, Grafana, FluentBit, and CI/CD pipelines Proven experience provisioning Kubernetes using tools such as kubeadm, Cluster API, or similar Nice-to-Have Deep Kubernetes expertise: CRDs, CSI, CNI, Kubernetes Operator Coding experience Exposure to HPC clusters, AI/ML workloads, or large-scale GPU clusters Hybrid or multi-cloud Kubernetes environment experience Contributions to CNCF projects or Kubernetes SIGs Why Join Us Work on cutting-edge Managed Kubernetes platforms for AI/ML workloads Influence the platform roadmap and help shape operations and reliability best practices Collaborate with a highly skilled engineer Opportunity to mentor and grow within a fast-growing, technology-driven environment About Lambda Founded in 2012, ~400 employees (2025) and growing fast We offer generous cash & equity compensation Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG Health, dental, and vision coverage for you and your dependents Wellness and Commuter stipends for select roles 401k Plan with 2% company match (USA employees) Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Kubernetes
Linux
Python
Golang
GitOps
Helm
Observability
CI/CD
Cluster Management
SLOs
SLIs
HPC
AI/ML
GPU Clusters
CNCF
Kubernetes Operators
Direct Apply
Posted 5 months ago
Lambda

Staff Product Manager - Cloud Storage

LambdaAnywhereFull-time
View Job
Compensation$150K - 250K a year

Define and execute the vision and strategy for Lambda’s cloud and hybrid storage platform, lead technology selection and integration, and ensure performance and scalability for AI workloads. | Bachelor’s degree in a technical field with 7+ years product management experience focused on cloud-scale storage or infrastructure platforms, including expertise in large-scale storage architectures and hybrid cloud integration. | We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be. If you'd like to build the world's best deep learning cloud, join us. *Note: This position requires presence in our San Francisco or Seattle office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. About the role The Product Manager, Cloud Storage Platform is a senior technical leader responsible for setting the vision, strategy, and architecture for Lambda’s storage infrastructure across cloud and hybrid environments. You will own the complete lifecycle of our storage platform — from ultra–high-performance block and file systems to petabyte- and exabyte-scale object storage — ensuring it delivers unmatched performance, durability, scalability, and cost efficiency for the most demanding AI workloads in the world. This role demands deep expertise across both software-defined and cloud-native storage architectures, along with the ability to unify them into a seamless, high-performance platform. You will define how storage is delivered, managed, and scaled globally, influencing multi-billion-dollar infrastructure investments and guiding world-class engineering teams to deliver storage capabilities that set a new industry benchmark for AI infrastructure. Key Responsibilities Define and execute the long-term vision and strategic roadmap for Lambda’s storage platform across cloud and hybrid environments, ensuring it delivers uncompromising performance, scalability, durability, and cost efficiency for the world’s largest AI workloads. Lead the evaluation, selection, and seamless integration of advanced storage technologies — spanning block, file, and object architectures — using rigorous benchmarking to optimize IOPS, throughput, latency, and total cost of ownership. Translate complex infrastructure capabilities into clear product requirements, precise service-level objectives (SLOs), and measurable performance benchmarks that align with demanding AI and HPC use cases. Architect and implement intelligent data tiering strategies (hot, warm, cold) to maximize performance where it matters and drive significant cost savings at scale. Collaborate with infrastructure and operations leaders to forecast multi-year capacity growth, design for petabyte-to-exabyte scalability, and ensure consistent performance under peak workloads. Define and enforce lifecycle management, replication, and disaster recovery policies that guarantee data integrity, compliance, and near-zero downtime. Own the observability and optimization roadmap for the storage platform, deploying advanced telemetry, monitoring, and analytics to proactively detect and remediate bottlenecks before they impact customers. Partner closely with engineering to drive continuous performance tuning, eliminate systemic inefficiencies, and ensure the platform remains ahead of industry benchmarks. Minimum Qualifications Bachelor’s degree or foreign equivalent in Computer Science, Electrical Engineering, Computer Engineering, or a closely related technical field. Seven (7) years of progressive, post-baccalaureate experience in product management, including at least four (4) years focused specifically on cloud-scale storage or infrastructure platforms. Proven expertise in the following areas, demonstrated within the required seven (7) years of experience: Designing and delivering large-scale storage platforms, including block, file, and object architectures, for performance-critical workloads. Evaluating and selecting storage technologies through benchmarking of throughput, IOPS, latency, durability, and total cost of ownership. Architecting and managing storage solutions for petabyte- to exabyte-scale datasets, including intelligent tiering strategies. Defining lifecycle management, replication, and disaster recovery strategies to ensure data durability and high availability. Integrating storage services across hybrid and multi-cloud environments to deliver a unified, high-performance platform. Salary range information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda Founded in 2012, ~400 employees (2025) and growing fast We offer generous cash & equity compensation Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG Health, dental, and vision coverage for you and your dependents Wellness and Commuter stipends for select roles 401k Plan with 2% company match (USA employees) Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Cloud-scale storage platforms
Block, file, and object storage architectures
Performance benchmarking (IOPS, throughput, latency)
Petabyte to exabyte scale storage
Lifecycle management and disaster recovery
Hybrid and multi-cloud integration
Product management
Direct Apply
Posted 5 months ago
Lambda

Engineering Manager, AI Cloud Platform

LambdaAnywhereFull-time
View Job
Compensation$180K - 250K a year

Lead and grow the AI Cloud Core Platform engineering team, drive roadmap execution, collaborate cross-functionally, and ensure scalable, reliable cloud platform delivery. | 10+ years software engineering with 5+ years management experience in high-growth tech, expertise in large-scale distributed backend systems, leadership in enterprise feature delivery, and strong collaboration skills. | We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be. If you'd like to build the world's best deep learning cloud, join us. Note: This position requires presence in our San Francisco office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. Engineering at Lambda is responsible for building and scaling our cloud offering. Our scope includes the Lambda website, cloud APIs and systems as well as internal tooling for system deployment, management and maintenance. What you’ll do Lead the AI Cloud Core Platform team of ~6 engineers, with end-to-end ownership of Cloud Platform and governance capabilities. Drive execution of roadmap features including cluster lifecycle automation. Partner closely with Product and Design to ensure the user experience matches the needs of enterprise customers. Balance rapid feature delivery with longer-term investments in scalability, observability, and platform design. Hire, mentor, and grow a team of engineers, providing career development and feedback. Collaborate with other Lambda teams (Control Plane, Billing, Platform) to ensure smooth, integrated delivery across the stack. Contribute to a culture of high performance, documentation, humility, and curiosity. Be product-focused in your leadership and execution, always placing the needs of the customer first, with a particular focus on feature velocity, reliability and security. Shape a culture of sustainable, empathetic, and high-velocity engineering, with a deep focus on cross-team collaboration, documentation, and data-driven decision-making. You 5+ years in a full-time management role at a high-growth technology company 10+ years of industry experience in software engineering, with a focus on large-scale distributed systems and backend systems. Proven record of leading and building engineering teams that work on mission-critical, high performance systems. Proven track record leading teams that deliver enterprise features or governance platforms. Exceptional leadership skills that encompass leading by trust, building empathy with your reports and other teams, and maintaining a sustainable but rapid velocity. Demonstrated expertise in managing long-term projects alongside urgent, short-term priorities and incident resolution. Extensive experience collaborating with product, sales, and other engineering teams to build cohesive products with a focus on user experience and reliability. Ability to understand, review and structure Python and Go applications. Nice to Have Experience with IAM, authentication/authorization (SSO, RBAC, SCIM), governance tooling, or compliance features. Background building cloud application platforms. Experience managing a remote, distributed team Salary Range Information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda Founded in 2012, ~400 employees (2025) and growing fast We offer generous cash & equity compensation Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG Health, dental, and vision coverage for you and your dependents Wellness and Commuter stipends for select roles 401k Plan with 2% company match (USA employees) Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Leadership
Python
Go (desired)
Cloud platform
Distributed systems
IAM and governance tooling (nice to have)
User experience
Enterprise feature delivery
Direct Apply
Posted 5 months ago
Lambda

Software Engineering Manager - Storage

LambdaAnywhereFull-time
View Job
Compensation$180K - 250K a year

Lead and manage a team developing high-performance distributed storage protocols and systems tailored for AI workloads, driving technical strategy and cross-functional collaboration. | 10+ years software development with 5+ years management in storage software engineering, expertise in distributed storage protocols and systems, programming in low-level languages, and experience with container orchestration and high-scale storage solutions. | We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be. If you'd like to build the world's best deep learning cloud, join us. • Note: This position requires presence in our San Francisco office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. Engineering at Lambda is responsible for building and scaling our cloud offering. Our scope includes the Lambda website, cloud APIs and systems as well as internal tooling for system deployment, management and maintenance. In the world of distributed AI, raw GPU and CPU horsepower is just a part of the story. High-performance networking and storage are the critical components that enable and unite these systems, making groundbreaking AI training and inference possible. The Lambda Infrastructure Engineering organization forges the foundation of high-performance AI clusters by welding together the latest in AI storage, networking, GPU and CPU hardware. Our expertise lies at the intersection of: • High-Performance Distributed Storage Solutions and Protocols: We engineer the protocols and systems that serve massive datasets at the speeds demanded by modern clustered GPUs. • Dynamic Networking: We design advanced networks that provide multi-tenant security and intelligent routing without compromising performance, using the latest in AI networking hardware. • Compute Virtualization: We enable cutting-edge virtualization and clustering that allows AI researchers and engineers to focus on AI workloads, not AI infrastructure, unleashing the full compute bandwidth of clustered GPUs. About the Role: We are seeking an experienced Software Engineering Manager with a history in the development of storage protocols and distributed storage systems to lead a team of Storage Software Engineers and Distributed Systems Engineers in the design, development, and optimization of cutting-edge distributed storage solutions. Your team will be responsible for building high-performance, scalable, and reliable implementations of object, block, and file protocols, specifically tailored to serve performance demanding AI training and inference workloads. This is a unique opportunity to work at the intersection of large-scale distributed systems and the rapidly evolving field of artificial intelligence infrastructure. You will be building the foundational infrastructure that powers some of the most advanced AI research and products in the world. What You’ll Do • Team Leadership & Management: • Grow/Hire, lead, and mentor a top-talent team of high-performing software engineers focused on delivering distributed storage protocols. • Foster a high-velocity culture of innovation, technical excellence, and collaboration. • Conduct regular one-on-one meetings, provide constructive feedback, and support career development for team members. • Drive outcomes by managing project priorities, deadlines, and deliverables using Agile methodologies. • Technical Strategy & Execution: • Drive the technical vision and strategy for our distributed storage protocols (e.g., S3, NFS, iSCSI) and their underlying distributed systems. • Oversee the development of highly optimized storage solutions designed to meet the performance demands of AI/ML workloads (e.g., high throughput, low latency, optimization for AI workload access patterns). • Lead the team in tackling complex distributed systems challenges, including concurrency, consistency, fault tolerance, and data durability across multiple data centers. • Guide engineering team in problem identification, requirements gathering, solution ideation, and stakeholder alignment on engineering RFCs. • Deeply understand the performance bottlenecks of existing storage systems and guide the team in developing innovative solutions to overcome them. • Lead the team in supporting customers. • Cross-Functional Collaboration: • Work closely with AI/ML research and products teams to understand customers storage needs and translate them into technical requirements. • Work closely with the product engineering team to deliver high quality products to customers to meet their unique needs. • Collaborate with product management to define the product roadmap and prioritize features. • Work closely with HPC Architecture, Networking, Compute, and Storage Engineering teams to deploy high-performance distributed storage protocols to serve AI/ML workloads. • Partner with fleet engineering and platforms teams to ensure seamless deployment, monitoring, and maintenance of the distributed storage protocols. • Work in lock-step with the Storage Engineering team to provide reliable storage products on top of a variety of physical storage solutions. • Innovation & Research: • Stay current with the latest trends and research in distributed systems, storage technologies, and AI/ML hardware/software advancements. • Work with the Lambda product team to uncover new trends in the AI inference and training product category. • Encourage and support the team in exploring new technologies and approaches to improve system performance and efficiency. You • Experience: • 10+ years of experience in software development, with at least 5+ years in a management or lead role in storage software engineering. • Demonstrated experience leading a team of software engineers on complex, cross-functional projects in a fast-paced startup environment. • Extensive hands-on experience in designing and implementing distributed storage systems. • Experience with storage protocols serving storage volumes at a scale greater than 20PB. • Experience developing and tuning distributed storage protocols across scaling challenges using namespacing, sharding, and caching strategies. • Familiarity with deploying and running applications on Kubernetes or other container orchestration systems (e.g., AWS ECS, Hashicorp Nomad). • Strong project management skills, leading high-confidence planning, project execution, and delivery of team outcomes on schedule. • Technical Skills: • Knowledge in one or more of the following storage protocols: object storage (e.g., S3), block storage (e.g., iSCSI), or file storage (e.g., NFS, SMB, Lustre). • Professional individual contributor experience in languages such as C++, Go, Rust, or Python. • Familiarity with modern storage technologies (e.g., NVMe, RDMA) and their role in optimizing performance. • Experience with containerization technologies (e.g., Docker, Kubernetes) and their integration with storage solutions. • Distributed Systems Knowledge: • Solid understanding of distributed systems concepts, including consensus algorithms (e.g., Raft, Paxos), distributed caching, failure recovery, consistency models (e.g., eventual consistency), fault tolerance, data replication, load balancing, and distributed consensus algorithms • People Management: • Experience building a high-performance team through deliberate hiring, upskilling, planned skills redundancy, performance-management, and expectation setting. Nice to Have • Experience: • Demonstrated delivery of distributed storage protocols in a CSP (Cloud Service Provider), NCP (Neo-Cloud provider), HPC-infrastructure integrator, or AI-infrastructure company. • Experience with storage protocols serving storage volumes at a scale greater than 100PB. • Implementation of distributed storage protocols backed by a variety of storage solutions, performance-tuned for AI/ML workloads. • Experience driving cross-functional engineering management initiatives (coordinating deployments, strategic planning, coordinating large projects). • Technical Skills: • Deep expertise in one or more of the following storage protocols: object storage (e.g., S3), block storage (e.g., iSCSI), or file storage (e.g., NFS, SMB, Lustre). • Strong programming skills in languages such as C++, Go, Rust, or Python. • In-depth knowledge of operating system internals, including file systems, caching, and I/O scheduling. • AI/ML Domain Knowledge: • Experience working with AI/ML training and inference frameworks (e.g., TensorFlow, PyTorch). • Understanding of the unique data access patterns and performance requirements of AI workloads. • Distributed Systems Knowledge: • Proven ability to design and debug highly concurrent and fault-tolerant systems. • People Management: • Experience driving organizational improvements (processes, systems, etc.) • Experience training, or managing managers. Salary Range Information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda • Founded in 2012, ~400 employees (2025) and growing fast • We offer generous cash & equity compensation • Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. • We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability • Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG • Health, dental, and vision coverage for you and your dependents • Wellness and Commuter stipends for select roles • 401k Plan with 2% company match (USA employees) • Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Distributed storage systems
Storage protocols (S3, NFS, iSCSI)
Software engineering management
High-performance networking and storage
Kubernetes and container orchestration
Programming in C++, Go, Rust, Python
Distributed systems concepts (consensus algorithms, fault tolerance)
Verified Source
Posted 6 months ago
Lambda

Senior Software Engineer - IAM

LambdaAnywhereFull-time
View Job
Compensation$140K - 180K a year

Design and build IAM platform features including authentication, authorization, MFA, and identity lifecycle management for a scalable AI cloud environment. | 8+ years backend/platform engineering with 3+ years leading IAM/authentication projects, expertise in modern IAM protocols and systems design, programming in Python or Go, and experience with infrastructure as code. | We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be. If you'd like to build the world's best deep learning cloud, join us. *Note: This position requires presence in our San Francisco or Seattle office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. Role Summary Help define and deliver Lambda’s next-generation Identity and Access Management platform - powering secure, intuitive, and scalable access control for customers ranging from early-stage startups to the most superintelligent AI teams in the world. You’ll design IAM systems that anticipate the needs of highly technical users who will stress-test every workflow, integration, and permission model. Your charter will be to lead the design and implementation of our IAM vision: Workspaces, enterprise-grade RBAC, MFA enhancements, and a unified identity platform across all Lambda products and services. What You’ll Do Design and build intuitive, beautiful web interfaces for ML/AI cloud users Integrate top-tier tooling, workflows, and models from the AI space Own features end-to-end—from design to deployment to monitoring You (Must-Haves) 8+ years of backend or platform engineering experience, with 3+ years leading IAM or authentication/authorization initiatives. Deep expertise in modern IAM patterns and technologies: Authentication (OIDC, OAuth2, SAML) Authorization (RBAC, ABAC, fine-grained permissions) MFA and advanced authentication factors SCIM and identity lifecycle management Experience integrating and customizing third-party identity platforms (e.g., Auth0, Okta, WorkOS) at scale. Strong architecture and systems design skills for distributed, multi-tenant SaaS environments. Proven track record of delivering IAM features in security-sensitive, high-uptime environments. Solid programming experience in Python, Go, or similar languages, plus comfort with IaC (Terraform, Atlantis, Crossplane). Nice-to-Haves Experience designing IAM systems for ML/AI or cloud infrastructure products. Hands-on experience with enterprise onboarding flows. Prior work on multi-cloud or hybrid identity solutions (AWS, GCP, Azure). Contributions to IAM standards or open-source identity projects. Leadership or mentorship experience Strong academic background (EECS, Math, Software Engineering, Physics) Salary Range The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. Final Note You don’t need to meet all qualifications to apply. Lambda values a variety of backgrounds, experiences, and skills. About Lambda Founded in 2012, ~400 employees (2025) and growing fast We offer generous cash & equity compensation Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG Health, dental, and vision coverage for you and your dependents Wellness and Commuter stipends for select roles 401k Plan with 2% company match (USA employees) Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

IAM (OIDC, OAuth2, SAML, RBAC, ABAC, MFA)
Python
Go
Terraform
Distributed SaaS architecture
Authentication/Authorization
Security-sensitive environments
Direct Apply
Posted 6 months ago
Lambda

Senior Software Engineer - Managed Kubernetes

LambdaAnywhereFull-time
View Job
Compensation$140K - 180K a year

Design, build, and maintain scalable Kubernetes control plane services and automation tools, support production issues, and develop internal APIs and CLI tools. | 6+ years software engineering experience with 3+ years leading projects, 2+ years on orchestration/deployment systems, strong Go and Python skills, Kubernetes experience, and Linux/cloud infrastructure knowledge. | We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be. If you'd like to build the world's best deep learning cloud, join us. • Note: This position requires presence in our San Francisco office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. Engineering at Lambda is responsible for building and scaling our cloud offering. Our scope includes the Lambda website, cloud APIs and systems as well as internal tooling for system deployment, management and maintenance. About the Role We are seeking a Senior Software Engineer to join our Managed Kubernetes (Mk8s) team. You will play a crucial role in shaping the architecture, reliability, and automation of our Kubernetes-based infrastructure, which powers mission-critical workloads across our global platform. What You’ll Do • Design, build, and maintain scalable control plane services, operators, and custom Kubernetes controllers, while developing automation in Python/Go for end-to-end cluster lifecycle management — including provisioning, upgrades, patching, and deletion. • Identify gaps and develop internal tools, APIs, and command-line interfaces (CLIs) that enable customers and ML/AI teams to deploy and effectively monitor inference services. • Write resilient systems that gracefully handle failure across large-scale distributed environments. • Develop automated tests to ensure quality and stability, and validate the clusters to identify and address hardware issues before delivery. • Support and debug production issues through on-call rotation. You • Have 6+ years of experience in software engineering, 3+ years leading large-scale complex projects, or tech lead. • At least two years of experience working on orchestration and deployment systems • Experience using Kubernetes and third-party operators (CRDs, CSI, CNI, etc.). • Strong programming skills in Go and Python; ability to collaborate effectively on shared codebases • Take pride in owning and delivering core components of products and platforms. • Experience with infrastructure-as-code tools (e.g. Terraform, Pulumi). • Solid knowledge of Linux systems, networking, containers, and cloud infrastructure. Nice to Have • Deep Kubernetes and Linux expertise • Experience operating the control plane and low-level pieces of large-scale Kubernetes clusters • Experience with user-level restrictions and hardening (e.g. AppArmor) • Experience with HPC clusters, environments & tooling • Experience with machine learning/AI frameworks • Expertise with hybrid or multi-cloud Kubernetes environments. • Familiarity with GPU, Infiniband, or high-performance computing on K8s. • Past contributions to CNCF projects or Kubernetes SIGs a plus. If you don’t meet all of these requirements but believe you may be a good fit, please still apply and provide a cover letter that helps us understand your experience and readiness for this role. Salary Range Information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda • Founded in 2012, ~400 employees (2025) and growing fast • We offer generous cash & equity compensation • Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. • We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability • Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG • Health, dental, and vision coverage for you and your dependents • Wellness and Commuter stipends for select roles • 401k Plan with 2% company match (USA employees) • Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Kubernetes
Go
Python
Infrastructure-as-code (Terraform, Pulumi)
Linux systems
Networking
Containers
Cloud infrastructure
Verified Source
Posted 6 months ago
Lambda

Senior Software Engineer - IAM

LambdaAnywhereFull-time
View Job
Compensation$150K - 220K a year

Design and build next-generation IAM platform features including enterprise-grade RBAC, MFA enhancements, and unified identity management for AI cloud users. | 8+ years backend/platform engineering with 3+ years leading IAM initiatives, deep expertise in modern IAM technologies, strong architecture skills, and programming in Python or Go. | We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be. If you'd like to build the world's best deep learning cloud, join us. • Note: This position requires presence in our San Francisco or Seattle office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. Role Summary Help define and deliver Lambda’s next-generation Identity and Access Management platform - powering secure, intuitive, and scalable access control for customers ranging from early-stage startups to the most superintelligent AI teams in the world. You’ll design IAM systems that anticipate the needs of highly technical users who will stress-test every workflow, integration, and permission model. Your charter will be to lead the design and implementation of our IAM vision: Workspaces, enterprise-grade RBAC, MFA enhancements, and a unified identity platform across all Lambda products and services. What You’ll Do • Design and build intuitive, beautiful web interfaces for ML/AI cloud users • Integrate top-tier tooling, workflows, and models from the AI space • Own features end-to-end—from design to deployment to monitoring You (Must-Haves) • 8+ years of backend or platform engineering experience, with 3+ years leading IAM or authentication/authorization initiatives. • Deep expertise in modern IAM patterns and technologies: • Authentication (OIDC, OAuth2, SAML) • Authorization (RBAC, ABAC, fine-grained permissions) • MFA and advanced authentication factors • SCIM and identity lifecycle management • Experience integrating and customizing third-party identity platforms (e.g., Auth0, Okta, WorkOS) at scale. • Strong architecture and systems design skills for distributed, multi-tenant SaaS environments. • Proven track record of delivering IAM features in security-sensitive, high-uptime environments. • Solid programming experience in Python, Go, or similar languages, plus comfort with IaC (Terraform, Atlantis, Crossplane). Nice-to-Haves • Experience designing IAM systems for ML/AI or cloud infrastructure products. • Hands-on experience with enterprise onboarding flows. • Prior work on multi-cloud or hybrid identity solutions (AWS, GCP, Azure). • Contributions to IAM standards or open-source identity projects. • Leadership or mentorship experience • Strong academic background (EECS, Math, Software Engineering, Physics) Salary Range The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. Final Note You don’t need to meet all qualifications to apply. Lambda values a variety of backgrounds, experiences, and skills. About Lambda • Founded in 2012, ~400 employees (2025) and growing fast • We offer generous cash & equity compensation • Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. • We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability • Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG • Health, dental, and vision coverage for you and your dependents • Wellness and Commuter stipends for select roles • 401k Plan with 2% company match (USA employees) • Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

IAM (Authentication: OIDC, OAuth2, SAML)
Authorization (RBAC, ABAC, fine-grained permissions)
MFA and advanced authentication factors
SCIM and identity lifecycle management
Python, Go, Terraform, IaC
Distributed multi-tenant SaaS architecture
Leadership and mentorship
Verified Source
Posted 6 months ago
Lambda

Engineering Project Manager - Infrastructure

LambdaAnywhereFull-time
View Job
Compensation$140K - 180K a year

Lead cross-functional infrastructure software engineering projects, manage project lifecycles, ensure timely delivery, and communicate with stakeholders. | 10+ years in software engineering with 7+ years in project management, strong Agile knowledge, leadership skills, and experience with project management tools. | We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be. If you'd like to build the world's best deep learning cloud, join us. • Note: This position requires presence in our San Francisco office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. In the world of distributed AI, raw GPU and CPU horsepower is just a part of the story. High-performance networking and storage are the critical components that enable and unite these systems, making groundbreaking AI training and inference possible. The Lambda Infrastructure Engineering organization forges the foundation of high-performance AI clusters by welding together the latest in AI storage, networking, GPU and CPU hardware. Our expertise lies at the intersection of: • High-Performance Distributed Storage Solutions and Protocols: We engineer the protocols and systems that serve massive datasets at the speeds demanded by modern clustered GPUs. • Dynamic Networking: We design advanced networks that provide multi-tenant security and intelligent routing without compromising performance, using the latest in AI networking hardware. • Compute Virtualization: We enable cutting-edge virtualization and clustering that allows AI researchers and engineers to focus on AI workloads, not AI infrastructure, unleashing the full compute bandwidth of clustered GPUs. We are seeking an experienced, technical, and outcome-focused Infrastructure Software Engineering Project Manager to help our team deliver cutting-edge, scalable, and reliable AI infrastructure solutions. The ideal candidate will have a deep understanding of software development lifecycles, familiarity with infrastructure technologies, and be an expert at application of project management methodologies. This role requires exceptional leadership skills, a passion for technology, and the ability to drive complex projects to successful completion in a fast-paced, dynamic environment. What You’ll Do • Project Leadership: Guide a cross-functional team of infrastructure software engineers, storage engineers, and networking engineers, fostering a culture of innovation, collaboration, and accountability. • Strategic Planning: Define project scope, goals, and deliverables in collaboration with engineering management and stakeholders. Develop comprehensive project plans, including timelines, resource allocation, and budget. • Execution and Delivery: Oversee the entire project lifecycle, from ideation to deployment. Ensure projects are delivered on time, within budget, and to the highest quality standards. • Technical Expertise: Possess a strong understanding of infrastructure technologies, including cloud platforms (e.g., AWS, Azure, GCP), containerization (e.g., Docker, Kubernetes), CI/CD pipelines, and automation tools. • Risk Management: Proactively identify, assess, and mitigate project risks and issues. Develop contingency plans and communicate status to stakeholders. • Stakeholder Communication: Serve as the primary point of contact for all project-related communications. Provide regular updates to stakeholders, including executive leadership, on project progress, risks, and key milestones. • Process Improvement: Continuously evaluate and improve project management processes and tools to enhance team efficiency and effectiveness. • Collaboration: Work closely with cross-functional teams, including product management, operations, and security, to ensure alignment and successful project integration. • Customer Focus: Work closely with engineering management to scope customer requirements. You • 10+ years of experience in the Software engineering industry with 7+ years performing a project management role, with a focus on software engineering and software infrastructure projects. • Demonstrated experience leading a team of software engineers on complex, cross-functional projects in a fast-paced startup environment. • Bachelor's degree in Computer Science, Engineering, or a related technical field. • Proven track record of successfully leading and delivering complex technical projects. • Strong knowledge of Agile and Scrum methodologies. • Exceptional leadership, communication, and interpersonal skills using those skills to lead teams to ordered execution from a disorganized starting point. • Ability to thrive in a fast-paced, high-pressure environment and manage multiple projects simultaneously. • Impeccable writing and documentation skills. • Experience with project management software (e.g., Jira, Asana, Trello). Nice to Have • Experience managing hybrid hardware deployment and software engineering projects. • Experience in a hyperscaler (CSP), neocloud provider (NCP), or high-performance computing (HPC) production environments. • Worked closely with product managers to deliver products to specification. • Deep understanding of infrastructure technologies and software development best practices. Salary Range Information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda • Founded in 2012, ~400 employees (2025) and growing fast • We offer generous cash & equity compensation • Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. • We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability • Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG • Health, dental, and vision coverage for you and your dependents • Wellness and Commuter stipends for select roles • 401k Plan with 2% company match (USA employees) • Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Project Management
Software Engineering
Infrastructure Technologies
Cloud Platforms (AWS, Azure, GCP)
Containerization (Docker, Kubernetes)
CI/CD Pipelines
Automation Tools
Agile and Scrum Methodologies
Leadership and Communication
Technical Documentation
Verified Source
Posted 6 months ago
Lambda

Data Center Systems Operations Engineer

LambdaAnywhereFull-time
View Job
Compensation$120K - 180K a year

Own availability and utilization analysis, coordinate cross-functional teams for data center infrastructure improvements, and lead strategic design and implementation of key programs. | 10+ years in data center infrastructure or HPC operations, deep familiarity with AI workloads, strong analytical and communication skills, and preferably hyperscale/cloud infrastructure experience. | We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be. If you'd like to build the world's best deep learning cloud, join us. *Note: This position prefers presence in our Bay Area office locations, but is open to remote presence for the right candidate. About the Job As Lambda continues to scale its AI platform and customer base, infrastructure decisions must be tightly aligned with product roadmaps, platform growth, and fiscal discipline. The Systems Operations Engineer will own availability analysis, long-term improvement of utilization, input into strategic design, and implementation of key programs across the entire Infrastructure Stack. This role sits within the Data Center Infrastructure (DC Infra) team and will work cross-functionally with Product, Platform Engineering, and Observability to understand overall health, analyze ongoing/potential issues, make recommendations and changes to our overall design, and ownership of key programs to improve the overall business. This position is a critical link between the HPC/HW systems and DC Infra—and will help ensure our designs and operations most effectively maximize availability and reliability across our entire Platform. What You’ll Do Availability Analysis Own end-to-end unification of availability (number of 9s) calculations across Lambda's data center products and various data center footprints, from the power/BMS/cooling and down into the rack/GPU level, and providing adequate telemetry back to facilities, site operations, and at the platform level Work with thermal/hardware team to understand AI workload characteristics on mechanical systems and need for different BMS control methodologies as Direct to Liquid Chip (DLC) Cooling technologies improve and densities increase Coordinate across DC Infra team to calculate estimated availabilities for new data center designs Work with product teams and capacity forecasting to understand how design decisions effecting availability impact time to market and satisfy customer needs Utilization Analysis and Oversubscription Strategy Own end-to-end utilization analysis across Lambda's entire data center infrastructure Analyze DC designs to understand peak possible capacity under varying conditions Build oversubscription strategy and lead/own company workstream to maximize available MW w/o impacting GPU reliability and customer experience Ensure appropriate availability considerations are included Observability and Analytics Coordinate with the observability team to ensure appropriate points are monitored to understand data center characteristics loads, especially under AI workloads Help the team understand where approximate warning/danger levels are Use observations and warning/danger levels to inform BOD for future Data Centers and suggest upgrades/modifications to current Data Centers Develop strategy for a data center fleet health dashboard Help provide structure ensuring overall day-to-day and long-term health can be understood from a 20k foot level with the ability to drill down into the details Power Capping Strategy and Implementation Coordinate with Site Operations team to strategize and build out power capping capabilities, related to worst-case scenario response/protection as we start aggressively employing oversubscription Identify appropriate IT blocks where real-time data is monitored Analyze, propose, and implement a rigorous testing process that iteratively finds and eliminates stranded power and cooling capacity related to utilization Site Selection Technical Review Conduct end-to-end technical evaluations of prospective data center sites, including power sufficiency and stability, cooling infrastructure and mechanical systems, and network topology feasibility Perform risk assessments and recommend sites based on infrastructure fit and growth capacity. Coordinate with DC Infra, Legal, and Business Strategy teams to ensure site selections align with workload and deployment timelines. Cluster-to-Facility Requirements Alignment Collaborate with HPC Architecture team and Capacity Manager to translate cluster-level hardware and workload requirements into facility-level specifications. Define infrastructure interface requirements (power, cooling, rack layouts, interconnects, monitoring) to ensure alignment between compute stack and facility capabilities. Support long-term infrastructure roadmap development to accommodate future hardware designs, density shifts, and workload patterns. Work with Capacity Manager to understand various levers that can be employed to accelerate growth during demand surges. You Self-starter with a proven ability to independently dive into the details to understand and solve hard problems across data center infrastructure and operations Ability to provide world-class analysis, boiling complex issues into the root cause or few key drivers 10+ years of experience working in directly in or closely with data center infrastructure and HPC/HW operations Deep familiarity with AI or compute workload patterns, scaling dynamics, and infrastructure cost drivers Ability to synthesize complex technical and business inputs into clear, actionable strategic recommendations Excellent communication and collaboration skills across technical, operational, and financial stakeholders Preferred Experience Prior experience in hyperscale or cloud infrastructure environments Familiarity with GPU cluster sizing, workload forecasting, or energy-efficient compute architectures Working knowledge of typical Data Center Infrastructure designs, topologies, systems and associated reliability/availability calculations Knowledge of DCIM tools, telemetry systems, or utilization analytics platforms Engineering degree from university, Masters preferred. Experience working across multi-disciplinary and non-technical teams to explain findings Salary Range Information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda Founded in 2012, ~400 employees (2025) and growing fast We offer generous cash & equity compensation Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG Health, dental, and vision coverage for you and your dependents Wellness and Commuter stipends for select roles 401k Plan with 2% company match (USA employees) Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Data Center Infrastructure
HPC Operations
Availability Analysis
Utilization Analysis
Telemetry and Observability
Power Capping Strategy
AWS Lambda
Cloud Architecture
Node.js
JavaScript
SQL
Python
Direct Apply
Posted 6 months ago
Lambda

Engineering Project Manager - Infrastructure

LambdaAnywhereFull-time
View Job
Compensation$140K - 180K a year

Lead and manage cross-functional infrastructure software engineering projects, ensuring delivery on time, within budget, and to quality standards while collaborating with stakeholders and improving processes. | 10+ years in software engineering with 7+ years in project management focused on software infrastructure, strong Agile knowledge, leadership skills, and experience with project management tools. | We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be. If you'd like to build the world's best deep learning cloud, join us. • Note: This position requires presence in our San Francisco office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. In the world of distributed AI, raw GPU and CPU horsepower is just a part of the story. High-performance networking and storage are the critical components that enable and unite these systems, making groundbreaking AI training and inference possible. The Lambda Infrastructure Engineering organization forges the foundation of high-performance AI clusters by welding together the latest in AI storage, networking, GPU and CPU hardware. Our expertise lies at the intersection of: • High-Performance Distributed Storage Solutions and Protocols: We engineer the protocols and systems that serve massive datasets at the speeds demanded by modern clustered GPUs. • Dynamic Networking: We design advanced networks that provide multi-tenant security and intelligent routing without compromising performance, using the latest in AI networking hardware. • Compute Virtualization: We enable cutting-edge virtualization and clustering that allows AI researchers and engineers to focus on AI workloads, not AI infrastructure, unleashing the full compute bandwidth of clustered GPUs. We are seeking an experienced, technical, and outcome-focused Infrastructure Software Engineering Project Manager to help our team deliver cutting-edge, scalable, and reliable AI infrastructure solutions. The ideal candidate will have a deep understanding of software development lifecycles, familiarity with infrastructure technologies, and be an expert at application of project management methodologies. This role requires exceptional leadership skills, a passion for technology, and the ability to drive complex projects to successful completion in a fast-paced, dynamic environment. What You’ll Do • Project Leadership: Guide a cross-functional team of infrastructure software engineers, storage engineers, and networking engineers, fostering a culture of innovation, collaboration, and accountability. • Strategic Planning: Define project scope, goals, and deliverables in collaboration with engineering management and stakeholders. Develop comprehensive project plans, including timelines, resource allocation, and budget. • Execution and Delivery: Oversee the entire project lifecycle, from ideation to deployment. Ensure projects are delivered on time, within budget, and to the highest quality standards. • Technical Expertise: Possess a strong understanding of infrastructure technologies, including cloud platforms (e.g., AWS, Azure, GCP), containerization (e.g., Docker, Kubernetes), CI/CD pipelines, and automation tools. • Risk Management: Proactively identify, assess, and mitigate project risks and issues. Develop contingency plans and communicate status to stakeholders. • Stakeholder Communication: Serve as the primary point of contact for all project-related communications. Provide regular updates to stakeholders, including executive leadership, on project progress, risks, and key milestones. • Process Improvement: Continuously evaluate and improve project management processes and tools to enhance team efficiency and effectiveness. • Collaboration: Work closely with cross-functional teams, including product management, operations, and security, to ensure alignment and successful project integration. • Customer Focus: Work closely with engineering management to scope customer requirements. You • 10+ years of experience in the Software engineering industry with 7+ years performing a project management role, with a focus on software engineering and software infrastructure projects. • Demonstrated experience leading a team of software engineers on complex, cross-functional projects in a fast-paced startup environment. • Bachelor's degree in Computer Science, Engineering, or a related technical field. • Proven track record of successfully leading and delivering complex technical projects. • Strong knowledge of Agile and Scrum methodologies. • Exceptional leadership, communication, and interpersonal skills using those skills to lead teams to ordered execution from a disorganized starting point. • Ability to thrive in a fast-paced, high-pressure environment and manage multiple projects simultaneously. • Impeccable writing and documentation skills. • Experience with project management software (e.g., Jira, Asana, Trello). Nice to Have • Experience managing hybrid hardware deployment and software engineering projects. • Experience in a hyperscaler (CSP), neocloud provider (NCP), or high-performance computing (HPC) production environments. • Worked closely with product managers to deliver products to specification. • Deep understanding of infrastructure technologies and software development best practices. Salary Range Information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda • Founded in 2012, ~400 employees (2025) and growing fast • We offer generous cash & equity compensation • Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. • We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability • Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG • Health, dental, and vision coverage for you and your dependents • Wellness and Commuter stipends for select roles • 401k Plan with 2% company match (USA employees) • Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Project Management
Infrastructure Technologies
Cloud Platforms (AWS, Azure, GCP)
Containerization (Docker, Kubernetes)
CI/CD Pipelines
Automation Tools
Leadership
Technical Documentation
Cross-functional Team Leadership
Verified Source
Posted 6 months ago
Lambda

Senior Tax Analyst

LambdaAnywhereFull-time
View Job
Compensation$Not specified

Perform month-end and year-end close activities, including journal entries, accruals, reconciliations, and analysis. Conduct analysis and review of tax data to identify discrepancies, variances, and trends. | Bachelor's degree in accounting, finance, or a related field is required, along with 3-5 years of related experience. Strong knowledge of US and International sales and use tax laws, regulations, and compliance requirements is essential. | Lambda is the #1 GPU Cloud for ML/AI teams training, fine-tuning and inferencing AI models, where engineers can easily, securely and affordably build, test and deploy AI products at scale. Lambda’s product portfolio includes on-prem GPU systems, hosted GPUs across public & private clouds and managed inference services – servicing government, researchers, startups and Enterprises world-wide. If you'd like to build the world's best deep learning cloud, join us. *Note: This position requires presence in our San Francisco/San Jose office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. What You’ll Do Perform month-end and year-end close activities, including journal entries, accruals, reconciliations, analysis Conduct analysis and review of tax data to identify discrepancies, variances, and trends. Document findings for audit trail Prepare workpapers and calculations for tax provision and financial reporting Manage estimated tax payments and data for tax compliance Assist in filing of sales tax, property tax and VAT returns across multiple states Maintain organized records of filings, payments, and correspondence Demonstrate ownership of tax engines, ERP systems, and underlying data. Support enhancements and implementations. Manage calendars and due dates for compliance and operations Collaborate with cross-functional teams to promote best practices Monitor changes in legislation and evaluate impact Exercise discretion and maintain confidentiality of the Company’s financial information You Bachelor's degree in accounting, finance, or a related field. 3-5 years of related experience in public accounting or in the internal tax department of a US multi-state corporation Proficient in using ERP systems and analyzing data Advanced Excel skills for data wrangling and analysis Strong knowledge of US and International sales and use tax laws, regulations, and compliance requirements. Excellent attention to detail and organizational skills to manage multiple projects and deadlines simultaneously. Possess intellectual curiosity and willingness to learn Strong analytical and problem-solving skills, with the ability to apply rules to practical scenarios. Effective communication and interpersonal skills to collaborate with internal teams and external stakeholders. High ethical standards and the ability to handle confidential and sensitive information with integrity. Nice to Have Experience in the machine learning or computer hardware industry. Netsuite experience (or other large ERP system experience) and Avalara CPA license or aspiration Salary Range Information The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description. About Lambda Founded in 2012, ~400 employees (2025) and growing fast We offer generous cash & equity compensation Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove. We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG Health, dental, and vision coverage for you and your dependents Wellness and Commuter stipends for select roles 401k Plan with 2% company match (USA employees) Flexible Paid Time Off Plan that we all actually use A Final Note: You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills. Equal Opportunity Employer Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Accounting
Finance
Tax Compliance
Data Analysis
ERP Systems
Excel
Sales Tax
Property Tax
VAT Returns
Analytical Skills
Problem-Solving
Communication
Interpersonal Skills
Confidentiality
Attention to Detail
Organizational Skills
Direct Apply
Posted 6 months ago

Ready to join Lambda?

Create tailored applications specifically for Lambda with our AI-powered resume builder

Get Started for Free

Ready to have AI work for you in your job search?

Sign-up for free and start using JobLogr today!

Get Started »
JobLogr badgeTinyLaunch BadgeJobLogr - AI Job Search Tools to Land Your Next Job Faster than Ever | Product Hunt