Find your dream job faster with JobLogr
AI-powered job search, resume help, and more.
Try for Free
HC

Health Catalyst

via Workday

All our jobs are verified from trusted employers and sources. We connect to legitimate platforms only.

Lead Site Reliability Engineer

Anywhere
Full-time
Posted 12/8/2025
Direct Apply
Key Skills:
Google Cloud Platform (GCP)
Google Kubernetes Engine (GKE)
Kubernetes
Terraform
CI/CD (Jenkins, GitLab CI/CD)
Python scripting
Bash scripting
Cloud cost optimization
Networking fundamentals
Security and compliance (HIPAA, SOC II)

Compensation

Salary Range

$120K - 160K a year

Responsibilities

Design, implement, and operate scalable cloud infrastructure on GCP with a focus on Kubernetes, automate CI/CD pipelines, ensure system reliability and security, and mentor team members.

Requirements

5-7 years in DevOps/SRE or Cloud Infrastructure with deep GCP and Kubernetes experience, CI/CD pipeline management skills, scripting ability, and knowledge of security and compliance frameworks.

Full Description

Join one of the nation’s leading and most impactful health care performance improvement companies. Over the years, Health Catalyst has achieved and documented clinical, operational, and financial improvements for many of the nation’s leading healthcare organizations. We are also increasingly serving international markets. Our mission is to be the catalyst for massive, measurable, data-informed healthcare improvement through: Data: integrate data in a flexible, open & scalable platform to power healthcare’s digital transformation Analytics: deliver analytic applications & services that generate insight on how to measurably improve Expertise: provide clinical, financial & operational experts who enable & accelerate improvement Engagement: attract, develop and retain world-class team members by being a best place to work Role: Lead Site Reliability Engineer Team: Technology Location: US remote Travel: none anticipated **This position is currently not eligible for visa sponsorship** Job Summary As a DevOps / Site Reliability Engineer, you’ll help shape and sustain the infrastructure behind Armus, a core Health Catalyst platform that drives outcomes for clinicians and patients across the country. You’ll work closely with software engineers and product teams to design, automate, and operate the cloud environments that power our clinical registries and analytics solutions. This is a high-visibility, high-expectation role for someone who thrives on accountability, loves solving complex system problems, and wants to grow within a cross-product SRE group that spans multiple technologies and teams. You’ll ship improvements weekly, automate relentlessly, and end each day knowing your work improves healthcare outcomes. If You Love Building reliable, scalable systems that stay up and perform under pressure Taking a half-defined problem and driving it to a clean, measurable solution Balancing speed and safety through automation, testing, and disciplined process Mentoring others, reviewing code, and strengthening DevOps culture Working across application, infrastructure, and security boundaries to make systems better every week Then this role will fit you perfectly. What You’ll Own Cloud Infrastructure (Google Cloud Platform Focus) Design, implement, and operate scalable, secure, and resilient infrastructure on Google Cloud Platform (GCP), with a heavy focus on Google Kubernetes Engine (GKE) Apply best practices in container orchestration, networking, IAM, and workload identity Lead cloud cost optimization, capacity planning, and efficient scaling initiatives Manage infrastructure as code using Terraform or similar tools While GCP is the core environment, equivalent experience with AWS or Azure will be considered CI/CD and Automation Build and maintain CI/CD pipelines using Jenkins or GitLab CI/CD Ensure reliable deployment flows across development, staging, and production environments Implement automated checks and rollback mechanisms for safe, repeatable releases Reliability, Monitoring, and Incident Response Implement and refine observability using Sentry, Sumo Logic, and GCP Cloud Monitoring and Logging Participate in the on-call rotation, respond quickly to operational issues, and drive long-term fixes Collaborate with customer success and support teams to quantify and resolve production impact Identify reliability risks early, automate detection and recovery, and reduce manual toil Security and Compliance Apply and maintain least-privilege IAM policies and secure configuration baselines Partner with InfoSec to remediate vulnerabilities and support HIPAA and SOC II audit readiness Contribute to incident response readiness and disaster recovery testing Collaboration and Continuous Improvement Engage with the cross-product SRE squad to learn and contribute across multiple Health Catalyst platforms Help standardize SRE best practices, tooling, and documentation Mentor teammates and continuously raise the bar for reliability and automation What You Bring 5–7 years of experience in DevOps, SRE, or Cloud Infrastructure Engineering Deep expertise in GCP, especially GKE Experience with other major clouds (AWS or Azure) is a plus Strong working knowledge of Kubernetes and containerized deployments Proven experience with CI/CD tools such as Jenkins or GitLab Scripting experience in Python, Bash, or similar languages Solid understanding of networking, security, and performance fundamentals Hands-on experience with cloud cost management and optimization Calm under pressure with strong troubleshooting and communication skills Nice to Have Exposure to healthcare data or interoperability standards such as FHIR, HL7, or CDA Familiarity with healthcare security and compliance frameworks like HIPAA and SOC II Experience in Agile or Scrum software development environments Background supporting SaaS or multi-tenant systems What Success Looks Like Platform uptime consistently meets or exceeds SLA targets Deployments are automated, low-risk, and frequent Reliability metrics improve quarter over quarter Infrastructure costs are measured, optimized, and trending down The Armus platform is seen as a model for SRE practices across Health Catalyst Why This Role Matters This role is central to the continued growth and reliability of the Armus platform. The person in this seat won’t just maintain systems—they’ll shape how Health Catalyst operates cloud infrastructure at scale. You’ll drive uptime, automation, and cost efficiency while influencing SRE practices company-wide. If you want to do meaningful engineering work with real impact and high expectations, this is the opportunity. Information Security and Compliance Responsibilities: Maintain compliance with training directives required by the organization pertaining to Information Security, Acceptable Use Policy and HIPAA Privacy and Security. Adhere to and comply with the organizations Acceptable Use Policy. Safeguard information system assets by identifying and reporting potential and actual security events to the organizations Security and Compliance Officers. The above statements describe the general nature and level of work being performed in this job function. They are not intended to be an exhaustive list of all duties, and indeed additional responsibilities may be assigned by Health Catalyst. Studies show that candidates from underrepresented groups are less likely to apply for roles if they don’t have 100% of the qualifications shown in the job posting. While each of our roles have core requirements, please thoughtfully consider your skills and experience and decide if you are interested in the position. If you feel you may be a good fit for the role, even if you don’t meet all of the qualifications, we hope you will apply. If you feel you are lacking the core requirements for this position, we encourage you to continue exploring our careers page for other roles for which you may be a better fit. At Health Catalyst, we appreciate the opportunity to benefit from the diverse backgrounds and experiences of others. Because of our deep commitment to respect every individual, Health Catalyst is an equal opportunity employer. Health Catalyst has been named as one of the 30 Best Workplaces in Technology by Fortune Magazine and a winner of Gallup Great Workplace award. Health Catalyst earned the highest overall score in Healthcare BI by KLAS and, for the sixth year in a row, was named to the Best Places to Work in Healthcare list by Modern Healthcare. Health Catalyst is a leading provider of data and analytics technology and services to healthcare organizations, and is committed to being the catalyst for massive, measurable, data-informed healthcare improvement. As of December 31, 2019, Health Catalyst served greater than 125 customers including academic medical centers, integrated delivery networks, community hospitals, large physician practices, ACOs, health information exchanges, health insurers, and other risk-bearing entities. Its customers leverage the cloud-based data platform—powered by data from more than 100 million patient records and encompassing trillions of facts—as well as its analytics software and professional services expertise to make data-informed decisions and realize measurable clinical, financial and operational improvements. Health Catalyst envisions a future in which all healthcare decisions are data informed. Learn more about working at Health Catalyst here. At Health Catalyst, we celebrate and are committed to diversity and inclusion across all dimensions, including but not limited to race, gender, identity, age, and religion. Our four team member resource groups include: Shades, Queers & Allies, Women Empowered, InspireAsian, Neurodivergent, and Veterans. We offer best in class benefits that encourage ownership and inclusion: mentoring and sponsorship programs, remote-work friendliness, career development, company equity, and flexible PTO.

This job posting was last updated on 12/9/2025

Ready to have AI work for you in your job search?

Sign-up for free and start using JobLogr today!

Get Started »
JobLogr badgeTinyLaunch BadgeJobLogr - AI Job Search Tools to Land Your Next Job Faster than Ever | Product Hunt