via Lever.co
$0K - 0K a year
Design, build, and maintain resilient, high-performance distributed systems, develop automation tools, and improve operational efficiency.
5+ years in software engineering or DevOps, proficiency in Go and Python, experience with Kubernetes, cloud, distributed systems, observability tools, and troubleshooting.
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Site Reliability Engineer in Texas. The Senior Site Reliability Engineer will play a critical role in enhancing the reliability, scalability, and performance of large-scale distributed systems. This role blends infrastructure expertise with software development, focusing on automation, observability, and proactive risk mitigation. You will collaborate closely with engineering teams to optimize platforms, improve operational efficiency, and ensure high availability for millions of users. The ideal candidate thrives in high-traffic environments, contributes to open-source projects, and enjoys solving complex technical challenges while mentoring peers and improving team practices. This position offers the opportunity to make a tangible impact on highly visible systems and services. \n Accountabilities: Collaborate with engineering teams to design, build, and maintain resilient, high-performance systems. Enhance infrastructure and platform services to support deployment, observability, and operational excellence. Develop automation tools to reduce manual tasks, mitigate risks, and improve engineering efficiency. Monitor, troubleshoot, and optimize network, system, and service-level performance. Participate in sustainable incident response, conducting blameless postmortems and implementing improvements. Contribute upstream to open-source projects and implement best practices for scalability and reliability. Share on-call responsibilities to ensure continuous system availability and performance. Requirements: 5+ years of experience in Software Engineering, Site Reliability Engineering, or a development-focused DevOps role. Proficiency in one or more programming languages, preferably Go and Python. Experience with Kubernetes, cloud systems, and distributed systems development. Familiarity with observability and monitoring tools such as Prometheus, Thanos, Grafana, Vector, Clickhouse, Otel, and Loki. Strong skills in debugging, optimizing code, and troubleshooting across applications, networking (TCP/IP), and systems. Solid working knowledge of Linux and containerization technologies. Excellent collaboration, communication, and problem-solving abilities. Benefits: Comprehensive healthcare coverage including medical, dental, and vision. 401(k) program with employer matching. Home office setup support and remote workspace benefits. Personal and professional development funds. Flexible vacation policies and global wellness days. Paid parental leave and family planning support. Paid volunteer time off. Equity opportunities in the form of restricted stock units. \n Why Apply Through Jobgether? We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Why Apply Through Jobgether? Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1
This job posting was last updated on 12/19/2025