Find your dream job faster with JobLogr
AI-powered job search, resume help, and more.
Try for Free
Tek Leaders Inc

Tek Leaders Inc

via LinkedIn

Apply Now
All our jobs are verified from trusted employers and sources. We connect to legitimate platforms only.

Site Reliability Engineer – SRE Lead

Anywhere
contractor
Posted 9/2/2025
Verified Source
Key Skills:
Site Reliability Engineering
DevOps
Python
Go
Bash
Linux/Unix system administration
Cloud platforms (AWS, Azure, GCP)
Terraform
Ansible
CloudFormation
Prometheus
Grafana
ELK
Datadog
Incident management
CI/CD pipelines
Docker
Kubernetes
Mentoring and leadership
APM tools (Dynatrace)
Device protocols and software decoding

Compensation

Salary Range

$140K - 180K a year

Responsibilities

Lead a team of SREs to ensure reliability, scalability, and performance of critical systems through automation, incident management, and strategic planning.

Requirements

Proven SRE or DevOps leadership experience with strong programming, Linux system administration, cloud architecture, infrastructure automation, monitoring, incident management, container orchestration, and mentoring skills.

Full Description

Role Name: Site Reliability Engineer – SRE Lead Location: Remote - OH Duration: Long Term Role Description: As a Site Reliability Engineer – Lead, you will drive the reliability, scalability, and performance of mission-critical systems and services while leading a team of SREs. This role combines deep technical expertise with leadership, mentoring, and strategic planning. You will set standards for operational excellence, guide incident response, and foster a culture of automation and continuous improvement. Collaboration with engineering, operations, and product teams is essential to align reliability initiatives with business objectives and ensure seamless service delivery. REQUIRED SKILL: • Proven experience in site reliability, DevOps, or systems engineering, with prior leadership or team lead responsibilities • Strong programming/scripting skills (e.g., Python, Go, Bash, or similar) • Deep expertise in Linux/Unix system administration and networking • Experience architecting and operating cloud platforms (AWS, Azure, GCP) • Proficiency with infrastructure-as-code and automation tools (e.g., Terraform, Ansible, CloudFormation) • Advanced knowledge of monitoring, logging, and alerting solutions (e.g., Prometheus, Grafana, ELK, Datadog) • Demonstrated incident management and root cause analysis skills • Experience designing and implementing CI/CD pipelines • Strong understanding of containerization and orchestration (Docker, Kubernetes) • Ability to define and enforce reliability, scalability, and security best practices • Excellent communication, stakeholder management, and collaboration skills • Experience mentoring, coaching, and developing SRE or engineering teams • Strong hands-on knowledge to define business process dashboards in APM tools like dynatrace with SLA, ALO and SLI definition, design and implementation as part of observability. • Experience with devices like Scanner, POS Devices, Peripheral devices (includes On device memory based devices) • Experience with Hardcoded protocols and software for devices and should be able to decode and run.

This job posting was last updated on 9/3/2025

Ready to have AI work for you in your job search?

Sign-up for free and start using JobLogr today!

Get Started »
JobLogr badgeTinyLaunch BadgeJobLogr - AI Job Search Tools to Land Your Next Job Faster than Ever | Product Hunt