via LinkedIn
$200K - 250K a year
Design, implement, and optimize large-scale AWS-based systems, automate operations, and ensure high availability and security.
8+ years of SRE experience, strong AWS, Python, SQL skills, and experience with CI/CD, monitoring, and security in large-scale environments.
Are you a Site Reliability Engineer working at a Large Financial Institution and being told by your leadership that you are too hands on or detail oriented or think and work like a start-up? Imagine working at Intellibus to engineer platforms that impact billions of lives around the world. With your passion and focus we will accomplish great things together! We are looking forward to you joining our Platform Engineering Team. Our Platform Engineering Team is working to solve the Multiplicity Problem. We are trusted by some of the most reputable and established FinTech Firms. Recently, our team has spearheaded the Conversion & Go Live of apps which support the backbone of the Financial Trading Industry. We are looking for Engineers who can • Design, implement, and optimize AWS-based systems to ensure high availability, scalability, and fault tolerance. • Develop automation scripts and tools in Python to reduce manual intervention and improve operational efficiency. • Design and optimize SQL databases, ensuring performance, scalability, and data integrity. • Build and maintain infrastructure as code (IaC) with Terraform/CloudFormation for consistent deployments. • Collaborate with development teams to establish best practices for monitoring, logging, and observability (CloudWatch, Splunk, ELK, Datadog, New Relic). • Participate in on-call rotations, troubleshoot incidents, conduct root cause analysis, and document post-incident reviews. • Implement and maintain security best practices and compliance requirements in the AWS environment. • Support CI/CD pipelines (Jenkins, PCF, ECS, Lambda) to improve software delivery reliability. • Contribute to capacity planning and performance optimization for growing workloads. • Stay current with emerging technologies, recommending new tools and practices. • Document designs, create roadmaps with milestones, and manage priorities in Jira. • Document work thoroughly, create a roadmap with milestones, and prioritize tasks in Jira. Key Skills & Qualifications • 8+ years of SRE experience in large-scale environments. • 8+ years of hands-on Python, SQL, and AWS experience. • Strong expertise with REST APIs and service integration. • Proven experience in: • Database Design & Data Integration • AWS Services: S3, VPC, ECS, Lambda, CloudWatch • CI/CD Pipelines: Jenkins, PCF • Monitoring & Logging: Splunk, ELK, Datadog, New Relic, Wavefront • Infrastructure as Code (Terraform, CloudFormation) • Bash scripting & automation • Strong communication, collaboration, and problem-solving skills. We work closely with • AWS S3 • Database Design • Data Integration • Jenkins • Splunk / ELK • Amazon VPC • Datadog New Relic / Wavefront • PCF • CI/CD • ECS • Lambda Our Process • Schedule a 15 min Video Call with someone from our Team • 4 Proctored GQ Tests (< 2 hours) • 30-45 min Final Video Interview • Receive Job Offer If you are interested in reaching out to us, please apply and our team will contact you within the hour.
This job posting was last updated on 12/23/2025