$90K - 130K a year
Troubleshoot and optimize Python scripts and SQL queries, maintain and enhance internal monitoring tools, and collaborate cross-functionally to ensure system stability and performance.
5+ years experience in Python and SQL development, expertise in Grafana, strong debugging skills, experience with CI/CD and Agile, and willingness to participate in incident response.
We are partnered with our direct client in search of a SQL/Python Developer (III) to join their Operations team. Direct Hire (No C2C or third-party submissions) Location: Addison 75001 Schedule: Fully remote but would prefer someone who lives in the DFW area so that they come in for meetings occasionally Interview process: 2 rounds of virtual interviews The SQL Developer III is a senior-level engineer responsible for ensuring the stability, performance, and reliability of internal applications, automations, and data infrastructure. This role focuses on diagnosing system failures, deploying production fixes, and reinforcing the integrity of code and data pipelines that support core operations. Working cross-functionally with incident managers, DevOps, analysts, and QA teams, the SQL Developer delivers rapid solutions to urgent issues while contributing to long-term system resilience and scalability. This position demands a strong technical foundation, a problem-solving mindset, and the ability to thrive in a fast-paced, collaborative environment. Needs: • 5+ years of hands-on experience in Python, SQL, and automation development, with exposure to both application and data workflows. • Proven expertise in Grafana, including building dashboards, setting up alerts, and monitoring system performance. • Strong debugging and troubleshooting capabilities across the technology stack—Python scripts, SQL queries, stored procedures, data pipelines, and system integrations. • Advanced proficiency in relational databases, including writing complex queries, optimizing stored procedures, and tuning performance. • Experience supporting production systems and automation in dynamic, real-time environments. • Familiarity with observability and log analysis tools such as Grafana or the ELK stack. • Solid understanding of ETL/ELT pipelines, internal tool development, and scripting business logic. • Experience with CI/CD pipelines, version control systems (e.g., Git), Agile methodologies, and standard software development lifecycle practices. • Strong communication and documentation skills, including the creation of runbooks and postmortem reports. • Experience with Snowflake or similar cloud-based data platforms is a plus. • Exposure to Azure environments is a plus. • Willingness to participate in after-hours incident response when necessary. • Ability and motivation to learn and take ownership of end-to-end business processes critical to the role’s success. Preferred: • Revenue Cycle Management (RCM) experience Duties: Production Support & Break/Fix Engineering • Troubleshoot and resolve issues in production automations, data workflows, application services, and ETL pipelines. • Debug Python scripts, SQL queries, stored procedures, and internal tools to identify and address root causes. • Participate in live incident response, triage sessions, and post-incident reviews to ensure timely resolution and continuous improvement. Script & Query Remediation • Modify and optimize Python scripts, SQL queries, and data transformations to fix failures or improve performance. • Validate and deploy hotfixes through peer-reviewed, version-controlled CI/CD pipelines. • Ensure all code changes meet standards for accuracy, integrity, and rollback safety. Tool & System Maintenance • Maintain and enhance internal tools for monitoring job health, detecting data anomalies, tracking exceptions, and routing business logic. • Implement logging, alerting, retry mechanisms, and metadata tracking to improve system observability and reliability. • Reduce technical debt by refactoring error-prone components and improving code quality. Cross-Team Diagnostics • Collaborate with analysts and incident managers to trace failures, reproduce bugs, and investigate data discrepancies. • Provide root cause analysis for both application and data-related incidents, supporting teams in understanding system behavior. Documentation & Knowledge Sharing • Record root causes, resolution details, and response timelines for major incidents. • Develop and maintain shared documentation, runbooks, and remediation guides in Confluence. • Standardize knowledge transfer processes for recurring failure patterns and resolution workflows. Deployment Support & Compliance • Coordinate with DevOps and QA teams to safely deploy hotfixes and data corrections. • Ensure all deployments adhere to CI/CD protocols, change control procedures, and rollback safeguards.
This job posting was last updated on 8/29/2025