Sourcing as a channel, not a feature.

Site Reliability Engineer

Charlotte, North Carolina, United States
Up to $170,000/ year
Site Reliability Engineer DevOps Engineer Infrastructure Engineer Platform Engineer
Actively hiring

Site Reliability Engineer

mthree
Charlotte, North Carolina, United States
Up to $170,000/ year
Site Reliability Engineer DevOps Engineer Infrastructure Engineer Platform Engineer
mthree
Actively hiring

hackajob is partnering with mthree to fill this position. Create a profile to be automatically considered for this role—and others that match your experience.

 

**Looking for local candidates**

Want to work in technology in the financial industry?

Our client is seeking a highly motivated Site Reliability Engineer responsible for ensuring reliability, scalability, and performance of large-scale systems and applications. The role blends software engineering, infrastructure engineering, and production support, with a strong focus on automation and observability.

About mthree:

Since 2010, mthree has been helping clients solve their business and technological challenges. We are a technology and business consultancy with a global workforce delivering significant business and IT projects in some of the largest financial services organizations worldwide.

Core Services:

  • Consulting and Advisory
  • Managed Services
  • Alumni Graduate Program
  • Alumni Pro Program

We have a global presence and are experts in delivering exceptional quality to our client base, providing consulting services across Risk, Regulation & Compliance; Vendor Products; Application Support; Application Development; Cyber & Information Security; Data Science and DevOps areas.

Our Expert program offers experienced professionals access to top roles in tech, finance, aviation and insurance. Join us to work on groundbreaking technology projects, from international trading platforms to critical applications for leading airlines. We recruit professionals who are eager to fast-track their careers in technology or operations within prestigious global organizations.

Key Responsibilities

1. Reliability & Production Ownership

  • Define and track service reliability goals (SLIs/SLOs) across applications
  • Ensure high availability, scalability, and performance of systems
  • Own production issues end-to-end and ensure problems do not recur

2. Observability & Monitoring

  • Design monitoring, logging, and tracing systems (dashboards, alerts)
  • Enhance operational visibility into platform performance
  • Evaluate and improve monitoring coverage for new releases

3. Automation & Efficiency (Toil Reduction)

  • Automate manual operational tasks and workflows
  • Build tools/software to reduce “toil” and improve efficiency
  • Implement CI/CD pipelines and automation frameworks

4. Incident Management & Root Cause Analysis

  • Participate in major incident triage and troubleshooting
  • Identify and resolve root causes of complex outages
  • Collaborate with problem management teams to prevent recurrence

5. Collaboration Across Teams

  • Work closely with software engineering, infrastructure, and architecture teams
  • Influence adoption of reliable design patterns and best practices
  • Drive early integration of non-functional requirements (reliability, scalability) 

6. Performance & Capacity Planning

  • Identify bottlenecks, capacity constraints, and vulnerabilities 
  • Optimize system performance and cost efficiency
  • Plan for growth and scaling needs

Required Qualifications

  • ~10–15+ years in SRE, software engineering, or infrastructure engineering
  • Strong experience with cloud platforms (AWS/Azure) 
  • Proven experience supporting large-scale distributed systems
  • Programming: Python, Java, or .NET 
  • DevOps: CI/CD tools (Jenkins, Git), GitOps
  • Observability: Splunk, Prometheus, Grafana, Dynatrace 
  • Systems: Linux/Unix, networking, load balancing, DNS 
  • Service Level Indicators (SLIs) & Objectives (SLOs)
  • Error budgets and reliability engineering practices 
  • Incident response and resiliency engineering
  • Strong collaboration and stakeholder management
  • Ability to lead initiatives and influence engineering culture
  • Problem-solving in high-pressure production environments

At mthree, our values support courageous teammates, needle movers, and learning champions all while striving to support the health and well-being of all employees.  We take great pride in celebrating the diversity of each individual who contributes to making mthree the company it is today and will be in the future. We value diversity both within mthree and with our partner companies, and we're proud to provide an environment where all our colleagues can flourish. That means promoting a strong culture of equality but, most importantly, inclusion.

We are committed to fair, transparent pay, and we strive to provide competitive compensation in addition to a comprehensive benefits package.  The base pay rate for this position is $140,000 - 170,000 USD.  

This pay rate represents mthree's good faith and reasonable estimate of the base pay for this role at the time of posting and based on the locations listed in the job advertisement. It is anticipated that qualified candidates selected for a placement will receive this pay rate as a starting salary once onsite with the mthree client, however, the ultimate salary offered for this role may be higher or lower and will be set based on a variety of non-discriminatory factors, including but not limited to, geographic location, skills, and competencies. 

Applicants must be currently authorized to work in the United States on a full-time basis. The Company will not sponsor applicants for work visas. 

hackajob is partnering with mthree to fill this position. Create a profile to be automatically considered for this role—and others that match your experience.

 

Upskill

Level up the hackajob way. Verify your skills, learn brand new ones and test your ability with Pathways, our learning and development platform.

Ready to reach your potential?