JOB DESCRIPTIONWe have an exciting and rewarding opportunity for you to advance your site reliability engineering career. Join us to design and deliver trusted technology solutions that support the worldâs most complex systems. You will be part of a collaborative team focused on reliability, scalability, and automation. Experience career growth and the benefits of working with industry leaders.
As a Lead Site Reliability Engineer at JPMorgan Chase within the Chief technology Office team, you apply your skills to solve complex business problems with simple, scalable solutions. You configure, maintain, monitor, and optimize applications and infrastructure, driving improvements through code and automation. You contribute to team knowledge on end-to-end operations, reliability, and scalability. You play a key role in supporting the firmâs business objectives.
Job responsibilities
- Guide and assist others in building designs and gaining consensus from peers
- Collaborate with software engineers and teams to implement deployment approaches using automated CI/CD pipelines
- Design, develop, test, and implement solutions for availability, reliability, and scalability in applications
- Implement infrastructure, configuration, and network as code for applications and platforms
- Resolve complex problems by collaborating with technical experts, stakeholders, and team members
- Utilize service level indicators and objectives to proactively resolve issues before customer impact
- Design and implement AI-based and automation solutions to optimize processes and reduce manual effort
- Collaborate with cross-functional teams to embed AI capabilities into workflows and drive automation initiatives
Required qualifications, capabilities, and skills
- Formal training or certification on software engineering concepts and five years applied experience
- Experience in SRE, DevOps, or application support roles with knowledge of SLIs/SLOs, incident response, and troubleshooting
- Hands-on experience with CI/CD pipelines, infrastructure as code, version control, containerization, and orchestration
- Familiarity with monitoring and observability tools such as Grafana, Prometheus, Splunk, or Open Telemetry
- Exposure to cloud platforms including AWS, GCP, or Azure and automating infrastructure and deployments
- Willingness to participate in on-call rotation and respond to production incidents
- Ability to break down issues, document solutions, and communicate effectively with team members and customers
- Overall knowledge of the Software Development Life Cycle
- Solid understanding of agile methodologies such as CI/CD, resiliency, and security
- Demonstrated knowledge of software applications and technical processes within a technical discipline
ABOUT US
hackajob is partnering with JPMorganChase to fill this position. Create a profile to be automatically considered for this role—and others that match your experience.