Save time and effort sourcing top tech talent

Site Reliability Engineer III

Hyderabad, Telangana, India
Site Reliability Engineer
Actively hiring

Site Reliability Engineer III

JPMorganChase
Hyderabad, Telangana, India
Site Reliability Engineer
JPMorganChase
Actively hiring

hackajob is partnering with JPMorganChase to fill this position. Create a profile to be automatically considered for this role—and others that match your experience.

 
JOB DESCRIPTION

There’s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.

As a Site Reliability Engineer III at JPMorgan Chase within the Consumer & Community Banking, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.

Job responsibilities

  • Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
  • Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
  • Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications
  • Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
  • Collaborates with technical experts, key stakeholders, and team members to resolve complex problems
  • Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
  • Supports the adoption of site reliability engineering best practices within your team

Required qualifications, capabilities, and skills

  • Formal training or certification on software engineering concepts and 5+ years applied experience

  • Minimum 9 years of overall experience, with at least 7 years as a software engineer and/or site reliability engineer focused on Data Warehousing (Oracle/Snowflake), SQL/PLSQL, and large data movement in cloud environments.
  • Advanced proficiency in Python and/or Java for large-scale data handling and migration.
  • Hands-on experience with platforms and applications hosted on public, private, or hybrid cloud infrastructures.
  • Formal training or certification in site reliability engineering (SRE) concepts, with at least 3 years of applied SRE experience.
  • Expertise in observability practices, including white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, and Splunk.
  • Strong understanding of site reliability culture and principles, with practical experience implementing SRE within applications or platforms.
  • Proficient knowledge of software applications and technical processes in areas such as Cloud, Artificial Intelligence, and Machine Learning.
  • Experience with continuous integration and continuous delivery (CI/CD) tools, including Jenkins, GitLab, and Terraform.
  • Skilled in containerization and orchestration technologies such as Docker, Kubernetes, and ECS.
  • Familiarity with troubleshooting common networking technologies and issues, and demonstrated ability to work collaboratively in large teams, communicate effectively, address roadblocks proactively, and implement innovative solutions while staying current with emerging technologies.
Preferred qualifications, capabilities, and skills
  • Proficiency in one or more technology domains, may be a cross-domain expert able to solve complex and mission critical problems within a business or across the firm
  • Adept in the development of automated tools, systems, and services in multiple technology domains
  • Working knowledge of infrastructure components. (E.g. routers, load balancers, cloud products, container systems, compute, storage and networks)
  • Excellent debugging and trouble shooting skills
  • Proficiency in service-level changes to a system and troubleshooting components
  • Monitoring tools and log analysis tools to manage operations

ABOUT US

hackajob is partnering with JPMorganChase to fill this position. Create a profile to be automatically considered for this role—and others that match your experience.

 

Upskill

Level up the hackajob way. Verify your skills, learn brand new ones and test your ability with Pathways, our learning and development platform.

Ready to reach your potential?