Save time and effort sourcing top tech talent

Senior Production Reliability Engineer / Sr. Associate Director, Service Management

Pune, Maharashtra, India
DevOps Engineer Operations Engineer Site Reliability Engineer Production Analyst Platform Engineer
Actively hiring

Senior Production Reliability Engineer / Sr. Associate Director, Service Management

Be part of something bigger
Pune, Maharashtra, India
DevOps Engineer Operations Engineer Site Reliability Engineer Production Analyst Platform Engineer
Be part of something bigger
Actively hiring

hackajob is partnering with Be part of something bigger to fill this position. Create a profile to be automatically considered for this role—and others that match your experience.

 

Some careers shine brighter than others.

If you’re looking for a career that will help you stand out, join HSBC and fulfil your potential. Whether you want a career that could take you to the top, or simply take you in an exciting new direction, HSBC offers opportunities, support and rewards that will take you further.

HSBC is one of the largest banking and financial services organisations in the world, with operations in 64 countries and territories. We aim to be where the growth is, enabling businesses to thrive and economies to prosper, and, ultimately, helping people to fulfil their hopes and realise their ambitions.

We are currently seeking an experienced professional to join our team in the role of a  Sr. Associate Director, Service Management

In this role, you will:

  • Design and implement automation solutions to reduce manual intervention and repetitive tasks, improving operational efficiency and system reliability.
  • Develop self-healing mechanisms and automated recovery processes to reduce Mean Time to Recovery (MTTR).
  • Build and maintain observability, monitoring and alerting systems using tools such as Open Telemetry, Grafana, Prometheus, InfluxDB, PostgreSQL, and AppDynamics.
  • Participate in post-incident reviews, providing technical insights to identify root causes and implement long-term fixes.
  • Collaborate with cross-functional teams to ensure lessons learned are translated into actionable improvements.
  • Review existing resilience architectures and implementations to identify gaps and recommend improvements.
  • Coach teams on reliability patterns and embed PRE thinking into design reviews.
  • Drive initiatives to improve operational efficiency, reduce technical debt, and enhance overall reliability.

To be successful in this role, you should meet the following requirements:

  • Strong experience in production reliability engineering or site reliability engineering roles.
  • Expertise in production automation and tools such as Ansible, Terraform, or similar.
  • Proficiency in monitoring and observability tools (Open Telemetry, Grafana, Prometheus, InfluxDB, PostgreSQL, AppDynamics).
  • Strong understanding of resilience engineering principles and experience in reviewing and improving system architectures.
  • Experience participating in post-incident reviews and implementing long-term solutions.
  • Proficiency in scripting and programming languages (Python, Bash, etc.).
  • Strong understanding of Linux systems, networking, and cloud platforms (GCP, AWS, Azure).Excellent collaboration and communication skills.
  • Proactive mindset focused on reducing operational toil and improving system reliability.Experience with log aggregation platforms such as Splunk and standardising log patterns.

You’ll achieve more when you join HSBC.

www.hsbc.com/careers

HSBC is committed to building a culture where all employees are valued, respected and opinions count. We take pride in providing a workplace that fosters continuous professional development, flexible working and opportunities to grow within an inclusive and diverse environment. Personal data held by the Bank relating to employment applications will be used in accordance with our Privacy Statement, which is available on our website.

Issued by – HSBC Software Development India

hackajob is partnering with Be part of something bigger to fill this position. Create a profile to be automatically considered for this role—and others that match your experience.

 

Upskill

Level up the hackajob way. Verify your skills, learn brand new ones and test your ability with Pathways, our learning and development platform.

Ready to reach your potential?