Save time and effort sourcing top tech talent

Site Reliability Engineer II

Gurgaon, Haryana, India
Cloud Engineer DevOps Engineer Platform Engineer Site Reliability Engineer
Actively hiring

Site Reliability Engineer II

American Express
Gurgaon, Haryana, India
Cloud Engineer DevOps Engineer Platform Engineer Site Reliability Engineer
American Express
Actively hiring

hackajob is partnering with American Express to fill this position. Create a profile to be automatically considered for this role—and others that match your experience.

 

At American Express, our culture is built on a 175-year history of innovation, shared values and Leadership Behaviors, and an unwavering commitment to back our customers, communities, and colleagues. As part of Team Amex, you'll experience this powerful backing with comprehensive support for your holistic well-being and many opportunities to learn new skills, develop as a leader, and grow your career.

Here, your voice and ideas matter, your work makes an impact, and together, you will help us define the future of American Express.

The Enterprise Data Management Technology Team brings together foundational strategic technology capabilities in data governance, data privacy, data retention and deletion, data quality, and automation, grounded in our data technology model that prioritizes data management. It employs a ground-breaking focus with development responsibilities for regulatory needs that deepen and expand data strategy, as well as core technical capabilities that cut across business lines and customer segments.

 

Responsibilities:

  • Infrastructure Management: Design, implement, and manage scalable, reliable infrastructure using cloud-native technologies and Infrastructure as Code (IaC) tools. 
     
  • Automation & CI/CD: Develop and maintain automated processes and Continuous Integration/Continuous Delivery (CI/CD) pipelines to streamline deployments and operational tasks. 
     
  • Monitoring & Alerting: Implement and manage comprehensive monitoring and alerting systems to detect issues early and ensure system health. 
     
  • Incident Management: Lead incident response efforts, perform root cause analysis (RCA) for outages, and implement measures to prevent future disruptions. 
     
  • Performance Tuning & Optimization: Gather and analyze metrics from systems and applications to identify performance bottlenecks and conduct tuning. 
  • Collaboration: Work closely with development teams to integrate reliability into software design and deployment processes. 
     
  • Capacity Planning: Manage server capacity to ensure systems can handle current and future demand. 
  • Site Reliability Engineering (SRE) Principles: Balance feature development speed with reliability, and establish and maintain Service Level Objectives (SLOs)

     

     

    Minimum Qualifications
     

  • Programming Languages: Python, Bash, Perl, or similar for automation.
  • Cloud Platforms: , GCP, Hydra.
  • Containerization: Docker, Kubernetes.
  • Monitoring Tools: Prometheus, Grafana, Splunk.
  • IaC Tools: Terraform.
  • Operating Systems: Linux (proficient in command-line tools like strace, truss).
  • Networking: TCP/IP, firewalls, load balancers

  •  

We back you with benefits that support your holistic well-being so you can be and deliver your best. This means caring for you and your loved ones' physical, financial, and mental health, as well as providing the flexibility you need to thrive personally and professionally:

  • Competitive base salaries 
  • Bonus incentives 
  • Support for financial-well-being and retirement 
  • Comprehensive medical, dental, vision, life insurance, and disability benefits (depending on location) 
  • Flexible working model with hybrid, onsite or virtual arrangements depending on role and business need 
  • Generous paid parental leave policies (depending on your location) 
  • Free access to global on-site wellness centers staffed with nurses and doctors (depending on location) 
  • Free and confidential counseling support through our Healthy Minds program 
  • Career development and training opportunities

American Express is an equal opportunity employer and makes employment decisions without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran status, disability status, age, or any other status protected by law.  

Offer of employment with American Express is conditioned upon the successful completion of a background verification check, subject to applicable laws and regulations.

hackajob is partnering with American Express to fill this position. Create a profile to be automatically considered for this role—and others that match your experience.

 

Upskill

Level up the hackajob way. Verify your skills, learn brand new ones and test your ability with Pathways, our learning and development platform.

Ready to reach your potential?