hackajob is partnering with American Express to fill this position. Create a profile to be automatically considered for this role—and others that match your experience.
As part of our diverse tech team, you can architect, code and ship software that makes us an essential part of our customers’ digital lives. Here, you can work alongside talented engineers in an open, supportive, inclusive environment where your voice is valued, and you make your own decisions on what tech to use to solve challenging problems. American Express offers a range of opportunities to work with the latest technologies and encourages you to back the broader engineering community through open source. And because we understand the importance of keeping your skills fresh and relevant, we give you dedicated time to invest in your professional development. Find your place in technology on #TeamAmex.
How will you make an impact in this role?
This person is responsible to provide consultation and strategic recommendations by quickly assessing and remediating complex platform availability issues. This is an opportunity to work in one of the best technology units to help improve customer experience for American Express and influence how millions of people interact with their cards, their merchants, and their money.
As a part of our tech team, we could work together to bring ground-breaking and diverse ideas to life that power the digital systems, services, products, and platforms that millions of customers around the world depend on. If you love to work with APIs, contribute to open source, or use the latest technologies, we’ll support you with an open environment and learning culture to grow your career.
Role Responsibilities:
Hands-on engineer with expertise in developing complex, large scale enterprise applications/tools.
Responsible for technical aspects of software engineering for assigned applications including design, developing prototypes, and coding assignments.
Empower teams to automate demand driven scalable application deployments in test or production environments.
Apply specialized knowledge of industry standards or practices to assigned initiatives to identify complex and or broad problems and issues and
formulate recommendations.
Collaborates with leadership across teams to define solutions, technical implementation to drive software maturity and practices.
Drive the technical roadmap for runtime systems, ensuring the reliability, scalability, and performance of platforms.
Establish and monitor key performance indicators (KPIs) for runtime and resiliency and drive continuous improvement efforts to meet or exceed these metrics.
Provide technical leadership and guidance to the team, fostering a culture of innovation, collaboration, and accountability.
Act as a technical contributor by participating in architecture design, code reviews, and troubleshooting complex technical issues.
Design and implement innovative solution/framework that will improve software engineering velocity, infrastructure resiliency and security, and data availability.
Develop common framework components (to be leveraged by enterprise applications), define standards for configuration, monitoring, reliability, and performance engineering.
As a Site Reliability Engineer (SRE) in a payment network, your primary responsibilities would include ensuring the reliability, availability, and performance of the payment infrastructure.
Contribute in proactive monitoring, incident response, and collaborating with development teams to implement robust, scalable systems.
You'll also play a crucial role in designing and optimizing the network's architecture for resilience and fault tolerance, while maintaining a strong focus on security and compliance with payment industry standards.
Additionally, you'll contribute to automation efforts to streamline operations and reduce manual intervention in the system's lifecycle.
24X7 production environment, including on-call responsibilities.
Minimum Qualifications:
Bachelor's Degree
Experience in Development Operations or SRE.
5+ years of progressive experience in distributed environment.
Experience in designing mission critical highly available enterprise applications coded in Java/goLang.
Experience with performance testing framework design, tuning Java applications.
Experience in managing relational and NoSQL databases.
Experience on enterprise tools set such as Splunk, Grafana, Dynatrace, AppDynamics, BMC, Prometheus etc.
Experience with ELF, Grafana, Prometheus and Splunk.
Experience with agile software development methodologies and practices such as Scrum/Kanban, iterations, user stories.
Experience with Service Oriented Architecture design principles, execution patterns and performance optimization.
Experience in design, data structures and algorithms, and analytical and debugging skills.
Experience in cloud infrastructure, distributed systems, and containerization technologies
Preferred Qualifications:
Masters Degree
Strong interpersonal communication skills and the ability to work well in a diverse team-focused environment.
Strong knowledge of Site Reliability Engineering best practices, including incident management, monitoring, and capacity planning.
Liaise between Site Reliability Engineering, development, Product Owners, and other partner teams to improve performance and availability.
Ability to build positive relationships with your team, business, and technology partners to achieve established goals.
Ability to effectively interpret technical/business objectives and challenges and articulate solutions.
Influence and lead team members with creative thought leadership with data driven changes and improvements by challenging status quo.
Demonstrate the ability to effectively communicate to internal business clients and leadership on each facet of issue handling including (but not limited to): issue identification, service restoration, solutions to permanently resolve to ensure high levels of ongoing service etc.
Strong background in payment processing.
hackajob is partnering with American Express to fill this position. Create a profile to be automatically considered for this role—and others that match your experience.
Level up the hackajob way. Verify your skills, learn brand new ones and test your ability with Pathways, our learning and development platform.