Save time and effort sourcing top tech talent

Consulting/Principal Site Reliability Engineer (Mumbai)

Mumbai, Maharashtra, India
DevOps Engineer Cloud Engineer Site Reliability Engineer
LexisNexis UK
Actively hiring

Sign up for the chance to get matched to this role, and similar opportunities.

Would you like to join our great reliability engineering team?
 

Would you like to be part of a rewarding project?
 

About the Business
 

At ICIS, our mission is to optimize the world’s resources. We help companies make strategic, sustainable decisions by bringing transparency to markets across the world. We create a comprehensive view of commodities markets, providing companies with the data and intelligence to successfully navigate across global value chains every day. Our customers benefit from instant access to price assessments, reports and forecasts, a dedicated news channel and supply and demand data. 
 

About our Team
 

Generally, the team is form in Squad where each squad consist of Squad Lead, Business Analyst, Dev lead, Developers and Testers. This Squad structure provides more connectivity among team members and allows us to deliver faster as all resources work as one group on dedicated tasks.
 

About the Role
Candidates with knowledge and skills to be able to operate in a role where they will need to be able to diagnose and debug issues beyond simple IAC or a github interface. Anyone can write terraform code and follow a youtube guide to create a pipeline. We need people with a deeper skillset, who have a good working knowledge of TCP, who have at least heard of the OSI stack. Knowledge of how a firewall works, what an ephemeral port, how to write a service unit file. We have 1000 virtual server instances so working knowledge of linux and windows in a server environment. Active Directory/Puppet/Ansible/LDAP etc

 

Responsibilities

  • Designing, implement, and maintain highly available and scalable container-based infrastructure using Docker and Kubernetes.
  • Monitoring, analyze, and optimize system performance, reliability, and scalability for containerized applications.
  • Collaborating with development, operations, and security teams to define best practices for containerization and deployment.
  • Troubleshooting and resolve issues related to container orchestration, networking, and storage.
  • Implementing and maintain automated deployment, scaling, and monitoring solutions for containerized environments.
  • Ensuring security best practices are followed, including vulnerability scanning, container hardening, and access controls.

 


Requirements

  • Have 8+ Years of experience as a Site Reliability Engineer or similar role, with a focus on containerization and orchestration.
  • Have expertise in Containers and Container Orchestration platforms such as Docker and Kubernetes.
  • Experience in Linux system administration and troubleshooting. Familiarity with cloud platforms, like Microsoft Azure, and related services.
  • Experience with automation and configuration management tools (e.g., Ansible, Terraform)
  • Experience with container monitoring and log management tools (e.g., Prometheus , Grafana).

Sign up for the chance to get matched to this role, and similar opportunities.

Upskill

Level up the hackajob way. Verify your skills, learn brand new ones and test your ability with Pathways, our learning and development platform.

Ready to reach your potential?