Your role as an Linux Engineering Specilist in the Team is to build and ensure the performance, availability and security of Enterprise Cloud/Network Cloud & Legacy computing infrastructure. Manage key stakeholder and customer relationships. Provide operational excellence and leadership across NC, EC & legacy platforms. Also play a critical role in managing complex Linux systems, as well as supporting and optimizing our infrastructure using technologies such as Ansible, Docker, Kubernetes, and Version Control Systems (VCS). This position requires strong technical expertise, excellent problem-solving skills, and a deep understanding of Linux operating systems
What you'll be doing?
- Undertake routine daily activities and be accountable for system design, build, testing, validation, maintenance, and ongoing support of components within Network Cloud
- Undertake the implementation of new and enhanced technology that aligns to customers’ strategic direction
- Performance Optimization: Identify and address performance bottlenecks in the Linux environment, including system, application, and network components, to ensure optimal system efficiency.
- System Security: Implement and maintain security measures and best practices to protect Linux servers from potential threats, vulnerabilities, and unauthorized access.
- Automation and Configuration Management: Develop and maintain Ansible playbooks and automation scripts to streamline administrative tasks, configuration management, and deployment processes.
- Containerization and Orchestration: Utilise containerisation and Kubernetes for orchestration to manage and scale containerised applications effectively.
- Participate in technology lifecycle planning, including contributing to the development of the technology roadmaps and Platform Health Assessments.
- Deliver the required documentation, knowledge transfer and training to successfully land new solutions into the support organisation.
- Drive the implementation of core and cloud infrastructure security to manage risks and exposure.
- Provide support for platform incidents
- Mentor other Engineering professionals, helping to improve the team’s abilities by acting as a technical resource.
- Continuously develop and share knowledge with the team on emerging trends and changes.
- Skills you'll need to have
- Strong interest in researching new technologies.
- Strong knowledge of Linux configuration management and automation including bash, awk, sed and Ansible.
- Knowledge of Linux filesystems and storage including LVM, iSCSI, file access control, partitioning and RAID.
- Strong Linux troubleshooting skills, covering both functional and performance issues.
- Strong knowledge of machine virtualisation, ideally KVM.
- Skills in integration of third party, Opensource and self developed solutions
- Understanding of the TSA and its implications to ways of working.
- Good knowledge of Linux container technologies including Docker.
- Familiarity with Kubernetes, VMware and OpenStack
- Good knowledge of Linux networking including bridging, routing, iptables and diagnostic tools such as tcpdump, ping etc.
- Ability to develop and maintain programs and scripts that aid in the operation and automation of tasks using various shell and scripting languages (Ansible/bash/ Python)
- An understanding of code review and familiarity with tools like GitLab & Jenkins
- Familiarity with enterprise grade server hardware (Dell/Cisco).
- Familiarity with data centre networking technologies.
- Familiarity with databases such as MySQL / PostgreSQL, particularly their storage requirements and backup models.
- Ability to self-manage priorities and take a flexible approach to implementation.
- · Ability to produce technical focused documentation/guides for a variety of audiences.
- General knowledge of change control and operational security processes and the importance of these processes to the integrity of services you operate
Experience you'll have
- · Experience administering Linux servers in virtualized settings using automation tools.
- · Experience researching and learning new technologies.
- · Researching and resolving problems both independently and as part of a team.
- · Experience responding to high priority incidents.
- · Documenting designs and ideas.
- · Actively participated in strategy setting sessions
- Has been part of a team that moves quickly whilst hosting critical services.
- Experience of using cloud platforms to develop solutions to problems
hackajob is partnering with BT to fill this position. Create a profile to be automatically considered for this role—and others that match your experience.