Senior Service Reliability Engineer, Network Security Engineer & Administrator, Emirates NBD

Job Description

Service Reliability Engineering (SRE) is an important endeavor for Emirates NBD Group IT to get more out IT investments. SRE’s are focused on optimizing new/existing services including the underlying technology, continual assessment of cloud infrastructure including its micro services and elimination of any manual work through automation.
Senior Service Reliability Engineers (SSREs) advocate and augment the reliability engineering principles, guidelines and standards. SRE’s partner with Product Owners, Platform and Engineering Teams to drive the Availability, Reliability, Scalability, Usability, Recoverability of application services and technologies in the production environment. They combine engineering and development experience and an innate drive to improve existing and new systems and processes. They collaborate with Development, Platform, Operations team to build and run scalable, sustainable production services which can advance and adapt to evolving business needs.
Essential Job Requirements include:
A bachelor's degree in Computer Engineering, Computer Science, Information Systems or other related field is highly preferred; however, equivalent work (6+ years) experience in Reliability Engineering will not be overlooked.
Passion for designing, building, and managing resilient applications and infrastructures
Experience with project management or lead technical role in large enterprise wide projects.
Extensive work experience with large sets of data and data analysis.
Ability to program (structured and OO) with one or more high level languages
Have clear understanding in dynamic resource management frameworks, cloud, server, distributed storage, networks, virtualized environments, applications, databases and associated tool sets.
An understanding and practical experience with containerization frameworks
Must be good at forecasting, statistical analysis and modeling are part of the job.
Java Spring Boot Experience
DevOps Experience/ Tools which helps to be a DevOps Engineer
AWS & AZURE
Cloud Transition Model – Waterfall/Agile - CI / CD DevOps /Dev Sec Ops
Chaos Testing Automation on the MicroServices
OPENSHIFT (PaaS Platform)
RHEL ,CENTOS & UBUNTU (OS)
VIRTUALBOX & VAGRANT (Virtualization)
DOCKER (Container RUNTIME Engine).
NGINX (Performing webserver for Containers)
Knowledge on ANSIBLE AUTOMATION
KUBERNETES (Container Orchestration), HELM (Kubernetes Package Management)
ENVOY & ISTIO (Service Mesh Data and Control Planes)
HARSHICORP (Securing Credentials)
Knowledge on MicroServices Fundamentals & Patterns, Monitoring the MicroServices , Custom Alerting
Understanding of monitoring/telemetry solutions (Icinga, ELK, AppDynamics) for data ingestion and analysis
PROMETHEUS (Container Infrastructure Monitoring), ELK (Log Monitoring), RUM (Real User Monitoring), GRAFANA Monitoring Dashboard Tool
Mongo DB, Postgres, Oracle
Experience with Atlassian suite of products

Qualifications
AS Mentioned in the JD
Primary Location: United Arab Emirates-Dubai-Dubai - Nadd Al Shiba, Meydan, Building M
Job: Professional Support
Organization: Technology Platforms
Schedule: Regular
Shift: Standard
Job Type: Full-time
Day Job

Quick response

Required Knowledge

K0001 Knowledge of computer networking concepts and protocols, and network security methodologies.
K0002 Knowledge of risk management processes (e.g., methods for assessing and mitigating risk).
K0004 Knowledge of cybersecurity and privacy principles.
K0050 Knowledge of local area and wide area networking principles and concepts including bandwidth management.
K0053 Knowledge of measures or indicators of system performance and availability.
K0088 Knowledge of systems administration concepts.
K0100 Knowledge of the enterprise information technology (IT) architecture.
K0103 Knowledge of the type and frequency of routine hardware maintenance.
K0104 Knowledge of Virtual Private Network (VPN) security.
K0130 Knowledge of virtualization technologies and virtual machine development and maintenance.
K0158 Knowledge of organizational information technology (IT) user security policies (e.g., account creation, password rules, access control).
K0167 Knowledge of system administration, network, and operating system hardening techniques.
K0179 Knowledge of network security architecture concepts including topology, protocols, components, and principles (e.g., application of defense-in-depth).
K0280 Knowledge of systems engineering theories, concepts, and methods.
K0289 Knowledge of system/server diagnostic tools and fault identification techniques.
K0332 Knowledge of network protocols such as TCP/IP, Dynamic Host Configuration, Domain Name System (DNS), and directory services.
K0346 Knowledge of principles and methods for integrating system components.

Required Skills

S0016 Skill in configuring and optimizing software.
S0073 Skill in using virtual machines. (e.g., Microsoft Hyper-V, VMWare vSphere, Citrix XenDesktop/Server, Amazon Elastic Compute Cloud, etc.).
S0143 Skill in conducting system/server planning, management, and maintenance.
S0144 Skill in correcting physical and technical problems that impact system/server performance.
S0151 Skill in troubleshooting failed system components (i.e., servers)
S0153 Skill in identifying and anticipating system/server performance, availability, capacity, or configuration problems.
S0158 Skill in operating system administration. (e.g., account maintenance, data backups, maintain system performance, install and configure new hardware/software).

Required Abilities

A0025 Ability to accurately define incidents, problems, and events in the trouble ticketing system.
A0074 Ability to collaborate effectively with others.
A0123 Ability to apply cybersecurity and privacy principles to organizational requirements (relevant to confidentiality, integrity, availability, authentication, non-repudiation).