Senior Service Reliability Engineer
  • United Arab Emirates Dubai
  • Emirates NBD
1 year before
31.12.2023
Operate and Maintain
Systems Administration
Job Description

Service Reliability Engineering (SRE) is an important endeavor for Emirates NBD Group IT to get more out IT investments. SRE’s are focused on optimizing new/existing services including the underlying technology, continual assessment of cloud infrastructure including its micro services and elimination of any manual work through automation.
Senior Service Reliability Engineers (SSREs) advocate and augment the reliability engineering principles, guidelines and standards. SRE’s partner with Product Owners, Platform and Engineering Teams to drive the Availability, Reliability, Scalability, Usability, Recoverability of application services and technologies in the production environment. They combine engineering and development experience and an innate drive to improve existing and new systems and processes. They collaborate with Development, Platform, Operations team to build and run scalable, sustainable production services which can advance and adapt to evolving business needs.
Essential Job Requirements include:
A bachelor's degree in Computer Engineering, Computer Science, Information Systems or other related field is highly preferred; however, equivalent work (6+ years) experience in Reliability Engineering will not be overlooked.
Passion for designing, building, and managing resilient applications and infrastructures
Experience with project management or lead technical role in large enterprise wide projects.
Extensive work experience with large sets of data and data analysis.
Ability to program (structured and OO) with one or more high level languages
Have clear understanding in dynamic resource management frameworks, cloud, server, distributed storage, networks, virtualized environments, applications, databases and associated tool sets.
An understanding and practical experience with containerization frameworks
Must be good at forecasting, statistical analysis and modeling are part of the job.
Java Spring Boot Experience
DevOps Experience/ Tools which helps to be a DevOps Engineer
AWS & AZURE
Cloud Transition Model – Waterfall/Agile - CI / CD DevOps /Dev Sec Ops
Chaos Testing Automation on the MicroServices
OPENSHIFT (PaaS Platform)
RHEL ,CENTOS & UBUNTU (OS)
VIRTUALBOX & VAGRANT (Virtualization)
DOCKER (Container RUNTIME Engine).
NGINX (Performing webserver for Containers)
Knowledge on ANSIBLE AUTOMATION
KUBERNETES (Container Orchestration), HELM (Kubernetes Package Management)
ENVOY & ISTIO (Service Mesh Data and Control Planes)
HARSHICORP (Securing Credentials)
Knowledge on MicroServices Fundamentals & Patterns, Monitoring the MicroServices , Custom Alerting
Understanding of monitoring/telemetry solutions (Icinga, ELK, AppDynamics) for data ingestion and analysis
PROMETHEUS (Container Infrastructure Monitoring), ELK (Log Monitoring), RUM (Real User Monitoring), GRAFANA Monitoring Dashboard Tool
Mongo DB, Postgres, Oracle
Experience with Atlassian suite of products

Qualifications
AS Mentioned in the JD
Primary Location: United Arab Emirates-Dubai-Dubai - Nadd Al Shiba, Meydan, Building M
Job: Professional Support
Organization: Technology Platforms
Schedule: Regular
Shift: Standard
Job Type: Full-time
Day Job


Quick response

Required Knowledge
  • K0001   Knowledge of computer networking concepts and protocols, and network security methodologies.
  • K0002   Knowledge of risk management processes (e.g., methods for assessing and mitigating risk).
  • K0004   Knowledge of cybersecurity and privacy principles.
  • K0050   Knowledge of local area and wide area networking principles and concepts including bandwidth management.
  • K0053   Knowledge of measures or indicators of system performance and availability.
  • K0088   Knowledge of systems administration concepts.
  • K0100   Knowledge of the enterprise information technology (IT) architecture.
  • K0103   Knowledge of the type and frequency of routine hardware maintenance.
  • K0104   Knowledge of Virtual Private Network (VPN) security.
  • K0130   Knowledge of virtualization technologies and virtual machine development and maintenance.
  • K0158   Knowledge of organizational information technology (IT) user security policies (e.g., account creation, password rules, access control).
  • K0167   Knowledge of system administration, network, and operating system hardening techniques.
  • K0179   Knowledge of network security architecture concepts including topology, protocols, components, and principles (e.g., application of defense-in-depth).
  • K0280   Knowledge of systems engineering theories, concepts, and methods.
  • K0289   Knowledge of system/server diagnostic tools and fault identification techniques.
  • K0332   Knowledge of network protocols such as TCP/IP, Dynamic Host Configuration, Domain Name System (DNS), and directory services.
  • K0346   Knowledge of principles and methods for integrating system components.

Required Skills
  • S0016   Skill in configuring and optimizing software.
  • S0073   Skill in using virtual machines. (e.g., Microsoft Hyper-V, VMWare vSphere, Citrix XenDesktop/Server, Amazon Elastic Compute Cloud, etc.).
  • S0143   Skill in conducting system/server planning, management, and maintenance.
  • S0144   Skill in correcting physical and technical problems that impact system/server performance.
  • S0151   Skill in troubleshooting failed system components (i.e., servers)
  • S0153   Skill in identifying and anticipating system/server performance, availability, capacity, or configuration problems.
  • S0158   Skill in operating system administration. (e.g., account maintenance, data backups, maintain system performance, install and configure new hardware/software).

Required Abilities
  • A0025  Ability to accurately define incidents, problems, and events in the trouble ticketing system.
  • A0074  Ability to collaborate effectively with others.
  • A0123  Ability to apply cybersecurity and privacy principles to organizational requirements (relevant to confidentiality, integrity, availability, authentication, non-repudiation).