JobsAisle
E

Site Reliability Engineer II - Real-Time and Big Data

Esri

Dubai, UAEAED 7,000-18,000/moToday
UAEIT & TechnologyFull Time

Skills Required

PythonAwsAzureDockerKubernetesGit

Job Description

OverviewJoin us to work collaboratively with our talented team of dynamic and passionate engineers to deliver capabilities that enable our customers to make a difference. Youll deploy and operate ArcGIS Velocity and ArcGIS Workflow Manager SaaS solutions. You will also have the opportunity to design deploy and operate nextgeneration realtime and big data GIS softwareasaservice (SaaS) capabilities for thousands of cloud users worldwide.Our teams have a broad mix of experience levels and tenures that support an environment that promotes professional development. We care about your career growth and strive to assign projects based on what will help each team member develop into a betterrounded engineer and enable them to take on more complex tasks in the future.Our team also puts a high value on worklife balance and we understand that striking a healthy balance between your personal and professional life is crucial to your happiness and success here. We offer a flexible hybrid schedule so you can have a more productive and wellbalanced life both in and outside of work.ResponsibilitiesCollaborate with a team of SRE engineers to operate SaaS capabilities across multiple regions on the cloud platformDesign implement configure and utilize monitoring systems to monitor the health of SaaS productsManage infrastructure used for ArcGIS Velocity and ArcGIS Workflow Manager respond to alerts and troubleshoot problems to resolutionDevelop implement and maintain automation solutions for repetitive operational tasks such as deployment pipelines incident resolution and scaling processesDesign and implement the deployment and upgrade containerized microservice components that when combined power Esris SaaS offeringsCreate and automate Git workflows to simplify code integration testing and infrastructure deploymentsParticipate in technical spike efforts bringing new innovative ideas to future versions of our softwareTroubleshoot the system incidents and provide root cause analysis reportsProvide rotational oncall technical supportRequirements5 years of experience managing Kubernetes (EKS) logging and monitoring (ELK Prometheus) and container technologies (Docker)Proficient in using Terraform for automating infrastructure provisioning and managementAbility to design and automate Git workflows for streamlined code integration testing and infrastructure deploymentAbility to write scripts to deploy infrastructure and/or applications (Bash Python Terraform)Expert level understanding and experience with cloud computing platforms (AWS or Microsoft Azure)Strong knowledge of Linux Operating system administration including troubleshooting performance tuning and shell scriptingProficient in cloud networking including VPCs subnets security groups and VPNs in platforms like AWS or AzureSkilled in identifying and resolving system and application issues through effective troubleshooting and root cause analysisWorking knowledge of a source control and issue management systemBachelors in computer science computer engineering GIS or information systemsRecommended QualificationsExperience designing administering and/or maintaining cloud environments such as AWS or Azure supporting 247 highavailability production environmentsInterest in working with GitOps principles to automate the deployment of applications on Kubernetes clustersCertifications: AWS Certified Solution Architect Associate CKA/CKAD or similarExperience managing OpenSearch (datastore or logstore) and Kafka for managing distributed data streams and ensuring high availability in largescale systemsAbility to work with continuous integration and delivery best practicesKnowledge of operating resilient highly available scalable and performance SaaS capabilitiesKnowledge of Esri ArcGIS or other web mapping technologiesWorking knowledge ofGitHub#J-18808-Ljbffr