JobsAisle
G

Senior Technology Engineer - Data Science Platform (DSP)

Global Software Solutions Group

Dubai, UAEAED 10,000-16,667/moToday
UAEIT & TechnologyFull Time

Skills Required

PythonReactAwsAzureKubernetesGitExcelDevopsErp

Job Description

<div><h3>Role Overview</h3><p>We are looking for a<b>Senior Technology Engineer</b>to drive<b>platform stability, automation, and operational excellence</b>within the Data Science Platform (DSP).</p><p>This is not a support role — this is a<b>hands‑on engineering role</b>where you own automation, orchestration, and reliability across a hybrid cloud ecosystem (OpenShift + AWS/Azure/GCP).</p><p>You will be the backbone of DSP operations — if things break, scale poorly, or require manual intervention, that's your problem to eliminate permanently.</p><h3>Key Responsibilities</h3><h3>Platform Engineering&Operations</h3><ul><li>Own end‑to‑end technical operations of DSP infrastructure</li><li>Ensure high availability, performance, and scalability of platform services</li><li>Monitor system health, troubleshoot issues, and implement permanent fixes (not patchwork)</li></ul><h3>Automation&Orchestration</h3><ul><li>Design and implement automation frameworks to eliminate manual processes</li><li>Build CI/CD pipelines and automate deployment workflows</li><li>Drive infrastructure‑as‑code (IaC) adoption using tools like Terraform/Ansible</li></ul><h3>Container&Cloud Platform Management</h3><ul><li>Manage and optimize OpenShift / Kubernetes environments</li><li>Work across multi‑cloud (AWS, Azure, GCP) infrastructure</li><li>Ensure efficient resource utilization and cost optimisation</li></ul><h3>MLOps / Data Platform Support</h3><ul><li>Enable smooth ML model deployment and lifecycle management</li><li>Support tools like OpenShift AI, SageMaker, or similar platforms</li><li>Ensure reproducibility and reliability of data science workflows</li></ul><h3>Monitoring&Reliability</h3><ul><li>Implement monitoring using Prometheus, Grafana, ELK stack</li><li>Define SLAs, SLOs, and ensure platform meets reliability standards</li><li>Drive proactive incident prevention (not reactive firefighting)</li></ul><h3>Collaboration&Governance</h3><ul><li>Work closely with Data Scientists, DevOps, and Platform teams</li><li>Ensure adherence to security, compliance, and governance standards</li><li>Act as a technical SME for DSP operations</li></ul><h3>Requirements</h3><h3>Mandatory Skills (Non‑Negotiable)</h3><ul><li>Strong experience in OpenShift / Kubernetes</li><li>Hands‑on experience in multi‑cloud environments (AWS/Azure/GCP)</li><li>Expertise in automation (Terraform, Ansible, Jenkins, GitOps)</li><li>Strong knowledge of CI/CD pipelines and DevOps practices</li><li>Experience in Python or scripting (Bash/Shell)</li><li>Experience with monitoring tools (Prometheus, Grafana, ELK)</li></ul><h3>Good to Have</h3><ul><li>Experience in MLOps / AI platforms (OpenShift AI, SageMaker, Bedrock)</li><li>Exposure to LLM deployment / inference platforms (vLLM, Triton, etc.)</li><li>Knowledge of data pipelines and big data ecosystems</li><li>Banking or financial services experience</li></ul><h3>Experience Required</h3><ul><li>6-10 years of relevant experience in Platform Engineering / DevOps / Cloud Engineering</li><li>Proven experience managing enterprise‑scale platforms</li></ul></div>#J-18808-Ljbffr