JobsAisle
O

Senior Delivery Engineer

Open Innovation AI

Abu Dhabi, UAEAED 7,000-18,000/moToday
UAEIT & TechnologyFull Time

Skills Required

KubernetesAccountingErpLeadership

Job Description

Open Innovation AI is a global technology company that specializes in developing advanced solutions for managing AI workloads. Its flagship product, the Open Innovation Cluster Manager (OICM), orchestrates complex AI tasks efficiently across diverse infrastructures. The platform is hardware-agnostic, optimized for various GPUs and accelerators hardware, and facilitates seamless integration and scalability for enterprise AI applications. Open Innovation AI focuses on optimizing and simplifying AI workload management and making AI technologies accessible to organizations of all sizes. With its innovative solutions, companies can reduce operational costs, accelerate time to value, and maximize their return on investment, ensuring that their AI strategies contribute directly to enhanced business outcomesRole Overview:The Senior Delivery Engineer is a hands‑on technical delivery role responsible for the physical and logical deployment of infrastructure and platform solutions at customer sites and data centers. This role owns the end‑to‑end planning and execution of projects including technical workshops with customers and other vendors, site survey, producing HLD/LLD rack‑and‑stack, cabling (for small deployments), Linux and Kubernetes installation, Open Innovation software rollout, and end‑to‑end commissioning.The role is accountable for translating design into a fully operational environment. The Senior Delivery Engineer is expected to be outcome‑oriented, deeply hands‑on, and comfortable operating in live customer environments with tight timelines and multiple stakeholders.Responsibilities:Participating and leading technical workshops with clients, system integrators and third party vendors to make technical decisions based on templates and best practices accounting for and accommodating customer specific requirements.Conduct site surveys and readiness assessments covering space, power, cooling, cabling, grounding, and access requirements at customer sites and data centers.Creating HLD and LLD documents based on the inputs gathered from workshops and project meetings.Execute hands‑on rack and stack activities, including installation of servers, storage, networking equipment, and supporting infrastructure.Perform structured cabling, labeling, and physical verification in line with approved designs and deployment standards.Deploy infrastructure components based on approved HLDs and LLDs and timely address any deviations from design.Install and configure Linux operating systems like RHEL, Ubuntu with our automation suite and provide feedback to the engineering team to make improvements.Basic networking, storage, and Virtualization knowledge required for integration with Open Innovation software stack.Deploy, configure, and validate Kubernetes clusters, including control plane, worker nodes, networking, and storage integration with our automation suite and provide feedback to the engineering team to make improvements.Install, configure, and commission Open Innovation software and platform components in Kubernetes with our automation suite and provide feedback to the engineering team to make improvements.Execute end‑to‑end integration, configuration, validation, and commissioning activities across compute, GPU, storage, network, and platform layers to deliver production‑ready environments.Lead go‑live readiness, health checks, and stabilization activities, troubleshooting and resolving deployment and integration issues through to operational handover.Coordinate on‑site delivery with data‑center operators, vendors, customer technical teams, and internal stakeholders, providing clear status updates, risks, and blockers to delivery leadership.Produce and maintain accurate as‑built documentation, deployment runbooks, and les‑sons learned, and support structured handover and knowledge transfer to Operations and Support teams.Required Qualification, Experience, Competence and Certifications:5–8+ years of hands‑on infrastructure and platform delivery experience across compute, storage, and network domains in customer data centers or colocation environments, including x86 servers, GPU platforms and enterprise‑grade networking.Strong practical experience with server platforms, distributed and vendor storage systems (PureStorage, CEPH, DDN, VAST, Dell), and data‑center networking (Ethernet, RoCE, InfiniBand), including rack & stack, cabling validation, and on‑site deployments.Proven experience deploying and commissioning Kubernetes and platform software on top of integrated compute, storage, and network infrastructure.Solid understanding of Linux system administration (RHEL, Ubuntu), and infrastructure integration across compute, storage, and network layers.Hands‑on troubleshooting experience across compute (servers/GPU), storage (CEPH or vendor storage), Kubernetes clusters, and data‑center networking during deployment, go‑live, and post‑deployment stabilization.Reporting To:Senior Manager – Systems Engineering#J-18808-Ljb