JobsAisle
T

Associate Principal Engineer Production Support Lead

TALENTMATE

Dubai, UAEAED 12,000-25,000/moYesterday
UAEIT & TechnologyFull Time

Skills Required

JavaAwsAzureGitAgileCommunication

Job Description

OverviewCompany DescriptionIn a changing and evolving world, challenges are ever more unique and complex. Nagarro helps to transform, adapt, and build new ways into the future through a forward-thinking, agile, and caring mindset. Today, we are 18,000+ experts across 37+ countries, forming a Nation of Nagarrians, ready to help our customers succeed. The nature of IT & digital product engineering has reached an incredible state of velocity and transition. We must adapt and meet it with an agile mindset that isn’t afraid to iterate towards the perfect solution. If we only solve todays problems, its not enough. We must do more. We must courageously embrace the future, with vision and clarity about where technology & business are heading. Thinking breakthroughs gets us there.Job DescriptionMust have Skills: Incident management, Java (Expert), Cloud, Production Support, Stakeholder & Vendor ManagementResponsibilitiesOwn and manage end-to-end production support for digital applications, ensuring timely resolution of incidents and service requests.Lead the production support team, assigning tickets, overseeing issue analysis, and coordinating fixes and deployments.Ensure production go-live handoffs are seamless, with proper documentation, readiness checks, and support coverage.Act as the primary escalation point for major incidents, SLA breaches, and recurring issues. Lead incident triage, root cause analysis (RCA), and post-incident reviews to drive long-term resolution. Maintain clear and timely communication with IT management, business users, and stakeholders during incidents.Proactively monitor applications to pre-empt issues, minimize downtime, and ensure performance against SLAs. Implement and maintain health check routines, dashboards, and alerting mechanisms for critical systems.Identify and automate manual, repetitive tasks to improve operational efficiency. Develop and maintain Standard Operating Procedures (SOPs) for production support activities and incident handling.Provide hands-on troubleshooting for Java backend systems, web and mobile technologies, and various frameworks. Understand and resolve issues related to certificate exchanges, API integrations, cloud infrastructure, and system dependencies. Offer technical consultation to support teams and contribute to solution design reviews when needed.Leverage hands-on experience with Azure (primary) and AWS (secondary) cloud platforms for troubleshooting and deployment support. Coordinate with network, firewall, SecOps, and infrastructure teams to resolve cross-domain issues.Maintain strong relationships with business users, IT teams, and external vendors, ensuring alignment and timely resolution. Review vendor fixes and designs, and coordinate production deployments with third-party teams.Own communication for major incidents, including status updates, impact assessments, and resolution timelines. Prepare and deliver incident reports, RCA documentation, and performance dashboards to senior stakeholders.Mentor and coach production leads and support engineers, fostering a culture of ownership and continuous improvement. Ensure team members are equipped with the right tools, knowledge, and support to handle production issues effectively.Work closely with application development teams, infrastructure teams, and business units to ensure smooth operations and issue resolution. Participate in project planning and go-live readiness to ensure production support requirements are met.Ensure all production support activities conform to IT and bank policies, procedures, and security standards. Support Business Resumption Plan (BRP) testing and other compliance-related activities as required.Provide coverage during team absences, and be prepared for temporary reassignment or promotion in case of business necessity. Strive to understand peer and superior roles to ensure continuity of operations during emergencies or transitions.Qualifications & SkillsTechnical & Troubleshooting Skills – Strong troubleshooting skills in Java backend systems; understanding of web and mobile technologies and various frameworks; hands-on experience with Azure Cloud (primary) and AWS Cloud (secondary); knowledge of certificate management, including exchange and renewal processes; familiarity with API integrations, system interfaces, and backend services; ability to analyze and resolve complex production issues across distributed systems.Production Support & Incident Management – Expertise in incident management, including triage, escalation handling, and RCA; experience managing high-volume, customer-facing applications; strong understanding of service request handling, SLAs, and operational KPIs; ability to lead war rooms, manage stress situations, and drive resolution under pressure.Monitoring & Automation – Proactive monitoring and health check implementation; experience in automating manual tasks and improving operational efficiency; development and maintenance of Standard Operatin