Havi Tech Hub | Observability | SRE (Site Reliability Engineering) Team LeaderYour new company Havi, a global leader since 1974, employs over 10,000 people and serves customers in more than 100 countries. Specializing in the foodservice industry, Havi provides innovative supply chain and logistics solutions, including analytics, planning, distribution, and freight management.Havi'sdiverse teams collaborate seamlessly across locations and functions, embodying a spirit of integrity and creativity to serve their customers in the best way possible.Your new role This role will act as theSRE (Site Reliability Engineering) Leadwithin the Supply Chain Technology function, driving the enterprise-wide reliability strategy across global regions.This person will ensure that services are reliable, observable, scalable, and continuously improving, establishing SLO/SLI standards, error budgets, and automation practices across platforms.The SRE team operates in a follow‐the‐sun model between Portugal and Malaysia, focusing on engineering excellence – not 24/7 reactive support.TheSRE Leadwill: Lead the evolution of reliability practices, strengthen governance, and enhance engineering standards as services mature; Define and own the enterprise SRE strategy (including SLO, SLI, and error budget frameworks); Establish and enforce reliability, observability, and resilience standards across platforms and services; Lead engineering-level response during major incidents and ensure structured root cause analysis; Drive automation initiatives to reduce operational toil and improve system resilience; Align reliability objectives with Product, Platform Engineering, Security, and Service Operations; Monitor and report reliability performance (availability, MTTR, error budgets and capacity trends) to leadership; Promote observability best practices (logs, metrics and tracing) across services; Mentor and develop SRE Engineers across global regions.A typical day will include defining and refining reliability practices, analysing performance and service health, reviewing error budgets, improving driving automation, coordinating cross‐functional engineering activities, and providing leadership during major incidents.This role will also include day‐to‐day collaboration with Platform Engineering, Service Operations, Security, Product & Service Management, Application Development, ITSM, strategic partners and Cloud providers.What you will need to succeed What the As aSRE Lead, you will need: Bachelor's Degree in Computer Science, Engineering, or equivalent experience ; +7 years of experience in Cloud infrastructure and distributed systems ; +5 years in reliability Engineering, SRE, or Senior Infrastructure roles ; Proven experience in m ajor incident leadership and reliability improvement ; Experience with automation and scripting (Python, Bash, PowerShell, etc.); Experience defining SLO frameworks at enterprise scale; Experience leading cross functional engineering initiatives; Strong Cloud expertise (Azure preferred); Advanced knowledge of observability tooling (logs, metrics and tracing); Knowledge of CI/CD reliability and deployment safety; Deep understanding of SRE principles and distributed systems reliability; Familiarity with Infrastructure as Code (Terraform, ARM, etc.); Strong analytical, strategic, and organizational skills; Demonstrated leadership and team development experience ; Excellent communication skills – Fluency in English.What the Company can offer you Have the opportunity to join a cross-functional team in an international company with a multicultural working environment!SRE Lead (m/f/d) Hays Working for your tomorrow