Emprego
Meus anúncios
Meus alertas email de emprego
Fazer login
Encontrar um emprego Fichas de empresas
Procurar

Distributed system operations specialist

Portalegre
beBeeReliability
Anunciada dia 22 agosto
Descrição

Job Opportunity

">

We are seeking a highly skilled Site Reliability Engineer to join our team.

A system reliability engineer is a cross-functional role that combines software and systems engineering to build and run large-scale distributed systems. This includes ensuring that the system is stable, efficient, scalable, and reliable, as well as maintaining its overall quality and performance.

This position requires strong analytical skills and the ability to understand complex systems architecture. The ideal candidate will have experience with microservices operationalization, cloud environments (especially AWS), collaboration platforms, and observability tools. Excellent communication skills are also essential, as this role involves working closely with both technical and non-technical stakeholders. Additionally, fluency in English is required.

Key Responsibilities:

* Provide hands-on support to technical and business teams, monitoring systems proactively to detect and respond to incidents and service degradation.
* Investigate integration issues, gather information, and collaborate with internal and external teams to identify root causes and implement solutions.
* Prioritize multiple concurrent issues effectively with your team, understanding the business context and technical architecture of each system to better assess impact and urgency.
* Participate in on-call rotations to ensure platform stability and contribute to the continuous improvement of monitoring, alerting, logging, and incident response processes.
* Act as a liaison between technical and non-technical stakeholders, adapting communication accordingly.

Required Skills and Qualifications:

* 3+ years of experience in Application Support or Site Reliability Engineering.
* Strong analytical mindset: identify patterns, differentiate between isolated errors and systemic issues.
* Experience with microservices operationalization.
* Proficient with tools like ELK stack, Prometheus, and Grafana.
* Familiarity with cloud environments, especially AWS.
* Experience using collaboration platforms such as Jira, Confluence, GitLab.
* Ability to understand complex systems architecture and how components interact within a broader ecosystem.
* Strong proactivity in identifying risks through logs and metrics and suggesting improvements to observability.
* Excellent communication skills, especially when engaging with non-technical stakeholders.
* Fluency in English, both written and spoken.

Nice to Have:

* Hands-on coding experience with .NET Core, Python, or similar.
* Background in Retail or Logistics domains.
* Familiarity with Transport Management Systems (TMS) and logistics processes.

Se candidatar
Criar um alerta
Alerta activado
Salva
Salvar
Ofertas parecidas
Emprego Portalegre
Emprego Distrito de Portalegre
Página principal > Emprego > Distributed System Operations Specialist

Jobijoba Portugal

Encontre ofertas

  • Ofertas de emprego por função
  • Pesquisa de ofertas de emprego por sector
  • Empregos por empresas
  • Empregos por localização

Contacto / Parceria

  • Entre em contacto
  • Publique as suas ofertas no site Jobijoba

Menções legais - Menções legais e termos de utilização - Política de dados - Gerir os meus cookies - Acessibilidade: Não conforme

© 2025 Jobijoba Portugal - Todos os direitos reservados

Se candidatar
Criar um alerta
Alerta activado
Salva
Salvar