We are looking for a
Site Reliability Engineer / Application Production Support Engineer
to join a big financial institution in Lisbon. This isn't your typical support role—it's a high-octane position at the intersection of electronic trading, SRE practices, and infrastructure automation.
The Tech Stack:
* OS:
Expert-level Linux (On-prem & AWS)
* Automation:
Python, Bash, Ansible
* Observability:
Prometheus, Grafana, ELK, Dynatrace
* CI/CD:
Jenkins, GitOps
* Environment:
C++ based trading systems, FIX Protocol, TCP/IP tuning.
What You'll Do:
* Engine Room Operations:
Rapid response to incidents in a high-pressure trading environment.
* Eliminate Toil:
Use Python and Ansible to automate repetitive tasks and modernize our platform.
* Master Observability:
Fine-tune monitoring stacks (Prometheus/Grafana) to detect anomalies before they impact the market.
* Bridge Dev & Ops:
Participate in code reviews and influence the SDLC to ensure reliability is baked in from the start.
Who You Are:
* Experienced:
5+ years in SRE, DevOps, or Production Support (Finance/Trading experience is a massive plus).
* Troubleshooter:
You have a "detective" mindset when it comes to logs, network packets, and system bottlenecks.
* Automation-First:
You'd rather write a script than do the same task twice.
* Curious:
Eager to dive into AWS, Kubernetes, and the intricacies of ultra-low latency hardware.
Why Join Us?
* Market Impact:
Work on systems that power global financial markets.
* Top-Tier Talent:
Collaborate with some of the best engineers in the industry.
* Modernization:
We are heavily invested in automation, IaC, and cloud transition.
* Growth:
Long-term career paths within a premier global financial institution.
Ready to trade up?
Apply now