2 days ago Be among the first 25 applicants
Get AI-powered advice on this job and more exclusive features.
Transformation, adaptability and innovation are part of our DNA.
We're passionate about technology and we want to be part of your story.
Do we share the same passion? You've come to the right place, Smart.
We are looking for a talented Site Reliability & Monitoring Engineer to join our team, focused on ensuring the stability, performance, and reliability of our Order Management System and related services. The ideal candidate will have a strong analytical mindset, experience in monitoring complex systems, an automation-first approach to eliminate repetitive tasks, and the ability to collaborate effectively across technical and business teams in a fast-paced environment.
Main Tasks & Responsibilities
- Monitor the Critical User Journeys of the team's products.
- Support both technical and business teams, with a focus on the Order Management System area.
- Gather information and troubleshoot application integration issues.
- Work collaboratively with internal and external teams.
- Manage multiple concurrent issues, setting priorities together with the team.
Key Requirements
- Strong analytical mindset – if an error occurs once, it's an incident; if it happens repeatedly, it's a problem that must be thoroughly investigated.
- Dislike for repetitive tasks – develop software automation to eliminate manual work.
- Proactive in validating logs and metrics to detect potential problems or unusual patterns.
- Curious, customer-focused, resilient, and eager to learn.
- Monitor production and non-production services within the team's scope.
- Engage with the development team throughout the entire product lifecycle, ensuring product reliability.
- Ability to understand different systems, their functionalities, and how they fit within the ecosystem.
- Availability for on-call prevention duties.
Nice to Have
- Knowledge of Microservices.
- Familiarity with technologies such as:
- Shell
- AWS
- Kafka (nice to have)
- Experience with at least one scripting language (Python, Bash, Perl, Golang, or others) to automate daily or corrective processes.
- Knowledge of collaborative platforms (Jira, Confluence, GitLab).
- Good command of English, both written and spoken.
What will you find at SMART?
A dynamic, hard-working and co-operative team;
Career plan and defined objectives;
Initial and ongoing training ;
Follow-up meetings and performance appraisals;
Business bonuses;
Personal and family benefits;
Numerous events, partnerships and internal dynamics;
Seniority level
- Seniority level
Mid-Senior level
Employment type
- Employment type
Full-time
Job function
- Job function
Consulting
- Industries
IT Services and IT Consulting
Referrals increase your chances of interviewing at Smart Consulting by 2x
Get notified about new Site Reliability Engineer jobs in Porto, Porto, Portugal.
Site Reliability Engineer/ Infrastructure Engineer
Site Reliability Engineer/ Infrastructure Engineer
Site Reliability Engineer ID38563 ($3,000 signing bonus)
Mid Site Reliability Engineer (SRE) @Porto
Service and Maintenance Engineer - Software Platform
Senior Site Reliability / Gitops Engineer
Cloud Operations Engineer (m/f/w) - REF86080O - hybrid from any OESL legal entity
We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr