Anunciada dia 5 setembro
Descrição
Job Description
We are seeking a highly skilled IT professional to fill the role of Site Reliability Engineer. This is an exciting opportunity for someone who wants to work with cutting-edge technology and be part of a team that prioritizes innovation and excellence.
The ideal candidate will have experience in managing, operating, and supporting public cloud infrastructures, as well as knowledge of implementing services in cloud environments. They should also have good troubleshooting skills, focusing on system and application security, cost optimization, and resource consumption.
The successful candidate will be responsible for administering on-premises and cloud systems, maintaining cloud and Kubernetes applications, migrating legacy applications to Kubernetes/Openshift and/or the cloud, and providing support for application development teams.
* Cloud Infrastructure Management: Experience in managing, operating, and supporting public cloud infrastructures (GCP, Azure, AWS)
* Service Implementation: Knowledge of implementing services (new and migrations) in cloud environments (IaaS/PaaS/SaaS/AI/Kubernetes/others)
* Cloudflare Implementation: Knowledge or experience with the implementation and management of services in Cloudflare
* Troubleshooting and Security: Good troubleshooting/post-mortem skills, focusing on system and application security
* Cost Optimization: Focusing on cost and value of resources and consumption in the cloud
* Operating System Administration: Operating system administration: scripting, hardening, tuning
* Monitoring and Operational Support: Knowledge of monitoring and operational support platforms (Nagios, Prometheus, Dynatrace, etc.)
* Networking and Application Balancing: Knowledge of networking and application and service balancing
* Container Infrastructure: Knowledge and experience in container infrastructures such as Docker and Kubernetes
* Programming Knowledge: Programming knowledge (Shell Scripting, Python, Perl, Go, etc.)
* CICD: Knowledge of CI/CD
* Operational Automation: Knowledge of operational automation (Ansible/Terraform/Helm, etc.)
* ITIL Processes: Knowledge of key ITIL processes
Why Join Us?
At our company, we believe in fostering a culture of excellence and investing in the development and well-being of our employees. We offer a dynamic and supportive work environment that encourages innovation, teamwork, and continuous learning.
As a Site Reliability Engineer, you will have the opportunity to work with a talented team of professionals who share your passion for technology and commitment to excellence. You will also have access to ongoing training and development opportunities to help you grow professionally and personally.