We are seeking an experienced System Reliability Engineer to join our team. The ideal candidate will have a strong background in maintaining availability, automating release/deploy processes, and ensuring seamless monitoring and alerting of complex software solutions.
The successful candidate will work closely with developers to prototype and design new features as part of the infrastructure. They will be responsible for deploying, installing, configuring, and maintaining sophisticated trading/finance software.
Key Responsibilities:
* Maintain availability and reliability of complex software systems
* Automate release/deploy processes using Infrastructure as Code
* Design and implement monitoring and alerting systems
* Collaborate with developers to prototype and design new features
* Deploy, install, configure, and maintain sophisticated trading/finance software
Qualifications:
* 5+ years of experience in system administration or DevOps
* Strong experience with OS-level administration on Linux and/or UNIX
* IAAS solutions using Ansible and/or Terraform
* Experience with Docker containers and orchestration tools
* In-depth knowledge of TCP/IP and ISO/OSI stack
* Experience with monitoring and logging tools (Zabbix, Elasticsearch or Opensearch, Grafana, Kibana, etc)
* English level not lower than B2