Infrastructure Reliability Specialist
100% Remote Position
Long-Term Contract Opportunity
Service Reliability Team Member
Job Description
The Infrastructure Reliability Specialist will provide ongoing support, monitoring, and troubleshooting to ensure the reliability and performance of our IT infrastructure. This role will work closely with our technical team and be located in a time zone that allows for seamless collaboration.
Key Responsibilities:
* Continuously monitor the health and performance of our IT infrastructure, utilizing various tools and technologies to detect and respond to incidents in real-time.
* Perform regular maintenance activities to ensure optimal performance and make recommendations for process improvements.
* Collaborate with our DevOps team to address and fix incidents, document root causes, and communicate resolutions to stakeholders.
* Develop and implement automation scripts to streamline monitoring and troubleshooting processes.
* Participate in post-incident reviews to identify areas for improvement and enhance our monitoring and support processes.
Required Skills and Qualifications:
* Strong understanding of IT infrastructure and systems.
* Excellent problem-solving and analytical skills.
* Ability to work independently and collaboratively as part of a team.
* Effective communication and documentation skills.
* Experience with automation tools and scripting languages.
Benefits:
* Opportunity to work on high-priority projects and contribute to the growth and development of our IT infrastructure.
* Collaborative and dynamic work environment.
* Ongoing training and professional development opportunities.
Other Information:
* This is a long-term contract position with a minimum duration of 12 months.
* The successful candidate will be required to work from home full-time, with occasional remote meetings and collaborations.