Senior MLOps Engineer Job Description
We are seeking an experienced Senior MLOps Engineer to join our team. As a key member of our organization, you will play a crucial role in shaping our hybrid cloud environment's MLOps strategy.
Key Responsibilities:
* MLOps Strategy & Architecture
o Collaborate with the team to develop a comprehensive MLOps strategy that aligns with business objectives, security requirements, and industry best practices
o Design a self-service approach to the MLOps loop problems, including data ingestion, feature engineering, model training, model validation, model deployment, model monitoring
o Evaluate and select suitable MLOps tools and technologies
o Develop and maintain thorough documentation for MLOps processes and infrastructure
* Hybrid ML-loop Implementation
o Design and implement Infrastructure as Code (IaC) solutions for provisioning and managing cloud and on-premises resources using Terraform
o Implement robust security measures to safeguard sensitive Data and ML models in both cloud and on-premises environments
o Implement and/or integrate tools to deliver the full MLOps loop
o Develop and implement monitoring dashboards and alerts to proactively identify and resolve MLOps platform issues
* Collaboration & Leadership
o Work closely with data scientists, software engineers, and infrastructure teams to deliver a high-quality MLOps solution
o Provide technical leadership and mentorship to junior engineers
o Communicate effectively with stakeholders at all levels, including technical and non-technical audiences
o Stay up-to-date with the latest MLOps trends and technologies
o Participate in code reviews and contribute to the development of best practices
Requirements
* A Master's or Ph.D. degree in Computer Science, Data Science, Machine Learning, or a related field
* At least 3 years of experience in MLOps, Data Engineering, or similar roles, with a strong focus on AI/ML use cases
* Strong knowledge of Cloud solutions, preferably Azure
* Experience with on-premises infrastructure management, including server management, networking, and storage
* Experience with containerization and orchestration tools like Docker, Kubernetes, and Apache Airflow
* Strong scripting and automation skills, such as Python and Bash
* Strong understanding of DevOps principles and practices
* Experience with CI/CD pipelines, preferably GitHub Actions
* Proficiency in IaC tools, preferably Terraform
Benefits
* A dynamic work environment with opportunities for career growth and professional development
* Flexible work conditions and a hybrid work system
* Exchange with colleagues worldwide and access to various benefits and perks