OPSPROS is seeking a Machine Learning Infrastructure Engineering Lead to spearhead the development and optimization of our machine learning infrastructure. This role requires collaboration with software engineering teams to design scalable, efficient systems for deploying machine learning models.
Key Responsibilities:
Lead the design and implementation of robust machine learning infrastructure solutions.
Collaborate with software engineers to integrate ML models into production environments.
Ensure high availability and scalability of machine learning systems.
Develop best practices for data management, model training, and deployment.
Monitor system performance and implement improvements.
Qualifications:
Proven experience in machine learning infrastructure and systems engineering.
Strong proficiency in programming languages such as Python and Java.
Familiarity with cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes).
Excellent problem-solving skills and ability to work in a remote team environment.
Experience with CI/CD pipelines for machine learning applications.