DevJobs

Senior ML Engineer - OpenShift AI

Overview
Skills
  • Python Python
  • Git Git
  • AWS AWS
  • Azure Azure
  • GCP GCP
  • Kubernetes Kubernetes
  • Grafana Grafana
  • OpenShift
  • OpenTelemetry
  • Prometheus Prometheus
Job Summary

Are you ready to join a game-changing open-source AI platform that harnesses the power of hybrid cloud to drive innovation?

The Red Hat OpenShift AI (RHOAI) team is looking for a Senior Machine Learning Engineer with experience in building, scaling, and monitoring AI/ML systems to join our rapidly growing engineering team. Our focus is to create a platform, partner ecosystem, and community by which enterprise customers can solve problems to accelerate business success using AI. This is a very exciting opportunity to shape the observability and reliability of GenAI workloads, contribute to the development of the RHOAI product, participate in open source communities, and be at the forefront of the exciting evolution of AI. You’ll join an ecosystem that fosters continuous learning, career growth, and professional development.

As a core ML engineer for one of our OpenShift AI teams, you will have the opportunity to design and build systems that monitor, validate, and improve AI model performance in production. You will work as part of an evolving development team to rapidly design, secure, build, test, and release new capabilities. The role is primarily an individual contributor who collaborates closely with other ML engineers, software developers, and cross-functional teams. You should have a passion for observability, MLOps, and building robust systems for real-world AI.

What You Will Do

  • Design and build observability and optimization tools for large-scale GenAI workloads running on Kubernetes
  • Develop systems to collect and analyze model performance metrics, logs, and resource usage in real-time
  • Innovate in the MLOps and AI observability domain by contributing to upstream communities
  • Collaborate with product, engineering, and research teams to improve model trust and performance
  • Write unit and integration tests and work with quality engineers to ensure product quality
  • Use CI/CD best practices to deliver solutions into RHOAI as part of our productization efforts
  • Contribute to a culture of continuous improvement by sharing technical knowledge and insights
  • Communicate effectively with stakeholders and team members to ensure visibility of ML performance
  • Represent RHOAI in external engagements including open source communities and customer meetings
  • Mentor and guide junior engineers and contribute to team growth

What You Will Bring

  • Experience in machine learning engineering, with a focus on production-grade systems
  • Proficiency in Python with a focus on AI/ML infrastructure or tooling
  • Experience working with Kubernetes, OpenShift, or other cloud-native platforms
  • Familiarity with ML observability tools (e.g. Prometheus, OpenTelemetry, and Grafana)
  • Hands-on experience with source control tools such as Git
  • Passion for open-source technology and collaborative development
  • Strong troubleshooting skills and system-level thinking
  • Ability to work autonomously and thrive in a fast-paced environment
  • Excellent written and verbal communication skills

The Following Will Be Considered a Plus

  • Master’s degree or higher in computer science, machine learning, or related discipline
  • Contributions to open-source projects, especially in the MLOps or ML observability domain
  • Experience with public cloud services (AWS, GCP, Azure)
  • Background in developing or deploying MLOps platforms or AI monitoring tools

About Red Hat

Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. Spread across 40+ countries, our associates work flexibly across work environments, from in-office, to office-flex, to fully remote, depending on the requirements of their role. Red Hatters are encouraged to bring their best ideas, no matter their title or tenure. We're a leader in open source because of our open and inclusive environment. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact.

Inclusion at Red Hat

Red Hat’s culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone. When this is realized, it empowers people from different backgrounds, perspectives, and experiences to come together to share ideas, challenge the status quo, and drive innovation. Our aspiration is that everyone experiences this culture with equal opportunity and access, and that all voices are not only heard but also celebrated. We hope you will join our celebration, and we welcome and encourage applicants from all the beautiful dimensions that compose our global village.

Equal Opportunity Policy (EEO)

Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.

Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.

Red Hat supports individuals with disabilities and provides reasonable accommodations to job applicants. If you need assistance completing our online job application, email application-assistance@redhat.com. General inquiries, such as those regarding the status of a job application, will not receive a reply.

Red Hat