As a Machine Learning Engineer on the Video AI Platform team, you'll develop and maintain the core systems that power video understanding, summarization, and classification. Your work will center on building scalable pipelines for fine-tuning and retraining models, including LoRA-based workflows and large language models, ensuring they remain accurate and efficient over time.
Key Responsibilities
- Design and manage end-to-end ML pipelines for model training, versioning, and evaluation
- Implement and maintain vector search and semantic similarity systems for content retrieval
- Architect model serving solutions for open-source and foundation models at scale
- Establish RAG pipelines to support video metadata extraction, summarization, and question answering
- Monitor live models for performance drift and trigger retraining when needed
- Optimize compute usage across distributed training environments to control costs
- Integrate human-in-the-loop labeling into training workflows for improved data quality
- Support deployment of diverse model types, including vision-language models and embedding generators
- Build and maintain a feature store with robust data versioning capabilities
- Develop evaluation frameworks to measure LLM accuracy, hallucination rates, and structured output reliability
Qualifications
You bring at least five years of hands-on experience in machine learning engineering, with a strong foundation in full lifecycle ML development. You're skilled in deploying models using tools like Triton, TorchServe, or Ray Serve, and have experience managing distributed compute platforms such as Kubernetes, Ray, or SageMaker.
Proficiency with experiment tracking systems like MLflow or Weights & Biases is essential, as is familiarity with model versioning practices. You have deep knowledge in NLP, recommendation systems, and reinforcement learning, and you thrive in Agile settings while collaborating with global teams.
Experience with real-time inference, streaming data, or human-in-the-loop workflows is a plus. You're comfortable working across the stack to deliver reliable, production-grade AI services that power intelligent video applications.


