Luma AI is hiring a Research Scientist / Engineer – Multimodal Capabilities

About the Role

Luma AI is looking for a Research Scientist / Engineer – Multimodal Capabilities to unlock advanced behaviors in our foundation models. You'll join the Multimodal Capabilities team to conduct strategic research on combining vision, audio, and language to solve fundamental questions.

What You'll Do

  • Collaborate with the Foundation Models team to identify capability gaps and research solutions.
  • Design datasets, experiments, and methodologies to systematically improve model capabilities across vision, audio, and language.
  • Develop evaluation frameworks and benchmarking approaches for multimodal AI capabilities.
  • Create prototypes and demonstrations that showcase new multimodal capabilities.

What We're Looking For

  • Strong programming skills in Python and PyTorch.
  • Experience with multimodal data processing pipelines and large-scale dataset curation.
  • Understanding of computer vision, audio processing, and/or natural language processing techniques.

Nice to Have

  • Expertise working with interleaved multimodal data.
  • Hands-on experience with Vision Language Models, Audio Language Models, or generative video models.

Technical Stack

  • Python, PyTorch

Team & Environment

You will be part of the Multimodal Capabilities team and collaborate closely with the Foundation Models team.

Benefits & Compensation

  • Salary: $200,000 - $300,000/yr + competitive equity in the form of stock options.
  • A comprehensive benefits plan.

Luma AI is an equal opportunity employer.

Required Skills
PythonPyTorchMachine LearningMultimodal AIComputer VisionNatural Language ProcessingDeep LearningResearchModel TrainingLarge-scale Systems PythonPyTorchMachine LearningMultimodal AIComputer VisionNatural Language ProcessingDeep LearningResearchModel TrainingLarge-scale Systems
Invoicing holding you back?

Focus on work, not paperwork

Stop worrying about invoicing, taxes, and compliance. Glopay handles the business setup, you handle the client work. Get paid faster and look professional.

Auto-generated compliant invoices
Built-in expense management
Income reports for tax season
95% of earnings stay with you
Try Glopay free
No credit card needed
About company
Luma AI
Luma AI is a technology company focused on developing advanced multimodal AI foundation models, working on innovative approaches to combining vision, audio, and language data.
All jobs at Luma AI Visit website
Job Details
Category data
Posted 8 months ago