United States of America Remote (Country)

DataGalaxy is hiring a Software Engineer (GenAI / MLOps & ModelOps) - AI-Share Team

About the Role

Join a growing team building core infrastructure for Generative AI capabilities in a data governance environment. You'll focus on ModelOps and MLOps to ensure AI systems are robust, traceable, and efficiently managed in production. This role is central to enabling reliable deployment and operation of LLMs, retrieval-augmented generation (RAG), and autonomous agents.

What You'll Do

  • Advance the ModelOps platform by integrating GenAI providers, automating deployments, and refining configuration management and operational tooling.
  • Design and implement repeatable patterns for running GenAI in production, including evaluation frameworks, version control, reproducible workflows, and safe environment transitions.
  • Enhance CI/CD pipelines tailored to AI workloads, incorporating packaging, automated validation, deployment logic, and rollback mechanisms.
  • Strengthen traceability across AI artifacts such as model configurations, prompts, evaluation results, and version histories to support debugging and governance.
  • Build observability into AI systems by tracking latency, availability, cost, usage patterns, and quality metrics through dashboards and alerting.
  • Develop and refine GenAI-powered features including agent logic, RAG pipelines, and model control protocols, balancing innovation with stability.
  • Collaborate with product, data, and engineering teams to integrate AI functionality in a sustainable, maintainable way.
  • Participate in code reviews, incident analysis, and documentation efforts to ensure system reliability and knowledge sharing.

What We’re Looking For

  • A practical mindset with strong curiosity and a commitment to steady, thoughtful progress.
  • Proven experience contributing to MLOps or ModelOps platforms and workflows.
  • Proficiency in Python for automation, tooling, and integration tasks.
  • Hands-on experience with cloud platforms and managed GenAI services such as Azure AI Foundry, AWS Bedrock, or GCP Vertex.
  • Familiarity with containerization (Docker), CI/CD systems, and observability tools.
  • Ability to collaborate across technical teams and work effectively in polyglot environments.
Required Skills
PythonAzure AI FoundryAWS BedrockGCP VertexCI/CDDockercontainersobservability toolingMLOpsModelOpsGenAIproduction deploymentmonitoringcost controltraceability PythonMLOpsModelOpsAzure AI FoundryAWS BedrockGCP VertexCI/CDDockercontainersobservability toolingcloud servicesGenAI integrationautomationpolyglot systemsmetadata management
Scaling your freelance income?

Invoice multiple clients effortlessly

Managing 3+ international clients? Glopay streamlines everything. One EU company, unlimited invoices, automatic compliance. You just send and get paid.

Unlimited clients & invoices
Multi-currency support
Automated tax compliance
Client portal for easy payments
Scale with Glopay
Trusted by 10,000+ freelancers
About company
DataGalaxy
DataGalaxy is a leading data and AI product governance platform that enables organizations to connect strategy, product management, discovery, and business impact. The company is trusted by over 200 global enterprises and is committed to driving data culture and literacy.
All jobs at DataGalaxy Visit website
Job Details
Category data
Posted 2 months ago