List of accepted countries and locations Remote (Global)

G2i is hiring a Machine Learning Evaluation Specialist (Remote)

About the Role

Role Overview

A Machine Learning Evaluation Specialist is needed to develop and assess advanced, research-oriented problems that test the limits of artificial intelligence systems. This fully remote position demands deep subject-matter expertise and the ability to create evaluation tasks that go beyond standard machine learning workflows.

Key Responsibilities

  • Formulate original, research-level machine learning challenges grounded in specialized domain knowledge
  • Create evaluation frameworks that require insight beyond typical ML pipelines
  • Review AI-generated responses for technical correctness, innovation, and sound methodology
  • Identify and articulate specific shortcomings in proposed solutions
  • Document the complexity of each problem, required expertise, and anticipated failure patterns

Qualifications

Candidates must demonstrate advanced understanding in a scientific or technical field connected to machine learning. A graduate degree (MS or PhD preferred) is required. You should have hands-on familiarity with core ML practices such as model selection, feature engineering, and performance evaluation.

You must be deeply aware of current research frontiers in your domain—the kind of knowledge that reveals where conventional AI approaches break down. Exceptional written communication skills are essential for clearly explaining complex problems and critiques.

This role requires self-direction and the ability to thrive on intellectually rigorous, independent work.

Work Structure

This is a freelance, project-based contract position with no guaranteed hours. Work is conducted remotely on an independent contractor (1099) basis. Candidates must pass an evaluation task to qualify for paid work. Weekly availability ranges from 10 to 40 hours, depending on project needs.

Compensation

Hourly rates range from $200 to $400, based on domain specialization and experience level. Work is offered on a per-project basis with flexible scheduling.

Required Skills
Machine LearningModel SelectionFeature EngineeringEvaluation MetricsActive ResearchScientific Domain ExpertiseWritten CommunicationIndependent WorkML MethodsTechnical Writing model selectionfeature engineeringevaluation metricsmachine learningactive researchwritten communicationindependent workproblem solvingscientific domain expertiseML methods
Planning long-term in Thailand?

Full relocation support, start to finish

From visa strategy to housing, banking, and schools for your family — SVBL plans and manages every detail of your move to Thailand so nothing falls through the cracks.

Complete relocation planning
Family visa & school enrollment
Banking & insurance setup
Cultural integration support
Plan your move
One partner for everything
About company
G2i

G2i is a video-based platform for hiring contract or full-time engineers, designed to help companies hire world-class talent quickly and efficiently. Since 2016, G2i has focused on reducing hiring noise by using video-based technical screening and assessments to increase hiring signal.

The company emphasizes quality, speed, and flexibility, offering a 7-day free trial and matching engineers in days rather than months. G2i serves startups to enterprises and supports hiring across the US, Canada, Latin America, and Europe.

Born in the open-source ecosystem, G2i actively gives back through initiatives like React Miami, the Developer Health Fund, and Dev Health OS. The platform specializes in frontend, backend, full-stack, mobile, infrastructure, data science, product management, and product design roles.

All jobs at G2i Visit website
Job Details
Category data
Posted 24 days ago