Join a team advancing AI's ability to write reliable, efficient code by serving as a technical guide for machine learning models. Your primary responsibility will be evaluating AI-generated Python code—comparing multiple solutions, determining the strongest implementation, and clearly articulating the reasoning behind your assessment.
What You'll Do
- Review and rank code outputs based on correctness, efficiency, readability, and adherence to best practices
- Refactor and correct flawed or suboptimal code to meet production standards
- Provide structured written feedback that informs model training through reinforcement learning from human feedback (RLHF)
- Run test cases and validate fixes to ensure code reliability
- Help refine the feedback loop so models progressively improve at writing, reviewing, and enhancing code
What We're Looking For
- At least three years of professional experience developing in Python
- Strong analytical skills with the ability to quickly identify logic flaws, inefficiencies, and potential security vulnerabilities
- Exceptional attention to detail and clarity in written explanations
- Comfort interpreting technical documentation and language specifications
- Proven ability to work independently in an asynchronous environment
- Eligibility to work as an independent contractor in your country
- Ability to verify identity as part of onboarding
Preferred Experience
- Background in constraint programming or formal logic systems
Work Environment
This is a fully remote role with no required office location. Work is asynchronous, allowing you to contribute on your own schedule. Weekly hours range from a minimum of 15 to over 40, depending on project needs. Payment is issued weekly via PayPal or Stripe. Compensation ranges from $30 to $70 per hour, with most work falling at the $30/hour rate, adjusted for location and experience level.


