Back to all jobs

Python - Software Engineer, AI

Work from home Full-time role Hiring

Before applying This role is open to contractors in accepted locations only. Please confirm your country is on the list before applying — we're unable to process applications from unlisted locations. List of accepted countries and locations. For US applicants This is a 1099 independent contractor role. It is not compatible with F-1 OPT, STEM OPT, or any visa status that requires W-2 employment, guaranteed hours, or employer sponsorship. We are unable to provide offer letters or employment verification for this role. What You'll Be Doing Help train large language models (LLMs) to write production-grade code across a wide range of programming languages:

  • Compare and rank multiple code snippets, explaining which is best and why
  • Repair and refactor AI-generated code for correctness, efficiency, and style
  • Inject feedback (ratings, edits, test results) into the RLHF pipeline and keep it running smoothly

End result: the model learns to propose, critique, and improve code the way you do. RLHF in one line: Generate code expert engineers rank, edit, and justify convert that feedback into reward signals reinforcement learning tunes the model toward code you'd actually ship. What You'll Need

  • 3+ years of professional software engineering experience in Python (constraint programming experience is a bonus, but not required)
  • Strong code-review instincts — you can spot logic errors, performance traps, and security issues quickly
  • Extreme attention to detail and excellent written communication skills. Much of this role involves explaining why one approach is better than another. This cannot be overstated.
  • Comfortable reading documentation and language specs, and able to work well in an asynchronous, low-oversight environment

Identity verification: Applicants will be required to verify their identity and confirm they have valid documentation to work as an independent contractor in their country of residence. What You Don't Need

  • No prior RLHF or AI training experience

Logistics

  • Location: Fully remote — work from anywhere on the accepted locations list
  • Compensation: $30–$70/hr based on location and seniority. Note: the majority of projects run at around $30/hr — higher rates apply to senior profiles and specific project types
  • Hours: Minimum 15 hrs/week, up to 40+ hrs/week available — hours vary by project and are not guaranteed week to week
  • Engagement: 1099 independent contractor
  • Payment: Weekly via PayPal or Stripe

Important: Hours are project-dependent and can vary week to week. We recommend keeping other work options open alongside this engagement rather than relying on it as your sole source of income. Apply tot his job Apply To this Job

More remote roles to explore

Head of Technology / Python / Vue Engineer / Start Up Ownership / Hybrid

Work from home Full-time role

AI Trainer - Advanced Python Developers - San Fran

Work from home Full-time role

Senior Python Engineer / Node.js / FinTech Start-Up

Work from home Full-time role

Senior Python and SAS Developer

Work from home Full-time role

Senior Python Developer - Contingent

Work from home Full-time role

Senior Python Developer – Customer Portal Engineering (Remote / US-friendly Time Zones)

Work from home Full-time role

Senior Java / Python Developer with Productions Support experience

Work from home Full-time role

Semior Software Engineer (Python Full Stack)

Work from home Full-time role

Python Developer(Mid Junior)

Work from home Full-time role

W2 Opening - Mid level Python Developer

Work from home Full-time role

[Hiring] Patient Services Advisor @Small Door Veterinary

Work from home Full-time role

Experienced Chat Moderator – Remote Community Management and Conflict Resolution

Work from home Full-time role

Experienced Live Chat Support Agent – Delivering Exceptional Customer Experiences in a Dynamic Remote Environment

Work from home Full-time role

Work From Home- Hotel Coordinator - Entry Level

Work from home Full-time role

Prior Authorization Clinical Team Lead

Work from home Full-time role

🎨 Product Designer, Design

Work from home Full-time role

Professional Data Entry Keyer – Data Management & Administrative Support Specialist

Work from home Full-time role

JavaScript Software Engineer Remote

Work from home Full-time role

Experienced Customer Service Advisor - Float - Administration WHE in McMechen, WV

Work from home Full-time role

Experienced Part-Time Remote Customer Service Representative – Delivering Exceptional Service to Global Shoppers

Work from home Full-time role