← Jobb

Python developer for AI Model Response Evaluator

Budget: $15.0 - $25.0 HOURLY / FULL_TIME ⭐ 4.52 (4) USA

python, artificial-intelligence, natural-language-processing

Looking for an experienced developer to evaluate AI-generated coding responses. Python background is strongly preferred. This requires genuine engineering judgment, you need to understand code well enough to critically assess whether a solution is correct, complete, and well-reasoned. Not a coding job, but real coding experience is a must. Who I'm looking for: - Python-heavy background - Comfortable reading and reviewing code, not just writing it - Clear, direct communicator with evidence-based reasoning - Detail-oriented and consistent Prior experience with AI labeling is a plus but not required. To apply, tell me: - Your engineering background - languages, domains, years of experience - One example where you reviewed someone else's code and caught a real issue - Your availability and preferred payment structure - Github link
Öppna på Upwork