Design RLHF Failure Benchmark
Költségvetés: -
/
⭐ (0)
Artificial Intelligence, Data Analysis, Deep Learning, Documentation, Machine Learning (ML), Python, Reinforcement Learning, RLHF
I want to commission a small-scale benchmark that reliably exposes where reinforcement-learning-from-human-feedback (RLHF) agents break down when the information they receive is partial, ambiguous, or internally inconsistent... (Budget: €30 - €250 EUR, Jobs: Artificial Intelligence, Data Analysis, Deep Learning, Documentation, Machine Learning (ML), Python, Reinforcement Learning, RLHF)
Megnyitás Upworkön