Evaluator Agent

HOW IT WORKS

  • Worker Agent attempts the task using browser tools.
  • Evaluator Agent checks the result against your criteria.
  • If rejected, Worker retries with feedback.
  • Process repeats until success or clarification needed.
LIVE EXECUTION LOG

Ready to evaluate tasks.