An Agent-Computer-Interface project from Seattle, Washington.
Build Multi-Agent Systems for Data Generation
A Comprehensive Benchmark to Evaluate LLMs as Agents