by lbeurerkellner on 12/23/24, 3:51 PM with 0 comments
You can play the challenge at the linked URL. You are given a pre-existing agent implementation where you can only change the system prompt and your goal is to make a small test suite of unit tests pass. The tool implementations are complete, but the agent needs to be instructed to use them properly.
The challenge was created to showcase two open source projects we released recently: A trace viewer for agent trajectories (Explorer, https://github.com/invariantlabs-ai/explorer) and a library for building agent unit tests (Testing, https://github.com/invariantlabs-ai/invariant).
Let us know what you think, we'll be around in the comment section. Happy holidays everyone!