from Hacker News

Show HN: Try test-driven agent development in this holiday prompting challenge

by lbeurerkellner on 12/23/24, 3:51 PM with 0 comments

Our team at Invariant has built an awesome agent testing challenge for the holidays, that showcases how agent development can benefit from concepts known from test-driven software engineering.

You can play the challenge at the linked URL. You are given a pre-existing agent implementation where you can only change the system prompt and your goal is to make a small test suite of unit tests pass. The tool implementations are complete, but the agent needs to be instructed to use them properly.

The challenge was created to showcase two open source projects we released recently: A trace viewer for agent trajectories (Explorer, https://github.com/invariantlabs-ai/explorer) and a library for building agent unit tests (Testing, https://github.com/invariantlabs-ai/invariant).

Let us know what you think, we'll be around in the comment section. Happy holidays everyone!