this is really exciting! I'd love to test my own coding agents against your benchmark and see how they perform. always looking to push the boundaries of what's possible with AI.