Festivus
Home
Data
Contribute
Changelog
Search
← Back to Benchmarks
sim
medium
eval_dataset
metric · varies
Mcp Agent Trajectory Benchmark
Description
HuggingFace evaluation dataset: obaydata/mcp-agent-trajectory-benchmark
Source
https://huggingface.co/datasets/obaydata/mcp-agent-trajectory-benchmark