dataset
mcp-agent-trajectory-benchmark
obaydata
or hover any field below to flag it
Overview
Name
mcp-agent-trajectory-benchmark
Source
obaydata
Episodes
0
Robot count
0
Format
json
Description
MCP Agent Trajectory Benchmark
A benchmark dataset of 49 MCP (Model Context Protocol) agent trajectories (38 single-pass + 11 multi-conv) with complete tool-use traces in the ATIF v1.2 (Agent Trajectory Interchange Format) format. Each agent operates in a distinct business domain with custom tools, realistic user conversations, and full execution traces.
Designed for training and evaluating tool-use / function-calling capabilities of LLMs.
Overview
Item
Details… See the full description on the dataset page: https://huggingface.co/datasets/obaydata/mcp-agent-trajectory-benchmark.
Robots used
null
Links
HuggingFace dataset