dataset

mcp-agent-trajectory-benchmark

obaydata

or hover any field below to flag it

Overview

Name
mcp-agent-trajectory-benchmark
Source
obaydata
Episodes
0
Robot count
0
Format
json
Description
MCP Agent Trajectory Benchmark A benchmark dataset of 49 MCP (Model Context Protocol) agent trajectories (38 single-pass + 11 multi-conv) with complete tool-use traces in the ATIF v1.2 (Agent Trajectory Interchange Format) format. Each agent operates in a distinct business domain with custom tools, realistic user conversations, and full execution traces. Designed for training and evaluating tool-use / function-calling capabilities of LLMs. Overview Item Details… See the full description on the dataset page: https://huggingface.co/datasets/obaydata/mcp-agent-trajectory-benchmark.
Robots used
null

Links