dataset

mcp-agent-trajectory-benchmark

obaydata

or hover any field below to flag it

Overview

Name

Source

obaydata

Episodes

Robot count

Format

json

Description

MCP Agent Trajectory Benchmark A benchmark dataset of 49 MCP (Model Context Protocol) agent trajectories (38 single-pass + 11 multi-conv) with complete tool-use traces in the ATIF v1.2 (Agent Trajectory Interchange Format) format. Each agent operates in a distinct business domain with custom tools, realistic user conversations, and full execution traces. Designed for training and evaluating tool-use / function-calling capabilities of LLMs. Overview Item Details… See the full description on the dataset page: https://huggingface.co/datasets/obaydata/mcp-agent-trajectory-benchmark.

Robots used

null

Links

HuggingFace dataset

obaydata/mcp-agent-trajectory-benchmark