dataset

Med-eval-logs

Med2026

or hover any field below to flag it

Overview

Name

Med-eval-logs

Source

Med2026

Episodes

Robot count

Format

csv

Description

MED Evaluation Logs Evaluation logs for the MED (Measure-Explain-Diagnose) framework analyzing vision tool-use reinforcement learning. Paper What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-Zoom 📄 Paper: arXiv:2602.01334 💻 Code: github.com/GAIR-NLP/Med Dataset Description This dataset contains evaluation results from vision tool-use RL experiments, tracking model performance across… See the full description on the dataset page: https://huggingface.co/datasets/Med2026/Med-eval-logs.

Robots used

null

Links

HuggingFace dataset

Med2026/Med-eval-logs