dataset
Med-eval-logs
Med2026
or hover any field below to flag it
Overview
Name
Med-eval-logs
Source
Med2026
Episodes
0
Robot count
0
Format
csv
Description
MED Evaluation Logs
Evaluation logs for the MED (Measure-Explain-Diagnose) framework analyzing vision tool-use reinforcement learning.
Paper
What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-Zoom
📄 Paper: arXiv:2602.01334
💻 Code: github.com/GAIR-NLP/Med
Dataset Description
This dataset contains evaluation results from vision tool-use RL experiments, tracking model performance across… See the full description on the dataset page: https://huggingface.co/datasets/Med2026/Med-eval-logs.
Robots used
null
Links
HuggingFace dataset