dataset

Med-eval-logs

Med2026

or hover any field below to flag it

Overview

Name
Med-eval-logs
Source
Med2026
Episodes
0
Robot count
0
Format
csv
Description
MED Evaluation Logs Evaluation logs for the MED (Measure-Explain-Diagnose) framework analyzing vision tool-use reinforcement learning. Paper What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-Zoom 📄 Paper: arXiv:2602.01334 💻 Code: github.com/GAIR-NLP/Med Dataset Description This dataset contains evaluation results from vision tool-use RL experiments, tracking model performance across… See the full description on the dataset page: https://huggingface.co/datasets/Med2026/Med-eval-logs.
Robots used
null

Links

HuggingFace dataset