dataset
embodied-action-outcome-coherence-v0.1
ClarusC64
or hover any field below to flag it
Overview
Name
embodied-action-outcome-coherence-v0.1
Source
ClarusC64
Episodes
0
Robot count
0
Format
csv
Description
Embodied Action–Outcome Coherence v0.1
What this tests
Whether an embodied agent updates world state from observed outcomes
Whether it avoids claiming success when the outcome says failure
Failure modes
outcome_ignoredResponse does not reflect the true post-action state
false_successResponse claims success despite an observed failure
causal_update_okResponse states the correct post-action state without contradiction
How it works
world_facts_t0 is the initial state
action_taken is what the… See the full description on the dataset page: https://huggingface.co/datasets/ClarusC64/embodied-action-outcome-coherence-v0.1.
Robots used
null
Links
HuggingFace dataset