Accepted evidence
Submit via the official submission page with your videos and metadata JSON.
Submission format
Zip your run package with the task/episode folder structure shown in the Submission section.
Update cadence
Submissions ingest daily, verification weekly, leaderboard publish weekly (UTC).
Leaderboard
Hugging Face Static Space format using results.json.
Loading results.json...
| Rank | Team / Model | Composite | Success | Avg Return | Submitted | Status |
|---|
Submission
Step 1
Prepare videos, metadata JSON, and contact details.
Step 2
Use the submission page to upload or register your zip package and metadata JSON with team/contact information.
Step 3
We verify evidence and publish validated results in weekly leaderboard updates.
submission_package/ --task1/ --episode_0.mp4 --episode_1.mp4 --... --task2/ --episode_0.mp4 --... --metadata.json
Submission JSON schema
{
"team_name": "Your Team Name",
"contact_email": "team@example.com",
"affiliation": "Example University",
"model_name": "Your Model Name",
"model_version": "v1.1",
"eval_setting": "Open-vocab tabletop; seed=5; 50 episodes",
"metrics": {
"composite_score": 89.3,
"success_rate": 83.0,
"avg_return": 71.2
},
"videos": [
{ "label": "long-horizon demo", "url": "https://huggingface.co/spaces/your-space/video1" }
],
"notes": "Any deviations from reference protocol."
}
Ranking updates and governance
- Ingest daily: new submissions are queued.
- Validate weekly: videos and metadata are checked.
- Publish weekly: rankings refresh after validation.
- Tie-break: composite score, then success rate, then earlier submission.