CUSTOM GRADEVIDEO

Egocentric Task Demonstrations

First-person video of real-world task completion with synchronized audio, hand-tracking, and environmental metadata for robotics and embodied AI.

Frame rate
Up to 60fps
Quality
Multi-layer QC
Availability
Sample clips on request

[ OVERVIEW ]

First-person video capturing how humans actually complete real-world tasks: cooking, repair, assembly, navigation, craft, and household work. Recordings include synchronized audio narration, hand-tracking data, environmental metadata, and step-level task annotations. Built for teams training embodied AI agents, robotic manipulation policies, and task-understanding models. Every contributor consents to derivative-work usage with demographic metadata on file.

[ KEY HIGHLIGHTS ]

  • First-person perspective with synchronized audio narration
  • Hand-tracking and keypoint data where the task and lighting allow
  • Step-level task annotations with verb-object labels per action
  • Environmental metadata: location class, lighting, clutter, object count
  • Full 1080p at up to 60fps with standard or action-camera capture
  • Diverse task domains: cooking, repair, assembly, navigation, craft
  • Licensed per task domain or across the full egocentric corpus

[ TECHNICAL SPECIFICATIONS ]

Files
MP4 H.264 at 1080p/30-60fps with synchronized WAV audio; action-camera variants available
Annotations
Step-level task labels · verb-object pairs · hand-tracking data · environmental tags
Schema
COCO-style JSON · Ego4D-compatible schema · custom task taxonomies on request
Licensing
Commercial training rights · derivative-work rights signed · per-domain or full-corpus

More from the catalog.

Explore the full catalog, or scope a custom build matched to your brief.