HUMAN_POV
HUMAN POVORGANIZING3D HAND POSE

Sorting and Organizing Objects

In this video, the person organizes items on a small wooden table, sorting them into two bowls while handling objects such as a toothbrush, glasses, a mouse, and small accessories.

Environment / bedroom

SEGMENTED VIDEOOBJECTS SEGMENTED
STILL FRAME
Sorting and Organizing Objects still frame

Scene Analysis

Objects and actions detected in the capture, segmented with SAM3 and labeled with a vision model.

Detected Objects
wooden tablepurple bowlorange bowltoothbrushglassesmouseboxjaryellow clipred cord
Detected Actions
sorting objectspicking up itemsplacing items in bowls

Interactive 3D Hand Pose

21 landmarks per hand, both hands. Drag to orbit, scrub the timeline, toggle each hand.

LOADING 3D HAND POSE...

Transcript

Frame-aligned narration of the demonstration.

0:00 - 0:07

I begin by surveying the table and reaching for the yellow clip. I pick it up and place it into the purple bowl.

0:07 - 0:14

Next, I grab the black mouse from the table and drop it into the orange bowl, which appears to be for storing these items.

0:14 - 0:21

I pick up the red cord and carefully add it to the orange bowl with the mouse. It seems like I’m organizing accessories here.

0:21 - 0:29

Now, I reach for a small box, examine it briefly, and place it into the purple bowl, keeping similar items together.

0:29 - 0:36

After that, I grab the glasses lying on the table and set them aside, choosing not to place them in either of the bowls.

0:36 - 0:43

The toothbrush is next. I reach for it and put it into the purple bowl, possibly organizing hygiene items separately.

0:43 - 0:50

I pick up the black jar and add it to the purple bowl as well, organizing similar items together in this bowl.

0:50 - 0:57

Finally, I take a moment to adjust the contents of both bowls, ensuring everything is neat and in place before stepping back.

Sample Specs

Task Family
Organizing
Duration
56.9 seconds
Frames
1,366
Frame Rate
24 fps
Resolution
Egocentric
Hand Landmarks
21 x 2 hands, per frame