Develop a node to use a ML model to recognize activities from video
The node should:
- subscribe to a camera topic (for rgb images)
- accumulate N frames (for this model it looks like they use 64 frames)
- transform them as expected by the model (resized to 224x224 and reshaped to 3x64x224x224)
- classify the sequence of frames
- Repeat for next sequence of frames
Develop a node to use a ML model to recognize activities from video
Use this model with the pretrained weights for the charades dataset
Class labels for the Charades dataset can be found in the file 'Charades_v1_classes.txt' here
The node should: