Skip to content

Latest commit

 

History

History
20 lines (16 loc) · 2.42 KB

File metadata and controls

20 lines (16 loc) · 2.42 KB

Pipeline Specifications

Generated: 2025-10-31T16:38:14.268275Z

This file is auto-generated. Do not edit by hand.

Name Display Name Family Variant Tasks Modalities Capabilities Backends Stability Outputs
audio_processing Audio Processing (Speech + Diarization) audio whisper-pyannote speech-transcription,speaker-diarization audio streaming,embedding pytorch beta WebVTT:transcript;RTTM:speaker_turns
face_analysis Face Analysis (DeepFace) face deepface face-detection,emotion-recognition,age-estimation,gender-prediction video person-linking,frame-level-analysis tensorflow,opencv stable COCO:face_detections/emotions/demographics
face_laion_clip LAION CLIP Face Semantic Embedding face laion-clip-face face-embedding,face-recognition,emotion-recognition image,video zero-shot,embedding,real-time pytorch experimental JSON:embeddings/attributes
face_openface3_embedding OpenFace3 Face Embedding face openface3-embedding face-embedding image,video embedding onnx,pytorch experimental JSON:embeddings
laion_voice LAION Empathic Voice Analysis audio laion-empathic-whisper emotion-recognition,audio-analysis audio embedding,empathic-analysis pytorch,huggingface stable JSON:emotion_segments/empathic_scores;WebVTT:emotion_timeline
person_tracking Person Tracking & Pose person yolov11n-pose-bytetrack object-tracking,pose-estimation video real-time,identity-persistence pytorch beta COCO:person_detection/keypoints/tracking
scene_detection Scene Detection scene pyscenedetect-clip scene-detection,scene-segmentation video batch,embedding pytorch beta JSON:scene_boundary/scene_category
speaker_diarization Speaker Diarization audio pyannote speaker-diarization,speaker-segmentation audio timeline,speaker-turns pytorch stable RTTM:speaker_turns
speech_recognition Speech Recognition audio whisper speech-transcription,automatic-speech-recognition audio streaming,word-timestamps,multilingual pytorch stable WebVTT:transcript
voice_emotion_baseline Voice Emotion + Transcription (Baseline) audio whisper-spectral-emotion speech-transcription,emotion-recognition audio streaming,embedding pytorch experimental WebVTT:transcript;JSON:emotion_segments

Total pipelines: 10