You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Enhance multimodal capabilities with new projector types and suppress tokens support
- Introduced support for new projector types: Gemma 4 Unified Vision (GEMMA4UV) and Gemma 4 Unified Audio (GEMMA4UA).
- Added functionality to suppress specific tokens during processing, improving model performance and flexibility.
- Updated tensor mappings and constants to accommodate new projector types and suppress tokens.
- Enhanced audio preprocessing for GEMMA4UA to handle raw waveform inputs efficiently.
- Revised model building logic for GEMMA4UV to utilize LayerNorm and positional embeddings effectively.
0 commit comments