Hi Is the cross attention for Pi07 between the VLM and action expert layer by layer or just at the start ?
Hi
Is the cross attention for Pi07 between the VLM and action expert layer by layer or just at the start ?