1.1x prefill and decode speedup (attention/activations) #4624
Annotations
2 errors and 1 warning
|
windows-latest (windows) Release
The operation was canceled.
|
|
windows-latest (windows) Release
Canceling since a higher priority waiting request for build-refs/heads/test_773579903-windows-latest-windows-Release exists
|
|
bazel
Failed to restore: getCacheEntry failed: Cache service responded with 503
|
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
gemma-macos-latest-make-Release
Expired
|
1.55 MB |
sha256:7ba1f2fe1022ef33ffc9bde7193bfc63f47c2c55ac9986e5e72e342c6b14dbce
|
|
|
gemma-ubuntu-latest-make-Release
Expired
|
8.17 MB |
sha256:ce55ad50fd3c29ed1db30fa48a4ed9d278aed43ddd44a0e1ba926457805ce44d
|
|