Commit 49a4412
committed
Update base for Update on "[Cria][Lllama runner] Use caching temp allocator"
Use of caching allocator improves TITO model performance by 6+ %.
Will add repro instructions here but requires next diff to see the impact
Differential Revision: [D85532078](https://our.internmc.facebook.com/intern/diff/D85532078/)
[ghstack-poisoned]1 parent 0e66111 commit 49a4412
0 file changed
0 commit comments