Skip to content

Latest commit

 

History

History
8 lines (5 loc) · 268 Bytes

File metadata and controls

8 lines (5 loc) · 268 Bytes

vitisbertl

experiments for low latency BERT large inference on Alveo U50.

vitis_hls C++ code module called "feeder" is a matrix multiplication kernel with 1024 parallel DSPs.

It implements (nmat*1024, 1024) . (1024, vec) when nmat in [1,8] and vec in [1,128]