Skip to content

Commit a257995

Browse files
committed
Merge branch 'main' of github.com:WukLab/wuklab_sysml
2 parents e88ddf4 + 214ffc3 commit a257995

1 file changed

Lines changed: 2 additions & 2 deletions

File tree

content/posts/vdcores.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -83,7 +83,7 @@ Hard to tell which one is faster Huh?
8383
Manually morphing between these two schedules requires significant changes to the kernel implementation. With decoupled cores abstraction, switching between them requires **instruction flow level change**, all tasks remain composable, without sacrificing performance.
8484
We try both with in 10 minutes with VDCores, and get a quick 7% performance gain in this operator.
8585

86-
## 3. Turning GPU SMs into Virtual Decoupled Cores
86+
## Turning GPU SMs into Virtual Decoupled Cores
8787

8888
> We turn every SM on H200 into a pair of Memory/Compute decoupeld cores, connected by message queues, all run at the speed of GPU!
8989
@@ -106,7 +106,7 @@ Under this principle, some designs emerges to further optimize the performance w
106106

107107

108108

109-
## 4. Decoupled Cores: In Live Action and in the Wild
109+
## Decoupled Cores: In Live Action and in the Wild
110110

111111
> We are working to bring VDCores to the open-source community and to a wider range of cores and hardware platforms. Stay tuned!
112112

0 commit comments

Comments
 (0)