-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathindex.html
More file actions
262 lines (261 loc) · 16.4 KB
/
index.html
File metadata and controls
262 lines (261 loc) · 16.4 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
<!DOCTYPE html>
<html >
<head>
<meta charset="utf-8">
<title>Amory Hoste</title>
<link rel="manifest" href="./site.webmanifest">
<meta content="Amory Hoste - Software Engineer" name="description">
<meta content="Amory Hoste" property="og:title">
<meta content="Amory Hoste - Software Engineer" property="og:description">
<meta content="Amory Hoste" property="twitter:title">
<meta content="Amory Hoste - Software Engineer" property="twitter:description">
<meta property="og:type" content="website">
<meta content="summary_large_image" name="twitter:card">
<meta content="width=device-width, initial-scale=1" name="viewport">
<link href="css/normalize.css" rel="stylesheet" type="text/css">
<link href="css/site.css" rel="stylesheet" type="text/css">
<link href="https://fonts.googleapis.com" rel="preconnect">
<link href="https://fonts.gstatic.com" rel="preconnect" crossorigin="anonymous">
<script src="https://ajax.googleapis.com/ajax/libs/webfont/1.6.26/webfont.js" type="text/javascript"></script>
<script type="text/javascript">WebFont.load({ google: { families: ["Roboto:300,regular,500,700","Roboto Condensed:300,regular,700","Roboto Slab:300,regular,700","JetBrains Mono:regular,700","Rubik:regular,500,700"] }});</script>
<script type="text/javascript">!function(o,c){var n=c.documentElement,t=" w-mod-";n.className+=t+"js",("ontouchstart"in o||o.DocumentTouch&&c instanceof DocumentTouch)&&(n.className+=t+"touch")}(window,document);</script>
<link href="images/logo/favicon.ico" rel="shortcut icon" type="image/x-icon">
<link rel="apple-touch-icon" sizes="180x180" href="images/logo/apple-touch-icon.png">
<link rel="icon" type="image/png" sizes="32x32" href="images/logo/favicon-32x32.png">
<link rel="icon" type="image/png" sizes="16x16" href="images/logo/favicon-16x16.png">
<meta content="yes" name="apple-mobile-web-app-capable">
<meta content="yes" name="mobile-web-app-capable">
<meta content="default" name="apple-mobile-web-app-status-bar-style">
<meta property="og:title" content="Amory Hoste - Software Engineer"/>
</head>
<body>
<div class="all-content w-clearfix" id="all-content">
<header class="head-content header">
<div class="headline">
<div class="headline-terminal">
<h1 class="terminal-typed">> cat amoryhoste.md</h1>
</div>
<div class="header-text">Amory Hoste</div>
<div class="headline-text">Hi there, I'm Amory. I am a Senior Research Engineer at <a href="https://www.linkedin.com/company/huawei-technologies-research-development-uk-ltd/" target="_blank" class="url-link">Huawei R&D UK</a> specializing in high-performance AI infrastructure, with a focus on low-level LLM inference optimization (inference engine, custom kernels, high-speed networking). Previously, I worked on building high performance & resource efficient cloud infrastructure for general compute and serverless workloads. <br><br>I obtained my Bachelor's degree at <a href="https://www.ugent.be/en" target="_blank" class="url-link">Ghent University </a>and my Master's degree at the <a href="https://systems.ethz.ch" target="_blank" class="url-link">Systems Group</a> at <a href="https://inf.ethz.ch" target="_blank" class="url-link">ETH Zurich</a>. I like to work across the hardware-software stack and transform new ideas into robust solutions with tangible business impact. <br><br>I am always open to new projects and exciting ideas, so please <a href="https://www.linkedin.com/in/amoryhoste" target="_blank" class="url-link">reach out</a> if you you have something interesting.</div>
<div class="headline-divider"></div>
<div class="headline-buttons">
<a href="https://amoryhoste.com/files/AmoryHoste_Resume.pdf" target="_blank" class="headline-button w-button"><span class="button-icon-pdf"></span> Resume</a>
<a href="https://www.linkedin.com/in/amoryhoste" target="_blank" class="headline-button w-button"><span class="button-icon-linkedin"></span> Linkedin</a>
</div>
<div class="headline-terminal interactive-terminal">
<h1 class="terminal-typing">></h1><span class="cursor"></span>
</div>
</div>
</header>
<div class="resume-content content" id="content">
<div class="main-content">
<div class="resume-terminal">
<h1 class="terminal-typed">> less resume.pdf</h1>
</div>
<div class="resume-item">
<div class="resume-text-item">
<div class="resume-text-item-left">
<div class="resume-header">Summary</div>
</div>
<div class="resume-text-item-right">
<div class="resume-text">Senior Systems Research Engineer specializing in high-performance AI and cloud infrastructure, with a focus on low-level LLM inference optimization (inference engine, custom kernels, networking).</div>
</div>
</div>
</div>
<div class="divider"></div>
<div class="resume-item">
<div class="resume-header">Work Experience</div>
<div class="resume-date-item">
<div class="date-item-info">
<div class="date-item-info-header"><img src="images/GB.svg" loading="lazy" alt="" class="flag">
<div class="date-item-header-text">Huawei R&D UK</div>
</div>
<div class="date-item-location">Edinburgh, Scotland, United Kingdom</div>
<div class="date-item-date date-item-calc">05/2023 - Present</div>
</div>
<div class="date-item-separator-solid">
<div class="separator-hider">■</div>
<div class="separator-bullet"></div>
</div>
<div class="date-item-content">
<div class="date-item-header">Senior Systems Research Engineer</div>
<div class="resume-text"><span class="resume-text-bold">Large scale LLM Inference optimization for Huawei Ascend NPUs.</span></div>
<div class="resume-text">Led multiple key projects to production integration and supervised two research interns. Currently working on long-context LLM inference and sparse attention.</div>
<ul role="list" class="list">
<li>
<div class="resume-text">Developed lightweight NPU Peer-to-Peer (P2P) Transfer Library, increasing KV cache transfer bandwidth by 2.3x, significantly outperforming existing NPU libraries for both RoCE and HCCS.</div>
</li>
<li>
<div class="resume-text">Wrote high-performance NPU kernels for several critical scenarios including Mixture of Experts Dispatch/Combine, Large Recommendation Model Embedding Retrieval and KV Cache Transfer.</div>
</li>
<li>
<div class="resume-text">Contributed support for LLM Prefill-Decode (PD) Disaggregation and P2P KV Cache Sharing on vLLM-Ascend to the open-source <a href="https://github.com/LMCache/LMCache-Ascend" target="_blank" class="url-link">LMCache-Ascend</a> project.</div>
</li>
<li>
<div class="resume-text">Improved Ascend 910B point-to-point bandwidth by 5.57x over single-path baseline by developing a software-based multipath transfer library tailored for its mesh-based topology.</div>
</li>
<li>
<div class="resume-text">Developed a QoS aware NPU-sharing mechanism, improving resource utilization by enabling colocation of smaller models while maintaining SLOs.</div>
</li>
</ul>
<div class="experience-skills-text"><strong class="experience-skills-text-bold">Awards: </strong>2x President’s Award - Significant Business Contribution, European Research Institute Excellent Contributor Award, 2012 Labs Outstanding Contributor Award, Quality Star Award<br></div>
</div>
</div>
<div class="resume-date-item">
<div class="date-item-info">
<div class="date-item-info-header"><img src="images/GB.svg" loading="lazy" alt="" class="flag">
<div class="date-item-header-text">Huawei R&D UK</div>
</div>
<div class="date-item-location">Edinburgh, Scotland, United Kingdom</div>
<div class="date-item-date date-item-calc">11/2021 - 05/2023</div>
</div>
<div class="date-item-separator-dots">
<div class="separator-bullet"></div>
</div>
<div class="date-item-content">
<div class="date-item-header">Systems Research Engineer</div>
<div class="resume-text"><span class="resume-text-bold">Performance & resource efficiency optimization of Huawei cloud workloads.</span></div>
<ul role="list" class="list">
<li>
<div class="resume-text">Developed a distributed Kubernetes scheduler optimized for real-time, high-throughput scheduling decisions, utilizing eBPF for fine-grained, low-overhead monitoring.</div>
</li>
<li>
<div class="resume-text">Designed and implemented custom scheduling algorithms to maximize resource utilization and ensure performance isolation for colocated cloud workloads.</div>
</li>
<li>
<div class="resume-text">Created a comprehensive benchmark suite and load generator to evaluate new algorithms and architectures against representative production scenarios.</div>
</li>
</ul>
<div class="experience-skills-text"><strong class="experience-skills-text-bold">Awards: </strong>Future Star Award<br></div>
</div>
</div>
<div class="resume-date-item">
<div class="date-item-info">
<div class="date-item-info-header"><img src="images/BE.svg" loading="lazy" alt="" class="flag">
<div class="date-item-header-text">Imec IDLab</div>
</div>
<div class="date-item-location">Ghent Area, Belgium</div>
<div class="date-item-date date-item-calc">08/2019 - 09/2019</div>
</div>
<div class="date-item-separator-solid">
<div class="separator-bullet"></div>
</div>
<div class="date-item-content">
<div class="date-item-header">Research Intern</div>
<div class="resume-text">Built web archival and automated quality analysis tools for the Royal Library of Belgium.<br></div>
</div>
</div>
<div class="extender">
<div class="extender-content">
<div class="extender-text">I</div>
</div>
</div>
<div class="resume-date-item-end">
<div class="date-item-info">
<div class="date-item-info-header"><img src="images/BE.svg" loading="lazy" alt="" class="flag">
<div class="date-item-header-text">Imec IDLab</div>
</div>
<div class="date-item-location">Ghent Area, Belgium</div>
<div class="date-item-date date-item-calc">08/2018</div>
</div>
<div class="date-item-separator-hidden">
<div class="separator-bullet"></div>
</div>
<div class="date-item-content">
<div class="date-item-header">Research Intern</div>
<div class="resume-text">Developed a fragmented R-tree index to enable efficient geospatial querying of linked data.</div>
</div>
</div>
</div>
<div class="divider"></div>
<div class="resume-item">
<div class="resume-header">Education</div>
<div class="resume-date-item">
<div class="date-item-info">
<div class="date-item-info-header"><img src="images/CH.svg" loading="lazy" alt="" class="flag">
<div class="date-item-header-text">ETH Zurich</div>
</div>
<div class="date-item-location">Zurich, Switzerland</div>
<div class="date-item-date">Sep 2019 - Sep 2021</div>
</div>
<div class="date-item-separator-dots">
<div class="separator-hider">■</div>
<div class="separator-bullet"></div>
</div>
<div class="date-item-content">
<div class="date-item-header">Master of Science in Computer Science</div>
<div class="education-grades">Grade: 5.71/6 (Top 10% of class)</div>
<div class="resume-text">Focus on (Distributed) Systems and High Performance Computing.</div>
<div class="education-thesis"><span class="resume-text-bold">Master thesis:</span> Analysis and Optimization of Serverless Cold Start Latencies through Function Snapshots at the <a href="https://systems.ethz.ch/research/easl.html" target="_blank" class="link-small">ETH Efficient Architectures and Systems Lab</a> under supervision of <a href="https://anakli.inf.ethz.ch" target="_blank" class="link-small">Prof. Ana Klimovic</a><br></div>
</div>
</div>
<div class="extender">
<div class="extender-content">
<div class="extender-text">I</div>
</div>
</div>
<div class="resume-date-item-end">
<div class="date-item-info">
<div class="date-item-info-header"><img src="images/BE.svg" loading="lazy" alt="" class="flag">
<div class="date-item-header-text">Ghent University</div>
</div>
<div class="date-item-location">Ghent, Belgium</div>
<div class="date-item-date">Sep 2016 - Jun 2019</div>
</div>
<div class="date-item-separator-hidden">
<div class="separator-bullet"></div>
</div>
<div class="date-item-content">
<div class="date-item-header">Bachelor of Science in Computer Science</div>
<div class="education-grades">Grade: 808/1000 (1st of class)</div>
<div class="resume-text">Minor in Electronics and Telecommunication.</div>
</div>
</div>
</div>
<div class="divider"></div>
<div class="resume-item">
<div class="resume-header">Technical Skills</div>
<div class="w-layout-grid skills-grid">
<div class="skills-header">Languages:</div>
<div class="resume-text">Python, C/C++, Go</div>
<div class="skills-header">ML & Inference:</div>
<div class="resume-text">vLLM Internals, PyTorch, Kernel Development, RDMA/RoCE, CUDA</div>
<div class="skills-header">Cloud:</div>
<div class="resume-text">Kubernetes, Container Runtimes, eBPF, Serverless, DevOps & Observability</div>
</div>
</div>
<div class="skills-note"><em class="italic">Note: this list is non-exhaustive and only includes recently used skills. Generally, I am quick and eager to learn new languages, frameworks and technologies where necessary.</em></div>
<div class="divider"></div>
<div class="resume-item">
<div class="resume-text-item">
<div class="resume-text-item-left">
<div class="resume-header">Interests</div>
</div>
<div class="resume-text-item-right">
<div class="resume-text">Outside of work, I enjoy reading up on recent hardware & infrastructure buildouts, playing tennis, running, hiking and ski touring.</div>
</div>
</div>
</div>
<div class="terminal-right">
<h1 class="terminal-typing">></h1><img src="images/cursor_thin.gif" loading="lazy" alt="" class="cursor">
</div>
</div>
</div>
<div class="scroll-counter">
<div class="counter-text" id="curscroll">[0/140]</div>
</div>
<div class="tmux-bar">
<div class="tmux-bar-left">
<div class="tmux-text">[0] 0:[tmux]*</div>
</div>
<div class="tmux-bar-right">
<div class="tmux-text-info">"amoryhoste.com"</div>
<div class="tmux-date" id="curdate">Last updated ... </div>
</div>
</div>
</div>
<script src="js/jquery-3.5.1.min.dc5e7f18c8.js" type="text/javascript" integrity="sha256-9/aliU8dGd2tb6OSsuzixeV4y/faTqgFtohetphbbj0=" crossorigin="anonymous"></script>
<script src="js/site.js"></script>
<script src="js/terminal.js"></script>
</body>
</html>