Skip to content

Commit a385633

Browse files
Prepare release 0.3.0
1 parent fff273f commit a385633

4 files changed

Lines changed: 36 additions & 4 deletions

File tree

CHANGELOG.md

Lines changed: 32 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,32 @@
1+
# Changelog
2+
3+
All notable changes to GPULlama3.java will be documented in this file.
4+
5+
## [0.3.0] - 2025-12-11
6+
7+
### Model Support
8+
9+
- [refactor] Generalize the design of `tornadovm` package to support multiple new models and types for GPU exec ([#62](https://github.com/beehive-lab/GPULlama3.java/pull/62))
10+
- Refactor/cleanup model loaders ([#58](https://github.com/beehive-lab/GPULlama3.java/pull/58))
11+
- Add Support for Q8_0 Models ([#59](https://github.com/beehive-lab/GPULlama3.java/pull/59))
12+
13+
### Bug Fixes
14+
15+
- [fix] Normalization compute step for non-nvidia hardware ([#84](https://github.com/beehive-lab/GPULlama3.java/pull/84))
16+
17+
### Other Changes
18+
19+
- Update README to enhance TornadoVM performance section and clarify GP… ([#85](https://github.com/beehive-lab/GPULlama3.java/pull/85))
20+
- Simplify installation by replacing TornadoVM submodule with pre-built SDK ([#82](https://github.com/beehive-lab/GPULlama3.java/pull/82))
21+
- [FP16] Improved performance by fusing dequantize with compute in kernels: 20-30% Inference Speedup ([#78](https://github.com/beehive-lab/GPULlama3.java/pull/78))
22+
- [cicd] Prevent workflows from running on forks ([#83](https://github.com/beehive-lab/GPULlama3.java/pull/83))
23+
- [CI][packaging] Automate process of deploying a new release with Github actions ([#81](https://github.com/beehive-lab/GPULlama3.java/pull/81))
24+
- [Opt] Manipulation of Q8_0 tensors with Tornado `ByteArray`s ([#79](https://github.com/beehive-lab/GPULlama3.java/pull/79))
25+
- Optimization in Q8_0 loading ([#74](https://github.com/beehive-lab/GPULlama3.java/pull/74))
26+
- [opt] GGUF Load Optimization for tensors in TornadoVM layout ([#71](https://github.com/beehive-lab/GPULlama3.java/pull/71))
27+
- Add `SchedulerType` support to all TornadoVM layer planners and layer… ([#66](https://github.com/beehive-lab/GPULlama3.java/pull/66))
28+
- Weight Abstractions ([#65](https://github.com/beehive-lab/GPULlama3.java/pull/65))
29+
- Bug fixes in sizes and names of GridScheduler ([#64](https://github.com/beehive-lab/GPULlama3.java/pull/64))
30+
- Add Maven wrapper support ([#56](https://github.com/beehive-lab/GPULlama3.java/pull/56))
31+
- Add changes used in Devoxx Demo ([#54](https://github.com/beehive-lab/GPULlama3.java/pull/54))
32+

CITATION.cff

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,6 @@ authors:
1515
given-names: "Christos"
1616
title: "GPULlama3.java"
1717
license: MIT License
18-
version: 0.1.0-beta
19-
date-released: "2025-05-30"
18+
version: 0.3.0
19+
date-released: 2025-12-11
2020
url: "https://github.com/beehive-lab/GPULlama3.java"

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -165,7 +165,7 @@ You can add **GPULlama3.java** directly to your Maven project by including the f
165165
<dependency>
166166
<groupId>io.github.beehive-lab</groupId>
167167
<artifactId>gpu-llama3</artifactId>
168-
<version>0.2.2</version>
168+
<version>0.3.0</version>
169169
</dependency>
170170
```
171171

pom.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@
77
<!-- Use your verified namespace -->
88
<groupId>io.github.beehive-lab</groupId>
99
<artifactId>gpu-llama3</artifactId>
10-
<version>0.2.2</version> <!-- release version (no -SNAPSHOT) -->
10+
<version>0.3.0</version> <!-- release version (no -SNAPSHOT) -->
1111

1212
<name>GPU Llama3</name>
1313
<description>GPU-accelerated LLaMA3 inference using TornadoVM</description>

0 commit comments

Comments
 (0)