Skip to content

Commit 6aeedf6

Browse files
authored
Add multiple model inference steps to workflow
1 parent f24531a commit 6aeedf6

1 file changed

Lines changed: 29 additions & 1 deletion

File tree

.github/workflows/build-and-run.yml

Lines changed: 29 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -77,7 +77,7 @@ jobs:
7777
which tornado || { echo "::error::tornado unavailable during GPULlama3 build"; exit 1; }
7878
tornado --version
7979
make
80-
- name: Run Test Inference
80+
- name: Run Llama-3.2-1B-Instruct-F16.gguf
8181
run: |
8282
set -x
8383
cd ${{ github.workspace }}
@@ -86,3 +86,31 @@ jobs:
8686
./llama-tornado --gpu --opencl \
8787
--model /home/michalis/models/Llama-3.2-1B-Instruct-F16.gguf \
8888
--prompt "Say hello"
89+
- name: Run Llama-3.2-1B-Instruct-Q8_0.gguf
90+
run: |
91+
set -x
92+
cd ${{ github.workspace }}
93+
export PATH="$TORNADO_SDK/bin:$JAVA_HOME/bin:$PATH"
94+
which tornado || { echo "::error::tornado not found at runtime"; exit 1; }
95+
./llama-tornado --gpu --opencl \
96+
--model /opt/models/Llama-3.2-1B-Instruct-Q8_0.gguf \
97+
--prompt "Say hello"
98+
- name: Run Qwen3-4B-f16.gguf
99+
run: |
100+
set -x
101+
cd ${{ github.workspace }}
102+
export PATH="$TORNADO_SDK/bin:$JAVA_HOME/bin:$PATH"
103+
which tornado || { echo "::error::tornado not found at runtime"; exit 1; }
104+
./llama-tornado --gpu --opencl \
105+
--model /opt/models/Qwen3-4B-f16.gguf \
106+
--prompt "Say hello"
107+
- name: Run Qwen3-4B-Q8_0.gguf
108+
run: |
109+
set -x
110+
cd ${{ github.workspace }}
111+
export PATH="$TORNADO_SDK/bin:$JAVA_HOME/bin:$PATH"
112+
which tornado || { echo "::error::tornado not found at runtime"; exit 1; }
113+
./llama-tornado --gpu --opencl \
114+
--model /opt/models/Qwen3-4B-Q8_0.gguf \
115+
--prompt "Say hello"
116+

0 commit comments

Comments
 (0)