cmake -B dflash/build -S dflash -DCMAKE_BUILD_TYPE=Release -DCMAKE_CUDA_ARCHITECTURES=70
-- The C compiler identification is GNU 13.3.0
-- The CXX compiler identification is GNU 13.3.0
-- The CUDA compiler identification is NVIDIA 12.0.140
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Check for working C compiler: /usr/bin/cc - skipped
-- Detecting C compile features
-- Detecting C compile features - done
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Check for working CXX compiler: /usr/bin/c++ - skipped
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Detecting CUDA compiler ABI info
-- Detecting CUDA compiler ABI info - done
-- Check for working CUDA compiler: /usr/bin/nvcc - skipped
-- Detecting CUDA compile features
-- Detecting CUDA compile features - done
CMake Warning (dev) at /usr/share/cmake-3.28/Modules/FetchContent.cmake:1331 (message):
The DOWNLOAD_EXTRACT_TIMESTAMP option was not given and policy CMP0135 is
not set. The policy's OLD behavior will be used. When using a URL
download, the timestamps of extracted files should preferably be that of
the time of extraction, otherwise code that depends on the extracted
contents might not be rebuilt if the URL changes. The OLD behavior
preserves the timestamps from the archive instead, but this is usually not
what you want. Update your project to the NEW behavior or specify the
DOWNLOAD_EXTRACT_TIMESTAMP option with a value of true to avoid this
robustness issue.
Call Stack (most recent call first):
CMakeLists.txt:22 (FetchContent_Declare)
This warning is for project developers. Use -Wno-dev to suppress it.
-- Using the multi-header code from /media/per/work/tmp/lucebox-hub/dflash/build/_deps/json-src/include/
-- The ASM compiler identification is GNU
-- Found assembler: /usr/bin/cc
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success
-- Found Threads: TRUE
-- Warning: ccache not found - consider installing it for faster compilation or disable this warning with GGML_CCACHE=OFF
-- CMAKE_SYSTEM_PROCESSOR: x86_64
-- GGML_SYSTEM_ARCH: x86
-- Including CPU backend
-- Found OpenMP_C: -fopenmp (found version "4.5")
-- Found OpenMP_CXX: -fopenmp (found version "4.5")
-- Found OpenMP: TRUE (found version "4.5")
-- x86 detected
-- Adding CPU backend variant ggml-cpu: -march=native
-- Found CUDAToolkit: /usr/include (found version "12.0.140")
-- CUDA Toolkit found
-- Using CMAKE_CUDA_ARCHITECTURES=70 CMAKE_CUDA_ARCHITECTURES_NATIVE=70-real
-- Could NOT find NCCL (missing: NCCL_LIBRARY NCCL_INCLUDE_DIR)
-- Warning: NCCL not found, performance for multiple CUDA GPUs will be suboptimal
-- CUDA host compiler is GNU 12.4.0
-- Including CUDA backend
-- ggml version: 0.9.11
-- ggml commit: b896cf696
CMake Warning at CMakeLists.txt:420 (message):
DFLASH27B_ENABLE_BSA=ON requested but CUDA_ARCHITECTURES contains '70'
(<80); disabling BSA (the spec-prefill path will fall back to the WMMA
kernel).
-- Configuring done (2.3s)
-- Generating done (0.0s)
-- Build files have been written to: /media/per/work/tmp/lucebox-hub/dflash/build
cmake --build dflash/build --target test_dflash -j
[ 1%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml.cpp.o
[ 1%] Building C object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml.c.o
[ 2%] Building C object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml-alloc.c.o
[ 2%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml-backend.cpp.o
[ 2%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml-opt.cpp.o
[ 2%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml-backend-meta.cpp.o
[ 3%] Building C object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml-quants.c.o
[ 3%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/gguf.cpp.o
[ 3%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml-threading.cpp.o
[ 3%] Linking CXX shared library libggml-base.so
[ 3%] Built target ggml-base
[ 4%] Building C object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o
[ 4%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o
[ 4%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o
[ 4%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o
[ 4%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o
[ 5%] Building C object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o
[ 5%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o
[ 6%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o
[ 6%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o
[ 6%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o
[ 8%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o
[ 8%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o
[ 8%] Building C object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o
[ 8%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/arange.cu.o
[ 11%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/conv2d-dw.cu.o
[ 11%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/concat.cu.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/argmax.cu.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/add-id.cu.o
[ 12%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/conv2d-transpose.cu.o
[ 12%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/convert.cu.o
[ 12%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/conv2d.cu.o
[ 13%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/cross-entropy-loss.cu.o
[ 13%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/cpy.cu.o
[ 13%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/count-equal.cu.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/binbcast.cu.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/clamp.cu.o
[ 13%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/cumsum.cu.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/acc.cu.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/argsort.cu.o
[ 11%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/conv-transpose-1d.cu.o
[ 13%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/diagmask.cu.o
[ 13%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/diag.cu.o
[ 14%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/fattn-chunked.cu.o
[ 14%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/fattn-tile.cu.o
[ 14%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/fattn-sparse.cu.o
[ 14%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/fattn.cu.o
[ 16%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/fattn-wmma-f16.cu.o
[ 17%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/gated_delta_net.cu.o
[ 17%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/gla.cu.o
[ 17%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/fill.cu.o
[ 17%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/mmq.cu.o
[ 18%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/im2col.cu.o
[ 18%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/mmf.cu.o
[ 18%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/ggml-cuda.cu.o
[ 18%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/getrows.cu.o
[ 18%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/mean.cu.o
[ 18%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/norm.cu.o
[ 19%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/mmvq.cu.o
[ 20%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/opt-step-sgd.cu.o
[ 19%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/moe-fused.cu.o
[ 20%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/opt-step-adamw.cu.o
[ 21%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/mmid.cu.o
[ 21%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/mmvf.cu.o
[ 21%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/out-prod.cu.o
[ 21%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/pad.cu.o
[ 22%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/pad_reflect_1d.cu.o
[ 22%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/quantize.cu.o
[ 22%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/pool2d.cu.o
[ 24%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/roll.cu.o
[ 24%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/rope.cu.o
[ 24%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/softcap.cu.o
[ 24%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/set.cu.o
[ 24%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/ssm-conv.cu.o
[ 25%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/set-rows.cu.o
[ 25%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/scale.cu.o
[ 26%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/solve_tri.cu.o
[ 26%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/softmax.cu.o
[ 26%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/ssm-scan.cu.o
[ 27%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/sum.cu.o
[ 27%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/sumrows.cu.o
[ 27%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq256-dv256.cu.o
[ 27%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/tri.cu.o
[ 27%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq128-dv128.cu.o
[ 27%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/top-k.cu.o
[ 28%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/turbo-wht.cu.o
[ 29%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/topk-moe.cu.o
[ 29%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/tsembd.cu.o
[ 31%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq40-dv40.cu.o
[ 32%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq112-dv112.cu.o
[ 32%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/upscale.cu.o
[ 32%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/wkv.cu.o
[ 32%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq576-dv512.cu.o
[ 32%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/unary.cu.o
[ 32%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq512-dv512.cu.o
[ 33%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o
[ 33%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq72-dv72.cu.o
[ 34%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o
[ 35%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o
[ 39%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_32.cu.o
[ 39%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o
[ 39%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o
[ 39%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o
[ 33%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o
[ 33%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq96-dv96.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o
[ 35%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq64-dv64.cu.o
[ 35%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq1_s.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o
[ 35%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_32.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq2_xxs.cu.o
[ 35%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o
[ 35%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq80-dv80.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq4_nl.cu.o
[ 42%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-mxfp4.cu.o
[ 42%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq4_xs.cu.o
[ 43%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq2_xs.cu.o
[ 36%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o
[ 43%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq3_s.cu.o
[ 43%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq2_s.cu.o
[ 37%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o
[ 44%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq3_xxs.cu.o
[ 37%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o
[ 44%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q2_k.cu.o
[ 44%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-nvfp4.cu.o
[ 45%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q5_0.cu.o
[ 45%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q4_k.cu.o
[ 45%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q4_1.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q3_k.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q5_1.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q4_0.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q8_0.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q5_k.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_3.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_1.cu.o
[ 48%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_7.cu.o
[ 49%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_9.cu.o
[ 50%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-bf16.cu.o
[ 51%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_10.cu.o
[ 48%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_6.cu.o
[ 48%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_12.cu.o
[ 48%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_4.cu.o
[ 48%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_8.cu.o
[ 49%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_5.cu.o
[ 50%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q6_k.cu.o
[ 54%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-q5_1.cu.o
[ 51%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_11.cu.o
[ 55%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-tq3_0.cu.o
[ 55%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-f16.cu.o
[ 55%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-q8_0.cu.o
[ 51%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-q4_1.cu.o
[ 51%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-f16.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_13.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_16.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_15.cu.o
[ 58%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_14.cu.o
[ 52%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-q4_0.cu.o
/media/per/work/tmp/lucebox-hub/dflash/deps/llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c: In function ‘ggml_compute_forward’:
[ 52%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-q5_0.cu.o
[ 55%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-q8_0.cu.o
[ 59%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-q5_1.cu.o
[ 55%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-tq3_0.cu.o
[ 62%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-q8_0.cu.o
[ 62%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_2.cu.o
[ 56%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-q5_0.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-bf16.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-q5_1.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-q4_1.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-bf16.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-f16.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-q4_0.cu.o
[ 58%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-q5_0.cu.o
[ 62%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-bf16.cu.o
[ 58%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-q4_1.cu.o
/media/per/work/tmp/lucebox-hub/dflash/deps/llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c:1704:5: warning: enumeration value ‘GGML_OP_MOE_FUSED’ not handled in switch [-Wswitch]
1704 | switch (tensor->op) {
| ^~~~~~
[ 64%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-tq3_0.cu.o
[ 59%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-q4_0.cu.o
[ 64%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-q8_0.cu.o
[ 64%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-q4_1.cu.o
[ 63%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-f16.cu.o
[ 63%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-q4_0.cu.o
[ 64%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-q4_0.cu.o
[ 65%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-q5_0.cu.o
[ 66%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-q4_1.cu.o
[ 63%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-tq3_0.cu.o
[ 66%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-q5_1.cu.o
[ 66%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-bf16.cu.o
[ 66%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-f16.cu.o
[ 66%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-q5_0.cu.o
[ 67%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-q8_0.cu.o
[ 67%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-bf16.cu.o
[ 67%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-q5_1.cu.o
[ 67%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-tq3_0.cu.o
[ 68%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-f16.cu.o
[ 68%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-q4_1.cu.o
[ 68%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-q4_0.cu.o
[ 68%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-q5_0.cu.o
[ 70%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-q5_1.cu.o
[ 70%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-f16.cu.o
[ 70%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-q8_0.cu.o
[ 71%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-bf16.cu.o
[ 71%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-q4_0.cu.o
[ 72%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-q4_1.cu.o
[ 72%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-tq3_0.cu.o
[ 72%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-q5_0.cu.o
[ 73%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-q8_0.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-q4_0.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-q5_1.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-tq3_0.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-q5_0.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-q4_1.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-q8_0.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-f16.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-bf16.cu.o
[ 75%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-q5_1.cu.o
[ 75%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-tq3_0.cu.o
/media/per/work/tmp/lucebox-hub/dflash/deps/llama.cpp/ggml/src/ggml-cpu/ops.cpp: In function ‘void ggml_compute_forward_clamp(const ggml_compute_params*, ggml_tensor*)’:
/media/per/work/tmp/lucebox-hub/dflash/deps/llama.cpp/ggml/src/ggml-cpu/ops.cpp:5553:12: warning: enumeration value ‘GGML_TYPE_TQ3_0’ not handled in switch [-Wswitch]
5553 | switch (src0->type) {
| ^
/media/per/work/tmp/lucebox-hub/dflash/deps/llama.cpp/ggml/src/ggml-cuda/moe-fused.cu(12): warning #177-D: function "dot_iq2_xs" was declared but never referenced
Remark: The warnings can be suppressed with "-diag-suppress <warning-number>"
/media/per/work/tmp/lucebox-hub/dflash/deps/llama.cpp/ggml/src/ggml-cuda/moe-fused.cu(32): warning #177-D: function "dot_iq3_xxs" was declared but never referenced
[ 77%] Linking CXX shared library libggml-cpu.so
[ 77%] Built target ggml-cpu
[ 78%] Linking CUDA shared library libggml-cuda.so
[ 78%] Built target ggml-cuda
[ 78%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml.dir/ggml-backend-dl.cpp.o
[ 79%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o
[ 79%] Linking CXX shared library libggml.so
[ 79%] Built target ggml
[ 79%] Building CXX object CMakeFiles/dflash_common.dir/src/errors.cpp.o
[ 79%] Building CXX object CMakeFiles/dflash_common.dir/src/draft/draft_gguf_loader.cpp.o
[ 80%] Building CXX object CMakeFiles/dflash_common.dir/src/draft/draft_safetensors_loader.cpp.o
[ 80%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/qwen35_target_graph.cpp.o
[ 80%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/gguf_target_loader.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen3/qwen3_drafter.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/draft/draft_graph.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen3/qwen3_graph.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen3/qwen3_loader.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen3/qwen3_daemon.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/gemma4/gemma4_loader.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/gemma4/gemma4_backend.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/gemma4/gemma4_daemon.cpp.o
[ 83%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen3/qwen3_backend.cpp.o
[ 83%] Building CXX object CMakeFiles/dflash_common.dir/src/gemma4/gemma4_graph.cpp.o
[ 83%] Building CXX object CMakeFiles/dflash_common.dir/src/gemma4/gemma4_dflash_target.cpp.o
[ 85%] Building CXX object CMakeFiles/dflash_common.dir/src/flashprefill_q8.cpp.o
[ 85%] Building CXX object CMakeFiles/dflash_common.dir/src/kv_quant.cpp.o
[ 85%] Building CXX object CMakeFiles/dflash_common.dir/src/kv_cache.cpp.o
[ 85%] Building CXX object CMakeFiles/dflash_common.dir/src/laguna/laguna_target_loader.cpp.o
[ 86%] Building CXX object CMakeFiles/dflash_common.dir/src/delta_net_chunked.cpp.o
[ 87%] Building CXX object CMakeFiles/dflash_common.dir/src/laguna/laguna_daemon.cpp.o
[ 87%] Building CXX object CMakeFiles/dflash_common.dir/src/laguna/laguna_target_graph.cpp.o
[ 87%] Building CXX object CMakeFiles/dflash_common.dir/src/laguna/laguna_backend.cpp.o
[ 87%] Building CXX object CMakeFiles/dflash_common.dir/src/common/dflash_feature_ring.cpp.o
[ 87%] Building CXX object CMakeFiles/dflash_common.dir/src/common/dflash_draft_ipc.cpp.o
[ 88%] Building CXX object CMakeFiles/dflash_common.dir/src/common/dflash_draft_ipc_daemon.cpp.o
[ 88%] Building CXX object CMakeFiles/dflash_common.dir/src/common/dflash_draft_graph.cpp.o
[ 88%] Building CXX object CMakeFiles/dflash_common.dir/src/common/dflash_spec_decode.cpp.o
[ 89%] Building CXX object CMakeFiles/dflash_common.dir/src/common/dflash_capture.cpp.o
[ 89%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/layer_split_forward.cpp.o
[ 90%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/layer_split_daemon.cpp.o
[ 90%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/qwen35_dflash_target.cpp.o
[ 91%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/qwen35_backend.cpp.o
[ 91%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/qwen35_layer_split_dflash_target.cpp.o
[ 91%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/graph_builders.cpp.o
[ 91%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/layer_split_daemon_loop.cpp.o
[ 91%] Building CXX object CMakeFiles/dflash_common.dir/src/common/sampler.cpp.o
[ 93%] Building CXX object CMakeFiles/dflash_common.dir/src/common/daemon_loop.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/common/layer_split_utils.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/common/gguf_inspect.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/common/peer_access.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/common/ddtree.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/server/sse_emitter.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/common/backend_factory.cpp.o
[ 95%] Building CXX object CMakeFiles/dflash_common.dir/src/server/tokenizer.cpp.o
[ 96%] Building CXX object CMakeFiles/dflash_common.dir/src/server/reasoning.cpp.o
[ 96%] Building CXX object CMakeFiles/dflash_common.dir/src/server/chat_template.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/server/tool_parser.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/qwen35_daemon.cpp.o
[ 96%] Building CXX object CMakeFiles/dflash_common.dir/src/server/prefix_cache.cpp.o
[ 95%] Building CXX object CMakeFiles/dflash_common.dir/src/server/tool_memory.cpp.o
[ 97%] Building CXX object CMakeFiles/dflash_common.dir/src/server/disk_prefix_cache.cpp.o
[ 97%] Building CUDA object CMakeFiles/dflash_common.dir/src/flashprefill_f16.cu.o
[ 97%] Building CXX object CMakeFiles/dflash_common.dir/src/cuda_cross_device_copy.cpp.o
[ 98%] Building CXX object CMakeFiles/dflash_common.dir/src/flashprefill.cpp.o
[ 98%] Building CXX object CMakeFiles/dflash_common.dir/src/flashprefill_select.cpp.o
/media/per/work/tmp/lucebox-hub/dflash/src/common/ddtree.cpp:24: warning: ignoring ‘#pragma omp parallel’ [-Wunknown-pragmas]
24 | #pragma omp parallel for schedule(static)
|
/media/per/work/tmp/lucebox-hub/dflash/src/gemma4/gemma4_graph.cpp: In function ‘bool dflash::common::gemma4_prefill_bsa(ggml_backend_t, const Gemma4Weights&, Gemma4Cache&, const float*, const int32_t*, int, std::vector<float>&)’:
/media/per/work/tmp/lucebox-hub/dflash/src/gemma4/gemma4_graph.cpp:1150:19: warning: unused variable ‘kv_source_il’ [-Wunused-variable]
1150 | const int kv_source_il = cache.kv_source[il];
| ^~~~~~~~~~~~
/media/per/work/tmp/lucebox-hub/dflash/src/flashprefill.cpp: In function ‘int dflash::common::flashprefill::flash_prefill_forward_f16(const void*, const void*, const void*, void*, int, int, int, int, int, float, const FlashPrefillConfig&)’:
/media/per/work/tmp/lucebox-hub/dflash/src/flashprefill.cpp:624:15: warning: unused variable ‘sm’ [-Wunused-variable]
624 | const int sm = prop.major * 10 + prop.minor;
| ^~
/media/per/work/tmp/lucebox-hub/dflash/src/common/backend_factory.cpp: In function ‘std::unique_ptr<dflash::common::ModelBackend> dflash::common::create_backend(const BackendArgs&)’:
/media/per/work/tmp/lucebox-hub/dflash/src/common/backend_factory.cpp:93:35: error: ‘const struct dflash::common::BackendArgs’ has no member named ‘draft_gpu’
93 | gcfg.draft_gpu = args.draft_gpu;
| ^~~~~~~~~
In file included from /media/per/work/tmp/lucebox-hub/dflash/src/common/dflash_draft_ipc.h:16,
from /media/per/work/tmp/lucebox-hub/dflash/src/common/dflash_spec_decode.h:16,
from /media/per/work/tmp/lucebox-hub/dflash/src/common/dflash_spec_decode.cpp:3:
/media/per/work/tmp/lucebox-hub/dflash/src/common/io_utils.h: In function ‘void dflash::common::stream_emit_fd(int, int32_t)’:
/media/per/work/tmp/lucebox-hub/dflash/src/common/io_utils.h:93:18: warning: ignoring return value of ‘ssize_t write(int, const void*, size_t)’ declared with attribute ‘warn_unused_result’ [-Wunused-result]
93 | (void)::write(stream_fd, &v, sizeof(v));
| ~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~
gmake[3]: *** [CMakeFiles/dflash_common.dir/build.make:650: CMakeFiles/dflash_common.dir/src/common/backend_factory.cpp.o] Error 1
gmake[3]: *** Waiting for unfinished jobs....
In file included from /media/per/work/tmp/lucebox-hub/dflash/src/common/dflash_draft_ipc.h:16,
from /media/per/work/tmp/lucebox-hub/dflash/src/qwen35/layer_split_forward.h:10,
from /media/per/work/tmp/lucebox-hub/dflash/src/qwen35/layer_split_daemon.h:9,
from /media/per/work/tmp/lucebox-hub/dflash/src/qwen35/layer_split_daemon.cpp:3:
/media/per/work/tmp/lucebox-hub/dflash/src/common/io_utils.h: In function ‘void dflash::common::stream_emit_fd(int, int32_t)’:
/media/per/work/tmp/lucebox-hub/dflash/src/common/io_utils.h:93:18: warning: ignoring return value of ‘ssize_t write(int, const void*, size_t)’ declared with attribute ‘warn_unused_result’ [-Wunused-result]
93 | (void)::write(stream_fd, &v, sizeof(v));
| ~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~
/media/per/work/tmp/lucebox-hub/dflash/src/qwen3/qwen3_backend.cpp: In member function ‘bool dflash::common::Qwen3Backend::do_step(const float*, int, int, std::vector<float>&)’:
/media/per/work/tmp/lucebox-hub/dflash/src/qwen3/qwen3_backend.cpp:177:15: warning: unused variable ‘ff’ [-Wunused-variable]
177 | const int ff = w_.n_ff;
| ^~
In file included from /media/per/work/tmp/lucebox-hub/dflash/src/common/dflash_draft_ipc.h:16,
from /media/per/work/tmp/lucebox-hub/dflash/src/qwen35/layer_split_forward.h:10,
from /media/per/work/tmp/lucebox-hub/dflash/src/qwen35/layer_split_daemon.h:9,
from /media/per/work/tmp/lucebox-hub/dflash/src/qwen35/layer_split_daemon_loop.cpp:5:
/media/per/work/tmp/lucebox-hub/dflash/src/common/io_utils.h: In function ‘void dflash::common::stream_emit_fd(int, int32_t)’:
/media/per/work/tmp/lucebox-hub/dflash/src/common/io_utils.h:93:18: warning: ignoring return value of ‘ssize_t write(int, const void*, size_t)’ declared with attribute ‘warn_unused_result’ [-Wunused-result]
93 | (void)::write(stream_fd, &v, sizeof(v));
| ~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~
gmake[2]: *** [CMakeFiles/Makefile2:218: CMakeFiles/dflash_common.dir/all] Error 2
gmake[1]: *** [CMakeFiles/Makefile2:790: CMakeFiles/test_dflash.dir/rule] Error 2
gmake: *** [Makefile:426: test_dflash] Error 2