Skip to content

Build error #253

@merbanan

Description

@merbanan
cmake -B dflash/build -S dflash -DCMAKE_BUILD_TYPE=Release -DCMAKE_CUDA_ARCHITECTURES=70
-- The C compiler identification is GNU 13.3.0
-- The CXX compiler identification is GNU 13.3.0
-- The CUDA compiler identification is NVIDIA 12.0.140
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Check for working C compiler: /usr/bin/cc - skipped
-- Detecting C compile features
-- Detecting C compile features - done
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Check for working CXX compiler: /usr/bin/c++ - skipped
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Detecting CUDA compiler ABI info
-- Detecting CUDA compiler ABI info - done
-- Check for working CUDA compiler: /usr/bin/nvcc - skipped
-- Detecting CUDA compile features
-- Detecting CUDA compile features - done
CMake Warning (dev) at /usr/share/cmake-3.28/Modules/FetchContent.cmake:1331 (message):
  The DOWNLOAD_EXTRACT_TIMESTAMP option was not given and policy CMP0135 is
  not set.  The policy's OLD behavior will be used.  When using a URL
  download, the timestamps of extracted files should preferably be that of
  the time of extraction, otherwise code that depends on the extracted
  contents might not be rebuilt if the URL changes.  The OLD behavior
  preserves the timestamps from the archive instead, but this is usually not
  what you want.  Update your project to the NEW behavior or specify the
  DOWNLOAD_EXTRACT_TIMESTAMP option with a value of true to avoid this
  robustness issue.
Call Stack (most recent call first):
  CMakeLists.txt:22 (FetchContent_Declare)
This warning is for project developers.  Use -Wno-dev to suppress it.

-- Using the multi-header code from /media/per/work/tmp/lucebox-hub/dflash/build/_deps/json-src/include/
-- The ASM compiler identification is GNU
-- Found assembler: /usr/bin/cc
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success
-- Found Threads: TRUE  
-- Warning: ccache not found - consider installing it for faster compilation or disable this warning with GGML_CCACHE=OFF
-- CMAKE_SYSTEM_PROCESSOR: x86_64
-- GGML_SYSTEM_ARCH: x86
-- Including CPU backend
-- Found OpenMP_C: -fopenmp (found version "4.5") 
-- Found OpenMP_CXX: -fopenmp (found version "4.5") 
-- Found OpenMP: TRUE (found version "4.5")  
-- x86 detected
-- Adding CPU backend variant ggml-cpu: -march=native 
-- Found CUDAToolkit: /usr/include (found version "12.0.140") 
-- CUDA Toolkit found
-- Using CMAKE_CUDA_ARCHITECTURES=70 CMAKE_CUDA_ARCHITECTURES_NATIVE=70-real
-- Could NOT find NCCL (missing: NCCL_LIBRARY NCCL_INCLUDE_DIR) 
-- Warning: NCCL not found, performance for multiple CUDA GPUs will be suboptimal
-- CUDA host compiler is GNU 12.4.0
-- Including CUDA backend
-- ggml version: 0.9.11
-- ggml commit:  b896cf696
CMake Warning at CMakeLists.txt:420 (message):
  DFLASH27B_ENABLE_BSA=ON requested but CUDA_ARCHITECTURES contains '70'
  (<80); disabling BSA (the spec-prefill path will fall back to the WMMA
  kernel).


-- Configuring done (2.3s)
-- Generating done (0.0s)
-- Build files have been written to: /media/per/work/tmp/lucebox-hub/dflash/build

cmake --build dflash/build --target test_dflash -j

[  1%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml.cpp.o
[  1%] Building C object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml.c.o
[  2%] Building C object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml-alloc.c.o
[  2%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml-backend.cpp.o
[  2%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml-opt.cpp.o
[  2%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml-backend-meta.cpp.o
[  3%] Building C object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml-quants.c.o
[  3%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/gguf.cpp.o
[  3%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-base.dir/ggml-threading.cpp.o
[  3%] Linking CXX shared library libggml-base.so
[  3%] Built target ggml-base
[  4%] Building C object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o
[  4%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o
[  4%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o
[  4%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o
[  4%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o
[  5%] Building C object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o
[  5%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o
[  6%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o
[  6%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o
[  6%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o
[  8%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o
[  8%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o
[  8%] Building C object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o
[  8%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/arange.cu.o
[ 11%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/conv2d-dw.cu.o
[ 11%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/concat.cu.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/argmax.cu.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/add-id.cu.o
[ 12%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/conv2d-transpose.cu.o
[ 12%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/convert.cu.o
[ 12%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/conv2d.cu.o
[ 13%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/cross-entropy-loss.cu.o
[ 13%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/cpy.cu.o
[ 13%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/count-equal.cu.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/binbcast.cu.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/clamp.cu.o
[ 13%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/cumsum.cu.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/acc.cu.o
[ 10%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/argsort.cu.o
[ 11%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/conv-transpose-1d.cu.o
[ 13%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/diagmask.cu.o
[ 13%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/diag.cu.o
[ 14%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/fattn-chunked.cu.o
[ 14%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/fattn-tile.cu.o
[ 14%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/fattn-sparse.cu.o
[ 14%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/fattn.cu.o
[ 16%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/fattn-wmma-f16.cu.o
[ 17%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/gated_delta_net.cu.o
[ 17%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/gla.cu.o
[ 17%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/fill.cu.o
[ 17%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/mmq.cu.o
[ 18%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/im2col.cu.o
[ 18%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/mmf.cu.o
[ 18%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/ggml-cuda.cu.o
[ 18%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/getrows.cu.o
[ 18%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/mean.cu.o
[ 18%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/norm.cu.o
[ 19%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/mmvq.cu.o
[ 20%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/opt-step-sgd.cu.o
[ 19%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/moe-fused.cu.o
[ 20%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/opt-step-adamw.cu.o
[ 21%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/mmid.cu.o
[ 21%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/mmvf.cu.o
[ 21%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/out-prod.cu.o
[ 21%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/pad.cu.o
[ 22%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/pad_reflect_1d.cu.o
[ 22%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/quantize.cu.o
[ 22%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/pool2d.cu.o
[ 24%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/roll.cu.o
[ 24%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/rope.cu.o
[ 24%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/softcap.cu.o
[ 24%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/set.cu.o
[ 24%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/ssm-conv.cu.o
[ 25%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/set-rows.cu.o
[ 25%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/scale.cu.o
[ 26%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/solve_tri.cu.o
[ 26%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/softmax.cu.o
[ 26%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/ssm-scan.cu.o
[ 27%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/sum.cu.o
[ 27%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/sumrows.cu.o
[ 27%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq256-dv256.cu.o
[ 27%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/tri.cu.o
[ 27%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq128-dv128.cu.o
[ 27%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/top-k.cu.o
[ 28%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/turbo-wht.cu.o
[ 29%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/topk-moe.cu.o
[ 29%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/tsembd.cu.o
[ 31%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq40-dv40.cu.o
[ 32%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq112-dv112.cu.o
[ 32%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/upscale.cu.o
[ 32%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/wkv.cu.o
[ 32%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq576-dv512.cu.o
[ 32%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/unary.cu.o
[ 32%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq512-dv512.cu.o
[ 33%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o
[ 33%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq72-dv72.cu.o
[ 34%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o
[ 35%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o
[ 39%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_32.cu.o
[ 39%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o
[ 39%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o
[ 39%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o
[ 33%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o
[ 33%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq96-dv96.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o
[ 35%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq64-dv64.cu.o
[ 35%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq1_s.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o
[ 35%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_32.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq2_xxs.cu.o
[ 35%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o
[ 35%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-tile-instance-dkq80-dv80.cu.o
[ 41%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq4_nl.cu.o
[ 42%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-mxfp4.cu.o
[ 42%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq4_xs.cu.o
[ 43%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq2_xs.cu.o
[ 36%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o
[ 43%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq3_s.cu.o
[ 43%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq2_s.cu.o
[ 37%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o
[ 44%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-iq3_xxs.cu.o
[ 37%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o
[ 44%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q2_k.cu.o
[ 44%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-nvfp4.cu.o
[ 45%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q5_0.cu.o
[ 45%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q4_k.cu.o
[ 45%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q4_1.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q3_k.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q5_1.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q4_0.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q8_0.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q5_k.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_3.cu.o
[ 47%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_1.cu.o
[ 48%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_7.cu.o
[ 49%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_9.cu.o
[ 50%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-bf16.cu.o
[ 51%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_10.cu.o
[ 48%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_6.cu.o
[ 48%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_12.cu.o
[ 48%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_4.cu.o
[ 48%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_8.cu.o
[ 49%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_5.cu.o
[ 50%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmq-instance-q6_k.cu.o
[ 54%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-q5_1.cu.o
[ 51%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_11.cu.o
[ 55%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-tq3_0.cu.o
[ 55%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-f16.cu.o
[ 55%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-q8_0.cu.o
[ 51%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-q4_1.cu.o
[ 51%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-f16.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_13.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_16.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_15.cu.o
[ 58%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_14.cu.o
[ 52%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-q4_0.cu.o
/media/per/work/tmp/lucebox-hub/dflash/deps/llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c: In function ‘ggml_compute_forward’:
[ 52%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-q5_0.cu.o
[ 55%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-q8_0.cu.o
[ 59%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-q5_1.cu.o
[ 55%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-bf16-tq3_0.cu.o
[ 62%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-q8_0.cu.o
[ 62%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/mmf-instance-ncols_2.cu.o
[ 56%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-q5_0.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-bf16.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-q5_1.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-q4_1.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-bf16.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-f16.cu.o
[ 57%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-f16-q4_0.cu.o
[ 58%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-q5_0.cu.o
[ 62%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-bf16.cu.o
[ 58%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-q4_1.cu.o
/media/per/work/tmp/lucebox-hub/dflash/deps/llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c:1704:5: warning: enumeration value ‘GGML_OP_MOE_FUSED’ not handled in switch [-Wswitch]
 1704 |     switch (tensor->op) {
      |     ^~~~~~
[ 64%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-tq3_0.cu.o
[ 59%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-q4_0.cu.o
[ 64%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-q8_0.cu.o
[ 64%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-q4_1.cu.o
[ 63%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-f16.cu.o
[ 63%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-q4_0.cu.o
[ 64%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-q4_0.cu.o
[ 65%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-q5_0.cu.o
[ 66%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-q4_1.cu.o
[ 63%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_0-tq3_0.cu.o
[ 66%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q4_1-q5_1.cu.o
[ 66%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-bf16.cu.o
[ 66%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-f16.cu.o
[ 66%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-q5_0.cu.o
[ 67%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-q8_0.cu.o
[ 67%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-bf16.cu.o
[ 67%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-q5_1.cu.o
[ 67%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_0-tq3_0.cu.o
[ 68%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-f16.cu.o
[ 68%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-q4_1.cu.o
[ 68%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-q4_0.cu.o
[ 68%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-q5_0.cu.o
[ 70%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-q5_1.cu.o
[ 70%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-f16.cu.o
[ 70%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-q8_0.cu.o
[ 71%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-bf16.cu.o
[ 71%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-q4_0.cu.o
[ 72%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-q4_1.cu.o
[ 72%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q5_1-tq3_0.cu.o
[ 72%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-q5_0.cu.o
[ 73%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-q8_0.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-q4_0.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-q5_1.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-q8_0-tq3_0.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-q5_0.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-q4_1.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-q8_0.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-f16.cu.o
[ 74%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-bf16.cu.o
[ 75%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-q5_1.cu.o
[ 75%] Building CUDA object deps/llama.cpp/ggml/src/ggml-cuda/CMakeFiles/ggml-cuda.dir/template-instances/fattn-vec-instance-tq3_0-tq3_0.cu.o
/media/per/work/tmp/lucebox-hub/dflash/deps/llama.cpp/ggml/src/ggml-cpu/ops.cpp: In function ‘void ggml_compute_forward_clamp(const ggml_compute_params*, ggml_tensor*)’:
/media/per/work/tmp/lucebox-hub/dflash/deps/llama.cpp/ggml/src/ggml-cpu/ops.cpp:5553:12: warning: enumeration value ‘GGML_TYPE_TQ3_0’ not handled in switch [-Wswitch]
 5553 |     switch (src0->type) {
      |            ^
/media/per/work/tmp/lucebox-hub/dflash/deps/llama.cpp/ggml/src/ggml-cuda/moe-fused.cu(12): warning #177-D: function "dot_iq2_xs" was declared but never referenced

Remark: The warnings can be suppressed with "-diag-suppress <warning-number>"

/media/per/work/tmp/lucebox-hub/dflash/deps/llama.cpp/ggml/src/ggml-cuda/moe-fused.cu(32): warning #177-D: function "dot_iq3_xxs" was declared but never referenced

[ 77%] Linking CXX shared library libggml-cpu.so
[ 77%] Built target ggml-cpu
[ 78%] Linking CUDA shared library libggml-cuda.so
[ 78%] Built target ggml-cuda
[ 78%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml.dir/ggml-backend-dl.cpp.o
[ 79%] Building CXX object deps/llama.cpp/ggml/src/CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o
[ 79%] Linking CXX shared library libggml.so
[ 79%] Built target ggml
[ 79%] Building CXX object CMakeFiles/dflash_common.dir/src/errors.cpp.o
[ 79%] Building CXX object CMakeFiles/dflash_common.dir/src/draft/draft_gguf_loader.cpp.o
[ 80%] Building CXX object CMakeFiles/dflash_common.dir/src/draft/draft_safetensors_loader.cpp.o
[ 80%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/qwen35_target_graph.cpp.o
[ 80%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/gguf_target_loader.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen3/qwen3_drafter.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/draft/draft_graph.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen3/qwen3_graph.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen3/qwen3_loader.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen3/qwen3_daemon.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/gemma4/gemma4_loader.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/gemma4/gemma4_backend.cpp.o
[ 81%] Building CXX object CMakeFiles/dflash_common.dir/src/gemma4/gemma4_daemon.cpp.o
[ 83%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen3/qwen3_backend.cpp.o
[ 83%] Building CXX object CMakeFiles/dflash_common.dir/src/gemma4/gemma4_graph.cpp.o
[ 83%] Building CXX object CMakeFiles/dflash_common.dir/src/gemma4/gemma4_dflash_target.cpp.o
[ 85%] Building CXX object CMakeFiles/dflash_common.dir/src/flashprefill_q8.cpp.o
[ 85%] Building CXX object CMakeFiles/dflash_common.dir/src/kv_quant.cpp.o
[ 85%] Building CXX object CMakeFiles/dflash_common.dir/src/kv_cache.cpp.o
[ 85%] Building CXX object CMakeFiles/dflash_common.dir/src/laguna/laguna_target_loader.cpp.o
[ 86%] Building CXX object CMakeFiles/dflash_common.dir/src/delta_net_chunked.cpp.o
[ 87%] Building CXX object CMakeFiles/dflash_common.dir/src/laguna/laguna_daemon.cpp.o
[ 87%] Building CXX object CMakeFiles/dflash_common.dir/src/laguna/laguna_target_graph.cpp.o
[ 87%] Building CXX object CMakeFiles/dflash_common.dir/src/laguna/laguna_backend.cpp.o
[ 87%] Building CXX object CMakeFiles/dflash_common.dir/src/common/dflash_feature_ring.cpp.o
[ 87%] Building CXX object CMakeFiles/dflash_common.dir/src/common/dflash_draft_ipc.cpp.o
[ 88%] Building CXX object CMakeFiles/dflash_common.dir/src/common/dflash_draft_ipc_daemon.cpp.o
[ 88%] Building CXX object CMakeFiles/dflash_common.dir/src/common/dflash_draft_graph.cpp.o
[ 88%] Building CXX object CMakeFiles/dflash_common.dir/src/common/dflash_spec_decode.cpp.o
[ 89%] Building CXX object CMakeFiles/dflash_common.dir/src/common/dflash_capture.cpp.o
[ 89%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/layer_split_forward.cpp.o
[ 90%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/layer_split_daemon.cpp.o
[ 90%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/qwen35_dflash_target.cpp.o
[ 91%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/qwen35_backend.cpp.o
[ 91%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/qwen35_layer_split_dflash_target.cpp.o
[ 91%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/graph_builders.cpp.o
[ 91%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/layer_split_daemon_loop.cpp.o
[ 91%] Building CXX object CMakeFiles/dflash_common.dir/src/common/sampler.cpp.o
[ 93%] Building CXX object CMakeFiles/dflash_common.dir/src/common/daemon_loop.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/common/layer_split_utils.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/common/gguf_inspect.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/common/peer_access.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/common/ddtree.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/server/sse_emitter.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/common/backend_factory.cpp.o
[ 95%] Building CXX object CMakeFiles/dflash_common.dir/src/server/tokenizer.cpp.o
[ 96%] Building CXX object CMakeFiles/dflash_common.dir/src/server/reasoning.cpp.o
[ 96%] Building CXX object CMakeFiles/dflash_common.dir/src/server/chat_template.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/server/tool_parser.cpp.o
[ 94%] Building CXX object CMakeFiles/dflash_common.dir/src/qwen35/qwen35_daemon.cpp.o
[ 96%] Building CXX object CMakeFiles/dflash_common.dir/src/server/prefix_cache.cpp.o
[ 95%] Building CXX object CMakeFiles/dflash_common.dir/src/server/tool_memory.cpp.o
[ 97%] Building CXX object CMakeFiles/dflash_common.dir/src/server/disk_prefix_cache.cpp.o
[ 97%] Building CUDA object CMakeFiles/dflash_common.dir/src/flashprefill_f16.cu.o
[ 97%] Building CXX object CMakeFiles/dflash_common.dir/src/cuda_cross_device_copy.cpp.o
[ 98%] Building CXX object CMakeFiles/dflash_common.dir/src/flashprefill.cpp.o
[ 98%] Building CXX object CMakeFiles/dflash_common.dir/src/flashprefill_select.cpp.o
/media/per/work/tmp/lucebox-hub/dflash/src/common/ddtree.cpp:24: warning: ignoring ‘#pragma omp parallel’ [-Wunknown-pragmas]
   24 |     #pragma omp parallel for schedule(static)
      | 
/media/per/work/tmp/lucebox-hub/dflash/src/gemma4/gemma4_graph.cpp: In function ‘bool dflash::common::gemma4_prefill_bsa(ggml_backend_t, const Gemma4Weights&, Gemma4Cache&, const float*, const int32_t*, int, std::vector<float>&)’:
/media/per/work/tmp/lucebox-hub/dflash/src/gemma4/gemma4_graph.cpp:1150:19: warning: unused variable ‘kv_source_il’ [-Wunused-variable]
 1150 |         const int kv_source_il = cache.kv_source[il];
      |                   ^~~~~~~~~~~~
/media/per/work/tmp/lucebox-hub/dflash/src/flashprefill.cpp: In function ‘int dflash::common::flashprefill::flash_prefill_forward_f16(const void*, const void*, const void*, void*, int, int, int, int, int, float, const FlashPrefillConfig&)’:
/media/per/work/tmp/lucebox-hub/dflash/src/flashprefill.cpp:624:15: warning: unused variable ‘sm’ [-Wunused-variable]
  624 |     const int sm = prop.major * 10 + prop.minor;
      |               ^~
/media/per/work/tmp/lucebox-hub/dflash/src/common/backend_factory.cpp: In function ‘std::unique_ptr<dflash::common::ModelBackend> dflash::common::create_backend(const BackendArgs&)’:
/media/per/work/tmp/lucebox-hub/dflash/src/common/backend_factory.cpp:93:35: error: ‘const struct dflash::common::BackendArgs’ has no member named ‘draft_gpu’
   93 |         gcfg.draft_gpu     = args.draft_gpu;
      |                                   ^~~~~~~~~
In file included from /media/per/work/tmp/lucebox-hub/dflash/src/common/dflash_draft_ipc.h:16,
                 from /media/per/work/tmp/lucebox-hub/dflash/src/common/dflash_spec_decode.h:16,
                 from /media/per/work/tmp/lucebox-hub/dflash/src/common/dflash_spec_decode.cpp:3:
/media/per/work/tmp/lucebox-hub/dflash/src/common/io_utils.h: In function ‘void dflash::common::stream_emit_fd(int, int32_t)’:
/media/per/work/tmp/lucebox-hub/dflash/src/common/io_utils.h:93:18: warning: ignoring return value of ‘ssize_t write(int, const void*, size_t)’ declared with attribute ‘warn_unused_result’ [-Wunused-result]
   93 |     (void)::write(stream_fd, &v, sizeof(v));
      |           ~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~
gmake[3]: *** [CMakeFiles/dflash_common.dir/build.make:650: CMakeFiles/dflash_common.dir/src/common/backend_factory.cpp.o] Error 1
gmake[3]: *** Waiting for unfinished jobs....
In file included from /media/per/work/tmp/lucebox-hub/dflash/src/common/dflash_draft_ipc.h:16,
                 from /media/per/work/tmp/lucebox-hub/dflash/src/qwen35/layer_split_forward.h:10,
                 from /media/per/work/tmp/lucebox-hub/dflash/src/qwen35/layer_split_daemon.h:9,
                 from /media/per/work/tmp/lucebox-hub/dflash/src/qwen35/layer_split_daemon.cpp:3:
/media/per/work/tmp/lucebox-hub/dflash/src/common/io_utils.h: In function ‘void dflash::common::stream_emit_fd(int, int32_t)’:
/media/per/work/tmp/lucebox-hub/dflash/src/common/io_utils.h:93:18: warning: ignoring return value of ‘ssize_t write(int, const void*, size_t)’ declared with attribute ‘warn_unused_result’ [-Wunused-result]
   93 |     (void)::write(stream_fd, &v, sizeof(v));
      |           ~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~
/media/per/work/tmp/lucebox-hub/dflash/src/qwen3/qwen3_backend.cpp: In member function ‘bool dflash::common::Qwen3Backend::do_step(const float*, int, int, std::vector<float>&)’:
/media/per/work/tmp/lucebox-hub/dflash/src/qwen3/qwen3_backend.cpp:177:15: warning: unused variable ‘ff’ [-Wunused-variable]
  177 |     const int ff     = w_.n_ff;
      |               ^~
In file included from /media/per/work/tmp/lucebox-hub/dflash/src/common/dflash_draft_ipc.h:16,
                 from /media/per/work/tmp/lucebox-hub/dflash/src/qwen35/layer_split_forward.h:10,
                 from /media/per/work/tmp/lucebox-hub/dflash/src/qwen35/layer_split_daemon.h:9,
                 from /media/per/work/tmp/lucebox-hub/dflash/src/qwen35/layer_split_daemon_loop.cpp:5:
/media/per/work/tmp/lucebox-hub/dflash/src/common/io_utils.h: In function ‘void dflash::common::stream_emit_fd(int, int32_t)’:
/media/per/work/tmp/lucebox-hub/dflash/src/common/io_utils.h:93:18: warning: ignoring return value of ‘ssize_t write(int, const void*, size_t)’ declared with attribute ‘warn_unused_result’ [-Wunused-result]
   93 |     (void)::write(stream_fd, &v, sizeof(v));
      |           ~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~
gmake[2]: *** [CMakeFiles/Makefile2:218: CMakeFiles/dflash_common.dir/all] Error 2
gmake[1]: *** [CMakeFiles/Makefile2:790: CMakeFiles/test_dflash.dir/rule] Error 2
gmake: *** [Makefile:426: test_dflash] Error 2

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions