Skip to content

Commit 193b81e

Browse files
committed
bump limit to 128
1 parent beb4e98 commit 193b81e

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

ggml/src/ggml-cuda/common.cuh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -152,7 +152,7 @@ static int ggml_cuda_highest_compiled_arch(const int arch) {
152152
#define MATRIX_ROW_PADDING 512 // last row of quant. matrices is a multiple of this to avoid out-of-bounds memory accesses
153153

154154
#define GGML_CUDA_MAX_STREAMS 8
155-
#define GGML_CUDA_MAX_GRAPHS 64
155+
#define GGML_CUDA_MAX_GRAPHS 128
156156

157157
[[noreturn]]
158158
void ggml_cuda_error(const char * stmt, const char * func, const char * file, int line, const char * msg);

0 commit comments

Comments
 (0)