Configuration Guide

Configuration Discovery

AutoLoader loads configuration in the following priority (highest first):

Environment variable — FASTSAFETENSORS_CONFIG=/path/to/config.json
Default path — ./fastsafetensors.json in the working directory (if it exists)
Built-in defaults — LoaderConfig() dataclass defaults

All fields are optional. Unspecified fields fall back to built-in defaults.

Default Configuration

When no config file is found, AutoLoader uses these defaults:

{
  "loader": "base",
  "framework": "pytorch",
  "parallel": {
    "use_pipeline": false
  },
  "debug": {
    "debug_log": false,
    "set_numa": true,
    "disable_cache": true
  }
}

The base loader extension defaults to copier_type: "gds" (GPU Direct Storage).

queue_size Semantics

`queue_size`	Mode	GPU Memory	Behavior
`-1`	Fully serial	1 batch	`copy_files → broadcast → copy_files → ...`
`0`	Unbuffered pipeline	Up to 2 batches	1 batch copying + 1 batch broadcasting concurrently
`>0`	Buffered pipeline	Up to `queue_size+1` batches	Producer fills queue while consumer broadcasts

use_pipeline: false forces queue_size=-1 (serial, minimal GPU memory).

Configuration Examples

1. Minimal — All Defaults (no config file needed)

from fastsafetensors import SingleGroup, AutoLoader

pg = SingleGroup()
loader = AutoLoader(pg, files, device="cuda:0")
for key, tensor in loader.iterate_weights():
    process(key, tensor)
loader.close()

No config file. Uses loader="base", gds, serial mode.

2. Base Loader with GDS

{
  "loader": "base",
  "base": {
    "copier_type": "gds"
  }
}

Enables GPU Direct Storage for NVMe-to-GPU transfers, bypassing host CPU/memory.

3. Base Loader with Pipeline Mode

{
  "parallel": {
    "use_pipeline": true,
    "max_concurrent_producers": 1,
    "queue_size": 0,
    "use_tqdm_on_load": true
  }
}

Overlaps copy_files with broadcast for higher throughput.

4. 3FS Loader

{
  "loader": "3fs",
  "3fs": {
    "mount_point": "/mnt/3fs",
    "entries": 64,
    "io_depth": 0,
    "buffer_size": 67108864
  }
}

Uses ThreeFSLoader with 3FS USRBIO backend.

5. Full Reference

{
  "loader": "base",
  "framework": "pytorch",
  "base": {
    "copier_type": "gds",
    "bbuf_size_kb": 16384,
    "max_threads": 16
  },
  "3fs": {
    "mount_point": "/mnt/3fs",
    "entries": 64,
    "io_depth": 0,
    "buffer_size": 67108864
  },
  "parallel": {
    "use_pipeline": false,
    "max_concurrent_producers": 1,
    "queue_size": 0,
    "use_tqdm_on_load": true
  },
  "debug": {
    "debug_log": false,
    "set_numa": true,
    "disable_cache": true
  }
}

Each loader type has its own extension section (e.g., base, 3fs). Adding a new loader only requires a new section — no changes to config.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configuration Guide

Configuration Discovery

Default Configuration

queue_size Semantics

Configuration Examples

1. Minimal — All Defaults (no config file needed)

2. Base Loader with GDS

3. Base Loader with Pipeline Mode

4. 3FS Loader

5. Full Reference

FilesExpand file tree

configuration.md

Latest commit

History

configuration.md

File metadata and controls

Configuration Guide

Configuration Discovery

Default Configuration

queue_size Semantics

Configuration Examples

1. Minimal — All Defaults (no config file needed)

2. Base Loader with GDS

3. Base Loader with Pipeline Mode

4. 3FS Loader

5. Full Reference