Enable MPS backend for bitsandbytes quantization

**Is your feature request related to a problem? Please describe.**

Bitsandbytes now has basic support for the Apple MPS backend, as I can tell by https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1818 and 
https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1875.

The issue is that diffusers does not allow me to use the quantization on Apple hardware, because of the error `No GPU found. A GPU is needed for quantization.` from here https://github.com/huggingface/diffusers/blob/f2be8bd6b3dc4035bd989dc467f15d86bf3c9c12/src/diffusers/quantizers/bitsandbytes/bnb_quantizer.py#L64-L65.

**Describe the solution you'd like.**

I modified the above lines to 

```python
if not (torch.cuda.is_available() or torch.xpu.is_available() or torch.mps.is_available()):
    raise RuntimeError("No GPU found. A GPU is needed for quantization.")
```

and tested the change with the quantized version of FLUX.2-dev as described in https://github.com/black-forest-labs/flux2/blob/main/docs/flux2_dev_hf.md#4-bit-transformer-and-4-bit-text-encoder-20g-of-vram, and all worked fine.

I am wondering if that solution could be adopted in the repository. I can open a PR if that is the case.

**Describe alternatives you've considered.**

None.

**Additional context.**
Add any other context or screenshots about the feature request here.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable MPS backend for bitsandbytes quantization #13361

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

	if not (torch.cuda.is_available() or torch.xpu.is_available()):
	raise RuntimeError("No GPU found. A GPU is needed for quantization.")

Enable MPS backend for bitsandbytes quantization #13361

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions