Update sentence-transformers requirement from ~=5.5.1 to ~=5.6.0 by dependabot[bot] · Pull Request #336 · DefinetlyNotAI/Logicytics

dependabot · 2026-06-17T05:12:52Z

Updates the requirements on sentence-transformers to permit the latest version.

Release notes

Sourced from sentence-transformers's releases.

v5.6.0 - Fixes for Causal LM Rerankers, Hard-Negative Mining, and More

This minor version is a correctness- and robustness-focused release. It fixes a silent scoring bug for causal-LM rerankers, corrects several hard-negative mining and GIST loss edge cases, restores TSDAE on transformers v5, and adds Apple Silicon (MPS) support for the cached losses.

The headline fix affects chat-template models that read the final token position, i.e. causal-LM rerankers (like Qwen3-Reranker) and last-token-pooling embedders: when an over-long input was truncated, the chat template's trailing suffix (e.g. the assistant prefill the model scores from) was silently dropped, producing wrong scores with no error. There's also a forward-looking deprecation: loading local custom code without trust_remote_code=True now warns, and will require it from v6.0.

Install this version with
# Training + Inference
pip install sentence-transformers[train]==5.6.0
Inference only, use one of:
pip install sentence-transformers==5.6.0
pip install sentence-transformers[onnx-gpu]==5.6.0
pip install sentence-transformers[onnx]==5.6.0
pip install sentence-transformers[openvino]==5.6.0
Multimodal dependencies (optional):
pip install sentence-transformers[image]==5.6.0
pip install sentence-transformers[audio]==5.6.0
pip install sentence-transformers[video]==5.6.0
Or combine as needed:
pip install sentence-transformers[train,onnx,image]==5.6.0
Fixed silently wrong scores when truncation drops chat-template suffixes (#3787)

Chat-template models render the full conversation to a flat string before tokenizing, so when the rendered input is longer than the tokenizer's model_max_length, the tokenizer truncates it from the right and drops the template's trailing suffix: the fixed tokens a template appends after the content, e.g. a prompt, instruction, [/INST], or a trailing EOS. For models that read the final token position, this silently corrupted the result:

causal-LM rerankers (e.g. Qwen/Qwen3-Reranker-0.6B) score a pair from the last token's yes/no logits, and

last-token-pooling embedders read the final hidden state.

When the suffix was truncated away, that final position landed mid-document instead of after the prefill, so the score or embedding came from the wrong place.

Transformer.preprocess now detects when truncation drops the suffix and splices it back onto the tail of each truncated row. Because the fix lives in the shared base Transformer, it applies across SentenceTransformer, CrossEncoder, and SparseEncoder. It's enabled by default and saved to the model configuration. Pass processing_kwargs={"chat_template": {"restore_suffix": False}} to opt back into raw truncation.

Hard-negative mining and GIST loss correctness (#3821, #3817, #3816)

A trio of correctness and scalability fixes for hard-negative mining and the GIST losses:

Sign-independent relative margin: mine_hard_negatives(relative_margin=...) and the margin_strategy="relative" branch of GISTEmbedLoss / CachedGISTEmbedLoss used a multiplicative threshold (positive * (1 - margin)) that only behaves correctly when the positive-pair similarity is positive. When that similarity was negative, the threshold moved the wrong way and let through false negatives: candidates more similar to the anchor than the true positive. The threshold is now positive - |positive| * margin, identical to before for positive scores but correct for negative ones.

Distributed positive masking in the GIST losses: with gather_across_devices=True and a non-zero margin, the false-negative suppression mask protected the wrong columns on ranks beyond the first (it ignored the per-rank offset into the gathered batch), which set the true positive's logit to -inf and produced a +inf loss. The mask now accounts for the cross-rank offset, so multi-GPU GIST training stays finite.

Memory-bounded mining without FAISS: mine_hard_negatives(use_faiss=False) (the default) materialized the full (queries × corpus) similarity matrix at once, which could OOM on large corpora. It now batches over the query axis (controlled by faiss_batch_size, default 16384), bounding peak memory while producing identical results.

TSDAE weight tying restored on transformers v5 (#3781)

transformers v5 removed the private PreTrainedModel._tie_encoder_decoder_weights helper that DenoisingAutoEncoderLoss (TSDAE) used to tie its separate encoder and decoder. As a stopgap, v5.5 raised a RuntimeError for the default tie_encoder_decoder=True on transformers >= 5.0.0, effectively breaking TSDAE there unless you pinned an older transformers or disabled tying. TSDAE now ships its own tying routine that shares storage between encoder and decoder, so it works on both transformers <5 and >=5 with the default settings.

Deprecation: loading local custom code without trust_remote_code (#3807)

... (truncated)

Commits

9c73df3 Release v5.6.0
222b052 [fix] Don't override device_map placement with the device argument (#3823)
a38a6bf Fix causal LM reranker scoring when max_length truncates chat-template suffix...
1812103 [fix] Make relative margin sign-independent in mining and GIST losses (#3821)
ae1acc3 Warn when loading local custom code without trust_remote_code (#3807)
429cf5d [fix] Support MPS in the cached losses' RandContext (#3812)
77fdbff [fix] fix MPS errors (#3818)
bfba988 [fix] Fix positive masking in GIST losses with multi-GPU + gather_across_devi...
29e382b [docs] Fix doc build problems (part 1) (#3811)
d16e6bf [fix] Avoid materializing the full similarity matrix in mine_hard_negatives w...
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR
@dependabot recreate will recreate this PR, overwriting any edits that have been made to it
@dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Updates the requirements on [sentence-transformers](https://github.com/huggingface/sentence-transformers) to permit the latest version. - [Release notes](https://github.com/huggingface/sentence-transformers/releases) - [Commits](huggingface/sentence-transformers@v5.5.1...v5.6.0) --- updated-dependencies: - dependency-name: sentence-transformers dependency-version: 5.6.0 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>

github-actions · 2026-06-17T05:13:12Z

Hi! Thanks for contributing for the first time to Logicytics 🤗 We hope it goes as smoothly as possible and appreciate your valuable contribution.

dependabot Bot added the type/Dependencies Pull requests that update a dependency file label Jun 17, 2026

dependabot Bot requested a review from DefinetlyNotAI as a code owner June 17, 2026 05:12

dependabot Bot added the type/Dependencies Pull requests that update a dependency file label Jun 17, 2026

github-project-automation Bot added this to Issue Board Jun 17, 2026

github-project-automation Bot moved this to Todo in Issue Board Jun 17, 2026

pull-request-size Bot added the size/XS Extra Small size pr label Jun 17, 2026

DefinetlyNotAI self-assigned this Jun 17, 2026

DefinetlyNotAI approved these changes Jun 17, 2026

View reviewed changes

github-project-automation Bot moved this from Todo to Waiting on Review in Issue Board Jun 17, 2026

DefinetlyNotAI merged commit 87b7eb8 into main Jun 17, 2026
9 checks passed

github-project-automation Bot moved this from Waiting on Review to Done in Issue Board Jun 17, 2026

DefinetlyNotAI deleted the dependabot/pip/sentence-transformers-approx-eq-5.6.0 branch June 17, 2026 07:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update sentence-transformers requirement from ~=5.5.1 to ~=5.6.0#336

Update sentence-transformers requirement from ~=5.5.1 to ~=5.6.0#336
DefinetlyNotAI merged 1 commit into
mainfrom
dependabot/pip/sentence-transformers-approx-eq-5.6.0

dependabot Bot commented on behalf of github Jun 17, 2026

Uh oh!

github-actions Bot commented Jun 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

dependabot Bot commented on behalf of github Jun 17, 2026

v5.6.0 - Fixes for Causal LM Rerankers, Hard-Negative Mining, and More

Fixed silently wrong scores when truncation drops chat-template suffixes (#3787)

Hard-negative mining and GIST loss correctness (#3821, #3817, #3816)

TSDAE weight tying restored on transformers v5 (#3781)

Deprecation: loading local custom code without trust_remote_code (#3807)

Uh oh!

github-actions Bot commented Jun 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

TSDAE weight tying restored on `transformers` v5 (#3781)

Deprecation: loading local custom code without `trust_remote_code` (#3807)