fix(deps): update dependency transformers to v4.50.0 [security]#86
Closed
dreadnode-renovate-bot[bot] wants to merge 1 commit into
Closed
fix(deps): update dependency transformers to v4.50.0 [security]#86dreadnode-renovate-bot[bot] wants to merge 1 commit into
dreadnode-renovate-bot[bot] wants to merge 1 commit into
Conversation
| datasource | package | from | to | | ---------- | ------------ | ------ | ------ | | pypi | transformers | 4.49.0 | 4.50.0 |
Contributor
Author
Edited/Blocked NotificationRenovate will not automatically rebase this PR, because it does not recognize the last commit author and assumes somebody else may have edited the PR. You can manually request rebase by checking the rebase/retry box above. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This PR contains the following updates:
| Package | Type | Update | Change |
|
Generated Summary
No relevant changes.
This summary was generated with ❤️ by rigging
| transformers | extras | minor |
4.49.0->4.50.0|GitHub Vulnerability Alerts
CVE-2025-1194
A Regular Expression Denial of Service (ReDoS) vulnerability was identified in the huggingface/transformers library, specifically in the file
tokenization_gpt_neox_japanese.pyof the GPT-NeoX-Japanese model. The vulnerability occurs in the SubWordJapaneseTokenizer class, where regular expressions process specially crafted inputs. The issue stems from a regex exhibiting exponential complexity under certain conditions, leading to excessive backtracking. This can result in high CPU usage and potential application downtime, effectively creating a Denial of Service (DoS) scenario. The affected version is v4.48.1 (latest).Release Notes
huggingface/transformers (transformers)
v4.50.0Compare Source
Release v4.50.0
New Model Additions
Model-based releases
Starting with version v4.49.0, we have been doing model-based releases, additionally to our traditional, software-based monthly releases. These model-based releases provide a tag from which models may be installed.
Contrarily to our software-releases; these are not pushed to pypi and are kept on our GitHub. Each release has a tag attributed to it, such as:
v4.49.0-Gemma-3v4.49.0-AyaVisionEach new model release will always be based on the current state of the main branch at the time of its creation. This ensures that new models start with the latest features and fixes available.
For example, if two models—Gemma-3 and AyaVision—are released from main, and then a fix for gemma3 is merged, it will look something like this:
We strive to merge model specific fixes on their respective branches as fast as possible!
Gemma 3
Gemma 3 is heavily referenced in the following model-based release and we recommend reading these if you want all the information relative to that model.
The Gemma 3 model was proposed by Google. It is a vision-language model composed by a SigLIP vision encoder and a Gemma 2 language decoder linked by a multimodal linear projection.
It cuts an image into a fixed number of tokens same way as Siglip if the image does not exceed certain aspect ratio. For images that exceed the given aspect ratio, it crops the image into multiple smaller pacthes and concatenates them with the base image embedding.
One particularity is that the model uses bidirectional attention on all the image tokens. Also, the model interleaves sliding window local attention with full causal attention in the language backbone, where each sixth layer is a full causal attention layer.
Shield Gemma2
ShieldGemma 2 is built on Gemma 3, is a 4 billion (4B) parameter model that checks the safety of both synthetic and natural images against key categories to help you build robust datasets and models. With this addition to the Gemma family of models, researchers and developers can now easily minimize the risk of harmful content in their models across key areas of harm as defined below:
We recommend using ShieldGemma 2 as an input filter to vision language models, or as an output filter of image generation systems. To train a robust image safety model, we curated training datasets of natural and synthetic images and instruction-tuned Gemma 3 to demonstrate strong performance.
Aya Vision
AyaVision is heavily referenced in the following model-based release and we recommend reading these if you want all the information relative to that model.
The Aya Vision 8B and 32B models is a state-of-the-art multilingual multimodal models developed by Cohere For AI. They build on the Aya Expanse recipe to handle both visual and textual information without compromising on the strong multilingual textual performance of the original model.
Aya Vision 8B combines the
Siglip2-so400-384-14vision encoder with the Cohere CommandR-7B language model further post-trained with the Aya Expanse recipe, creating a powerful vision-language model capable of understanding images and generating text across 23 languages. Whereas, Aya Vision 32B uses Aya Expanse 32B as the language model.Key features of Aya Vision include:
Mistral 3.1
Mistral 3.1 is heavily referenced in the following model-based release and we recommend reading these if you want all the information relative to that model.
Building upon Mistral Small 3 (2501), Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance. With 24 billion parameters, this model achieves top-tier capabilities in both text and vision tasks.
It is ideal for:
Smol VLM 2
SmolVLM-2 is heavily referenced in the following model-based release and we recommend reading these if you want all the information relative to that model.
SmolVLM2 is an adaptation of the Idefics3 model with two main differences:
SigLIP-2
SigLIP-2 is heavily referenced in the following model-based release and we recommend reading these if you want all the information relative to that model.
The SigLIP2 model was proposed in SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features by Michael Tschannen, Alexey Gritsenko, Xiao Wang, Muhammad Ferjad Naeem, Ibrahim Alabdulmohsin,
Nikhil Parthasarathy, Talfan Evans, Lucas Beyer, Ye Xia, Basil Mustafa, Olivier Hénaff, Jeremiah Harmsen,
Andreas Steiner and Xiaohua Zhai.
The model comes in two variants
transformers)Prompt Depth Anything
PromptDepthAnything is a high-resolution, accurate metric depth estimation model that leverages prompting, inspired by its success in vision-language (VLMs) and large language models (LLMs). Using iPhone LiDAR as a prompt, the model generates precise depth maps at up to 4K resolution, unlocking the potential of depth foundation models.
New tool: attention visualization
We add a new tool to
transformersto visualize the attention layout of a given model. It only requires a model ID as input, and will load the relevant tokenizer/model and display what the attention mask looks like. Some examples:Deprecating transformers.agents in favor of smolagents
We are deprecating
transformers.agentsin favour of thesmolagentslibrary. Read more about smolagents here.Quantization
We support adding custom quantization method by using the
@register_quantization_configand@register_quantizerdecorator:AMD is developing its in-house quantizer named Quark released under MIT license, which supports a broad range of quantization pre-processing, algorithms, dtypes and target hardware. You can now load a model quantized by quark library:
Torchao is augmented with
autoquantsupport, CPU-quantization, as well as newAOBaseConfigobject instances for more advanced configuration.Tensor Parallelism implementation changes
At loading time, the parallelization is now applied module-by-module, so that no memory overhead is required compared to what the final weight distribution will be!
Generation
This release includes two speed upgrades to
generate:do_sample=True;num_beams. The speedup is more visible on smaller models, wheremodel.forwarddoesn't dominate the total run time.CandidateGeneratorby @keyboardAnt, @jmamou, and @gauravjain14 in #35029Documentation
A significant redesign of our documentation has wrapped-up. The goal was to greatly simplify the
transformersdocumentation, making it much more easy to navigate. Let us know what you think!Notable repo maintenance
The research examples folder that was hosted in
transformersis no more. We have moved it out oftransformersand in the following repo: github.com/huggingface/transformers-research-projects/We have updated our flex attention support so as to have it be on-par with our Flash Attention 2 support.
More models support flex attention now thanks to @qubvel
First integration of hub kernels for deformable detr!
Bugfixes and improvements
EsmModelIntegrationTest::test_inference_bitsandbytesby @faaany in #36225LlavaForConditionalGenerationModelTest::test_configafter #36077 by @ydshieh in #36230/generationby @gante in #36235test_export_to_onnxby @gante in #36241test_fast_is_faster_than_slowby @ydshieh in #36240Speech2TextFeatureExtractorAPI. by @KarelVesely84 in #34638pt_tfequivalence tests by @gante in #36253test_from_pretrained_low_cpu_mem_usage_equalless flaky by @gante in #36255GenerationTesterMixininheritance is correct 🐛 🔫 by @gante in #36180mainby @ydshieh in #36375is_causalfail with compile by @Cyrilvallez in #36374benchmark.ymlby @ydshieh in #36402CandidateGeneratorby @keyboardAnt in #35029contents: writeby @ydshieh in #36445torch.distributed-compatibleDynamicCacheby @gante in #36373src/transformers/image_utils.pyby @hmellor in #36435hub_retryby @ydshieh in #36449TRUST_REMOTE_CODEforRealmRetrieverfor security by @ydshieh in #36511input_idspassed toPrefixConstrainedLogitsProcessoris zero by @HiDolen in #36489DataCollatorForLanguageModelingby @capemox in #36457HybridCache] disable automatic compilation by @gante in #36620make fix-copiesby @gante in #36664from_pretrainedby @Cyrilvallez in #36033metadevice by @gante in #36543gc.collect()if only 1 shard is used by @gante in #36721test_eager_matches_sdpa_inferenceby @gante in #36650generation_config, overwrite default values with the model's basegeneration_configby @gante in #36684TrainingArguments.torch_empty_cache_stepspost_init check by @pkuderov in #36734test_eager_matches_sdpa_inferenceby @gante in #36740is_decoderusage in PretrainedConfig documentation by @d-kleine in #36724tj-actions/changed-filesby @ydshieh in #36795dist": "loadfile"forpytestin CircleCI jobs by @ydshieh in [#36811](https://redirect.github.com/huggingface/transformers/issueConfiguration
📅 Schedule: Branch creation - "" (UTC), Automerge - At any time (no schedule defined).
🚦 Automerge: Enabled.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR has been generated by Renovate Bot.