You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Make the U55 annotation and lowering pipelines handle SDPA
aten._safe_softmax consistently by decomposing it instead of leaving it
in the graph.
Previously, transform_for_annotation_pipeline still used
skip_safe_softmax for U55, which left aten._safe_softmax in annotated
SDPA graphs and prevented delegation.
Add a regression test that verifies U55 SDPA graphs no longer contain
aten._safe_softmax after the annotation pipeline runs.
Also warn when aten._safe_softmax is decomposed as regular softmax in
the annotation pipeline, since this is only semantics-preserving when
no row is fully masked at runtime.
Remove the unstable softmax decomposition path and its remaining
references now that the Arm backend uses the stable decomposition path.
Update the related pipeline-config and softmax tests accordingly.
Signed-off-by: per.held@arm.com
Change-Id: I7a5147d5492974ead52ea92326352f7f4407bd67
"The stable softmax decomposition is now supported by all arm targets and will be made default in a future release. Overwrite the default config using `compile_spec.set_pass_pipeline_config(ArmPassPipelineConfig())` to use the stable algorithm and avoid this error."
32
-
)
33
-
defdisable_masked_softmax(self) ->None:
34
-
"""
35
-
.. warning::
36
-
37
-
The stable softmax decomposition is now supported by all arm targets and will be made default in a future release. Overwrite the default config using `compile_spec.set_pass_pipeline_config(ArmPassPipelineConfig())` to use the stable algorithm and avoid this error."
0 commit comments