Small fix in inference cli: Do final multiplication in Pytorch instead by sognetic · Pull Request #578 · numz/ComfyUI-SeedVR2_VideoUpscaler

sognetic · 2026-05-03T11:10:02Z

Hi, first of all: Thanks for the nice software and especially the great docs, running the model is really a breeze with this tooling.

I've run this in standalone mode on a RTX 5090 server node to process a >1h video files in ~10 minute chunks and noticed noticeable slowdown (more than 10 minutes) after each chunk, with only a single core out of 192 actually doing anything. I think the culprit is doing this final transformation in numpy instead of e.g. pytorch, multiplying first in pytorch and then casting to numpy removed the bottleneck. The clamp is just to make the transformation a bit more obvious wrt. value range.
I've only tested this with a video file in a single-GPU setup on that specific machine and there might be ways to do this better (e.g. multiplying entirely on the GPU) but I think this already solves the issue sufficiently.
Let me know what you think and thanks again for the great project!

Do multiplication in Pytorch instead

09cea9f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Small fix in inference cli: Do final multiplication in Pytorch instead#578

Small fix in inference cli: Do final multiplication in Pytorch instead#578
sognetic wants to merge 1 commit into
numz:mainfrom
sognetic:main

sognetic commented May 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

sognetic commented May 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant