Firstly, thank you for your work on making a clean model for medical image generation
I have problems in trying to insert my 5 channel cell images in the model.
My problem shows:
RuntimeError: Given groups=1, weight of size [128, 5, 3, 3], expected input[1, 3 , 256, 256] to have 5 channels, but got 3 channels instead
where it starts at:
noise_pred = model(sample=noisy_images, timestep=timesteps, return_dict=False)[0]
in training.py file
I have inserted and checked noisy images shape as torch.size(5, 256, 256) before it enters the model code, and timestep as torch.size(5).
same problem arises when I use 3 channel images, but the problem arises are:
RuntimeError: Given groups=1, weight of size [128, 3, 3, 3], expected input[1, 2 , 256, 256] to have 3 channels, but got 2 channels instead
Whenever it goes into to the model, the size of the channel changes for some reason.
For reference, the model for 5 channel is:
model = diffusers.UNet2DModel(
sample_size=config.image_size, # the target image resolution
in_channels=5, # the number of input channels, 3 for RGB images
out_channels=5, # the number of output channels
layers_per_block=2, # how many ResNet layers to use per UNet block
block_out_channels=(128, 128, 256, 256, 512, 512), # the number of output channels for each UNet block
down_block_types=(
"DownBlock2D", # a regular ResNet downsampling block
"DownBlock2D",
"DownBlock2D",
"DownBlock2D",
"AttnDownBlock2D", # a ResNet downsampling block with spatial self-attention
"DownBlock2D",
),
up_block_types=(
"UpBlock2D", # a regular ResNet upsampling block
"AttnUpBlock2D", # a ResNet upsampling block with spatial self-attention
"UpBlock2D",
"UpBlock2D",
"UpBlock2D",
"UpBlock2D"
),
)
Thank you so much if you can help out on that.
Firstly, thank you for your work on making a clean model for medical image generation
I have problems in trying to insert my 5 channel cell images in the model.
My problem shows:
RuntimeError: Given groups=1, weight of size [128, 5, 3, 3], expected input[1, 3 , 256, 256] to have 5 channels, but got 3 channels instead
where it starts at:
noise_pred = model(sample=noisy_images, timestep=timesteps, return_dict=False)[0]
in training.py file
I have inserted and checked noisy images shape as torch.size(5, 256, 256) before it enters the model code, and timestep as torch.size(5).
same problem arises when I use 3 channel images, but the problem arises are:
RuntimeError: Given groups=1, weight of size [128, 3, 3, 3], expected input[1, 2 , 256, 256] to have 3 channels, but got 2 channels instead
Whenever it goes into to the model, the size of the channel changes for some reason.
For reference, the model for 5 channel is:
Thank you so much if you can help out on that.