attn_loss=0

I also couldn't do 
```
wasserstein_ext = torch.utils.cpp_extension.load_inline("wasserstein", cpp_sources="", cuda_sources=cuda_source,
                                                    extra_cuda_cflags=["--expt-relaxed-constexpr"]   )
```
so I just commented it out and used the cpu version below in the method sinkstep. However, for some reason during training, the attn_loss is always zero and the results are very bad. how to resolve this issue?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

attn_loss=0 #6

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

attn_loss=0 #6

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions