ACDC dataset now have two folders: training and testing

After using this split #6 to divide the training folder into 3 parts .I reproduced your result:

But when I tested the model on test folder which patient number from 101 to 150.the result is low:

Did all the results on the ACDC leaderboard test on the testing folder or on the training folder, like in your paper?
What's more,in my this repo https://github.com/kingo233/FCT-Pytorch. I trained using the training folder and get dice 90 on testing folder.But I can't get improvement any more...is this the upper limit of FCT?
ACDC dataset now have two folders: training and testing



After using this split #6 to divide the training folder into 3 parts .I reproduced your result:
But when I tested the model on test folder which patient number from 101 to 150.the result is low:
Did all the results on the ACDC leaderboard test on the testing folder or on the training folder, like in your paper?
What's more,in my this repo https://github.com/kingo233/FCT-Pytorch. I trained using the training folder and get dice 90 on testing folder.But I can't get improvement any more...is this the upper limit of FCT?