Hi there, thanks for the great work first. And I came across some strange loss during training, I just want to make sure the training procedure is the same as yours in the paper. Thanks in advance.