Thanks your opened source code,I use your train code and dadaset, during train I get the train loss is 'nan',I want to know this reason,thank you