When tokenizing the `</s>` token, it does not get tokenized into a single special token, instead it gets split into multiple subtokens.