In my understanding, the layout for each class needs a specific decoder, right? So it is a binary classification problem for each class?