Skip to content

The Problem of Segmentation Tasks #17

@moonboy12138

Description

@moonboy12138

Great work! I'd like to know how to apply ChannelViT to image segmentation tasks, because its single-channel mapping results in a spatial distribution for each channel. For example, given an input of shape (B, 3, 64, 64) with a patch size of 16, it produces a mapping of shape (B, 388); whereas with a typical ViT, the shape would be (B, 8*8), so you can simply feed the feature layer into a decoder. However, if the sequence produced by ChannelViT involves channel tokens, what should I do? Looking forward to your reply !!!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions