Skip to content

Gradient Checkpointing for OpenCLIP should be optional #36

@apolinario

Description

@apolinario

I know hardcoding it came from me but while Gradient Checkpointing makes things faster and use less VRAM so very useful on some use-cases, but can break things on A100 and also break cutn_batches on most text-to-image implementations, so ideally it should be optional for the user

More broadly we should think on how to load options that pertain to particular loaders/modules/perceptors while not breaking the overall mocking logics

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions