Draft: Initial draft implementation of CFG for LLM by ottonemo · Pull Request #996 · skorch-dev/skorch

ottonemo · 2023-07-19T15:11:07Z

Based on the paper

    Sanchez, Guillaume, et al.
    "Stay on topic with Classifier-Free Guidance."
    arXiv preprint arXiv:2306.17806 (2023).

a draft implementation of classifier free guidance.

This is simply for sharing internally and might very well be completely wrong. It is debatable if we should expose such a feature as a flag to the network or make it a separate classifier instance (or a mixin). In the past we were very much against special (potentially short-lived) feature flags and it was much nicer to have this implemented as an addon/callback. We might need to do something similar here as well.

Open tasks:

evaluate existing examples
write explicit test cases

Based on the paper Sanchez, Guillaume, et al. "Stay on topic with Classifier-Free Guidance." arXiv preprint arXiv:2306.17806 (2023). a draft implementation of classifier free guidance. This is simply for sharing internally and might very well be completely wrong. It is debatable if we should expose such a feature as a flag to the network or make it a separate classifier instance (or a mixin). In the past we were very much against special (potentially short-lived) feature flags and it was much nicer to have this implemented as an addon/callback. We might need to do something similar here as well.

BenjaminBossan · 2023-07-19T15:32:28Z

The paper in question is this one:

https://arxiv.org/abs/2306.17806

Note that this method should have a greater effect the longer the labels are.

Some random comments:

At the moment, two forward passes are needed. Shouldn't we be able to pre-compute (or cache) P_wi_wji, since labels are always the same and known from the start?
How about, instead of exposing use_cfg, we expose cfg_gamma. If it is 1 (or None), don't use CFG, else apply that gamma instead of basically hard-coding it to 1.5?

It is debatable if we should expose such a feature as a flag to the network or make it a separate classifier instance (or a mixin). In the past we were very much against special (potentially short-lived) feature flags and it was much nicer to have this implemented as an addon/callback.

If this method works really well, I can see it being added explicitly. Alternatively, we could have a callbacks equivalent for logits processors, with _LogitsRecorder being the default.

- Makes it possible to set gamma parameter - Setting it to `None` disabled functionality completely

- `label_id` was misleading since it is actually a list of token ids related to a label and not a scalar value. Also the general process of generating logits it not related to labels at all but rather just to tokens - `kwargs` was named to be similar to transformers `generate` convention but is meant to be passed to `generate` and is therefore, in the context of `generate_logits` a model input. This should help the reader distinguish between expected input (`token_ids`) and model input (`model_input`)

Use cfg_gamma instead of use_cfg boolean flag

1c34aca

- Makes it possible to set gamma parameter - Setting it to `None` disabled functionality completely

ottonemo force-pushed the feature/llm-classifier-free-guidance branch from 96de091 to 1c34aca Compare July 19, 2023 18:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft: Initial draft implementation of CFG for LLM#996

Draft: Initial draft implementation of CFG for LLM#996
ottonemo wants to merge 3 commits intomasterfrom
feature/llm-classifier-free-guidance

ottonemo commented Jul 19, 2023 •

edited

Loading

Uh oh!

BenjaminBossan commented Jul 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ottonemo commented Jul 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BenjaminBossan commented Jul 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ottonemo commented Jul 19, 2023 •

edited

Loading