Refactor classification example to avoid hardcoded label token IDs #497

Ashish-Patnaik · 2026-01-09T13:56:52Z

This PR improves the robustness of "examples/classification.py" by removing hardcoded
token IDs for the classification labels.

Fixes Issue #496

Changes

Dynamically computes the token IDs for "Yes" and "No" using the tokenizer instead
of relying on fixed numeric IDs.
Adds a small validation check to ensure each label maps to exactly one token,
preventing silent errors if tokenization changes.
Fixes minor typos in the prompt template and comments:
- "grammaticaly" = "grammatically"
- "respectivelly" = "respectively"

Why?
Hardcoding token IDs makes the example fragile if the tokenizer vocabulary changes
or if the script is reused with a different model variant. This change keeps the
example functionally identical while making it safer and easier to maintain.

google-cla · 2026-01-09T13:56:58Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

Refactor classification example to avoid hardcoded label token IDs

033a936

Ashish-Patnaik force-pushed the new branch from 2b177ff to 033a936 Compare January 9, 2026 15:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor classification example to avoid hardcoded label token IDs #497

Refactor classification example to avoid hardcoded label token IDs #497

Uh oh!

Ashish-Patnaik commented Jan 9, 2026 •

edited

Loading

Uh oh!

google-cla bot commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Refactor classification example to avoid hardcoded label token IDs #497

Are you sure you want to change the base?

Refactor classification example to avoid hardcoded label token IDs #497

Uh oh!

Conversation

Ashish-Patnaik commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

google-cla bot commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Ashish-Patnaik commented Jan 9, 2026 •

edited

Loading