feat: add initial prompt (prompt_ids) support for Whisper generation by jhlee111 · Pull Request #1540 · huggingface/transformers.js

jhlee111 · 2026-02-22T00:48:18Z

Closes #923
Closes #1028

Summary

Add prompt_ids support to WhisperForConditionalGeneration.generate(), enabling initial prompt conditioning for Whisper transcription. This is a long-requested feature (both issues are from 2024) that matches the behavior of the Python transformers library.

What this does

When prompt_ids (an array of token IDs, typically starting with <|startofprev|>) is provided via generation config, it is prepended to init_tokens following the Whisper training format:

[<|startofprev|>, ...prompt_text..., <|startoftranscript|>, <|lang|>, <|task|>, ...]

After generation, the prompt tokens are stripped from the output sequences so they don't appear in the transcription results.

Usage

// Encode prompt (e.g., domain-specific terms)
const prompt_ids = tokenizer.encode("<|startofprev|> " + text, { add_special_tokens: false });

// Pass to pipeline or model.generate()
const output = await model.generate({
  inputs: features,
  prompt_ids,
  language: "en",
  return_timestamps: true,
});

Changes

packages/transformers/src/models/whisper/modeling_whisper.js — 1 file, ~20 lines added
- Prepend prompt_ids to init_tokens when provided
- Strip prompt tokens from output sequences after generation

No breaking changes — when prompt_ids is not provided, behavior is identical to before.

Checklist

Build passes (pnpm build)
Prettier formatting passes (pnpm format:check)
No breaking changes to existing functionality
Tests — no existing Whisper generation tests found in the repo; happy to add if desired

Implement prompt_ids handling in WhisperForConditionalGeneration.generate() to support initial prompt conditioning, matching the Python transformers library behavior. When prompt_ids is provided via generation config, it is prepended to init_tokens following the Whisper training format: [<|startofprev|>, ...prompt_text..., <|startoftranscript|>, <|lang|>, <|task|>, ...] The prompt tokens are stripped from output sequences after generation to prevent them from appearing in transcription results. Closes huggingface#923 Closes huggingface#1028

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: add initial prompt (prompt_ids) support for Whisper generation#1540

feat: add initial prompt (prompt_ids) support for Whisper generation#1540
jhlee111 wants to merge 1 commit intohuggingface:mainfrom
jhlee111:feat/whisper-prompt-ids

jhlee111 commented Feb 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

jhlee111 commented Feb 22, 2026

Summary

What this does

Usage

Changes

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant