:bug: Fix pad token id resolution in padding function by gkumbhat · Pull Request #512 · foundation-model-stack/foundation-model-stack

gkumbhat · 2026-03-03T22:31:34Z

Changes

Currently pad_input_ids doesn't take into account padding token id for model and assumes it to be 0 for all users. However, this can sometimes be not the case, for example for mistral-3 models, the padding token id is actually 11. This can cause very confusing problems. This PR fixes it by taking optional pad_token_id parameter and using that instead of doing torch.zeros

Signed-off-by: Gaurav-Kumbhat <Gaurav.Kumbhat@ibm.com>

Ssukriti · 2026-04-28T22:30:50Z

    use_cache: bool = False,
    contiguous_cache: bool = False,
    eos_token_id: Optional[int] = None,
+    pad_token_id: Optional[int] = None,


where is this used in generate @gkumbhat ?

Ssukriti · 2026-04-28T22:31:05Z

can you update PR @gkumbhat ?

gkumbhat · 2026-04-28T23:05:53Z

@Ssukriti yep. I'll update the PR

Signed-off-by: Gaurav-Kumbhat <Gaurav.Kumbhat@ibm.com>

gkumbhat added 2 commits March 3, 2026 16:28

🐛 Fix pad token id resolution in padding function

c593a95

Signed-off-by: Gaurav-Kumbhat <Gaurav.Kumbhat@ibm.com>

✨ Add pad token id to generate function

c48ce21

Signed-off-by: Gaurav-Kumbhat <Gaurav.Kumbhat@ibm.com>

gkumbhat marked this pull request as ready for review March 31, 2026 18:30

Merge branch 'main' into fix_pad_token_id_resolution

b1df276

gkumbhat requested a review from flaviabeo April 3, 2026 17:50

Ssukriti reviewed Apr 28, 2026

View reviewed changes

Ssukriti mentioned this pull request Apr 28, 2026

add_mistral_support foundation-model-stack/aiu-fms-testing-utils#196

Merged

Ssukriti reviewed Apr 29, 2026

View reviewed changes

Comment thread fms/utils/generation.py

📝 Add comment for parity with pad_token_id

396328a

Signed-off-by: Gaurav-Kumbhat <Gaurav.Kumbhat@ibm.com>

Ssukriti approved these changes Apr 29, 2026

View reviewed changes

Ssukriti merged commit 476e18e into foundation-model-stack:main Apr 29, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🐛 Fix pad token id resolution in padding function#512

🐛 Fix pad token id resolution in padding function#512
Ssukriti merged 4 commits into
foundation-model-stack:mainfrom
gkumbhat:fix_pad_token_id_resolution

gkumbhat commented Mar 3, 2026

Uh oh!

Ssukriti Apr 28, 2026

Uh oh!

Ssukriti commented Apr 28, 2026

Uh oh!

gkumbhat commented Apr 28, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

gkumbhat commented Mar 3, 2026

Changes

Uh oh!

Ssukriti Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

Ssukriti commented Apr 28, 2026

Uh oh!

gkumbhat commented Apr 28, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants