Skip to content

Transformers upgrade to 4.57.6#516

Merged
flaviabeo merged 5 commits into
foundation-model-stack:mainfrom
flaviabeo:transformers
Mar 10, 2026
Merged

Transformers upgrade to 4.57.6#516
flaviabeo merged 5 commits into
foundation-model-stack:mainfrom
flaviabeo:transformers

Conversation

@flaviabeo

@flaviabeo flaviabeo commented Mar 10, 2026

Copy link
Copy Markdown
Collaborator

The newer versions of transformers have a parameter called cache_position that would require implementation like was done in transformers to have full compatibility with the generation done in FMS. The way to disable this is by passing use_cache=False in our tests that compare the output to transformers.

Signed-off-by: Flavia Beo <flavia.beo@ibm.com>
Signed-off-by: Flavia Beo <flavia.beo@ibm.com>
Signed-off-by: Flavia Beo <flavia.beo@ibm.com>
Signed-off-by: Flavia Beo <flavia.beo@ibm.com>
["hello how are you?", "a: this is a test. b: this is another test. a:"],
]
use_cache_options = [True, False, None]
use_cache_options = [False]

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you please add a comment here explaining why we are setting it to false now?

Signed-off-by: Flavia Beo <flavia.beo@ibm.com>

@gkumbhat gkumbhat left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This means we are disabling test for cache use-case. However given the change from transformers and this generate only getting used for testing and example use-cases, I think this is fine for now.

@flaviabeo flaviabeo merged commit 1abd95e into foundation-model-stack:main Mar 10, 2026
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants