Skip to content

Fixed paged implementation to include cuda#481

Merged
ani300 merged 4 commits into
mainfrom
fix_paged_cuda
Nov 24, 2025
Merged

Fixed paged implementation to include cuda#481
ani300 merged 4 commits into
mainfrom
fix_paged_cuda

Conversation

@JRosenkranz

Copy link
Copy Markdown
Collaborator

This PR adds cuda as a device for the paged_attn_compute/store operation and removes the specific cpu calls from compute.

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>
Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>
Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>
@ani300 ani300 merged commit 070dcdc into main Nov 24, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants