-
Notifications
You must be signed in to change notification settings - Fork 2k
Open
Labels
AutoDeploy<NV> AutoDeploy Backend<NV> AutoDeploy Backendfeature requestNew feature or request. This includes new model, dtype, functionality supportNew feature or request. This includes new model, dtype, functionality support
Description
🚀 The feature, motivation and pitch
Feature: Add an export patch for IBM Granite MoE models (ibm-granite/granite-3.x-*b-a800m-instruct) to enable torch.export compatibility in AutoDeploy.
Motivation: GraniteMoe models currently fail during AutoDeploy engine compilation with the error:
"Cannot copy out of meta tensor; no data!"
This is caused by GraniteMoeTopKGating calling .tolist() on a tensor, which is incompatible with meta tensors used during torch.export tracing.
Failure observed on AutoDeploy Dashboard for models including ibm-granite/granite-3.1-3b-a800m-instruct.
Alternatives
No response
Additional context
No response
Before submitting a new issue...
- Make sure you already searched for relevant issues, and checked the documentation and examples for answers to frequently asked questions.
coderabbitai
Metadata
Metadata
Assignees
Labels
AutoDeploy<NV> AutoDeploy Backend<NV> AutoDeploy Backendfeature requestNew feature or request. This includes new model, dtype, functionality supportNew feature or request. This includes new model, dtype, functionality support
Type
Projects
Status
Ready