Skip to content

[Feature]: AutoDeploy: Add torch.export support for GraniteMoe models #10170

@karthikvetrivel

Description

@karthikvetrivel

🚀 The feature, motivation and pitch

Feature: Add an export patch for IBM Granite MoE models (ibm-granite/granite-3.x-*b-a800m-instruct) to enable torch.export compatibility in AutoDeploy.

Motivation: GraniteMoe models currently fail during AutoDeploy engine compilation with the error:

"Cannot copy out of meta tensor; no data!"

This is caused by GraniteMoeTopKGating calling .tolist() on a tensor, which is incompatible with meta tensors used during torch.export tracing.

Failure observed on AutoDeploy Dashboard for models including ibm-granite/granite-3.1-3b-a800m-instruct.

Alternatives

No response

Additional context

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and checked the documentation and examples for answers to frequently asked questions.

Metadata

Metadata

Assignees

Labels

AutoDeploy<NV> AutoDeploy Backendfeature requestNew feature or request. This includes new model, dtype, functionality support

Type

No type

Projects

Status

Ready

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions