RuntimeError: You can't move a model that has some modules offloaded to cpu or disk.

Hi! I was trying to run the inference.ipynb notebook, and I got an RuntimeError.

While I ran  `model = LlamaForCausalLM.from_pretrained(train_config.model_name, device_map="auto", config=config).to(device)
` , I got **RuntimeError: You can't move a model that has some modules offloaded to cpu or disk.**

I guess it is related to the `device = "cuda" if torch.cuda.is_available() else "cpu"` , but my GPU RAM is available and not full.

How should I do to solve it? Thank you.

Best regards, 
Maggie


![image](https://github.com/user-attachments/assets/60d37a38-c0af-451c-8dd2-bc35da350a67)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: You can't move a model that has some modules offloaded to cpu or disk. #19

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

RuntimeError: You can't move a model that has some modules offloaded to cpu or disk. #19

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions