Move fqn mapping logic to StateDictAdapter #1557

wesleytruong · 2025-08-12T16:54:39Z

This moves the logic that parses model.safetensors.index.json and generates the fqn_to_index_mapping to StateDictAdapter since this logic should be shared by all classes that inherit from StateDictAdapter.

…e this logic should be shared by all StateDictAdapters

tianyu-l · 2025-08-12T17:09:55Z

torchtitan/protocols/state_dict_adapter.py

 from .model import BaseModelArgs


-class StateDictAdapter(ABC):
+class BaseStateDictAdapter(ABC):


maybe don't need this BaseStateDictAdapter -- could you think of a case where people would like to inherit BaseStateDictAdapter but not StateDictAdapter? What do you think

I'm not sure if it would be better to handle state dict adapter this way, but there are some multi-modal repositories that may have multiple safetensors.index.jsons such as even https://huggingface.co/black-forest-labs/FLUX.1-dev/tree/main. I think you could handle such cases by using sd_adapter on each of the individual sub-models, but perhaps someone may prefer having them all under the same class? In the case of Flux, the extra models are just autoencoders that are not being trained, but perhaps someone may want to use torchtitan in a way that trains multiple models at the same time

In the case of FLUX, how would it work?

Does our approach still work? It sounds to me that we are not ready to read https://huggingface.co/black-forest-labs/FLUX.1-dev/tree/main/transformer

We don't support loading from multiple models from multiple folders anyway, so not sure if overgeneralizing helps. In the future if we support that, we probably need multiple StateDictAdapter?

All that said I'm OK with this change, but probably need to change https://github.com/pytorch/torchtitan/blob/main/torchtitan/protocols/train_spec.py#L24 to use the base one.

I think our approach will still work currently for all models using tokenizer_path since its logic for downloading tokenizer files is identical to previous download_tokenizer script. For loading the model in Flux we can pass the hf_assets_path of the full repo to FluxStateDictAdapter and then FluxStateDictAdapter can pass hf_assets_path + "transformer" to the parent class. I agree though that I think I can iterate on the download_hf_assets options to narrow search to certain subfolders, or pattern match on full path names instead of base names.

…DictAdapter and StateDictAdapter

tianyu-l

sounds good.

please resolve with @wwwjn on parallel work #1538

wesleytruong requested review from tianyu-l, fegin, wwwjn and wconstab as code owners August 12, 2025 16:54

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 12, 2025

moves fqn_mapping from LlamaStateDictAdapter to StateDictAdapter sinc…

3212a1f

…e this logic should be shared by all StateDictAdapters

wesleytruong force-pushed the move_sd_adapter_logic branch from 8f5ee2c to 3212a1f Compare August 12, 2025 16:57

tianyu-l reviewed Aug 12, 2025

View reviewed changes

wesleytruong added 2 commits August 12, 2025 11:16

updated references to StateDictAdapter

5ddf5a4

adds a short docstring to StateDictAdapter to differentiate BaseState…

71fd9e7

…DictAdapter and StateDictAdapter

tianyu-l approved these changes Aug 12, 2025

View reviewed changes

wesleytruong merged commit 8bd8c93 into main Aug 12, 2025
7 checks passed

tianyu-l deleted the move_sd_adapter_logic branch August 13, 2025 01:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move fqn mapping logic to StateDictAdapter #1557

Move fqn mapping logic to StateDictAdapter #1557

Uh oh!

wesleytruong commented Aug 12, 2025

Uh oh!

tianyu-l Aug 12, 2025

Uh oh!

wesleytruong Aug 12, 2025

Uh oh!

tianyu-l Aug 12, 2025

Uh oh!

wesleytruong Aug 12, 2025

Uh oh!

tianyu-l left a comment

Uh oh!

Uh oh!

Uh oh!

Move fqn mapping logic to StateDictAdapter #1557

Move fqn mapping logic to StateDictAdapter #1557

Uh oh!

Conversation

wesleytruong commented Aug 12, 2025

Uh oh!

tianyu-l Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

wesleytruong Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

tianyu-l Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

wesleytruong Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

tianyu-l left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!