Add Qwen2 GGUF loading support #31175

Isotr0py · 2024-06-01T12:53:24Z

What does this PR do?

Add Qwen2 GGUF loading support
Use model_type for gguf tokenizer converter selection instead of tokenizer_type

According to convert-hf-to-gguf.py, most of models may register tokenizer as gpt2 tokenizer. Use model_type to select corresponding tokenizer instead of tokenizer_type.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

younesbelkada

Thanks a lot for this great contribution ! Can you confirm the other slow tests pass ? 🙏 I left few minor comments, what do you think?

src/transformers/integrations/ggml.py

src/transformers/tokenization_utils_fast.py

younesbelkada

Great work ! Thanks for adding Qwen2 support for GGUF files ! Can you run the styling checks? make fixup after that this PR is ready IMO

amyeroberts

Thanks for adding!

src/transformers/convert_slow_tokenizer.py

src/transformers/integrations/ggml.py

Isotr0py added 6 commits June 1, 2024 01:41

add qwen2 gguf support

53c1221

Update docs

c134d02

fix qwen2 tokenizer

638ca42

add qwen2 gguf test

f035536

fix typo in qwen2 gguf test

d3241f9

format code

e758112

younesbelkada reviewed Jun 3, 2024

View reviewed changes

Remove mistral, clarify the error message

b8065ad

younesbelkada approved these changes Jun 3, 2024

View reviewed changes

format code

cf5e14e

younesbelkada requested a review from amyeroberts June 3, 2024 12:19

amyeroberts approved these changes Jun 3, 2024

View reviewed changes

src/transformers/convert_slow_tokenizer.py Outdated Show resolved Hide resolved

src/transformers/integrations/ggml.py Show resolved Hide resolved

add typing and update docstring

48bdb5e

amyeroberts merged commit e462843 into huggingface:main Jun 3, 2024

Isotr0py deleted the gguf branch June 3, 2024 13:58

younesbelkada mentioned this pull request Jun 3, 2024

Loading GGUF files support #30391

Merged

SunMarc mentioned this pull request Jul 8, 2024

Add support for GGUF Phi-3 #31826

Closed

2 tasks

a8nova mentioned this pull request Jul 8, 2024

Add support for GGUF Phi-3 #31844

Merged

5 tasks

NielsRogge mentioned this pull request May 12, 2025

Adding native support to load GGUF models using transformers #38063

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Qwen2 GGUF loading support #31175

Add Qwen2 GGUF loading support #31175

Uh oh!

Isotr0py commented Jun 1, 2024

Uh oh!

younesbelkada left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

younesbelkada left a comment

Uh oh!

amyeroberts left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add Qwen2 GGUF loading support #31175

Add Qwen2 GGUF loading support #31175

Uh oh!

Conversation

Isotr0py commented Jun 1, 2024

What does this PR do?

Before submitting

Who can review?

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!