add support for mixtral-8x7b and mixtral-8x7b-instruct #408

saiatmakuri · 2023-12-12T00:07:43Z

Pull Request Summary

bump vllm to v0.2.4 to support mixtral-8x7b models

Test Plan and Usage Guide

spin up model for local inference

model-engine/model_engine_server/domain/use_cases/llm_model_endpoint_use_cases.py

ian-scale · 2023-12-12T00:17:50Z

model-engine/model_engine_server/infra/repositories/live_tokenizer_repository.py

@@ -58,6 +58,8 @@ def get_default_supported_models_info() -> Dict[str, ModelInfo]:
        ),
        "mistral-7b": ModelInfo("mistralai/Mistral-7B-v0.1", None),
        "mistral-7b-instruct": ModelInfo("mistralai/Mistral-7B-Instruct-v0.1", None),


let's also update mistral-7b-Instruct to use the newer version released today: https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2 we can do this in a follow up pr but good to do as well

should we add it as a separate model instead of replacing the current one? also, should do that as a follow-up PR

Personally I wouldn't make it a separate model, I would just add the weights to s3 and use them in favor of the v0.1 weights. I suppose there is some value to having both models though for completeness, @yunfeng-scale thoughts on this?

don't think there's need to add as a new model

model-engine/model_engine_server/domain/use_cases/llm_model_endpoint_use_cases.py

saiatmakuri added 2 commits December 11, 2023 23:40

bump datadog module to 0.47.0 for ipv6 support for dogstatsd

80d0cc1

add mixtral-8x7b and mixtral-8x7b-instruct

52b1a47

saiatmakuri requested review from yunfeng-scale and ian-scale December 12, 2023 00:07

saiatmakuri self-assigned this Dec 12, 2023

Merge branch 'main' into saiatmakuri/add-mixtral-8x7b

70cb274

ian-scale reviewed Dec 12, 2023

View reviewed changes

yunfeng-scale reviewed Dec 12, 2023

View reviewed changes

model-engine/model_engine_server/domain/use_cases/llm_model_endpoint_use_cases.py Outdated Show resolved Hide resolved

saiatmakuri added 3 commits December 12, 2023 00:41

update context window

ba1f626

docker update

d3bad8a

install megablocks

8281ce9

yunfeng-scale approved these changes Dec 14, 2023

View reviewed changes

Merge branch 'main' into saiatmakuri/add-mixtral-8x7b

3f69751

saiatmakuri merged commit 474155e into main Dec 14, 2023

saiatmakuri deleted the saiatmakuri/add-mixtral-8x7b branch December 14, 2023 22:26

yunfeng-scale mentioned this pull request Mar 6, 2024

Fix cacher #462

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add support for mixtral-8x7b and mixtral-8x7b-instruct #408

add support for mixtral-8x7b and mixtral-8x7b-instruct #408

Uh oh!

saiatmakuri commented Dec 12, 2023

Uh oh!

Uh oh!

ian-scale Dec 12, 2023

Uh oh!

saiatmakuri Dec 12, 2023

Uh oh!

ian-scale Dec 12, 2023

Uh oh!

yunfeng-scale Dec 12, 2023

Uh oh!

Uh oh!

Uh oh!

add support for mixtral-8x7b and mixtral-8x7b-instruct #408

add support for mixtral-8x7b and mixtral-8x7b-instruct #408

Uh oh!

Conversation

saiatmakuri commented Dec 12, 2023

Pull Request Summary

Test Plan and Usage Guide

Uh oh!

Uh oh!

ian-scale Dec 12, 2023

Choose a reason for hiding this comment

Uh oh!

saiatmakuri Dec 12, 2023

Choose a reason for hiding this comment

Uh oh!

ian-scale Dec 12, 2023

Choose a reason for hiding this comment

Uh oh!

yunfeng-scale Dec 12, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!