Skip to content

Conversation

saiatmakuri
Copy link
Contributor

Pull Request Summary

bump vllm to v0.2.4 to support mixtral-8x7b models

Test Plan and Usage Guide

spin up model for local inference

@@ -58,6 +58,8 @@ def get_default_supported_models_info() -> Dict[str, ModelInfo]:
),
"mistral-7b": ModelInfo("mistralai/Mistral-7B-v0.1", None),
"mistral-7b-instruct": ModelInfo("mistralai/Mistral-7B-Instruct-v0.1", None),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let's also update mistral-7b-Instruct to use the newer version released today: https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2 we can do this in a follow up pr but good to do as well

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should we add it as a separate model instead of replacing the current one? also, should do that as a follow-up PR

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Personally I wouldn't make it a separate model, I would just add the weights to s3 and use them in favor of the v0.1 weights. I suppose there is some value to having both models though for completeness, @yunfeng-scale thoughts on this?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

don't think there's need to add as a new model

@saiatmakuri saiatmakuri merged commit 474155e into main Dec 14, 2023
@saiatmakuri saiatmakuri deleted the saiatmakuri/add-mixtral-8x7b branch December 14, 2023 22:26
@yunfeng-scale yunfeng-scale mentioned this pull request Mar 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants