Skip to content

Conversation

mabrowning
Copy link

Llama 3.1 70B and 3.3 70B have identical performance, but the later has more fine-tuned weights. Cerebras has transitioned over to only hosting the latter.

I see there is another PR (#132) is adding a llama 3.3 model under the llama 3.1 label. Is there anything special about the display name, or should we add a new set of models LLAMA_33_70B_CHAT etal?

Llama 3.1 and 3.3 have identical performance, but the later has further
fine-tuned weights. Cerebras has transitioned over to only hosting the
latter.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant