k8s-llm-stack

Ready to run LLM stack configuration for Kuberenetes using Open Web UI / Ollama / HuggingFace TGI

kubectl create ns k8s-llm-stack
kubectl apply -f https://raw.githubusercontent.com/kubernetes/ingress-nginx/controller-v1.8.2/deploy/static/provider/cloud/deploy.yaml
kubectl apply -f nvidia-runtime.yaml
kubectl apply -f open-webui.yaml
kubectl apply -f ollama.yaml
kubectl apply -f tgi.yaml
kubectl apply -f vllm.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

k8s-llm-stack

About

Uh oh!

Releases

Packages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
nvidia-runtime.yaml		nvidia-runtime.yaml
ollama.yaml		ollama.yaml
open-webui.yaml		open-webui.yaml
tgi.yaml		tgi.yaml
vllm.yaml		vllm.yaml

mshahzeb/k8s-llm-stack

Folders and files

Latest commit

History

Repository files navigation

k8s-llm-stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages