Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Support Youtu-VL Model examples python python script changes
#18315 opened Dec 23, 2025 by f291400 Loading…
Add metal count equal op Apple Metal https://en.wikipedia.org/wiki/Metal_(API) documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning
#18314 opened Dec 23, 2025 by gatbontonpc Loading…
vulkan: handle rope with large number of rows ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#18306 opened Dec 22, 2025 by jeffbolznv Loading…
vulkan: fix command buffer corruption in ggml_backend_vk_event_wait ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#18302 opened Dec 22, 2025 by jeffbolznv Loading…
vulkan: extend topk_moe to handle sigmoid w/exp_probs_b for nemotron ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#18295 opened Dec 22, 2025 by jeffbolznv Loading…
Vulkan: Tune Flash Attention for MoE on AMD GPUs ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#18280 opened Dec 22, 2025 by 0cc4m Loading…
KYLIN: fix compile error for cuda backend ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#18275 opened Dec 22, 2025 by lizhenneng Loading…
docs: Fix typos in SYCL documentation documentation Improvements or additions to documentation SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#18269 opened Dec 21, 2025 by yoka Loading…
add LLAMA_ARG_OVERRIDE_TENSOR env var for -ot arg
#18267 opened Dec 21, 2025 by ddh0 Loading…
Add Gemma3n multimodal support with MobileNetV5 vision encoder examples model Model specific python python script changes
#18256 opened Dec 21, 2025 by simrnsingh Loading…
ggml rpc : Add missing check for rpc buffer type ggml changes relating to the ggml tensor library for machine learning
#18242 opened Dec 21, 2025 by struct Loading…
ggml-cpu: parallelize tensor repacking with OpenMP ggml changes relating to the ggml tensor library for machine learning
#18239 opened Dec 21, 2025 by pestopoppa Loading…
webui: Fix the header backdrop blur examples server
#18230 opened Dec 20, 2025 by ImadSaddik Loading…
server: /v1/responses (text generation only) examples python python script changes server
#18227 opened Dec 20, 2025 by openingnow Loading…
ggml-metal: guard buffer map slicing Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#18225 opened Dec 20, 2025 by SzymonPrajs Loading…
ProTip! Follow long discussions with comments:>50.