-
Notifications
You must be signed in to change notification settings - Fork 12.6k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
chat : only remove double bos/eos if added
testing
Everything test related
#15086
opened Aug 5, 2025 by
CISC
Loading…
model : add reasoning/tool parsing to Llama 3.x Nemotron
testing
Everything test related
#15083
opened Aug 5, 2025 by
aldehir
Loading…
ggml: WebGPU disable SET_ROWS for now
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
#15078
opened Aug 5, 2025 by
reeselevine
Loading…
OpenCL: fix profiling crash in llama-bench
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#15072
opened Aug 4, 2025 by
rmatif
Loading…
CANN: GGML_OP_CPY optimization
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#15070
opened Aug 4, 2025 by
noemotiovon
Loading…
CANN: add optional support for ACL Graph execution
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#15065
opened Aug 4, 2025 by
noemotiovon
Loading…
quantize : configurable neutral imatrix prior
examples
generation quality
Quality of model output
need feedback
Testing and feedback with results are needed
research 🔬
#15060
opened Aug 3, 2025 by
compilade
Loading…
1 of 3 tasks
ggml-cpu : add basic RVV support for vector f32 ops
ggml
changes relating to the ggml tensor library for machine learning
#15057
opened Aug 3, 2025 by
xctan
Loading…
vulkan: conv2d addressing optimizations
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#15056
opened Aug 3, 2025 by
jeffbolznv
Loading…
Support streaming delta.reasoning_content in WebUI
examples
server
#15052
opened Aug 3, 2025 by
mostlygeek
Loading…
Fix: respect localStorage base URL override in Web UI
examples
server
#15048
opened Aug 3, 2025 by
insanerest
Loading…
Fix: flush partial stop string when <EOG> is reached in /completion endpoint in streaming mode
examples
server
#15007
opened Aug 1, 2025 by
matteoserva
Loading…
fix compile bug when the BASE_CUDA_DEV_CONTAINER is based on Ubuntu 2…
devops
improvements to build systems and github actions
#15005
opened Aug 1, 2025 by
simevo
Loading…
Add support for CogVLM model
examples
python
python script changes
#15002
opened Aug 1, 2025 by
Tianyue-Zhao
Loading…
2 of 4 tasks
OpenCL: add initial FA support
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#14987
opened Jul 31, 2025 by
rmatif
Loading…
CUDA: add set
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14980
opened Jul 31, 2025 by
jeemzz147
Loading…
ggml: initial IBM zDNN backend
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
#14975
opened Jul 30, 2025 by
taronaeo
Loading…
Optimize l2_norm_f32 op with SIMD
ggml
changes relating to the ggml tensor library for machine learning
#14970
opened Jul 30, 2025 by
TIKki43
Loading…
Implementation of GGML_NUMA_MIRROR for inferencing performance gain on numa systems
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
ggml : fix field name when new ggml_backend
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
OpenCL
Issues specific to the OpenCL backend
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#14944
opened Jul 29, 2025 by
aisk
Loading…
ops: add MUSA
documentation
Improvements or additions to documentation
#14941
opened Jul 29, 2025 by
yeahdongcn
Loading…
mtmd : support home-cooked Mistral Small Omni
examples
#14928
opened Jul 28, 2025 by
ngxson
Loading…
repack : optimize mul_mat_id path
ggml
changes relating to the ggml tensor library for machine learning
#14918
opened Jul 28, 2025 by
ggerganov
Loading…
1 task
opencl: fixed a typo
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#14908
opened Jul 27, 2025 by
l29ah
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.