Diego Devesa
|
a815940e0e
|
ggml : add predefined list of CPU backend variants to build (llama/10626)
* ggml : add predefined list of CPU backend variants to build
* update CPU dockerfiles
|
2024-12-08 20:14:35 +02:00 |
|
Diego Devesa
|
3daeacad24
|
ggml : move AMX to the CPU backend (llama/10570)
ggml : automatic selection of best CPU backend (llama/10606)
|
2024-12-08 20:14:35 +02:00 |
|
Shupei Fan
|
330273901f
|
ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541)
* ggml-cpu: support IQ4_NL_4_4 by runtime repack
* ggml-cpu: add __ARM_FEATURE_DOTPROD guard
|
2024-12-08 20:14:35 +02:00 |
|
Diego Devesa
|
77e3e4a090
|
ggml : add support for dynamic loading of backends (llama/10469)
* ggml : add support for dynamic loading of backends
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
|
2024-12-08 20:14:35 +02:00 |
|
Charles Xu
|
3298916e5e
|
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (llama/9921)
* backend-cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels
---------
Co-authored-by: Diego Devesa <slarengh@gmail.com>
|
2024-11-20 21:00:08 +02:00 |
|
Diego Devesa
|
746bf2596f
|
ggml : build backends as libraries (llama/10256)
* ggml : build backends as libraries
---------
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Co-authored-by: R0CKSTAR <xiaodong.ye@mthreads.com>
|
2024-11-20 21:00:08 +02:00 |
|