whisper.cpp/ggml/src
Meng, Hengyu 17e49d3ab2 fallback mmvq (llama/9088)
* fallback mmvq to mul_mat

* mmvq in cuda path

* Update ggml/src/ggml-sycl.cpp

Co-authored-by: Alberto Cabrera Pérez <alberto.cabrera@codeplay.com>

---------

Co-authored-by: Alberto Cabrera Pérez <alberto.cabrera@codeplay.com>
2024-08-28 13:22:20 +03:00
..
ggml-cann ggml : move rope type enum to ggml.h (llama/8949) 2024-08-28 13:22:20 +03:00
ggml-cuda ggml : move rope type enum to ggml.h (llama/8949) 2024-08-28 13:22:20 +03:00
ggml-sycl fallback mmvq (llama/9088) 2024-08-28 13:22:20 +03:00
kompute-shaders whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
vulkan-shaders sync : ggml vulkan (ggml/0) 2024-08-21 11:07:13 +03:00
CMakeLists.txt cann: update cmake (llama/8765) 2024-08-08 22:48:46 +03:00
ggml-aarch64.c ggml : reading the runtime sve config of the cpu (llama/8709) 2024-08-08 22:48:46 +03:00
ggml-aarch64.h ggml : add ggml-aarch64 (ggml/0) 2024-08-08 22:48:46 +03:00
ggml-alloc.c ggml : reduce hash table reset cost (llama/8698) 2024-08-08 22:48:46 +03:00
ggml-backend-impl.h whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
ggml-backend.c ggml : dynamic ggml_sched_max_splits based on graph_size (llama/9047) 2024-08-28 13:22:20 +03:00
ggml-blas.cpp ggml : reduce hash table reset cost (llama/8698) 2024-08-08 22:48:46 +03:00
ggml-cann.cpp ggml : add CANN backend (llama/0) 2024-08-09 09:58:16 +03:00
ggml-common.h feat: Support Moore Threads GPU (llama/8383) 2024-08-08 22:48:46 +03:00
ggml-cuda.cu feat: ref. cross entropy, add CUDA, fix grad test (ggml/929) 2024-08-28 13:22:20 +03:00
ggml-impl.h ggml : reading the runtime sve config of the cpu (llama/8709) 2024-08-08 22:48:46 +03:00
ggml-kompute.cpp ggml : reduce hash table reset cost (llama/8698) 2024-08-08 22:48:46 +03:00
ggml-metal.m ggml : move rope type enum to ggml.h (llama/8949) 2024-08-28 13:22:20 +03:00
ggml-metal.metal yolo : add backend support (ggml/924) 2024-08-21 11:07:13 +03:00
ggml-quants.c ggml : fix typo in ggml-quants.c comment (ggml/922) 2024-08-21 11:07:13 +03:00
ggml-quants.h ggml : reading the runtime sve config of the cpu (llama/8709) 2024-08-08 22:48:46 +03:00
ggml-rpc.cpp rpc : print error message when failed to connect endpoint (llama/9042) 2024-08-28 13:22:20 +03:00
ggml-sycl.cpp fallback mmvq (llama/9088) 2024-08-28 13:22:20 +03:00
ggml-vulkan.cpp ggml : move rope type enum to ggml.h (llama/8949) 2024-08-28 13:22:20 +03:00
ggml.c ggml : move rope type enum to ggml.h (llama/8949) 2024-08-28 13:22:20 +03:00
sgemm.cpp whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
sgemm.h whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00