whisper.cpp

mirror/whisper.cpp

Fork 0

fb36a1538a

readme : fix real-time audio input example build instructions (#2692) master Samuel Durante 2025-01-02 07:05:38 -0300
c81b8b910b

objc : rename ggml-cpu-aarch64.c to .cpp (#2687) Alter 2025-01-02 10:05:09 +0000
85b60f31d0

docs : replace Core ML with OpenVINO (#2686) Konosuke Sakai 2025-01-02 19:03:02 +0900
227b5ffa36

make : fix "main" -> "whisper-cli" Georgi Gerganov 2024-12-31 11:46:17 +0200
36a64a253f

ci : re-enable Windows cublas build (#2676) Nikolaj Olsson 2024-12-31 10:11:42 +0100
c84b83c370

ruby : Fix of C++ header guard name, model URI support, type signature and more (#2683) KITAITI Makoto 2024-12-30 21:26:35 +0900
5136fd92c2

examples : handle "main.exe" deprecation Georgi Gerganov 2024-12-30 13:00:18 +0200
7d55637f0b

cli : add --suppress_nst support (#2664) Andreas Lubbe 2024-12-24 08:30:07 +0100
0994506054

cli : add no_speech_thold (#2663) Andreas Lubbe 2024-12-24 08:29:19 +0100
53c9a3a984

cmake : remove hardcoded install rpath Georgi Gerganov 2024-12-23 21:22:10 +0200
ed09075ca0

server : fix help print Georgi Gerganov 2024-12-22 15:32:05 +0200
f07a81aa9f

ruby : bug fix on callbacks and no_speech_prob (#2656) KITAITI Makoto 2024-12-22 04:52:06 +0900
4183517076

server : add no-speech threshold parameter and functionality (#2654) Sacha Arbonel 2024-12-21 16:00:08 +0100
f4668169a0

whisper : rename suppress_non_speech_tokens to suppress_nst (#2653) Georgi Gerganov 2024-12-21 12:54:35 +0200
944ce49439

server : add option to suppress non-speech tokens (#2649) Sacha Arbonel 2024-12-21 11:05:05 +0100
2e59dced12

whisper : rename binaries + fix install (#2648) Georgi Gerganov 2024-12-21 09:43:49 +0200
e4e05981d6

ruby : update gem version to v1.3.1 Georgi Gerganov 2024-12-20 11:53:27 +0200
3de9deead5

release : v1.7.3 Georgi Gerganov 2024-12-18 18:12:40 +0200
47f989f9b3

ci : msys enable SDL2 build (#2635) Georgi Gerganov 2024-12-18 12:52:41 +0200
acc4e13dee ruby : sync ggml (#2643) KITAITI Makoto 2024-12-18 19:51:48 +0900
ba6c2a8fd9 android : try to fix build Georgi Gerganov 2024-12-18 09:54:23 +0200
6576af00d7 files : remove old sources Georgi Gerganov 2024-12-18 08:39:39 +0200
8ac5db0169 sync : ggml Georgi Gerganov 2024-12-18 08:38:56 +0200
61edb117a0 talk-llama : sync llama.cpp Georgi Gerganov 2024-12-17 21:19:28 +0200
eb97b257eb sync : ggml Georgi Gerganov 2024-12-17 19:25:44 +0200
479499dc0e ggml : update ggml_backend_cpu_device_supports_op (llama/10867) Georgi Gerganov 2024-12-17 18:35:42 +0200
d420a759c5 vulkan: bugfixes for small subgroup size systems + llvmpipe test (llama/10809) Eve 2024-12-17 05:52:55 +0000
a1ab9b5e91 rwkv6: add wkv6 support for Vulkan backend (llama/10829) Zhiyuan Li 2024-12-17 05:00:46 +0800
e22d38e4f2 llama : add Qwen2VL support + multimodal RoPE (llama/10361) HimariO 2024-12-14 20:43:46 +0800
856fbaa92f Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (llama/10693) lhez 2024-12-13 12:23:52 -0800
2c05efa4b1 Fix crash caused by ggml_backend_load_all when launching on Android Activity (llama/10812) 谢乃闻 2024-12-13 12:56:07 +0000
c21fb10b28 vulkan: small mul_mat_vec optimizations (llama/10665) Eve 2024-12-13 08:42:04 +0000
26c9fd0cdc SYCL: Reduce most of the compiler warnings (llama/10748) Akarshan Biswas 2024-12-13 12:12:15 +0530
e6eed605cf ggml : Fix compilation issues on ARM platform when building without fp16 (llama/10811) Karol Kontny 2024-12-13 01:04:19 +0100
abe3102cb7 CUDA: faster non-contiguous concat (llama/10760) a3sh 2024-12-13 02:09:50 +0800
1193e494a9 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (llama/10797) Diego Devesa 2024-12-12 19:02:49 +0100
e5e951672e Vulkan: Use improved q4_k and q5_k dequant code in dequant shaders (llama/10798) 0cc4m 2024-12-12 18:36:00 +0100
0e24559ad9 Vulkan: Add VK_EXT_subgroup_size_control support to ensure full subgroups for coopmats (llama/10721) 0cc4m 2024-12-12 18:35:37 +0100
527ac800cf ggml: load all backends from a user-provided search path (llama/10699) Gilad S 2024-12-11 02:47:21 +0200
479bd77169 vulkan: request round-to-even for fp16 in im2col/rope_head (llama/10767) Jeff Bolz 2024-12-10 14:23:17 -0600
d8bf63a41b vulkan: dynamic subgroup size for the remaining k quants (llama/10745) Eve 2024-12-10 19:33:23 +0000
b82c8d76dc CUDA: rename macros to avoid conflicts with WinAPI (llama/10736) Andreas Kieslinger 2024-12-10 18:23:24 +0100
86346f811e vulkan: disable spirv-opt for coopmat shaders (llama/10763) Jeff Bolz 2024-12-10 11:22:20 -0600
c635f40a34 ggml : remove return from ggml_gallocr_allocate_node (ggml/1048) Daniel Bevenius 2024-12-14 03:23:08 +0100
e0be0de1ee ggml : add check for grad_accs (ggml/1046) Daniel Bevenius 2024-12-13 08:19:38 +0100
60dc6d003f common : remove old types Georgi Gerganov 2024-12-10 17:19:09 +0200
eb27e0d834 CUDA: fix shared memory access condition for mmv (llama/10740) Johannes Gäßler 2024-12-09 20:07:12 +0100
a682fdce0c vulkan: fix compile warnings (llama/10731) Jeff Bolz 2024-12-09 01:24:01 -0600
9ffbd3d969 Vulkan: fix NaN in tanh.comp with AMD proprietary driver on Windows (llama/10723) stduhpf 2024-12-08 19:19:19 +0100
6585a890b4 vulkan: compile a test shader in cmake to check for coopmat2 support (llama/10713) Jeff Bolz 2024-12-08 02:05:55 -0600
d0a050b51f ggml : disable iq4_nl interleave size 8 (llama/10709) Georgi Gerganov 2024-12-07 18:38:15 +0200
e990d1b791 ggml : refactor online repacking (llama/10446) Djip007 2024-12-07 13:37:50 +0100
4a6d52efe6 Vulkan: VK_KHR_cooperative_matrix support to speed up prompt processing (llama/10597) 0cc4m 2024-12-07 10:24:15 +0100
8b841d430a metal : Extend how Llama.cpp locates metal resources (llama/10676) Robert Ormandi 2024-12-07 01:55:01 -0600
b74b68212a vulkan: Add VK_NV_cooperative_matrix2 support for mul_mat and flash attention (llama/10206) Jeff Bolz 2024-12-05 13:15:05 -0600
3a27b2b91b

ruby : Add no_speech_thold (#2641) KITAITI Makoto 2024-12-18 18:00:50 +0900
d34445e960

stream : improve consistency in README (#2642) crummyh 2024-12-18 00:43:48 -0600
f897eb7670

whisper : support no_speech_thold (#2625) Karthick 2024-12-17 22:45:47 +0530
2f2841bfce

whisper : add single-timestamp logic (#2629) Karthick 2024-12-17 22:37:08 +0530
09a1b61218

readme : fix typo (#2637) crummyh 2024-12-17 11:05:35 -0600
94e7da1ff2

cmake : fix "amd64" processor string (#2638) Georgi Gerganov 2024-12-17 18:34:32 +0200
c4aed6831e

vulkan : fix soft_max.comp division by zero (#2633) gn64 2024-12-16 19:34:38 +0900
199579652e

common : add cstdio header Georgi Gerganov 2024-12-16 08:57:04 +0200
d17e7139d8

stream : update build instructions Georgi Gerganov 2024-12-15 21:55:36 +0200
6a52eaea74

android : fix build and ci (#2624) Thamster 2024-12-14 10:25:53 -0500
6aa1d7b892

models : fix typo in download-ggml-model.sh (#2623) Michael Rienstra 2024-12-12 08:02:00 -0800
262e865a70

ruby : Sync whisper.cpp and model download feature (#2617) KITAITI Makoto 2024-12-09 20:17:50 +0900
ed733e85a1

scripts : update to new build system Georgi Gerganov 2024-12-09 11:30:16 +0200
5980b1ae77

devops : add cmake Georgi Gerganov 2024-12-08 23:09:26 +0200
0415a66044

devops : update make commands Georgi Gerganov 2024-12-08 23:07:29 +0200
7d134e3737

ggml : remove old files (skip) (#0) Georgi Gerganov 2024-12-08 23:04:26 +0200
9df53b357e

ggml : sync remnants (skip) (#0) Georgi Gerganov 2024-12-08 22:48:25 +0200
b2115b4d9b

scripts : remove amx from sync Georgi Gerganov 2024-12-08 22:48:14 +0200
0164427dd5 ci : disable freeBSD builds [no ci] Georgi Gerganov 2024-12-08 15:52:57 +0200
627b11c78a readme : update build instructions Georgi Gerganov 2024-12-08 15:48:14 +0200
472464453d ci : disable CUDA and Android builds Georgi Gerganov 2024-12-08 15:36:01 +0200
11dddfbc9e ci : disable Obj-C build + fixes Georgi Gerganov 2024-12-08 13:35:35 +0200
384e214cc7 make : shim cmake Georgi Gerganov 2024-12-06 15:34:53 +0200
f2c680f893 talk-llama : sync llama.cpp Georgi Gerganov 2024-12-05 14:30:33 +0200
fbe66da0e5 sync : ggml Georgi Gerganov 2024-12-05 14:29:18 +0200
a815940e0e ggml : add predefined list of CPU backend variants to build (llama/10626) Diego Devesa 2024-12-04 14:45:40 +0100
904e307bce ggml-cpu : fix HWCAP2_I8MM value (llama/10646) Diego Devesa 2024-12-04 14:40:44 +0100
491ec076b4 vulkan: Implement "fast divide" (mul+shift) for unary ops like copy (llama/10642) Jeff Bolz 2024-12-04 01:28:59 -0600
966433fdf2 SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (llama/10584) Nicolò Scipione 2024-12-04 02:29:20 +0100
6f1ba9d82d Avoid using __fp16 on ARM with old nvcc (llama/10616) Frankie Robertson 2024-12-04 02:41:37 +0200
015ecd0001 vulkan: optimize and reenable split_k (llama/10637) Jeff Bolz 2024-12-03 13:29:54 -0600
b7c64a4352 ggml: add GGML_SET Metal kernel + i32 CPU kernel (ggml/1037) PAB 2024-12-04 09:19:30 +0100
7895d39508 ggml : add GGML_PAD_REFLECT_1D operation (ggml/1034) PAB 2024-12-03 20:20:04 +0100
22616f00f9 files : remove make artifacts Georgi Gerganov 2024-12-03 20:29:32 +0200
02c6fcbc2c common : fix compile warning Georgi Gerganov 2024-12-03 20:25:37 +0200
3daeacad24 ggml : move AMX to the CPU backend (llama/10570) Diego Devesa 2024-12-03 20:22:12 +0200
4d73962da4 metal : small-batch mat-mul kernels (llama/10581) Georgi Gerganov 2024-12-03 11:52:33 +0200
068812650e SYCL: Fix and switch to GGML_LOG system instead of fprintf (llama/10579) Akarshan Biswas 2024-12-02 12:34:11 +0530
4b7e059e15 ggml-cpu: replace AArch64 NEON assembly with intrinsics in ggml_gemv_q4_0_4x4_q8_0() (llama/10567) Adrien Gallouët 2024-11-30 18:13:18 +0100
30e35d7271 vulkan: Dynamic subgroup size support for Q6_K mat_vec (llama/10536) Eve 2024-11-30 07:00:02 +0000
3623bd58f2 ggml : fix I8MM Q4_1 scaling factor conversion (llama/10562) Georgi Gerganov 2024-11-29 16:25:39 +0200
cb847c20a7 ggml-cpu: fix typo in gemv/gemm iq4_nl_4_4 (llama/10580) Shupei Fan 2024-11-29 21:49:02 +0800
964b154a2a sycl : offload of get_rows set to 0 (llama/10432) Alberto Cabrera Pérez 2024-11-29 12:38:45 +0000
d7c2a04bce sycl : Reroute permuted mul_mats through oneMKL (llama/10408) Alberto Cabrera Pérez 2024-11-29 09:49:43 +0000
2bb4ca9cba CANN: RoPE operator optimization (llama/10563) Chenguang Li 2024-11-29 14:46:55 +0800

Commit Graph Select branches Hide Pull Requests master Mono Color

Commit Graph

Select branches

Hide Pull Requests

master