Commit Graph

  • fb36a1538a
    readme : fix real-time audio input example build instructions (#2692) master Samuel Durante 2025-01-02 07:05:38 -0300
  • c81b8b910b
    objc : rename ggml-cpu-aarch64.c to .cpp (#2687) Alter 2025-01-02 10:05:09 +0000
  • 85b60f31d0
    docs : replace Core ML with OpenVINO (#2686) Konosuke Sakai 2025-01-02 19:03:02 +0900
  • 227b5ffa36
    make : fix "main" -> "whisper-cli" Georgi Gerganov 2024-12-31 11:46:17 +0200
  • 36a64a253f
    ci : re-enable Windows cublas build (#2676) Nikolaj Olsson 2024-12-31 10:11:42 +0100
  • c84b83c370
    ruby : Fix of C++ header guard name, model URI support, type signature and more (#2683) KITAITI Makoto 2024-12-30 21:26:35 +0900
  • 5136fd92c2
    examples : handle "main.exe" deprecation Georgi Gerganov 2024-12-30 13:00:18 +0200
  • 7d55637f0b
    cli : add --suppress_nst support (#2664) Andreas Lubbe 2024-12-24 08:30:07 +0100
  • 0994506054
    cli : add no_speech_thold (#2663) Andreas Lubbe 2024-12-24 08:29:19 +0100
  • 53c9a3a984
    cmake : remove hardcoded install rpath Georgi Gerganov 2024-12-23 21:22:10 +0200
  • ed09075ca0
    server : fix help print Georgi Gerganov 2024-12-22 15:32:05 +0200
  • f07a81aa9f
    ruby : bug fix on callbacks and no_speech_prob (#2656) KITAITI Makoto 2024-12-22 04:52:06 +0900
  • 4183517076
    server : add no-speech threshold parameter and functionality (#2654) Sacha Arbonel 2024-12-21 16:00:08 +0100
  • f4668169a0
    whisper : rename suppress_non_speech_tokens to suppress_nst (#2653) Georgi Gerganov 2024-12-21 12:54:35 +0200
  • 944ce49439
    server : add option to suppress non-speech tokens (#2649) Sacha Arbonel 2024-12-21 11:05:05 +0100
  • 2e59dced12
    whisper : rename binaries + fix install (#2648) Georgi Gerganov 2024-12-21 09:43:49 +0200
  • e4e05981d6
    ruby : update gem version to v1.3.1 Georgi Gerganov 2024-12-20 11:53:27 +0200
  • 3de9deead5
    release : v1.7.3 Georgi Gerganov 2024-12-18 18:12:40 +0200
  • 47f989f9b3
    ci : msys enable SDL2 build (#2635) Georgi Gerganov 2024-12-18 12:52:41 +0200
  • acc4e13dee ruby : sync ggml (#2643) KITAITI Makoto 2024-12-18 19:51:48 +0900
  • ba6c2a8fd9 android : try to fix build Georgi Gerganov 2024-12-18 09:54:23 +0200
  • 6576af00d7 files : remove old sources Georgi Gerganov 2024-12-18 08:39:39 +0200
  • 8ac5db0169 sync : ggml Georgi Gerganov 2024-12-18 08:38:56 +0200
  • 61edb117a0 talk-llama : sync llama.cpp Georgi Gerganov 2024-12-17 21:19:28 +0200
  • eb97b257eb sync : ggml Georgi Gerganov 2024-12-17 19:25:44 +0200
  • 479499dc0e ggml : update ggml_backend_cpu_device_supports_op (llama/10867) Georgi Gerganov 2024-12-17 18:35:42 +0200
  • d420a759c5 vulkan: bugfixes for small subgroup size systems + llvmpipe test (llama/10809) Eve 2024-12-17 05:52:55 +0000
  • a1ab9b5e91 rwkv6: add wkv6 support for Vulkan backend (llama/10829) Zhiyuan Li 2024-12-17 05:00:46 +0800
  • e22d38e4f2 llama : add Qwen2VL support + multimodal RoPE (llama/10361) HimariO 2024-12-14 20:43:46 +0800
  • 856fbaa92f Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (llama/10693) lhez 2024-12-13 12:23:52 -0800
  • 2c05efa4b1 Fix crash caused by ggml_backend_load_all when launching on Android Activity (llama/10812) 谢乃闻 2024-12-13 12:56:07 +0000
  • c21fb10b28 vulkan: small mul_mat_vec optimizations (llama/10665) Eve 2024-12-13 08:42:04 +0000
  • 26c9fd0cdc SYCL: Reduce most of the compiler warnings (llama/10748) Akarshan Biswas 2024-12-13 12:12:15 +0530
  • e6eed605cf ggml : Fix compilation issues on ARM platform when building without fp16 (llama/10811) Karol Kontny 2024-12-13 01:04:19 +0100
  • abe3102cb7 CUDA: faster non-contiguous concat (llama/10760) a3sh 2024-12-13 02:09:50 +0800
  • 1193e494a9 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (llama/10797) Diego Devesa 2024-12-12 19:02:49 +0100
  • e5e951672e Vulkan: Use improved q4_k and q5_k dequant code in dequant shaders (llama/10798) 0cc4m 2024-12-12 18:36:00 +0100
  • 0e24559ad9 Vulkan: Add VK_EXT_subgroup_size_control support to ensure full subgroups for coopmats (llama/10721) 0cc4m 2024-12-12 18:35:37 +0100
  • 527ac800cf ggml: load all backends from a user-provided search path (llama/10699) Gilad S 2024-12-11 02:47:21 +0200
  • 479bd77169 vulkan: request round-to-even for fp16 in im2col/rope_head (llama/10767) Jeff Bolz 2024-12-10 14:23:17 -0600
  • d8bf63a41b vulkan: dynamic subgroup size for the remaining k quants (llama/10745) Eve 2024-12-10 19:33:23 +0000
  • b82c8d76dc CUDA: rename macros to avoid conflicts with WinAPI (llama/10736) Andreas Kieslinger 2024-12-10 18:23:24 +0100
  • 86346f811e vulkan: disable spirv-opt for coopmat shaders (llama/10763) Jeff Bolz 2024-12-10 11:22:20 -0600
  • c635f40a34 ggml : remove return from ggml_gallocr_allocate_node (ggml/1048) Daniel Bevenius 2024-12-14 03:23:08 +0100
  • e0be0de1ee ggml : add check for grad_accs (ggml/1046) Daniel Bevenius 2024-12-13 08:19:38 +0100
  • 60dc6d003f common : remove old types Georgi Gerganov 2024-12-10 17:19:09 +0200
  • eb27e0d834 CUDA: fix shared memory access condition for mmv (llama/10740) Johannes Gäßler 2024-12-09 20:07:12 +0100
  • a682fdce0c vulkan: fix compile warnings (llama/10731) Jeff Bolz 2024-12-09 01:24:01 -0600
  • 9ffbd3d969 Vulkan: fix NaN in tanh.comp with AMD proprietary driver on Windows (llama/10723) stduhpf 2024-12-08 19:19:19 +0100
  • 6585a890b4 vulkan: compile a test shader in cmake to check for coopmat2 support (llama/10713) Jeff Bolz 2024-12-08 02:05:55 -0600
  • d0a050b51f ggml : disable iq4_nl interleave size 8 (llama/10709) Georgi Gerganov 2024-12-07 18:38:15 +0200
  • e990d1b791 ggml : refactor online repacking (llama/10446) Djip007 2024-12-07 13:37:50 +0100
  • 4a6d52efe6 Vulkan: VK_KHR_cooperative_matrix support to speed up prompt processing (llama/10597) 0cc4m 2024-12-07 10:24:15 +0100
  • 8b841d430a metal : Extend how Llama.cpp locates metal resources (llama/10676) Robert Ormandi 2024-12-07 01:55:01 -0600
  • b74b68212a vulkan: Add VK_NV_cooperative_matrix2 support for mul_mat and flash attention (llama/10206) Jeff Bolz 2024-12-05 13:15:05 -0600
  • 3a27b2b91b
    ruby : Add no_speech_thold (#2641) KITAITI Makoto 2024-12-18 18:00:50 +0900
  • d34445e960
    stream : improve consistency in README (#2642) crummyh 2024-12-18 00:43:48 -0600
  • f897eb7670
    whisper : support no_speech_thold (#2625) Karthick 2024-12-17 22:45:47 +0530
  • 2f2841bfce
    whisper : add single-timestamp logic (#2629) Karthick 2024-12-17 22:37:08 +0530
  • 09a1b61218
    readme : fix typo (#2637) crummyh 2024-12-17 11:05:35 -0600
  • 94e7da1ff2
    cmake : fix "amd64" processor string (#2638) Georgi Gerganov 2024-12-17 18:34:32 +0200
  • c4aed6831e
    vulkan : fix soft_max.comp division by zero (#2633) gn64 2024-12-16 19:34:38 +0900
  • 199579652e
    common : add cstdio header Georgi Gerganov 2024-12-16 08:57:04 +0200
  • d17e7139d8
    stream : update build instructions Georgi Gerganov 2024-12-15 21:55:36 +0200
  • 6a52eaea74
    android : fix build and ci (#2624) Thamster 2024-12-14 10:25:53 -0500
  • 6aa1d7b892
    models : fix typo in download-ggml-model.sh (#2623) Michael Rienstra 2024-12-12 08:02:00 -0800
  • 262e865a70
    ruby : Sync whisper.cpp and model download feature (#2617) KITAITI Makoto 2024-12-09 20:17:50 +0900
  • ed733e85a1
    scripts : update to new build system Georgi Gerganov 2024-12-09 11:30:16 +0200
  • 5980b1ae77
    devops : add cmake Georgi Gerganov 2024-12-08 23:09:26 +0200
  • 0415a66044
    devops : update make commands Georgi Gerganov 2024-12-08 23:07:29 +0200
  • 7d134e3737
    ggml : remove old files (skip) (#0) Georgi Gerganov 2024-12-08 23:04:26 +0200
  • 9df53b357e
    ggml : sync remnants (skip) (#0) Georgi Gerganov 2024-12-08 22:48:25 +0200
  • b2115b4d9b
    scripts : remove amx from sync Georgi Gerganov 2024-12-08 22:48:14 +0200
  • 0164427dd5 ci : disable freeBSD builds [no ci] Georgi Gerganov 2024-12-08 15:52:57 +0200
  • 627b11c78a readme : update build instructions Georgi Gerganov 2024-12-08 15:48:14 +0200
  • 472464453d ci : disable CUDA and Android builds Georgi Gerganov 2024-12-08 15:36:01 +0200
  • 11dddfbc9e ci : disable Obj-C build + fixes Georgi Gerganov 2024-12-08 13:35:35 +0200
  • 384e214cc7 make : shim cmake Georgi Gerganov 2024-12-06 15:34:53 +0200
  • f2c680f893 talk-llama : sync llama.cpp Georgi Gerganov 2024-12-05 14:30:33 +0200
  • fbe66da0e5 sync : ggml Georgi Gerganov 2024-12-05 14:29:18 +0200
  • a815940e0e ggml : add predefined list of CPU backend variants to build (llama/10626) Diego Devesa 2024-12-04 14:45:40 +0100
  • 904e307bce ggml-cpu : fix HWCAP2_I8MM value (llama/10646) Diego Devesa 2024-12-04 14:40:44 +0100
  • 491ec076b4 vulkan: Implement "fast divide" (mul+shift) for unary ops like copy (llama/10642) Jeff Bolz 2024-12-04 01:28:59 -0600
  • 966433fdf2 SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (llama/10584) Nicolò Scipione 2024-12-04 02:29:20 +0100
  • 6f1ba9d82d Avoid using __fp16 on ARM with old nvcc (llama/10616) Frankie Robertson 2024-12-04 02:41:37 +0200
  • 015ecd0001 vulkan: optimize and reenable split_k (llama/10637) Jeff Bolz 2024-12-03 13:29:54 -0600
  • b7c64a4352 ggml: add GGML_SET Metal kernel + i32 CPU kernel (ggml/1037) PAB 2024-12-04 09:19:30 +0100
  • 7895d39508 ggml : add GGML_PAD_REFLECT_1D operation (ggml/1034) PAB 2024-12-03 20:20:04 +0100
  • 22616f00f9 files : remove make artifacts Georgi Gerganov 2024-12-03 20:29:32 +0200
  • 02c6fcbc2c common : fix compile warning Georgi Gerganov 2024-12-03 20:25:37 +0200
  • 3daeacad24 ggml : move AMX to the CPU backend (llama/10570) Diego Devesa 2024-12-03 20:22:12 +0200
  • 4d73962da4 metal : small-batch mat-mul kernels (llama/10581) Georgi Gerganov 2024-12-03 11:52:33 +0200
  • 068812650e SYCL: Fix and switch to GGML_LOG system instead of fprintf (llama/10579) Akarshan Biswas 2024-12-02 12:34:11 +0530
  • 4b7e059e15 ggml-cpu: replace AArch64 NEON assembly with intrinsics in ggml_gemv_q4_0_4x4_q8_0() (llama/10567) Adrien Gallouët 2024-11-30 18:13:18 +0100
  • 30e35d7271 vulkan: Dynamic subgroup size support for Q6_K mat_vec (llama/10536) Eve 2024-11-30 07:00:02 +0000
  • 3623bd58f2 ggml : fix I8MM Q4_1 scaling factor conversion (llama/10562) Georgi Gerganov 2024-11-29 16:25:39 +0200
  • cb847c20a7 ggml-cpu: fix typo in gemv/gemm iq4_nl_4_4 (llama/10580) Shupei Fan 2024-11-29 21:49:02 +0800
  • 964b154a2a sycl : offload of get_rows set to 0 (llama/10432) Alberto Cabrera Pérez 2024-11-29 12:38:45 +0000
  • d7c2a04bce sycl : Reroute permuted mul_mats through oneMKL (llama/10408) Alberto Cabrera Pérez 2024-11-29 09:49:43 +0000
  • 2bb4ca9cba CANN: RoPE operator optimization (llama/10563) Chenguang Li 2024-11-29 14:46:55 +0800