whisper.cpp

History

mahorozte 4af9626702 CUDA: remove unnecessary warp reduce in FA (ggml/1032) * kqmax_new_j in every thread within warp is same after operate at line 199,this reduce can be omit * same problem in vec32 --------- Co-authored-by: ZhaoXiaoYu <zhao.xiaoyu@zte.com.cn>		2024-12-08 20:14:35 +02:00
..
include	ggml: new optimization interface (ggml/988)	2024-11-20 21:00:08 +02:00
src	CUDA: remove unnecessary warp reduce in FA (ggml/1032)	2024-12-08 20:14:35 +02:00
.gitignore	whisper : reorganize source code + improve CMake (#2256 )	2024-06-26 19:34:09 +03:00
CMakeLists.txt	ggml : sync resolve (skip) (#0 )	2024-11-20 21:00:08 +02:00