mirror of
https://github.com/ggerganov/llama.cpp.git
synced 2025-03-06 20:48:53 +01:00
* backend : add eval callback ggml-ci * backend : group nodes in a single compute when user don't need them * backend : clean-up the implementation ggml-ci * simple : do not perform tensor data copy if not needed * simple : fix * imatrix : offload to GPU support * imatrix : fix ggml_mul_mat_id hanlding ggml-ci * ci : add imatrix test ggml-ci * ci : rearrange output ggml-ci |
||
---|---|---|
.. | ||
CMakeLists.txt | ||
imatrix.cpp |