Skip to content

Commit

Permalink
Retire the ggml_mul_mat() branch for transposed src0 (ggerganov#500)
Browse files Browse the repository at this point in the history
* Retire the ggml_mul_mat() for transposed src0

- It can always be made contiguous with ggml_cpy()
- The code is now simplified
- The results are deterministic in respect to num threads

* SIMD-ify dequantize_row_q4_0() for ARM_NEON (ggerganov#502)

* Attempt to SIMD-ify dequantize_row_q4_0() for ARM_NEON

* Fix dequantization - forgot to interleave the quants
  • Loading branch information
ggerganov committed Mar 25, 2023
1 parent 502a400 commit ecbe466
Showing 1 changed file with 237 additions and 716 deletions.
Loading

0 comments on commit ecbe466

Please sign in to comment.