-
Intel
- Shanghai
-
ollama Public
Forked from felipeagc/ollamaGet up and running with Llama 2, Mistral, and other large language models locally.
-
auto-round Public
Forked from intel/auto-roundSOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
Python Apache License 2.0 UpdatedJun 11, 2024 -
data-parallel-CPP Public
Forked from Apress/data-parallel-CPPSource code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben Ashbaugh, James Brodman, Michael Kinsner, John Pennycook, Xin…
CMake Other UpdatedFeb 20, 2024 -
-
intel-extension-for-pytorch Public
Forked from intel/intel-extension-for-pytorchA Python package for extending the official PyTorch that can easily obtain performance on Intel platform
Python Apache License 2.0 UpdatedOct 23, 2023 -
bitsandbytes Public
Forked from bitsandbytes-foundation/bitsandbytes8-bit CUDA functions for PyTorch
Python MIT License UpdatedJul 26, 2023 -
oneDNN Public
Forked from oneapi-src/oneDNNoneAPI Deep Neural Network Library (oneDNN)
C++ Apache License 2.0 UpdatedJun 19, 2023