-
LLaMA-Efficient-Tuning Public
Forked from hiyouga/LLaMA-FactoryEasy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA) (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)
Python Apache License 2.0 UpdatedSep 14, 2024 -
llama3 Public
Forked from meta-llama/llama3The official Meta Llama 3 GitHub site
Python Other UpdatedSep 12, 2024 -
MiniCPM-V Public
Forked from OpenBMB/MiniCPM-VMiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Python Apache License 2.0 UpdatedSep 10, 2024 -
InternVL Public
Forked from OpenGVLab/InternVL[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Python MIT License UpdatedAug 30, 2024 -
quiet-star Public
Forked from ezelikman/quiet-starCode for Quiet-STaR
Python Apache License 2.0 UpdatedAug 21, 2024 -
FlagEmbedding Public
Forked from FlagOpen/FlagEmbeddingBGE embedding
Python MIT License UpdatedAug 9, 2024 -
cosmopedia Public
Forked from huggingface/cosmopedia合成数据-较好的例子
Python Apache License 2.0 UpdatedJul 8, 2024 -
infini-attention Public
Forked from vmarinowski/infini-attentionAn unofficial pytorch implementation of 'Efficient Infinite Context Transformers with Infini-attention'
Python UpdatedJun 3, 2024 -
st-moe-pytorch Public
Forked from lucidrains/st-moe-pytorchImplementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
Python MIT License UpdatedMay 28, 2024 -
mergekit Public
Forked from arcee-ai/mergekitTools for merging pretrained large language models.
Python GNU Lesser General Public License v3.0 UpdatedMay 27, 2024 -
MathCoder Public
Forked from mathllm/MathCoderFamily of LLMs for mathematical reasoning.
Python Apache License 2.0 UpdatedMay 22, 2024 -
llama-mistral Public
Forked from dzhulgakov/llama-mistralfork code; 基于llama实现moe代码结构
Python Other UpdatedMay 20, 2024 -
LLMTest_NeedleInAHaystack Public
Forked from gkamradt/LLMTest_NeedleInAHaystackDoing simple retrieval from LLM models at various context lengths to measure accuracy
Jupyter Notebook Other UpdatedMay 10, 2024 -
Pai-Megatron-Patch Public
Forked from alibaba/Pai-Megatron-PatchThe official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Python Apache License 2.0 UpdatedApr 22, 2024 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedApr 22, 2024 -
Awesome-Chinese-LLM Public
Forked from HqWu-HITCS/Awesome-Chinese-LLM整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
UpdatedFeb 23, 2024 -
opencompass Public
Forked from open-compass/opencompassOpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Python Apache License 2.0 UpdatedFeb 22, 2024 -
awesome-pretrained-chinese-nlp-models Public
Forked from lonePatient/awesome-pretrained-chinese-nlp-modelsAwesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Python MIT License UpdatedFeb 19, 2024 -
Qwen Public
Forked from QwenLM/QwenThe official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Python Apache License 2.0 UpdatedFeb 17, 2024 -
MiniCPM Public
Forked from OpenBMB/MiniCPMMiniCPM-2B: An end-side LLM outperforms Llama2-13B.
Python Apache License 2.0 UpdatedFeb 5, 2024 -
UltraEval Public
Forked from OpenBMB/UltraEvalAn open source framework for evaluating foundation models.
Python Apache License 2.0 UpdatedFeb 1, 2024 -
sentencepiece Public
Forked from google/sentencepieceUnsupervised text tokenizer for Neural Network-based text generation.
C++ Apache License 2.0 UpdatedJan 29, 2024 -
BCEmbedding Public
Forked from netease-youdao/BCEmbeddingNetease Youdao's open-source embedding and reranker models for RAG products.
Python Apache License 2.0 UpdatedJan 29, 2024 -
Chinese-LLaMA-Alpaca Public
Forked from ymcui/Chinese-LLaMA-Alpaca中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Python Apache License 2.0 UpdatedJan 19, 2024 -
-
pytorch_pretrained_BERT_lm Public
Forked from ninjawork007/pytorch_pretrained_BERTJupyter Notebook Apache License 2.0 UpdatedJan 15, 2024 -
AlignBench Public
Forked from THUDM/AlignBench多维度中文对齐评测基准 | Benchmarking Chinese Alignment of LLMs
Python UpdatedDec 29, 2023 -
llama-moe Public
Forked from pjlab-sys4nlp/llama-moe⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
Python Apache License 2.0 UpdatedDec 25, 2023 -
WizardLM Public
Forked from nlpxucan/WizardLMLLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Python UpdatedDec 19, 2023 -
magicoder Public
Forked from ise-uiuc/magicoderMagicoder: Source Code Is All You Need
Python MIT License UpdatedDec 19, 2023