Skip to content
View miiiz's full-sized avatar

Block or report miiiz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 27,985 3,170 Updated Sep 30, 2024

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

9,094 1,489 Updated Aug 31, 2023

Retrieval and Retrieval-augmented LLMs

Python 6,996 511 Updated Sep 26, 2024

🤖 Free Search with AI, 💡 Open-Source Perplexity, 📍 Support Ollama/SearXNG, Support Docker deployment. 让AI大模型和搜索引擎回答你的问题,支持本地大模型(Ollama)、聚合搜索引擎SearXNG,支持Docker一键部署。

TypeScript 489 107 Updated Sep 28, 2024

推荐系统入门教程,在线阅读地址:https://datawhalechina.github.io/fun-rec/

Jupyter Notebook 4,219 805 Updated Jun 11, 2024

An AI-powered search engine with a generative UI

TypeScript 6,025 1,497 Updated Sep 30, 2024

中文命名实体识别。包含目前最新的中文命名实体识别论文、中文实体识别相关工具、数据集,以及中文预训练模型、词向量、实体识别综述等。

588 46 Updated May 21, 2024

Video+code lecture on building nanoGPT from scratch

Python 3,434 473 Updated Aug 13, 2024

tensorflow实战练习,包括强化学习、推荐系统、nlp等

Python 6,689 3,276 Updated Sep 24, 2023

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

Python 2,251 287 Updated Sep 30, 2024

计算广告/推荐系统/机器学习(Machine Learning)/点击率(CTR)/转化率(CVR)预估/点击率预估

1,941 437 Updated Dec 17, 2019

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 6,823 772 Updated Aug 24, 2023

Multilingual/multidomain question generation datasets, models, and python library for question generation.

Python 315 30 Updated Aug 20, 2024

Generate question/answer training pairs out of raw text.

Python 198 28 Updated Dec 10, 2023

Collection of data science projects in Python

Jupyter Notebook 1,664 431 Updated Nov 11, 2023

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,071 838 Updated Jul 1, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,205 1,058 Updated May 23, 2024

Curated list of data science interview questions and answers

3,216 739 Updated Sep 29, 2024

Data science interview questions and answers

HTML 8,806 1,958 Updated Sep 5, 2024

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Python 6,743 662 Updated Sep 30, 2024

An LLM-powered advanced RAG pipeline built from scratch

Python 789 49 Updated Jan 26, 2024

A beautiful resume/cover letter LaTeX template pair that are extraordinarily easy to use.

TeX 332 140 Updated Feb 21, 2024

A framework for large scale recommendation algorithms.

Python 1,736 316 Updated Sep 27, 2024
Python 1,583 137 Updated Sep 27, 2024

Start building LLM-empowered multi-agent applications in an easier way.

Python 4,926 300 Updated Sep 30, 2024

CTR prediction model based on spark(LR, GBDT, DNN)

Scala 905 260 Updated Mar 6, 2020
Next