Skip to content
View zhf3564859793's full-sized avatar
  • Hong Kong University of Science and Technology
  • Hong Kong

Highlights

  • Pro

Block or report zhf3564859793

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 3,847 297 Updated Sep 19, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,119 3,981 Updated Sep 22, 2024

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

Python 2,031 128 Updated Aug 21, 2024

The code used to train and run inference with the ColPali architecture.

Python 527 54 Updated Sep 20, 2024

Improved file parsing for LLM’s

Python 2,382 90 Updated Sep 17, 2024

Camelot: PDF Table Extraction for Humans

Python 3,643 355 Updated Jan 5, 2023

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Python 503 36 Updated Aug 20, 2024

E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2…

Jupyter Notebook 221 11 Updated Sep 8, 2024

Community maintained fork of pdfminer - we fathom PDF

Python 5,824 921 Updated Aug 2, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 11,457 859 Updated Sep 20, 2024

Convert PDF to Markdown via OpenAI multi-modal text/vision model.

Python 13 2 Updated Aug 6, 2024
Python 2 Updated Mar 7, 2024
Jupyter Notebook 3 1 Updated Aug 19, 2024
Jupyter Notebook 1 Updated May 7, 2024

Inverse Design of Vitrimeric Polymers by Molecular Dynamics and Generative Modeling: https://arxiv.org/abs/2312.03690

Python 3 Updated Jan 3, 2024

ChatGPT Chemistry Assistant

Jupyter Notebook 70 9 Updated Aug 7, 2023

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 31,144 3,837 Updated Sep 19, 2024

Python PDF parser for scientific publications: content and figures

Python 328 53 Updated Mar 21, 2024

The code for "Graph Diffusion Transformer for Multi-Conditional Molecular Generation"

Python 8 Updated May 25, 2024

[KDD'22] Source codes of "Graph Rationalization with Environment-based Augmentations"

Python 33 5 Updated Jun 16, 2024

PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取

Python 144 27 Updated Oct 17, 2023

Code and data for the publication "Structured information extraction from scientific text with large language models" by Dagdelen & Dunn et al.

Python 59 10 Updated Dec 22, 2023

ChatPDF Implement PDF parsing based on LangChain and LLM language model(ChatGLM,GPT...) | ChatPDF 基于LangChain和LLM语言模型实现PDF解析阅读

Python 40 5 Updated Jun 5, 2024

The official repository for "Extracting Polymer Nanocomposite Samples from Full-Length Documents"

Python 4 Updated Jun 4, 2024

The Block Copolymer Phase Behavior Database (BCDB)

Python 14 3 Updated Mar 1, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 7,955 995 Updated Sep 5, 2024
Python 5 2 Updated Jul 19, 2024
Next