-
Hong Kong University of Science and Technology
- Hong Kong
Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
A high-throughput and memory-efficient inference and serving engine for LLMs
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
The code used to train and run inference with the ColPali architecture.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2…
Community maintained fork of pdfminer - we fathom PDF
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
Convert PDF to Markdown via OpenAI multi-modal text/vision model.
Inverse Design of Vitrimeric Polymers by Molecular Dynamics and Generative Modeling: https://arxiv.org/abs/2312.03690
ChatGPT Chemistry Assistant
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Python PDF parser for scientific publications: content and figures
The code for "Graph Diffusion Transformer for Multi-Conditional Molecular Generation"
[KDD'22] Source codes of "Graph Rationalization with Environment-based Augmentations"
PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取
Code and data for the publication "Structured information extraction from scientific text with large language models" by Dagdelen & Dunn et al.
ChatPDF Implement PDF parsing based on LangChain and LLM language model(ChatGLM,GPT...) | ChatPDF 基于LangChain和LLM语言模型实现PDF解析阅读
The official repository for "Extracting Polymer Nanocomposite Samples from Full-Length Documents"
The Block Copolymer Phase Behavior Database (BCDB)
Implementation of Denoising Diffusion Probabilistic Model in Pytorch