E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2…

Jupyter Notebook 221 11 Updated Sep 8, 2024

pdfminer / pdfminer.six

Community maintained fork of pdfminer - we fathom PDF

Python 5,824 921 Updated Aug 2, 2024

opendatalab / MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具，支持PDF/网页/多格式电子书提取。

Python 11,457 859 Updated Sep 20, 2024

zyocum / pdf2md

Convert PDF to Markdown via OpenAI multi-modal text/vision model.

Python 13 2 Updated Aug 6, 2024

defnecirci / MatSciTableExtract

Python 2 Updated Mar 7, 2024

hatanaka-lab / CopDDB

Jupyter Notebook 3 1 Updated Aug 19, 2024

ZHM-Sesame / polymert

Jupyter Notebook 1 Updated May 7, 2024

yiwenzheng98 / VitrimerVAE

Inverse Design of Vitrimeric Polymers by Molecular Dynamics and Generative Modeling: https://arxiv.org/abs/2312.03690

Python 3 Updated Jan 3, 2024

zach-zhiling-zheng / ChatGPT_Chemistry_Assistant

ChatGPT Chemistry Assistant

Jupyter Notebook 70 9 Updated Aug 7, 2023

hiyouga / LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 31,144 3,837 Updated Sep 19, 2024

titipata / scipdf_parser

Python PDF parser for scientific publications: content and figures

Python 328 53 Updated Mar 21, 2024

liugangcode / Graph-DiT

The code for "Graph Diffusion Transformer for Multi-Conditional Molecular Generation"

Python 8 Updated May 25, 2024

liugangcode / GREA

[KDD'22] Source codes of "Graph Rationalization with Environment-based Augmentations"

Python 33 5 Updated Jun 16, 2024

ck-unifr / pdf_parsing

PDF解析（文字，章节，表格，图片，参考），基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答，摘要，信息抽取

Python 144 27 Updated Oct 17, 2023

lbnlp / NERRE

Code and data for the publication "Structured information extraction from scientific text with large language models" by Dagdelen & Dunn et al.

Python 59 10 Updated Dec 22, 2023

ZhouhaoJiang / PdfReader-LangChian-LLM

ChatPDF Implement PDF parsing based on LangChain and LLM language model(ChatGLM,GPT...) | ChatPDF 基于LangChain和LLM语言模型实现PDF解析阅读

Python 40 5 Updated Jun 5, 2024

ghazalkhalighinejad / PNCExtract

The official repository for "Extracting Polymer Nanocomposite Samples from Full-Length Documents"

Python 4 Updated Jun 4, 2024

olsenlabmit / BCDB

The Block Copolymer Phase Behavior Database (BCDB)

Python 14 3 Updated Mar 1, 2024

CMMAi / Mechanical_Properties_Prediction_of_Random_Copolymers

Jupyter Notebook 1 Updated Mar 6, 2024

Ramprasad-Group / polymer_knowledge_extraction

Jupyter Notebook 23 6 Updated Sep 3, 2024

lucidrains / denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 7,955 995 Updated Sep 5, 2024

TRI-AMDD / PolyGen

Python 5 2 Updated Jul 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Haifan zhf3564859793

Highlights

Block or report zhf3564859793

Lists (2)

llm

polymer

Stars

Ucas-HaoranWei / GOT-OCR2.0

vllm-project / vllm

Dicklesworthstone / llm_aided_ocr

illuin-tech / colpali

jamesymwang / Kp-predict_MACCS-and-Molecular-Transformer

Filimoa / open-parse

atlanhq / camelot

QuivrHQ / MegaParse

wisupai / e2m