Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
fast python port of arc90's readability tool, updated to match latest readability.js!
Search for words, documents, images, videos, news, maps and text translation using the DuckDuckGo.com search engine. Downloading files and images to a local hard drive.
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
🔥 🔥 🔥 Open Source JIRA, Linear, Monday, and Asana Alternative. Plane helps you track your issues, epics, and product roadmaps in the simplest way possible.
Official Python client for Elasticsearch
Elasticsearch integration into LangChain
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Desktop app for prototyping and debugging LangGraph applications locally.
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
Ongoing research training transformer models at scale
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Multilingual Voice Understanding Model
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. (将PaddleOCR模型做了转换,采用ONNXRuntime推理,速度很快)
A cloud-native vector database, storage for next generation AI applications
Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。
GoMate:RAG Framework within Reliable input,Trusted output
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Agent framework and applications built upon Qwen2.x, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.