Stars
This is a continuously updated handbook for readers to easily track the latest NL2SQL techniques in the literature and provide practical guidance for researchers and practitioners.
Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?” (VLDB'24)
Official repository for the paper “HAIChart: Human and AI Paired Visualization System” (VLDB'24)
Automatic Generation of Visualizations and Infographics using Large Language Models
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程
KaggleBench是一个用于评价可视化推荐效果的公开benchmark。其数据来源是数据分析网站Kaggle上的数据集及其对应的数据可视化结果。Benchmark总共包含18个数据集,每个数据集对应一个有序的可视化结果。
benchmark dataset for visualization recommendation
Code and data of KDD'21 paper "Table2Charts: Recommending Charts by Learning Shared Table Representations"
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Data2Vis: Automatic Generation of Data Visualizations Using Sequence to Sequence Recurrent Neural Networks
PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models. PICARD is a ServiceNow Research project that was started at Element AI.
Official repository for the paper “Learned Data-aware Image Representations of Line Charts for Similarity Search” (SIGMOD'23)
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
This repo includes ChatGPT prompt curation to use ChatGPT better.
Coresets over Multiple Tables for Feature-rich and Data-efficient Machine Learning
A pytorch implementation for FACE: A Normalizing Flow based Cardinality Estimator
A self-tuning anomaly detection system to address the challenges of method selection and hyper-parameter tuning without access to a sufficient number of human supplied, ground truth labels.