awesome-public-datasets

A topic-centric list of HQ open datasets. PR ☛☛☛

37.4 k

OpenRefine

OpenRefine is a free, open source power tool for working with messy data and improving it

Java7.54 k

datasets

🤗 Fast, efficient, open-access datasets and evaluation metrics for Natural Language Processing and more in PyTorch, TensorFlow, NumPy and Pandas

Python4.4 k

doccano

Open source text annotation tool for machine learning practitioner.

Python3.72 k

pipedream

Serverless integration and compute platform. Free for developers.

JavaScript1.19 k

ChineseGLUE

Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard

Python841

2019-wuhan-coronavirus-data

2019 Wuhan Coronavirus data (COVID-19 / 2019-nCoV)

PHP614

audino

Open source audio annotation tool for humans™

JavaScript605

CLUEDatasetSearch

搜索所有中文NLP数据集,附常用英文NLP数据集

Python447

Ⓒ2020 GitHub Index - 🔨Under Construction
📧 admin@githubs.cn - Forum