Libre dataset scripts for Parrot.
https://parrot.codes/
bf1e9c43be | ||
---|---|---|
docs | ||
img | ||
src/the_stack | ||
tests | ||
.gitattributes | ||
.gitignore | ||
BUILD.md | ||
CHANGELOG.txt | ||
LICENSE-CC | ||
README.md | ||
datasets.ods | ||
pyproject.toml | ||
requirements.txt |
README.md
Parrot Datasets
Datasets for Parrot Libre AI IDE.
Libre Datasets
A list of libre datasets suitable for training a libre instruct model shall be listed.
Note other well known datasets, and their license suitability.
Parrot Dataset Licensing
The model may use data that is under a license that appears on one of these three lists as an acceptable free/open license:
Unsuitable Licenses
Licenses that are not free, libre, open, even if they may claim to be "open source".
These are not "Wikipedia Commons compatible", for example:
- Creative Commons Non-commercial (NC).
- Proprietary licenses.
- Any "custom" license that hasn't been reviewed by the general community.
Datasets Table
Table of datasets. See also the spreadsheet datasets.ods
.
Datasets
Datasets perhaps to be built and used.
The Smack
Libre version of The Stack.
See: datasets/the-smack
.
License
Creative Commons Attribution-ShareAlike 4.0 International
Copyright © 2023, Jeff Moe.