Libre dataset scripts for Parrot. https://parrot.codes/
Go to file
Jeff Moe cbe6149161 v0.0.9 2023-11-25 11:53:00 -07:00
docs v0.0.9 2023-11-25 11:53:00 -07:00
img dataset table img 2023-11-23 11:56:44 -07:00
src/the_stack re-arrange directory structure 2023-11-25 11:43:15 -07:00
tests re-arrange directory structure 2023-11-25 11:43:15 -07:00
.gitattributes Spreadsheets with LFS 2023-11-17 08:45:32 -07:00
.gitignore re-arrange directory structure 2023-11-25 11:43:15 -07:00
BUILD.md Version revving, noted, fixed 2023-11-25 11:51:54 -07:00
CHANGELOG.txt v0.0.9 2023-11-25 11:53:00 -07:00
LICENSE-CC Creative Commons Attribution-ShareAlike 4.0 International 2023-11-16 11:18:44 -07:00
README.md The Smack, noted 2023-11-24 17:30:36 -07:00
datasets.ods More datasets in table... 2023-11-17 22:56:39 -07:00
pyproject.toml v0.0.9 2023-11-25 11:53:00 -07:00
requirements.txt re-arrange directory structure 2023-11-25 11:43:15 -07:00

README.md

Parrot Datasets

Datasets for Parrot Libre AI IDE.

https://parrot.codes

Libre Datasets

A list of libre datasets suitable for training a libre instruct model shall be listed.

Note other well known datasets, and their license suitability.

Parrot Dataset Licensing

The model may use data that is under a license that appears on one of these three lists as an acceptable free/open license:

Unsuitable Licenses

Licenses that are not free, libre, open, even if they may claim to be "open source".

These are not "Wikipedia Commons compatible", for example:

  • Creative Commons Non-commercial (NC).
  • Proprietary licenses.
  • Any "custom" license that hasn't been reviewed by the general community.

Datasets Table

Table of datasets. See also the spreadsheet datasets.ods.

Table of Datasets

Datasets

Datasets perhaps to be built and used.

The Smack

Libre version of The Stack. See: datasets/the-smack.

License

Creative Commons Attribution-ShareAlike 4.0 International

Copyright © 2023, Jeff Moe.