parrot-datasets/README.md

68 lines
1.4 KiB
Markdown

.. _parrot-datasets:
Parrot Datasets
===============
Datasets for Parrot Libre AI IDE.
.. _parrot-libre-datasets:
Libre Datasets
---------------
A list of libre datasets suitable for training a libre instruct model shall be listed.
Note other well known datasets, and their license suitability.
.. _parrot-dataset-licensing:
Parrot Dataset Licensing
-------------------------
The model may use data that is under a license that appears on one of these three lists as an acceptable free/open license:
* https://www.gnu.org/licenses/license-list.html
* https://opensource.org/licenses/
* https://commons.wikimedia.org/wiki/Commons:Licensing
.. _unsuitable-licenses:
Unsuitable Licenses
--------------------
Licenses that are not free, libre, open, even if they may claim to be "open source".
These are not "Wikipedia Commons compatible", for example:
* Creative Commons Non-commercial (NC).
* Proprietary licenses.
* Any "custom" license that hasn't been reviewed by the general community.
.. _datasets-table:
Datasets Table
--------------
Table of datasets. See also the spreadsheet `datasets.ods`.
.. image:: img/datasets-table.png
:alt: Table of Datasets
.. _datasets:
Datasets
--------
Datasets perhaps to be built and used.
* The Smack
Libre version of The Stack. See: `datasets/the-smack`.
.. _license:
License
-------
Creative Commons Attribution-ShareAlike 4.0 International
Copyright © 2023, Jeff Moe.