parrot-datasets/README.md

68 lines
1.4 KiB
Markdown
Raw Permalink Normal View History

2023-11-25 12:43:16 -07:00
.. _parrot-datasets:
Parrot Datasets
===============
2023-11-25 12:43:16 -07:00
Datasets for Parrot Libre AI IDE.
2023-11-25 12:43:16 -07:00
.. _parrot-libre-datasets:
2023-11-16 13:43:52 -07:00
2023-11-25 12:43:16 -07:00
Libre Datasets
---------------
A list of libre datasets suitable for training a libre instruct model shall be listed.
2023-11-16 13:43:52 -07:00
Note other well known datasets, and their license suitability.
2023-11-25 12:43:16 -07:00
.. _parrot-dataset-licensing:
2023-11-16 13:43:52 -07:00
2023-11-25 12:43:16 -07:00
Parrot Dataset Licensing
-------------------------
2023-11-16 13:43:52 -07:00
2023-11-25 12:43:16 -07:00
The model may use data that is under a license that appears on one of these three lists as an acceptable free/open license:
2023-11-16 13:43:52 -07:00
2023-11-25 12:43:16 -07:00
* https://www.gnu.org/licenses/license-list.html
2023-11-16 13:43:52 -07:00
* https://opensource.org/licenses/
* https://commons.wikimedia.org/wiki/Commons:Licensing
2023-11-25 12:43:16 -07:00
.. _unsuitable-licenses:
2023-11-16 13:43:52 -07:00
2023-11-25 12:43:16 -07:00
Unsuitable Licenses
--------------------
2023-11-16 14:39:54 -07:00
2023-11-25 12:43:16 -07:00
Licenses that are not free, libre, open, even if they may claim to be "open source".
2023-11-16 14:39:54 -07:00
These are not "Wikipedia Commons compatible", for example:
* Creative Commons Non-commercial (NC).
* Proprietary licenses.
* Any "custom" license that hasn't been reviewed by the general community.
2023-11-25 12:43:16 -07:00
.. _datasets-table:
Datasets Table
--------------
2023-11-16 14:39:54 -07:00
2023-11-23 11:56:55 -07:00
Table of datasets. See also the spreadsheet `datasets.ods`.
2023-11-16 20:50:26 -07:00
2023-11-25 12:43:16 -07:00
.. image:: img/datasets-table.png
:alt: Table of Datasets
2023-11-16 14:39:54 -07:00
2023-11-25 12:43:16 -07:00
.. _datasets:
Datasets
--------
2023-11-16 14:39:54 -07:00
2023-11-24 17:30:36 -07:00
Datasets perhaps to be built and used.
2023-11-25 12:43:16 -07:00
* The Smack
Libre version of The Stack. See: `datasets/the-smack`.
.. _license:
2023-11-24 17:30:36 -07:00
2023-11-25 12:43:16 -07:00
License
-------
2023-11-24 17:30:36 -07:00
Creative Commons Attribution-ShareAlike 4.0 International
2023-11-25 12:43:16 -07:00
Copyright © 2023, Jeff Moe.