2023-11-16 11:18:07 -07:00
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
.. _parrot-datasets:
|
|
|
|
|
|
|
|
Parrot Datasets
|
|
|
|
===============
|
2023-11-16 11:18:07 -07:00
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
Datasets for Parrot Libre AI IDE.
|
2023-11-16 11:18:07 -07:00
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
.. _parrot-libre-datasets:
|
2023-11-16 13:43:52 -07:00
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
Libre Datasets
|
|
|
|
---------------
|
|
|
|
|
|
|
|
A list of libre datasets suitable for training a libre instruct model shall be listed.
|
2023-11-16 13:43:52 -07:00
|
|
|
Note other well known datasets, and their license suitability.
|
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
.. _parrot-dataset-licensing:
|
2023-11-16 13:43:52 -07:00
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
Parrot Dataset Licensing
|
|
|
|
-------------------------
|
2023-11-16 13:43:52 -07:00
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
The model may use data that is under a license that appears on one of these three lists as an acceptable free/open license:
|
2023-11-16 13:43:52 -07:00
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
* https://www.gnu.org/licenses/license-list.html
|
2023-11-16 13:43:52 -07:00
|
|
|
* https://opensource.org/licenses/
|
|
|
|
* https://commons.wikimedia.org/wiki/Commons:Licensing
|
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
.. _unsuitable-licenses:
|
2023-11-16 13:43:52 -07:00
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
Unsuitable Licenses
|
|
|
|
--------------------
|
2023-11-16 14:39:54 -07:00
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
Licenses that are not free, libre, open, even if they may claim to be "open source".
|
2023-11-16 14:39:54 -07:00
|
|
|
These are not "Wikipedia Commons compatible", for example:
|
|
|
|
|
|
|
|
* Creative Commons Non-commercial (NC).
|
|
|
|
* Proprietary licenses.
|
|
|
|
* Any "custom" license that hasn't been reviewed by the general community.
|
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
.. _datasets-table:
|
|
|
|
|
|
|
|
Datasets Table
|
|
|
|
--------------
|
2023-11-16 14:39:54 -07:00
|
|
|
|
2023-11-23 11:56:55 -07:00
|
|
|
Table of datasets. See also the spreadsheet `datasets.ods`.
|
2023-11-16 20:50:26 -07:00
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
.. image:: img/datasets-table.png
|
|
|
|
:alt: Table of Datasets
|
2023-11-16 14:39:54 -07:00
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
.. _datasets:
|
|
|
|
|
|
|
|
Datasets
|
|
|
|
--------
|
2023-11-16 14:39:54 -07:00
|
|
|
|
2023-11-24 17:30:36 -07:00
|
|
|
Datasets perhaps to be built and used.
|
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
* The Smack
|
|
|
|
Libre version of The Stack. See: `datasets/the-smack`.
|
|
|
|
|
|
|
|
.. _license:
|
2023-11-24 17:30:36 -07:00
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
License
|
|
|
|
-------
|
2023-11-24 17:30:36 -07:00
|
|
|
|
2023-11-16 11:18:07 -07:00
|
|
|
Creative Commons Attribution-ShareAlike 4.0 International
|
|
|
|
|
2023-11-25 12:43:16 -07:00
|
|
|
Copyright © 2023, Jeff Moe.
|