Suitable and unsuitable datasets

main
Jeff Moe 2023-11-16 14:39:54 -07:00
parent e46cf57c08
commit 77538cafb3
1 changed files with 35 additions and 0 deletions

View File

@ -22,6 +22,41 @@ of these three lists as an acceptable free/open license:
* https://commons.wikimedia.org/wiki/Commons:Licensing
# Datasets to Evaluate
Datasets freely available to download, but may not have suitable license.
Determine which, if any, are ok.
* StackOverflow.
# Suitable Datasets
Datasets from the following may be suitable to use for training.
* Wikipedia.
# Unsuitable Licenses
Licenses that are not free, libre, open, even if they may claim to
be "open source".
These are not "Wikipedia Commons compatible", for example:
* Creative Commons Non-commercial (NC).
* Proprietary licenses.
* Any "custom" license that hasn't been reviewed by the general community.
# Unsuitable Datasets
Datasets that are not free, libre, open, even if they may claim to
be "open source".
## Unsuitable Model License
The following models are unsuitable due to using an unsuitable license.
### Non-commercial
Non-commercial licenses are not open source and are not suitable.
# License
Creative Commons Attribution-ShareAlike 4.0 International