Commit Graph

100 Commits (main)

Author SHA1 Message Date
Jeff Moe 208d4c2c0d underscore muh 2023-11-24 18:32:56 -07:00
Jeff Moe 83310747f5 Add pytest for license scriptlet 2023-11-24 18:21:44 -07:00
Jeff Moe 38a8c9d29b v0.0.4 2023-11-24 18:10:44 -07:00
Jeff Moe 5a1441441f Print license list by default 2023-11-24 18:03:54 -07:00
Jeff Moe d29e41c8fa arrow 2023-11-24 17:33:16 -07:00
Jeff Moe b1db281aa5 The Smack, noted 2023-11-24 17:30:36 -07:00
Jeff Moe b58a115998 sort stack license output 2023-11-24 17:26:32 -07:00
Jeff Moe df6a972c29 Add option to print unique licenses used by The Stack 2023-11-24 17:23:47 -07:00
Jeff Moe 29e9146847 all the pretty records 2023-11-24 17:20:00 -07:00
Jeff Moe ac47373b79 stack licenses color option, record selection 2023-11-24 17:06:45 -07:00
Jeff Moe 9fe78fbc20 Add scriptlet to read licenses from The Stack 2023-11-24 16:57:12 -07:00
Jeff Moe 046f68e844 Revert "Print panda dataframe types of headers"
This reverts commit 44f43a5fcc.
2023-11-24 16:46:09 -07:00
Jeff Moe 44f43a5fcc Print panda dataframe types of headers 2023-11-24 16:43:42 -07:00
Jeff Moe 3debab7105 marginally faster without multiprocessing 2023-11-24 16:38:58 -07:00
Jeff Moe 7003b5766c dont iterate all dirs, header reader 2023-11-24 16:36:55 -07:00
Jeff Moe f5ed65602f format with black 2023-11-24 16:34:11 -07:00
Jeff Moe 8cf313a5de multiprocessing, but not faster 2023-11-24 16:24:57 -07:00
Jeff Moe a3881bcf43 cleanup, improve header reader 2023-11-24 16:20:30 -07:00
Jeff Moe 067ca1ddee v0.0.3 2023-11-24 16:10:51 -07:00
Jeff Moe 9fa9846c93 the-stack-headers, noted 2023-11-24 16:10:15 -07:00
Jeff Moe 7f57a9c85d Update data/ path 2023-11-24 16:06:17 -07:00
Jeff Moe 865da0222b Rename to the-stack-headers 2023-11-24 15:53:19 -07:00
Jeff Moe 35c43af45a Slightly better readability 2023-11-24 15:51:51 -07:00
Jeff Moe 4705e4ffb7 Read metadata scriptlet 2023-11-24 15:42:20 -07:00
Jeff Moe d69fd04acd ignore venv env 2023-11-24 15:36:32 -07:00
Jeff Moe 75e43a6c13 no rust thx 2023-11-24 15:35:25 -07:00
Jeff Moe e5314d316e example path 2023-11-24 15:34:57 -07:00
Jeff Moe 60cd1a40e6 Add example metadata python script 2023-11-24 14:16:13 -07:00
Jeff Moe 80d0b5e957 New rust The Smack project 2023-11-24 14:12:26 -07:00
Jeff Moe dadc4d5381 The Smack Dataset 2023-11-24 13:56:20 -07:00
Jeff Moe 63f8f8dfa7 v0.0.2 2023-11-23 11:57:19 -07:00
Jeff Moe a9cb42505d dataset table 2023-11-23 11:56:55 -07:00
Jeff Moe 96b07adc61 dataset table img 2023-11-23 11:56:44 -07:00
Jeff Moe c1e66595ea More datasets in table... 2023-11-17 22:56:39 -07:00
Jeff Moe 59bd05df73 More datasets in table 2023-11-17 11:01:27 -07:00
Jeff Moe 0d41a4d2f6 Updates to table 2023-11-17 10:59:34 -07:00
Jeff Moe 7e78d5e222 Updates to table 2023-11-17 10:20:24 -07:00
Jeff Moe a0175f99e7 Table of a few datasets 2023-11-17 09:30:39 -07:00
Jeff Moe 02f1a12085 Blank spreadsheet 2023-11-17 09:19:46 -07:00
Jeff Moe a1b9f5fdc9 ignore libreoffice temp files 2023-11-17 08:45:54 -07:00
Jeff Moe 9256093143 Spreadsheets with LFS 2023-11-17 08:45:32 -07:00
Jeff Moe 98fdcb3068 Datasets, perhaps 2023-11-17 08:45:16 -07:00
Jeff Moe 7badc64cd1 No Common Crawl 2023-11-16 20:50:26 -07:00
Jeff Moe cd64cff3b5 v0.0.1 2023-11-16 14:40:16 -07:00
Jeff Moe 77538cafb3 Suitable and unsuitable datasets 2023-11-16 14:39:54 -07:00
Jeff Moe e46cf57c08 Libre datasets, licensing 2023-11-16 13:43:52 -07:00
Jeff Moe c6789ef06e v0.0.0 2023-11-16 11:19:05 -07:00
Jeff Moe 456faf9a4d ignore temp files 2023-11-16 11:18:55 -07:00
Jeff Moe f6f20dc4e4 Creative Commons Attribution-ShareAlike 4.0 International 2023-11-16 11:18:44 -07:00
Jeff Moe 8b89ef28cd Datasets for Parrot Libre AI IDE readme 2023-11-16 11:18:07 -07:00