1
0
Fork 0
tinygrab/extra
George Hotz b8dfbba703 hip_matmul: f16 gemm 2048x2048 gets 36 TFLOPS 2023-07-08 00:35:45 +00:00
..
accel Remove POW llop and add SQRT llop (#1104) 2023-07-05 18:07:58 -07:00
datasets Fix naming conflict with huggingface datasets (#1161) 2023-07-07 10:43:44 -07:00
disk Fast DiskTensor to other Tensor (#916) 2023-06-03 12:25:41 -07:00
gemm hip_matmul: f16 gemm 2048x2048 gets 36 TFLOPS 2023-07-08 00:35:45 +00:00
intel intel benchmark matmul gets 60 TFLOPS? 2023-06-04 17:01:50 +00:00
ptx ptx example 2023-05-26 19:28:51 -07:00
rocm faster RDNA assembly backend (#990) 2023-06-16 12:06:38 -07:00
triton A Triton backend for tinygrad (#470) 2023-02-01 11:53:57 -08:00
archprobe.py no werror in archprobe 2023-05-03 19:34:17 +00:00
augment.py Fix naming conflict with huggingface datasets (#1161) 2023-07-07 10:43:44 -07:00
gradcheck.py Fix: Jacobian tests [WIP] (#1126) 2023-07-05 15:36:22 -07:00
helpers.py helpers.py: improved test coverage + exception handling (#1165) 2023-07-07 10:26:05 -07:00
hip_wrapper.py hip matmul 2023-06-28 19:21:01 +00:00
introspection.py Devicebufferless (#708) 2023-03-18 14:40:23 -07:00
kernel_search.py move to shapetracker.py 2023-03-11 07:50:07 -08:00
lib_test_ast.py fix ast test 2023-03-04 07:49:25 -08:00
lr_scheduler.py Over 90% on CIFAR with examples/hlb_cifar10.py (#1073) 2023-07-06 20:46:22 -07:00
onnx.py Use generators instead of lists in `any`s and `all`s (#1111) 2023-07-03 16:06:06 -07:00
onnx_ops.py Handle broadcast flag on gemm (#1103) 2023-07-02 22:15:07 -07:00
thneed.py include missing queue in thneed load (#1095) 2023-07-02 12:33:59 -07:00
training.py Add return statement to the train function (#1135) 2023-07-05 08:13:38 -07:00
utils.py Fix: auto create parent dir when downloading file (#1173) 2023-07-07 13:40:29 -07:00