tinygrab

deepcrayon

tinygrab

Author	SHA1	Message	Date
Jeff Moe	fe8d9753f0	docstrings, requirements webgpu	2023-12-06 11:31:18 -07:00
Jeff Moe	96fb6d334b	Add torch docstrings, and to requirements	2023-12-06 11:03:20 -07:00
Jeff Moe	e999513df0	Metal docstrings with fail	2023-12-06 10:53:45 -07:00
Jeff Moe	f1af595918	docstrings for llvm	2023-12-06 10:44:45 -07:00
Jeff Moe	042dd69139	ops_gpu docstrings	2023-12-06 10:22:47 -07:00
Jeff Moe	421396343e	ops_disk doctrings	2023-12-06 10:14:14 -07:00
Jeff Moe	6b70666dd9	cuda doctrings; tinygrad notice	2023-12-06 10:07:16 -07:00
Jeff Moe	dcf9c9438d	ops_clang, ops_cpu docstrings	2023-12-06 09:46:46 -07:00
Jeff Moe	661dcc5ed0	Reformat, uh, everything, with black	2023-12-04 22:01:04 -07:00
George Hotz	41d696145d	hotfix: forking works okay in HIP now	2023-12-04 21:59:18 +00:00
George Hotz	09b6e254a3	hip compile speed (#2606 )	2023-12-04 13:47:40 -08:00
George Hotz	664475f247	vals is an argument (#2599 ) * vals is an argument * don't even know how that's legal python	2023-12-03 21:50:43 -08:00
George Hotz	fcd0b2ee6c	fix multigpu on tinybox (#2595 ) * fix multigpu on tinybox * fixed multigpu	2023-12-03 16:48:07 -08:00
George Hotz	bbeba8ec85	use default dict for external_model_benchmark (#2592 ) * device default * Device.DEFAULT * half max for cuda * CUDA_INCLUDE_PATH * closer to working * cuda fixups * Update ops_cuda.py	2023-12-03 15:25:43 -08:00
George Hotz	171543fc8d	cleanups to save lines and files (#2577 ) * runtime/graph -> features/graph * put all the cstyle renderers in cstyle * same line for those * how did that pass mypy	2023-12-02 16:29:56 -08:00
George Hotz	a9a76639c8	that's not needed (#2574 )	2023-12-02 16:01:29 -08:00
nimlgen	065495e0c9	save a few lines in ops_gpu (#2564 ) Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2023-12-02 15:05:22 -08:00
George Hotz	d6b404ac11	No dtype alloc (#2570 ) * fix all allocs * improve docs * ugh fix fake alloc	2023-12-02 13:29:40 -08:00
George Hotz	5068e99d18	refactor to remove extra kernel params (#2563 ) * refactor to have compiled kernel * bugfixes * docs/beautiful.py * revert that * fix tests	2023-12-02 00:32:25 -08:00
George Hotz	27481b9206	Switch ops_gpu -> gpuctypes (#2532 ) * ops_gpu is go * fix size 0 * fix image, and add more tests * nerf openpilot test, doesn't test thneed * run the schedule * better * oops, new inputs * delete pyopencl * Update ops_gpu.py	2023-12-01 22:30:21 -08:00
George Hotz	217cda81ba	hotfix: no metalgraph if there's weird ops	2023-12-01 19:32:55 -08:00
Christopher Mauri Milan	077567f62d	Remove as_buffer for TORCH (#2554 ) * remove as_buffer for torch * enable torch zerocopy if on cpu * remove as_buffer even on torch:cpu	2023-12-01 18:51:38 -08:00
chenyu	67f4e03724	rewrite 0 size loadop into a CONST (#2556 ) * rewrite 0 size loadop into a CONST * check alloc size * EMPTY is better * Revert "EMPTY is better" This reverts commit 574fe0f9ed28f1b97da5a81afdfd2cd5d9a94ff9. * no ast is created * fix test	2023-12-01 18:29:06 -05:00
George Hotz	f9b1de598f	hotfix: metal fastpath on sonoma	2023-12-01 14:55:34 -08:00
George Hotz	f5de21e753	fast path for copy (#2548 ) * fast copy * ruff first * flat_mv on malloc * order + webgpu test	2023-12-01 11:34:47 -08:00
George Hotz	d8175a4380	simple fix (#2543 )	2023-12-01 09:42:15 -08:00
nimlgen	badc97f824	hip & cuda to gpuctypes (#2539 ) * cuda with gpuctypes * hip gpuctypes * graphs * rename + linter happy * use cpu_time_execution * no ji in build_kernel_node_params * remove hip_wrapper * hip fix * no arc * smalle changes * no clean moduke in cudacpu	2023-12-01 09:25:27 -08:00
chenyu	7fec966b5e	bye bye NOOP (#2534 ) * bye bye NOOP * SIN * NEG	2023-11-30 23:10:35 -08:00
George Hotz	12fa846122	zero copy (#2531 ) * zero copy * zero copy test * loads coder in milliseconds * zero copy for cpu and torch * src_from_buffer is None * SLOW_METAL_COPY there	2023-11-30 18:38:41 -08:00
George Hotz	2c363b5f0b	new style device (#2530 ) * cpu tests pass * torch works * works * metal works * fix ops_disk * metal jit works * fix openpilot * llvm and clang work * fix webgpu * docs are rly broken * LRU works on metal * delete comment * revert name to ._buf. LRU only on Compiled * changes * allocator * allocator, getting closer * lru alloc * LRUAllocator * all pass * metal * cuda * test examples * linearizer * test fixes * fix custom + clean realize * fix hip * skip tests * fix tests * fix size=0 * fix MOCKHIP * fix thneed * copy better * simple * old style metal copy * fix thneed * np reshape * give cuda a device	2023-11-30 17:07:16 -08:00
George Hotz	6707f2588e	use copyin (#2500 ) * it's always copyin * all RawBuffer are RawBufferCopyIn * cleanups * this fixes it * requirements='C' * more correct	2023-11-29 09:34:00 -08:00
George Hotz	e333672675	realize cleanup (#2496 ) * move that logic * revert that change * clean up transfer and asserts * what's that junk	2023-11-28 21:08:39 -08:00
George Hotz	5629fc368c	Use Buffer.STORE at the end of ASTs (#2494 ) * work * store broken * interpreteds work * this passes * symbolic cpu * fix tests * fix opt tests * images fail * fix InterpretedFlopCounter * stupid hack for images	2023-11-28 20:11:37 -08:00
George Hotz	ab5d14d4ba	MEM -> LOAD (#2492 ) * MEM -> LOAD * keep legacy working	2023-11-28 16:46:37 -08:00
Davi Silva	136dbd8b36	HIP CI that compiles (to RDNA3) but doesn't have to run (#2482 ) * hip amd compilation * gate the test properly * cleanup unused import * remove superfluous numpy conversion * add SpeedyNet tests (f32 [passes] & f16 [fails]) * make CI verbose (error log from hip compiler) * test the real ops_hip * Merge branch 'tinygrad:master' into ci/hip-compilation * fix CI * cleanup * really fix CI * Fix CI Three: the refixening --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2023-11-27 21:17:06 -08:00
George Hotz	756b01f46f	why were these ever called buffer (#2483 )	2023-11-27 21:02:07 -08:00
George Hotz	acbe6d1b53	Revert "HIP compilation on CI targeting RDNA3 (#2459 )" (#2481 ) This reverts commit `d275ff930a`.	2023-11-27 20:41:21 -08:00
qtkite	cb507a9389	Remove the toCPU copy (#2445 ) * Remove the rawbuffer copy in runtime/lib.py on line 44 * remove buffer view * added metadata back, oops * delayed cpu testcase * whitespace * whitespace * buffer behavior as is * Update test_jit.py	2023-11-27 20:37:13 -08:00
Davi Silva	d275ff930a	HIP compilation on CI targeting RDNA3 (#2459 ) * hip amd compilation * gate the test properly * cleanup unused import * remove superfluous numpy conversion * add SpeedyNet tests (f32 [passes] & f16 [fails]) * make CI verbose (error log from hip compiler) * test the real ops_hip * Merge branch 'tinygrad:master' into ci/hip-compilation * fix CI * cleanup * really fix CI	2023-11-27 20:33:11 -08:00
George Hotz	9e07824542	move device to device.py (#2466 ) * move device to device.py * pylint test --disable R,C,W,E --enable E0611 * fix tests	2023-11-27 11:34:37 -08:00
George Hotz	8e9cdef61f	clean up the buffers (#2447 ) * clean up the buffers * remove allocate_output * functools.lru_cache is methodcache * add TestShapeTrackerSize * cache_clear * no 0 sz buffer, add _ on functions that shouldn't be imported * fix size * if -> while	2023-11-26 11:02:29 -08:00
George Hotz	9eb2746d62	fix copy issue + add regression test (#2441 )	2023-11-25 14:06:08 -08:00
andresgit	259a869fc1	Fix UnicodeDecodeError when debugging on Intel APU (#2421 ) * test DEBUG=5 * print prg if NVIDIA, fixes error on Intel APU	2023-11-25 12:30:50 -08:00
nimlgen	e68aebfff9	bring hip graph back (#2385 ) * bring hip graph back * share with metal * fix linter * remove hasattrs * Update ops_hip.py * hip wrapper does not use _buf --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2023-11-24 07:53:44 -08:00
George Hotz	8f89e21fca	torch and numpy don't share ops anymore (#2412 ) * torch and numpy don't share ops anymore * that should be filtered out elsewhere * still const * graph + enet example cleanup * hmm, we do still need it because of symbolic	2023-11-23 16:58:10 -08:00
George Hotz	193be14b6c	that had bugs, force an order (#2411 )	2023-11-23 15:52:16 -08:00
George Hotz	5bb720a777	Cocoa is no longer used	2023-11-23 14:31:21 -08:00
qazal	b927942d58	Move HIP render logic to its dedicated place (#2394 ) * update HIP language * vectorized render_cast with special treatment for hip only * test coverage for all cases --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>	2023-11-23 13:03:29 -08:00
George Hotz	0505c5ea50	remove force_wait, refactor to graph (#2405 ) * remove force_wait * refactor * get rid of stupid ASTRunner * fix del in diskbuffer * BufferOps.FROM_UNDERLYING * put offset in the rawbuffer * fix bugs * use exec	2023-11-23 12:46:07 -08:00
George Hotz	e4026dc197	don't pass lazybuffer to rawbuffer (#2400 ) * don't pass lazybuffer to rawbuffer * tensor comments	2023-11-23 09:40:28 -08:00

1 2 3 4 5 ...

326 Commits (deepcrayon)