tinygrad

History

CaltropHungerton 38fb1e14a2 Intel XMX Tensor Core Support (#5622 ) * fixed xmx demo * i think i'm invoking the DPAS but it's slow * compiler build arg to stop register spilling, indicated where to fix flop counter * don't mind this * do NOT mind me * do not mind me * do not view * i will add bf16 later * in process of figuring out tc fields * we figured out the fields!!! * added check for cl device vendor, added seperate IntelRenderer * remove tc thread_local_aliases * cleaning debris before draft pr * edits for linter * deduping and checking device extensions * i will find more line reductions in other places * before merge upstream * double grf size in compiler to fix register spilling (bandaid), device checking changes * tc python emulation * fixed emulation * tests for emulated intel tensor core * TC=0, 1 working on upstream, fixed perf * test * debris * check for specialized cl device when we canonicalize device * bf16 support, tc=3 test added * address tests * revert half2 loads on intel tc, cleanup * linter * fold_expanded revert * lint, whitespace fix * cuda bf16 (only one with bf16) is skipped in test tensor cores, so i will skip for intel bf16 too * make line shorter, no need for noqa E501 * removed device intel * fix python emulation --------- Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>		2024-08-16 09:19:21 -07:00
..
external	fix qcom process_replay for kernel diff (#6079 )	2024-08-14 15:05:49 -04:00
imported	update pylint path to check indent/space for all (#6022 )	2024-08-10 14:41:09 -04:00
models	add failing regression test for image (#5540 )	2024-07-17 17:27:18 -07:00
testextra	names shadowing builtins (#5179 )	2024-06-27 08:15:01 -04:00
unit	UOp pattern DEFINE_VAR with min==max is also CONST (#6095 )	2024-08-15 12:09:44 -04:00
Dockerfile	…
__init__.py	…
helpers.py	minor test fixups from the AST is UOp diff (#6081 )	2024-08-14 23:58:04 +03:00
test_arange.py	fix tests locally on metal (#6025 )	2024-08-10 12:36:22 -07:00
test_assign.py	test masked assign views (#4599 )	2024-05-15 15:06:48 +03:00
test_compile_failures.py	new test for e2e compile failures (#6101 )	2024-08-15 18:56:22 -07:00
test_const_folding.py	MetaOps.KERNEL (#5543 )	2024-07-17 19:41:23 -07:00
test_conv.py	db in wal mode (#5388 )	2024-07-12 20:43:36 -07:00
test_conv_shapetracker.py	rename lazyops to parents [run_process_replay] (#6091 )	2024-08-15 17:27:32 +03:00
test_copy_speed.py	…
test_custom_function.py	s/loadops/metaops [run_process_replay] (#5421 )	2024-07-12 13:26:50 -07:00
test_device_speed.py	move uopgraph to file [run_process_replay] (#5364 )	2024-07-10 17:34:50 -07:00
test_dtype.py	Shape change bitcast for more dtypes (#6047 )	2024-08-14 10:03:34 -07:00
test_dtype_alu.py	rename lazyops to parents [run_process_replay] (#6091 )	2024-08-15 17:27:32 +03:00
test_fusion_op.py	increase test_recursive_add time from 1s to 2s (#6078 )	2024-08-14 13:52:02 -04:00
test_fuzz_shape_ops.py	fix typing for test to run in py38 (#4930 )	2024-06-12 13:22:30 -04:00
test_gc.py	threefry again (#3785 )	2024-03-18 16:47:07 -04:00
test_graph.py	fix hcq sync (#5062 )	2024-06-26 17:50:37 +03:00
test_hcq.py	fix non-jitted transfers in profile (#5980 )	2024-08-08 17:58:08 +03:00
test_image_dtype.py	add failing regression test for image (#5540 )	2024-07-17 17:27:18 -07:00
test_jit.py	remove realize from threefry (#5969 )	2024-08-07 15:08:49 -07:00
test_kernel_cache.py	…
test_lazybuffer.py	use assertIs in test_schedule (#6035 )	2024-08-11 19:19:18 +03:00
test_lazyop.py	scheduleitem is not Tuple [run_process_replay] (#5425 )	2024-07-12 15:13:19 -07:00
test_linearizer.py	Intel XMX Tensor Core Support (#5622 )	2024-08-16 09:19:21 -07:00
test_linearizer_dumb.py	spec for in order LOAD/STORE indexing (#6073 )	2024-08-14 19:18:00 +03:00
test_linearizer_failures.py	rewrite bool ADD to OR and MUL to AND (#6084 )	2024-08-15 10:11:57 -04:00
test_linearizer_overflows.py	lowerer is kernel [run_process_replay] (#5437 )	2024-07-12 18:50:55 -07:00
test_masked_st.py	…
test_method_cache.py	simple LoadOps.ASSIGN (#3745 )	2024-03-14 20:44:34 -07:00
test_multitensor.py	added fix and reg tests (#6060 )	2024-08-12 21:00:48 -04:00
test_net_speed.py	nv mockgpu (#4600 )	2024-05-15 23:46:08 +03:00
test_nn.py	add LSTMCell to nn (#6080 )	2024-08-14 12:08:42 -07:00
test_ocl.py	touchup cl_errors (#6058 )	2024-08-13 13:06:59 -04:00
test_ops.py	bilinear interp uint8 fails (#6103 )	2024-08-15 19:34:39 -07:00
test_optim.py	improve test_dropout_on_shard (#4912 )	2024-06-11 11:36:02 -04:00
test_pattern_matcher.py	revert UOps eq, this needs to be isolated in realize.py (#6063 )	2024-08-13 18:02:34 +03:00
test_pickle.py	some TestPickleJIT tests (#5860 )	2024-08-01 12:39:59 -07:00
test_profiler.py	hcq profiler support args (#5989 )	2024-08-09 00:18:36 +03:00
test_randomness.py	jit sampling functionn in test_randomness.test_multinomial (#5034 )	2024-06-18 14:21:05 -04:00
test_renderer_failures.py	Fix compile error for max with inline const (#5840 )	2024-08-12 23:40:39 +08:00
test_sample.py	…
test_schedule.py	rename lazyops to parents [run_process_replay] (#6091 )	2024-08-15 17:27:32 +03:00
test_search.py	rename lazyops to parents [run_process_replay] (#6091 )	2024-08-15 17:27:32 +03:00
test_setitem.py	setitem in-place operator tests (#4577 )	2024-05-14 01:28:02 -04:00
test_specific_conv.py	nv mockgpu (#4600 )	2024-05-15 23:46:08 +03:00
test_speed_v_torch.py	remove CUDACPU flag in tests [run_process_replay] (#5902 )	2024-08-04 16:06:38 -04:00
test_subbuffer.py	remove CUDACPU flag in tests [run_process_replay] (#5902 )	2024-08-04 16:06:38 -04:00
test_symbolic_jit.py	sort vars in jit when building expected input args (#4990 )	2024-06-16 15:55:51 -04:00
test_symbolic_ops.py	symbolic Tensor.var (#4843 )	2024-06-05 12:55:54 -04:00
test_symbolic_shapetracker.py	support symbolic reshape with non-contiguous (#4844 )	2024-06-05 16:01:19 -04:00
test_tensor.py	Ignore non-computational backends when overwriting the default (#5770 )	2024-08-10 09:23:29 -07:00
test_tensor_data.py	BEAM_COMPARE=2 validates the correctness of BEAM kernels (#5458 )	2024-07-13 13:53:43 -07:00
test_tensor_variable.py	Should this symbolic test fail? (#4501 )	2024-06-18 15:21:26 -04:00
test_to_numpy.py	…
test_transcendental.py	lower test_transcendental fuzz test threshold for sin float64 (#5956 )	2024-08-07 02:04:37 -04:00
test_uop_graph.py	UOp pattern DEFINE_VAR with min==max is also CONST (#6095 )	2024-08-15 12:09:44 -04:00
test_uops.py	spec for in order LOAD/STORE indexing (#6073 )	2024-08-14 19:18:00 +03:00
test_uops_stats.py	UOpGraph not in renderer or Program [run_process_replay] (#5867 )	2024-08-01 16:20:30 -07:00
test_verify_lazyop.py	pretty print lazy op per default (#5505 )	2024-07-18 09:34:08 -07:00
test_winograd.py	rename lazyops to parents [run_process_replay] (#6091 )	2024-08-15 17:27:32 +03:00
test_zero_copy.py	…