tinygrad

Commit Graph

Author	SHA1	Message	Date
George Hotz	b8c94a67c9	Simple chonker (#431 ) * chonker will make llvm fast * work * better speed tests, we will make them fast * with the cache add is the same speed * relu and neg are fast * fix sum speed * maximum maxnum? * hack for gemm opt * gemm very slow * zeros like * test_permute * shapetracker returns self * fix shapetracker factorization * err, int strides * permutes are faster now in tinygrad than pytorch * support -1 in expand * gemm unrolled * improve final test case * WIP GEMM * why isn't GEMM fast? * revert cache dim * ffp contract works on clang, not llvm? * ignore llvm ir * this makes fma work at least, but no faster * USE_4x4 * 63 GFLOPS * 87 GFLOPS * that wasn't matmul, 44 GFLOPS now * 82 GFLOPS permuted * this permute too * a little speed for the convs * 45 GFLOPS * speed tests pass again * clean up prints * fix FMA WHAT A WASTE OF TIME * colors * moar fair * GPU * useless on chonker * cleanups * improve factorized shapetracker * better threshold * label conv * work * ops test pass again * hot load the index * run the last view, no need to create * ZeroView needs a repr for the key to work * fix segfault on out of bounds * one more test * start amx, and llvm.initialize_native_asmparser * amx works * nice AMX class * nicer AMX class * refactor get_idxs * amx working * is slower... * useless flip * cache * SZ_X * AMX_SZ_X/Y work alone * Contiguous mlop * test gemm packed * PREPARE in packed * use_amx factor * prefetch isn't faster * loop * same 3ms * 2.24 ms * allow double on store in TG * amx reduce is the same speed as non amx reduce * include memory bandwidth * clean up shapetracker * flip returns stride * prepare for upstream * Update ops_llvm.py (#426) * permutes are yellow and green now * faster conv * llvm cleanups * Show optimised IR under debug 4 (#428) * ASTKernel class * Make tinygrad work with older python version (#427) * Make tinygrad work with older python version * Use partialmethod instead of partial * smiple chonker is chonking * remove junk from test speed vs torch * fix linker and types * AMX is only here now * add LLVM tests, it's a valid backend now * oops, run llvm test * contiguous_op * fix loadops compare * dedup reduceops Co-authored-by: calledit <1573053+calledit@users.noreply.github.com>	2022-11-10 23:17:09 -08:00
Liam	8dc28dd733	Create python-publish.yml (#163 )	2022-11-08 08:45:01 -08:00
George Hotz	d02f8f9bc0	can we lose the lines with E701 still there?	2022-10-28 08:36:03 -07:00
George Hotz	ef62db3186	cleanups, remove E701	2022-10-28 08:28:56 -07:00
George Hotz	b65b70812a	Exec AST (#404 ) * working exec ast * exec_ast is staticmethod * GenericExecAST * fold that sometimes * ExplicitExecAST * exec_ast for GPU * gpu working * get_lazyop_shape * now gpubuffer is ExplicitExecAST * dedup * add a type * RESHAPE in opencl code * fix linter * that too for linter * cleanups * remove dead code * GenericShape is less lines * add ALLOWED_KERNEL_COUNT to tests * fix mypy * that's gotta be recursive * fix opencl shape processing * remove unneeded lambda	2022-10-28 08:27:03 -07:00
George Hotz	3b9b7eda48	remove run_thneed dead code	2022-10-20 17:24:18 -07:00
YassineYousfi	ae0f9b17df	openpilot: new models and onnx ops (#401 ) * ngrl stuff * fngrl * fix typo in compile script * workflow dispatch * new models in tests * dont need to up this threshold Co-authored-by: HaraldSchafer <harald.the.engineer@gmail.com>	2022-10-20 11:49:19 -07:00
George Hotz	8382c51c12	always MATMUL, test the ops in OPENCL	2022-10-01 13:31:29 -04:00
George Hotz	e737513c52	external_test_opt	2022-09-28 23:29:41 -04:00
George Hotz	f215534a64	1100 lines, but sane linter rules	2022-09-06 13:47:45 -07:00
George Hotz	fa33ba716e	flip that	2022-08-28 11:27:33 -07:00
George Hotz	227937d6d2	fix test maybe	2022-08-22 09:50:14 -07:00
George Hotz	11626053b0	run_thneed with test	2022-08-22 09:45:46 -07:00
George Hotz	0aca848d10	enable the openpilot test	2022-08-21 12:05:48 -07:00
George Hotz	a8734df030	add openpilot tests to tinygrad	2022-08-21 12:03:37 -07:00
George Hotz	ccd539e93e	maybe that's a better way to do this	2022-08-20 08:25:12 -07:00
George Hotz	b7d782b921	hmm, with the new reduce, we have to opt 3 for memory usage	2022-08-20 08:16:15 -07:00
George Hotz	b132de677d	tinygrad.nn (#367 ) * tinygrad.nn * flake8 * working on pylint * more pylint * more pylint * pylint passes * networkx * mypy can't infer that type * junk	2022-08-18 07:41:00 -07:00
George Hotz	ef1100fdff	touchups	2022-07-19 09:30:06 -07:00
George Hotz	ef4afdb5d2	tests maybe	2022-07-18 08:24:14 -07:00
George Hotz	a2c4bcf313	disable opencl tests	2022-07-18 08:17:21 -07:00
George Hotz	762e859089	testopencl	2022-07-17 11:56:40 -07:00
George Hotz	608e2431f7	test opencl, commit to removing the crap conv code from GPU	2022-07-17 11:55:37 -07:00
George Hotz	34f43ea10e	LAZY and CLCACHE are defaults	2022-07-04 13:09:15 -07:00
George Hotz	d7aad46758	test lazy also, make TestMNIST faster	2022-07-03 15:19:19 -07:00
Nicklas Boman	64d986bc8b	add mypy to ci testing (#353 )	2022-07-03 15:11:35 -07:00
George Hotz	2ee85812f7	intel opencl (#342 ) * intel opencl * run clinfo * that fix it? * meh * think it's the same * basekit fix * it wasn't basekit * more minimal * no clinfo	2022-06-20 19:25:55 -07:00
Jacky Lee	81664baf64	Fix OpenCL installation (#301 )	2022-01-06 10:35:48 -05:00
George Hotz	b0511f9392	Revert "does pybind fix CI?" This reverts commit `d128e4fcae`.	2021-12-30 14:13:58 -05:00
George Hotz	d128e4fcae	does pybind fix CI?	2021-12-30 14:11:39 -05:00
George Hotz	6bee5bdb7d	add torch tests	2021-10-30 18:58:45 -07:00
George Hotz	7472a7ebe2	not forcing 3.9 for a stupid type	2021-10-30 16:52:40 -07:00
George Hotz	f193eeed25	bump all to python 3.9	2021-10-30 16:15:41 -07:00
George Hotz	ac8afd24fa	refactor accel	2021-10-30 16:10:59 -07:00
Dinesh Kumar Gnanasekaran	2146860307	fixed OpenCL installation while running tests (#262 ) Co-authored-by: dinesh <dinesh-GDK>	2021-06-12 11:14:21 -07:00
George Hotz	6842ad9ec8	minor cleanups, yolo work	2021-01-03 08:14:16 -08:00
Liam	ebd72ff437	Test split (#231 ) * Split tests Split tests into "Test CPU" and "Test GPU". Add test flag "TEST_DEVICES" which is a comma separated list of devices: CPU,GPU,ANE * Run tests based on provided TEST_DEVICES flag By default will run all "CPU,GPU,ANE" * fix bad quote * Revert changes and use GPU=1 This is done through setting the default Tensor Device to Device.CPU of GPU=1 is set. Run GPU tests: GPU=1 pytest -s -v	2021-01-01 09:19:03 -05:00
iainwo	56d44637f3	fixed pylint, formatted python files iwth cblack on localhost (#204 ) * fixed pylint, formatted python files iwth cblack on localhost * Revert "fixed pylint, formatted python files iwth cblack on localhost" This reverts commit 07e2b88466fa53399ad78d962ffb2ad55bc45344. * dedented 4-spaces added linter Co-authored-by: Iain Wong <iainwong@outlook.com>	2020-12-17 14:37:31 -08:00
Mufeed VH	e6a5c6c93e	Added indentation linter (#187 ) * Added indentation linter * pylint package latest	2020-12-12 17:15:09 -08:00
Liam	e79cda6dad	Add pyopencl to dependency installs (#174 ) * Add pyopencl to dependency installs OpenCL was not actually being tested as pyopencl was not installed. * Reduce installation to 1 liner	2020-12-10 09:24:08 -08:00
George Hotz	df64658a2c	weee, opencl tests in CI	2020-11-10 10:04:45 -08:00
George Hotz	d47a128812	pocl	2020-11-10 10:02:13 -08:00
George Hotz	c05401a9ca	sudo maybe	2020-11-10 09:53:49 -08:00
George Hotz	09bc8eddfe	clinfo	2020-11-10 09:51:38 -08:00
George Hotz	58e703d099	fix tests	2020-11-10 09:49:19 -08:00
George Hotz	23405cec43	intel opencl	2020-11-10 09:41:40 -08:00
George Hotz	33090c4b0d	install more	2020-11-10 09:34:56 -08:00
George Hotz	a52590e76c	cpu opencl maybe	2020-11-10 09:32:54 -08:00
George Hotz	2e7f16bf3f	the power of cheating	2020-11-02 07:42:11 -08:00
George Hotz	fc358a07ad	print sloccount	2020-11-02 07:38:13 -08:00
Omar Ben Brahim	ec7bdb87ca	clean up CI (#28 )	2020-10-29 07:43:31 -07:00
Liam	4a663d31da	less than 1000 lines (#24 ) * Actions sloccount check for < 1000 lines (tests and source) * Sudo lol * Test sloccount in actions ci * More tests for the expression * apt install test to do validation in cli * fuck test * try something that probably won't work * ofcourse not * test that test fails if lines greater than 1000 * more than 1000 results in 1,000 not 1000. Delete * remove test lines. Should pass now * Untouch file Untouched a file that was changed in testing	2020-10-27 08:07:15 -07:00
Adrian Garcia Badaracco	02ecdc86c3	remove install from workflow	2020-10-24 09:03:22 -05:00
Adrian Garcia Badaracco	5ed2bbfa0a	remove newline	2020-10-21 18:43:11 -05:00
Adrian Garcia Badaracco	81fe70a3c0	make single line	2020-10-21 18:42:40 -05:00
Adrian Garcia Badaracco	466a67f4dd	Add requirements.txt	2020-10-21 18:37:47 -05:00
Adrian Garcia Badaracco	02adb0ac3a	Make test_mnist runnable by pytest and directly	2020-10-21 11:30:08 -05:00
George Hotz	d5662e09e3	run both tests	2020-10-19 08:59:20 -07:00
grekiki	f394a5edee	Make unit-tests run in all branches	2020-10-18 23:03:51 +02:00
William Gibson	02729c7f2a	Add CI for tests	2020-10-18 21:26:01 +01:00

... 2 3 4 5 6

260 Commits