Commit Graph

2686 Commits

Author SHA1 Message Date
nimlgen 408c0a5e7f qcom match texture/sampler descriptors to OpenCL (#7622)
* qcom ioctl compare more regs

* bug fix
2024-11-11 15:57:42 -05:00
nimlgen 45db7d9045
fuzz qcom vs opencl (#7130)
* fuzz qcom vs opencl

* fix nv

* bettre?

* typo

* open both devs
2024-10-17 18:49:08 +03:00
chenyu 287a198c4f
increase test_strongly_connected_DAG threshold (#7131)
flaky
2024-10-17 11:08:50 -04:00
George Hotz ded1b38b84
minor dtype cleanup [pr] (#7124)
* minor dtype cleanup [pr]

* use ptr() function
2024-10-17 17:41:23 +08:00
George Hotz d990a16326
fix tests to use render (#7116) 2024-10-17 14:35:22 +08:00
chenyu 842fe444df
test case for valid only simplification (#7108) 2024-10-16 16:40:46 -04:00
Francis Lata 90eff347e2
tinytqdm write support (#6359)
* add write support

* add test

* update test case to compare write outputs

* assert final write output

* flush when using write

* update write logic

* Revert "update write logic"

This reverts commit 5e0e611b46cde7a22e41aa5770bc4ccad20de073.

---------

Co-authored-by: chenyu <chenyu@fastmail.com>
2024-10-16 14:51:41 -04:00
nimlgen 39ab67e9ef
beam capture and replay in fuzz (#7099)
* beam capture and reply in fuzz

* clean a bit
2024-10-16 20:26:58 +03:00
qazal 9d5342976e
skip TestSetitem.test_setitem_inplace_operator (#7103) 2024-10-16 18:59:08 +03:00
qazal 6acda43a2c
test a rewrite of permuted reduce [pr] (#7093)
* test a rewrite of permuted reduce [pr]

* addd rewrite tracker

* expected

* passes
2024-10-16 12:49:54 +03:00
qazal 40f33c110b
big graph var_vals as rewrite context (#7007)
* var_vals as rewrite context

* no default arg

* add st var_vals

* delete some stuff

* add the rewrite rule again

* extra

* this whole part is preschedule

* test with a second context

* redo

* i always forget tensor variable
2024-10-16 07:31:44 +03:00
qazal 390171d686
delete SAVE_SCHEDULE=1 [pr] (#7087) 2024-10-16 07:13:20 +03:00
George Hotz 3169cb386d
remove graph [pr] (#7085) 2024-10-16 11:40:07 +08:00
qazal fb29de6cc3
split schedule to view_left and view_right [pr] (#7077)
* split schedule to view_left and view_right [pr]

* move valid
2024-10-16 03:39:38 +03:00
qazal 545e79969f
always record matches in viz (#7073)
* always record matches in viz

* simpler
2024-10-15 23:03:12 +03:00
nimlgen b025495e5c
fuzz nv vs cuda (#7066)
* fuzz nv vs cuda

* fixes

* smth

* um

* cmp the same

* dnrt

* correct gpfifo scan

* fix
2024-10-15 22:22:40 +03:00
qazal 09de958855
move print_diff to test/helpers (#7071) 2024-10-15 22:00:39 +03:00
qazal 1a45e94f5d
viz late to_json [pr] (#7070) 2024-10-15 21:36:45 +03:00
qazal 52d8afde2b
new viz unittests, isolate the ctx bug (#7069)
* start new test_viz

* test_rewrite_twice

* test_rewrite_with_ctx

* add back some of the old tests

* lints
2024-10-15 18:53:56 +03:00
nimlgen 9f00eacde5
nv tagged memory + resnet failed kernel (#7061)
* nv tagged memory

* linter

* metal fix?
2024-10-15 18:19:58 +03:00
chenyu 2008bac6bf
use validhack logic to rewrite buffer idx (#6740)
* use validhack logic to rewrite buffer idx

saved a whopping one mod in the conv backward kernel...

* cleanup more
2024-10-14 16:47:31 -04:00
chenyu a99e42cf2f
clean up test_uop_symbolic.py (#7058)
enable more tests and remove dead tests
2024-10-14 15:35:58 -04:00
chenyu fbaab30fe3
add timing to fuzz_linearizer (#7056)
and applied smaller FUZZ_MAX_SIZE. this is getting quite slow in CI
2024-10-14 11:57:41 -04:00
qazal 0ef186d4be
scheduler internal api cleanups [pr] (#7052)
* delete external_benchmark_ast.py [pr]

* cleanup 2

* random
2024-10-14 15:56:10 +03:00
George Hotz f85c9ba00a
rewrite max to use cmplt + where (#7037) 2024-10-14 20:00:51 +08:00
George Hotz b20b22a738 hotfix: add test_tiny, because many times it's what you want 2024-10-14 16:32:33 +08:00
Louis Novy 2ac5aec66b
Fix exponential complexity in _is_padding_okay [pr] (#7008)
* preliminary test

* missed Optional

* don't check for cache during recursion

* match style from st_fixup... may be marginally faster?

* pathological test case: strongly connected DAG

* move to test_schedule as this isn't really a fusion

* oops this shouldn't be edited

* Revert "oops this shouldn't be edited"

This reverts commit 487cb027dc5120542755446d1595ec7b76c207e8.

* Revert "move to test_schedule as this isn't really a fusion"

This reverts commit 48d8c550ce84453e6fc0306e1c6c448fe1286f79.

* move to test_schedule as this isn't really a fusion

* ok no more merge error funny business
2024-10-14 02:34:47 +03:00
chenyu bd8ecf7fd6
remove NumNode (#7035) 2024-10-13 16:42:19 -04:00
chenyu c4c806a210
generate new kernel dataset (#7034)
* generate new kernel dataset

pre req to remove NumNode
```
extra/optimization/generate_dataset.sh
gzip -k /tmp/sops
mv /tmp/sops.gz extra/datasets/
```

* fix var range in fuzz_linearizer
2024-10-13 16:19:41 -04:00
chenyu 1a27417262
remove arbitrary multiplication case (#7033)
adds the wrongly simplified kernel in test_linearizer_failures
#7019
2024-10-13 15:06:05 -04:00
chenyu 13575f080a
remove bitcast backward in function.py (#7031)
bitcast cannot backward
2024-10-13 10:08:27 -04:00
qazal 13846930cd
hotfix: extract_dataset.py (#7029) 2024-10-13 11:18:23 +03:00
chenyu 04d9b46d51
derivative of softmax is indepedent of max (#7009)
* derivative of softmax is indepedent of max

* update test
2024-10-12 15:59:23 -04:00
chenyu cae1c41755
test case of softmax backward kernel count (#7022) 2024-10-12 15:46:32 -04:00
George Hotz 85a45164fb
remove pyint [pr] (#7016)
* remove pyint

* bump time on tp [pr]

* dont truncate in const fold

* remove dead code

* Revert "dont truncate in const fold"

This reverts commit 29c81db0f7880848b001c2728aa555a1ef17e7d3.

* remove define_var
2024-10-12 22:36:24 +08:00
George Hotz 38d45dfba5 hotfix: no rng in test/external/external_benchmark_schedule.py 2024-10-12 22:03:04 +08:00
George Hotz a71bb09ec3
remove symbolic file [pr] (#7012) 2024-10-12 18:44:44 +08:00
George Hotz 5ae2de9845
UOp.variable (#7010)
* UOp.variable [pr]

* fix tests

* clean

* improve name rendering

* last bug
2024-10-12 18:20:44 +08:00
Markiian Novosad 8831c691e2
Add slice parameter type checking to disallow Tensor usage for slices (#6967)
* add support for single el tensors for slices

* rm trailing spaces

* cleanup long lines

* remove tensor in slice support, add comprehensive err msg

* cleanup getitem, add slice type check

* Edit err message
2024-10-11 16:20:21 -04:00
qazal 7451812bbf
delete AST_REWRITE ctx var (#6995) 2024-10-11 11:33:16 +03:00
George Hotz e7a0ffe46a
break out linearization [pr] (#6994) 2024-10-11 15:27:33 +08:00
George Hotz e441794c4b
remove custom op support, we waste time maintaining this (#6991)
* remove custom op support, we waste time maintaining this

* customop is over
2024-10-11 14:31:09 +08:00
George Hotz c08521e823
minor cleanups from toonygrad (#6990) 2024-10-11 14:19:10 +08:00
George Hotz f50d0e0ee0
cloud device [pr] (#6964)
* first try at cloud device [pr]

* real separation

* we're free

* clang works

* unhappy with timeout

* better timeouts and free

* unrelated

* use http verbs + add test

* lines + better test

* fix DELETE

* shorter cloud

* split key

* fix sending renderer

* PTXRenderer serialization

* add sessions

* http.client

* minor timeout bump

* fix keep-alive

* inc server timeout

* real fix timeout

* that one too
2024-10-11 12:24:06 +08:00
Bhavya Gada 23c09f4b4c
add support for padding='same' in nn.conv (#6975)
* add support for padding='same' in nn.conv

* express concisely

* simplify loop

* test same padding with dilation and conv1d

* fix bad indentation

* make loop one liner
2024-10-11 11:39:07 +08:00
qazal 4ef5310039
track viz context even if rewrite errors [pr] (#6976) 2024-10-10 18:33:15 +03:00
chenyu 592e5f1df2
skip test_viz test_no_dedup_different_opts (#6979) 2024-10-10 11:10:24 -04:00
chenyu e3dc10f8f6
improve fold_unrolled_divs (#6977)
addressed #6935
the first few terms in fold_unrolled_divs might have been folded already, so the check should first try to add those terms back. there is a case that every but one term is folded which is not an add chain anymore, so just added as a failed test case for now
2024-10-10 10:52:05 -04:00
qazal 3481468702
bring viz to core (#6970)
* move viz to core

* pathfix

* move test_viz to core

* cleanup test_viz diff

* use contextvars
2024-10-10 16:56:26 +03:00
qazal 3724a66716
move test_viz to test/, prereq for tinygrad/viz [pr] (#6972) 2024-10-10 11:40:46 +03:00