mirror of https://github.com/commaai/tinygrad.git
12996d3a7d
* these asserts should pass * fix that assert * ALU dtypes * acc dtype for group_for_reduce * cast image ALUs to the base dtype * remove all casts from linearizer * fix argmax * fix multinomial * fix __getitem__ * Revert "fix __getitem__" This reverts commit 62ad719bfa5a2e1fcbfa931360f54897f8977602. * fix MemBuffer outputs being wrong when there is an arange + ALU with a different dtype eg. fancy slicing (int, float), bert embeddings (int, long) this should be fixed in lazy instead of having to break the kernel * cleanup argmax fix * fix matmul in ints cast in the end * fix llama * skip wrong hardcoded asts in the worlds dataset * fix llama p2 * cleanup missing parts of the diff --------- Co-authored-by: George Hotz <geohot@gmail.com> |
||
---|---|---|
.. | ||
accel | ||
assembly | ||
datasets | ||
dist | ||
gemm | ||
junk | ||
models | ||
optimization | ||
qcom_gpu_driver | ||
triton | ||
archprobe.py | ||
augment.py | ||
autopad.py | ||
dump_cache.py | ||
export_model.py | ||
gradcheck.py | ||
introspection.py | ||
lr_scheduler.py | ||
multitensor.py | ||
onnx.py | ||
onnx_ops.py | ||
thneed.py | ||
to_movement_ops.py | ||
training.py |