mirror of https://github.com/commaai/tinygrad.git
7c5729a3bd
* wmma: refactor to remove wmma_func and create TC funcs as needed * test_linearizer: disable bf16 CUDA during emulation testing * cstyle: clean up creation of CUDA vec dtypes * extra/gemm: add option to accumulate to bfloat16 * cleanups * benchmark: add CUDA bfloat16 matmul * more cleanups |
||
---|---|---|
.. | ||
benchmark.yml | ||
python-publish.yml | ||
szdiff.yml | ||
test.yml |