tinygrad/test/external
Casey Primozic aab9ee0fca
Add RDNA3 assembler `UOps.CAST` partial support + other fixes/improvements (#1012)
* Add support for one case of `UOps.CAST` for RDNA3 assembler

 * Adds support for casting from `bool` -> `float32`.  Seems like a very common operation that is required in many places.
 * Fix bool register definition for vector operations
   * Use `vcc_lo` instead of `vcc` which seems to be required since it's configured to use wavefront_size=32
 * Add vector support for some places that were scalar only in register definition and comparison ops
 * Fix some issues in what seems to be defunct `external_test_image.py`
   * Some tests still don't pass for other reasons, but it at least runs now and one broken test is now fixed

* Refactor RDNA3 assembler register definition

 * Unify multi-registor code between dtypes and combine with single-register allocation since they're all untyped registers at the end of the day
2023-06-20 11:34:10 -07:00
..
external_copy_benchmark.py good changes from the M1 Tensor Core project (#730) 2023-03-29 05:11:02 +04:00
external_hlb_cifar.py fix batchnorm at training (#753) 2023-04-19 08:01:04 -07:00
external_metal_uaf.py fix metal uaf (#964) 2023-06-09 21:28:06 -07:00
external_multi_gpu.py nocopy (#764) 2023-05-05 09:32:06 -07:00
external_osx_profiling.py multi cl_queue (#762) 2023-05-03 12:15:28 -07:00
external_test_gpu_ast.py move to shapetracker.py 2023-03-11 07:50:07 -08:00
external_test_image.py Add RDNA3 assembler `UOps.CAST` partial support + other fixes/improvements (#1012) 2023-06-20 11:34:10 -07:00
external_test_llvm.py fix up external tests 2023-03-06 06:52:28 -08:00
external_test_onnx_backend.py exclude a few more onnx tests 2023-06-19 08:51:29 -07:00
external_test_opt.py Adds tril & triu support (#936) 2023-06-09 22:13:20 -07:00
external_test_optim.py Add LAMB optimizer (#821) 2023-05-28 15:09:05 -07:00
external_test_yolo.py test cleanups 2023-03-03 06:36:06 -08:00
external_test_yolov8.py Add yolov8 implementation (#806) 2023-06-16 18:55:19 -07:00
fuzz_shapetracker.py Remove Zeroview (#748) 2023-04-17 08:21:46 -07:00
fuzz_symbolic.py Remove Zeroview (#748) 2023-04-17 08:21:46 -07:00
graph_batchnorm.py test cleanups 2023-03-03 06:36:06 -08:00