.. |
mlperf_bert
|
New BERT dataloader (#5881)
|
2024-08-02 15:12:23 -04:00 |
mlperf_resnet
|
…
|
|
mlperf_unet3d
|
[MLPerf] UNet3D dataloader (#4343)
|
2024-04-28 22:34:18 -04:00 |
openpilot
|
comma benchmark (#5518)
|
2024-08-02 14:36:54 -07:00 |
process_replay
|
fix qcom process_replay for kernel diff (#6079)
|
2024-08-14 15:05:49 -04:00 |
external_benchmark_hip_compile.py
|
…
|
|
external_benchmark_load_stable_diffusion.py
|
…
|
|
external_benchmark_multitensor_allreduce.py
|
add RING_ALLREDUCE_THRESHOLD (#5835)
|
2024-07-31 16:13:09 +03:00 |
external_benchmark_openpilot.py
|
comma benchmark (#5518)
|
2024-08-02 14:36:54 -07:00 |
external_benchmark_resnet.py
|
ruff: unnecessary-comprehension (#5174)
|
2024-06-27 07:45:29 -04:00 |
external_benchmark_schedule.py
|
MetaOps.KERNEL (#5543)
|
2024-07-17 19:41:23 -07:00 |
external_cl_half_max.py
|
…
|
|
external_hip_compiler_bug.py
|
CompiledASTRunner -> CompiledRunner (#4148)
|
2024-04-11 08:49:52 -07:00 |
external_jit_failure.py
|
…
|
|
external_llama_eval.py
|
…
|
|
external_metal_compile_fail.py
|
metal compile fail
|
2024-07-11 19:27:05 -07:00 |
external_model_benchmark.py
|
ruff: unnecessary-comprehension (#5174)
|
2024-06-27 07:45:29 -04:00 |
external_multi_gpu.py
|
move disassemblers and openpilot (#4592)
|
2024-05-14 19:30:02 -07:00 |
external_osx_profiling.py
|
…
|
|
external_slow_global_dim4_resnet.py
|
lowerer is kernel [run_process_replay] (#5437)
|
2024-07-12 18:50:55 -07:00 |
external_test_amd.py
|
amd doorbell size is 64bits (#4448)
|
2024-05-06 16:59:59 +03:00 |
external_test_datasets.py
|
clean up how preprocessed folder is defined (#5813)
|
2024-07-30 12:35:26 -04:00 |
external_test_embedding.py
|
…
|
|
external_test_example.py
|
numpy device + pickle it (#4120)
|
2024-04-09 13:19:30 -07:00 |
external_test_hcq.py
|
add _alloc_signal/_free_signal to hcq (#5264)
|
2024-07-02 23:35:39 +03:00 |
external_test_hip_compile.py
|
lowerer is kernel [run_process_replay] (#5437)
|
2024-07-12 18:50:55 -07:00 |
external_test_hsa_driver.py
|
Rename tinygrad/runtime/driver to support (#5413)
|
2024-07-12 11:06:42 -07:00 |
external_test_image.py
|
…
|
|
external_test_jit_on_models.py
|
Pulled CLIP and UNet into Seperate Files (#5253)
|
2024-07-01 22:33:01 -04:00 |
external_test_llama3_ff.py
|
work to make GEMV fast (#5824)
|
2024-07-30 17:41:40 -07:00 |
external_test_lm_head.py
|
isolate the 134ms kernel in train_gpt2.py (#4773)
|
2024-05-29 17:26:24 -04:00 |
external_test_losses.py
|
[MLPerf][UNet3D] Add DICE loss + metrics (#4204)
|
2024-04-17 20:09:33 -04:00 |
external_test_mamba.py
|
external that test
|
2024-03-29 19:35:50 -07:00 |
external_test_metrics.py
|
Convert BinaryOps.DIV to UnaryOps.RECIP and BinaryOps.IDIV (#4887)
|
2024-06-14 02:43:46 -07:00 |
external_test_mnist_data_select.py
|
…
|
|
external_test_nv.py
|
lowerer is kernel [run_process_replay] (#5437)
|
2024-07-12 18:50:55 -07:00 |
external_test_onnx_backend.py
|
remove "no-nans-fp-math"="true" for LLVM (#5282)
|
2024-07-03 17:52:50 -04:00 |
external_test_opt.py
|
test: put conv in one reduce (#4441)
|
2024-07-22 12:16:13 +03:00 |
external_test_optim.py
|
improve test_dropout_on_shard (#4912)
|
2024-06-11 11:36:02 -04:00 |
external_test_speed_llama.py
|
all realize 2 (#4527)
|
2024-05-10 22:43:09 -07:00 |
external_test_speed_theoretical.py
|
test flops (and allow wide ALU in UOps) [run_process_replay] (#5749)
|
2024-07-26 21:07:28 -07:00 |
external_test_uops_graphing.py
|
lowerer is kernel [run_process_replay] (#5437)
|
2024-07-12 18:50:55 -07:00 |
external_test_whisper_librispeech.py
|
names shadowing builtins (#5179)
|
2024-06-27 08:15:01 -04:00 |
external_test_yolo.py
|
…
|
|
external_test_yolov8.py
|
…
|
|
fuzz_graph.py
|
graph fuzzer (#5082)
|
2024-06-21 18:47:23 +03:00 |
fuzz_kfd.py
|
add _alloc_signal/_free_signal to hcq (#5264)
|
2024-07-02 23:35:39 +03:00 |
fuzz_linearizer.py
|
pretty print lazy op per default (#5505)
|
2024-07-18 09:34:08 -07:00 |
fuzz_schedule.py
|
graph LBScheduleItem [run_process_replay] (#5960)
|
2024-08-07 19:59:11 +03:00 |
fuzz_shapetracker.py
|
…
|
|
fuzz_shapetracker_math.py
|
tinytqdm.set_description and tinytrange (#5101)
|
2024-06-22 14:45:06 -04:00 |
fuzz_symbolic.py
|
…
|
|
fuzz_uops.py
|
fuzz uops is simpler with List[UOp] [run_process_replay] (#5875)
|
2024-08-02 17:28:15 +03:00 |
graph_batchnorm.py
|
…
|
|
speed_beam_v_hcopt.py
|
move graph/search to engine (#4596)
|
2024-05-14 23:12:59 -07:00 |
speed_compare_cuda_nv.py
|
move colorize_float to helpers.py (#5490)
|
2024-07-15 11:29:03 -07:00 |
speed_compare_cuda_ptx.py
|
move colorize_float to helpers.py (#5490)
|
2024-07-15 11:29:03 -07:00 |
verify_kernel.py
|
pretty print lazy op per default (#5505)
|
2024-07-18 09:34:08 -07:00 |