.. |
accel
|
…
|
|
assembly
|
lowerer is kernel [run_process_replay] (#5437)
|
2024-07-12 18:50:55 -07:00 |
backends
|
move GraphException to jit.py (#5744)
|
2024-07-26 19:01:12 -04:00 |
datasets
|
New BERT dataloader (#5881)
|
2024-08-02 15:12:23 -04:00 |
disassemblers/adreno
|
move disassemblers and openpilot (#4592)
|
2024-05-14 19:30:02 -07:00 |
gemm
|
work to make GEMV fast (#5824)
|
2024-07-30 17:41:40 -07:00 |
hip_gpu_driver
|
amd cleanup sdma (#4796)
|
2024-06-01 17:06:44 +03:00 |
hiprtc
|
…
|
|
junk
|
…
|
|
mockgpu
|
optimize nv profiler (#5856)
|
2024-08-01 23:57:45 +03:00 |
models
|
shard kvcache (#5830)
|
2024-07-30 20:29:54 -07:00 |
nv_gpu_driver
|
nv driver (#4044)
|
2024-04-22 19:50:20 +04:00 |
optimization
|
MetaOps.KERNEL (#5543)
|
2024-07-17 19:41:23 -07:00 |
qcom_gpu_driver
|
fix opencl_ioctl on comma (#5814)
|
2024-07-30 20:44:06 -07:00 |
archprobe.py
|
…
|
|
augment.py
|
…
|
|
disk_read_speed.py
|
io_uring for copies from disk (#5035)
|
2024-06-21 11:36:51 +03:00 |
dump_cache.py
|
…
|
|
export_model.py
|
all realize 2 (#4527)
|
2024-05-10 22:43:09 -07:00 |
f16_w_uint32.py
|
fix various examples (#4691)
|
2024-05-22 20:43:21 -04:00 |
gradcheck.py
|
…
|
|
hip_events.py
|
…
|
|
introspection.py
|
bring buffer back to device (#4517)
|
2024-05-10 11:22:31 -07:00 |
lr_scheduler.py
|
use at least float32 for optim.lr (#4297)
|
2024-04-25 14:42:28 -04:00 |
mcts_search.py
|
parallel mcts (#5626)
|
2024-07-21 14:53:23 -07:00 |
multitensor.py
|
…
|
|
onnx.py
|
remove numpy from dtype (#4969)
|
2024-06-14 15:38:45 -04:00 |
onnx_ops.py
|
pow(2) -> square in RMSNorm [run_process_replay] (#5901)
|
2024-08-04 14:21:31 -04:00 |
ring_copy.py
|
…
|
|
thneed.py
|
…
|
|
threefry.py
|
docs: showcase remove mnist_gan and add conversation.py (#4757)
|
2024-05-28 11:09:26 -04:00 |
to_movement_ops.py
|
scheduleitem is not Tuple [run_process_replay] (#5425)
|
2024-07-12 15:13:19 -07:00 |
training.py
|
tinytqdm.set_description and tinytrange (#5101)
|
2024-06-22 14:45:06 -04:00 |
transfer_speed.py
|
…
|
|