Commit Graph

228 Commits

Author SHA1 Message Date
nimlgen 3c56aeee70
add Tensor.from_blob (#6765)
* draft tensor from pointer init

* some docs and types

* comment

* cleaner

* test

* malloc

* qcom cl interop

* jit example

* cleaner

* dealoc

* wording

* docs
2024-09-26 18:33:19 +08:00
George Hotz e015b41ce9
remove e( function just alu( [run_process_replay] (#6589)
* remove e( function just alu( [run_process_replay]

* missed two
2024-09-19 10:24:02 +08:00
George Hotz bdd0c06f29
add void type to uop (#6471)
* unwrap_dtype maybe

* uopgraph stuff that hardcoded None

* test_ops passes

* dtypes.py fixups

* update test_linearizer and friends

* more ast updates

* test_beam and test_schedule too

* add void type to uop [run_process_replay]

* remove dumb casts

* start making it green

* more cast cleanups

* more cls methods to fix

* regenerate dataset

* split UOp and NOp const

* maybe that too

* fix docs

* update test_uop_symbolic

* test_verify_ast

* new sops with no diff

* meh, type_ignore is alright

* remove that assert

---------

Co-authored-by: qazal <qazal.software@gmail.com>
2024-09-11 18:16:28 +08:00
Obada Khalili 0fbd141038
tinygrad Tensor Puzzles (#6315)
* Update index.md

* update readme

* Revert "update readme"

This reverts commit 8415a8e90cf3174e0668ea93e2901bdb635b052e.

* update readme

* remove mention

* update index.md
2024-09-09 09:32:38 +08:00
nimlgen bf645d62b3
qcom docs (#6338) 2024-09-02 20:42:20 +03:00
nimlgen 9b616cb33e
HCQArgsState lifetime docs (#6323) 2024-08-30 00:31:49 +03:00
qazal 8c50ef8b7c
start uop docs (#6291)
* start uop docs

* only need show_labels

* sink comes first

* hotfix: invalid

* touchups

* 2 space indent works

* limit some buffer uops

* better BARRIER doc, Op -> UOp when it makes sense.

* make KernelInfo optional

* more work

relative links don't work

* this can be local in multi reduce+pads

* add UOps.SHAPETRACKER details

* UOps.CONST both types

* nit: local buffer isn't device Buffer, habit

* nit2: dtype -> DType
2024-08-29 15:22:39 +03:00
wozeparrot ea5b7910b7
AMD support gfx103x (#5926) 2024-08-28 14:17:08 -07:00
George Hotz 5ed6c6ef3e hotfix: 220V 15A -> 220V 20A 2024-08-27 10:20:43 -07:00
wozeparrot a7bf20c7cd
feat: updated tinybox docs (#6261)
* feat: updated tinybox docs

* fix: grammar
2024-08-23 18:27:46 -07:00
chenyu 590c0922b6
Tensor.prod (#6250)
* Tensor.prod

a new reduce op!

* onnx ReduceProd
2024-08-23 10:06:32 -04:00
Alessandro Benetti 9328248610
support for std_mean and cross_entropy (#6181)
* support for std_mean and cross_entropy (#3)

* Cross entropy and std mean support

* remove extra examples
2024-08-19 12:06:44 -07:00
George Hotz 9bc81c6db4
UOps.SHAPETRACKER (#6129)
* UOps.SHAPETRACKER [run_process_replay]

* no process replay
2024-08-16 23:26:34 -07:00
George Hotz 89c7989659
no shapetracker in ops [run_process_replay] (#6117) 2024-08-16 17:23:27 -07:00
George Hotz 74ee9febec
remove iter from uopgraph (#6110)
* remove iter from uopgraph

* linearize returns uops

* fix tests

* linearize in linearize

* tests fix

* touchup

* test failures
2024-08-16 15:58:29 -07:00
qazal 28c75bf2a6
merge uops with ops (#6111)
Co-authored-by: chenyu <chenyu@fastmail.com>
2024-08-16 18:17:57 -04:00
qazal c23d44c779
AST is UOp (#6030)
* most of the work from the uops2 branch

* schedule

* realize

* kernel

* lowerer

* search

* green

* merge uops with ops

* Revert "merge uops with ops"

This reverts commit 1408a59f12c97e3466679884266b247cf9df46bc.

* fix benchmark

* remove extra dedup
2024-08-16 22:09:00 +03:00
George Hotz 64563abc90
add LSTMCell to nn (#6080)
* add LSTMCell to nn

* lstmcell works with no input on first

* fix no bias 0

* simpler
2024-08-14 12:08:42 -07:00
George Hotz 97c3563109 hotfix: clamp in docs 2024-08-13 16:06:30 -07:00
nimlgen fa84e6ec48
init hcq args state (#6046)
* init hcq args state

* cleaner

* amd

* fillargs

* fixes

* myoy

* docs

* fix

* not needed

* spacing
2024-08-13 17:11:58 +03:00
chenyu d82370f6ef
docs: fix broken links and update is_floating_point (#6023)
* docs: fix broken links and update is_floating_point

broken links would only show as INFO and not an error.

* make doc andhors warn
2024-08-10 15:58:48 -04:00
George Hotz 0a8668cf30 improvements to docs 2024-08-07 09:57:24 -07:00
Eitan Turok 39c8c9c00a
Add docs (#5942)
* init commit

* finish writing

* add to docs

* fix docs

* fix typo

* delete new line

* rename to tensor properties

---------

Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>
2024-08-07 07:38:51 -07:00
nimlgen 71e1472290
hcq more types (#5791)
* mhcq more types

* linter

* pylint

* docs: bind
2024-07-29 18:03:23 +03:00
nimlgen fff19b961b
docs: user runtime docs (#5756) 2024-07-27 23:21:54 +03:00
nimlgen 32b0c07d5a
docs: fix synchronization example in hcq (#5604) 2024-07-20 10:52:06 +03:00
chenyu 3acd8559f4
doc: variable names in abstractions2.py (#5591) 2024-07-19 17:06:15 -04:00
nimlgen b1782e3fef
hcq refactor signal into class (#5575)
* hcq refactor signal into class

* fix amd

* amd do not use amd_signal_t

* cleanup

* signal setter

* fix linter

* docs

* more docs + types

* fix types
2024-07-19 23:23:05 +03:00
Francis Lata 2dc100c565
fix typo in runtime overview docs (#5588) 2024-07-19 22:00:15 +03:00
nimlgen 9d7edc9269
hcq rename HCQCompat -> HCQ (#5577) 2024-07-19 11:34:17 +03:00
kormann c951bc99af
fix abstracions2 printout (#5557) 2024-07-18 21:21:45 +03:00
kormann 2c4add6844
pretty print lazy op per default (#5505)
* pretty lop

* min diff

* walrus

* fix

* min diff

* simplify

* pretty helper function

* ws

* pretty uop upat

* tests

* stricter tests

* test passes

* ws

* stronger upat test

* delete print_tree

* min diff

* stricter exp test

* fix merge

* stronger uops eval test

* +readable and deep upat test

* +readable and deep upat test

* sort inv fix

* fix

* revert allowed_len
2024-07-18 09:34:08 -07:00
George Hotz fa7e734b49
MetaOps.KERNEL (#5543) 2024-07-17 19:41:23 -07:00
chenyu 0afcbfae84
docs: add Tensor.interpolate to doc page (#5510) 2024-07-16 14:17:19 -04:00
nimlgen 8dfd11c1d8
docs: hcq add types (#5495)
* docs: hcq add types

* linter
2024-07-15 22:14:48 +03:00
George Hotz 338b7590b9 hotfix: docs for BatchNorm 2024-07-15 12:04:17 -07:00
nimlgen c9ec7ce070
start hcq docs (#5411)
* start hcq docs

* more hcq docs

* docs

* docs

* linter

* correct args

* linter

* ts returns int
2024-07-15 21:31:11 +03:00
nimlgen 604fb60143
docs: fix link to jit in env_vars (#5474) 2024-07-14 16:08:16 +03:00
chenyu 28972418c4
s/get_linearizer/get_kernel [run_process_replay] (#5467) 2024-07-13 20:32:22 -04:00
Simone Margaritelli 03c3b14cc2
docs: addded JIT description to dos/env_vars.md (#5445)
* docs: addded JIT description to dos/env_vars.md

* docs: rephrased JIT=2 in env_vars.md
2024-07-13 07:07:11 -07:00
chenyu 4df63da190
clean up rest of the loadop [run_process_replay] (#5440)
to metaop and filter_sink
2024-07-12 23:38:51 -04:00
George Hotz 6707c778d0
scheduleitem is not Tuple [run_process_replay] (#5425)
* scheduleitem is not Tuple [run_process_replay]

* fix tests

* fix op + fuzzers

* fix mop test
2024-07-12 15:13:19 -07:00
George Hotz f6ef283e6a
s/loadops/metaops [run_process_replay] (#5421) 2024-07-12 13:26:50 -07:00
nimlgen 6604d2b2c3
amd/nv respect visible devs (#5409)
* nv/amd respect visible devices

* linter

* sort amd gpus

* env docs
2024-07-12 20:02:12 +03:00
wozeparrot 245d83a392
more tinybox docs (#5361) 2024-07-10 16:13:24 -07:00
Ian Paul d5a68ae6b3
Simple abstractions3.py fix (#5343)
* abstractions3.py fix

* Add abstractions3.py to CI tests
2024-07-09 13:48:42 +03:00
chenyu 6856f915d6
Tensor.any and Tensor.all (#5320)
does not work in ptx yet due to how boolean tensor is handled
2024-07-07 14:36:00 -04:00
chenyu c1e330f302
Tensor.int and Tensor.bool (#5317) 2024-07-07 11:52:58 -04:00
chenyu b2c3a28a5e
nn.RMSNorm (#5272)
the norm itself has no significant value to add to Tensor method, but we would want Tensor.normalize
2024-07-02 21:39:01 -04:00
George Hotz 146eb3a811 hotfix: add repeat_interleave docs 2024-06-30 15:25:18 -07:00