tinygrad/test/models
reddyn12 9b5e15db6e
Mamba Implementation (#3456)
* first commit

* state back to orig

* mamba comparisions

* rm file

* rename file

* use Tensor.einsum and mke default model 370M

* Cleaned code and made a comparision test

* Simplyfy pull request. Only has 1 mamba implementation now.

* Update prompt

* rm whitespaces

* last space

* remove Einops dependency

* rm unused code

* add tests

* rm print statement

* rm imports

* skip CLANG

* Update skipIf description

* skip model test in CI and add CLANG fix

* rm Device import

* don't be stupid

* Fix conv assign

When the prompt is too short, the logic for conv_state assign messes up. This can be fixed when padding the tokenized array to min length of 4. I padded using the empty string token, but idk if proper practice is to use the PAD token

* fix p1

* temp

* fix jit import

---------

Co-authored-by: schlimeszn <schlimeszn@gmail.com>
Co-authored-by: reddyn <nikidsniper@gmail.com>
Co-authored-by: George Hotz <72895+geohot@users.noreply.github.com>
2024-03-28 17:49:12 -07:00
..
efficientnet test cleanups 2023-03-03 06:36:06 -08:00
waifu2x waifu2x vgg7: testcase, auto-RGBA->RGB, function to grab pretrained models, training "fix" (#2117) 2023-10-19 22:07:15 -07:00
whisper whisper: support batch inference, add librispeech WER test (#2074) 2023-11-16 13:50:08 -08:00
__init__.py more test cleanups (#2631) 2023-12-05 16:17:57 -08:00
test_bert.py Apply ruff linting rules to tests (#2473) 2023-11-27 21:24:06 -08:00
test_efficientnet.py remove cpu prereqs (#3410) 2024-02-15 13:45:06 +01:00
test_end2end.py call ruff with --preview (#2522) 2023-11-30 13:59:00 -05:00
test_mamba.py Mamba Implementation (#3456) 2024-03-28 17:49:12 -07:00
test_mnist.py Remove pytest markers (#2831) 2023-12-18 18:53:28 -05:00
test_onnx.py fix some long lines in tests (#3006) 2024-01-03 23:53:33 -05:00
test_real_world.py create engine folder and move code (#3948) 2024-03-26 20:38:03 -07:00
test_rnnt.py move things, clean up extra (#2292) 2023-11-13 20:18:40 -08:00
test_train.py Remove pytest markers (#2831) 2023-12-18 18:53:28 -05:00
test_waifu2x.py Apply ruff linting rules to tests (#2473) 2023-11-27 21:24:06 -08:00
test_whisper.py move is_dtype_supported to test.helpers (#3762) 2024-03-15 14:33:26 -04:00