Yet another AI from scratch repo.
- The files are appropriately named.
- May contain AI slop.
- Decision Trees
- Transformers
- Encoder
- Decoder
- Encoder + Decoder
- Impact on Different tokenizers (i.e. BPE tokenizer vs Character level)
- Impact on number of heads
- Diffusion
- RL
- Neural Network primitives
- CNN - A guide to convolution arithmetic for deep learning
- Discrete convolutions
- Stride
- Padding
- Pooling
- Transposed
- Dilated convolutions
- Works with Gradients
- CNN - A guide to convolution arithmetic for deep learning
- Errors and residuals
- Abstract common functions? Might reduce readability :(
- Use Multiple Dispatch and Structs/Types
- Filename, Model name is Paper name
- Stateless? Memory allocations concerns if truely end-to-end stateless. We get history for free though. Maybe Stateless functions only?
- Remove Flux / NNlib dependancies in core code.
- Remove Zygote / AutoDiff? We lose flexibility in changing the model.
- Test from-scratch versions against established libraries for correctness.
- Einstien notation-like that better exposes the math instead of hiding behind API abstractions. Maybe start with Tullio?
- Improve training in examples/