This repository contains various demonstrations of the Flux machine learning library. Any of these may freely be used as a starting point for your own models.
The models are broadly categorised into the folders vision (e.g. large convolutional neural networks (CNNs)), text (e.g. various recurrent neural networks (RNNs) and natural language processing (NLP) models), games (Reinforcement Learning / RL). See the READMEs of respective models for more information.
Each model comes with its own Julia project. To use this, open Julia in the project folder, and enter
using Pkg; Pkg.activate("."); Pkg.instantiate()
This will install all needed packages, at the exact versions when the model was last updated. Then you can run the model code with include("<model-to-run>.jl")
, or by running the model script line-by-line.
Models may also be run with NVIDIA GPU support, if you have a CUDA installed. Most models will have this capability by default, pointed at by calls to gpu
in the model code.
Each model can be used in Gitpod, just open the repository by gitpod
- Based on Gitpod's policies, free access is limited.
- All of your work will place in the Gitpod's cloud.
- It isn't an officially maintained feature.
We welcome contributions of new models and documentation.
If you want to share a new model, we suggest you follow these guidelines:
- Models should be in a folder with a project and manifest file to pin all relevant packages.
- Models should include a README(.md) to explain what the model is about, how to run it, and what results it achieves (if applicable).
- Models should ideally be CPU/GPU agnostic and not depend directly on GPU functionality.
- Please keep the code short, clean, and self-explanatory, with as little boilerplate as possible.
You can contribute in one of the following ways
- Add or improve documentation to existing models: Write the following information:
- Give a brief introduction to the model’s architecture and the goal it archives.
- Describe the Flux API that the model demonstrates (high-level API, AD, custom operations, custom layers, etc.).
- Add literature background for the model. More specifically, add articles, blog posts, videos, and any other resource that is helpful to better understand the model.
- Mention the technique that is being demonstrated. Briefly describe the learning technique being demonstrated (Computer vision, regression, NLP, time series, etc.).
- Write in-depth tutorials for a model: You can further extend the documentation of a model and create a tutorial to explain in more detail the architecture, the training routine, use your own data, and so forth. After you write a tutorial, create a PR with it for the Tutorials section on the FluxML website.
Each example lists the version of Flux for which it was most recently updated. Bringing them up to the latest is a great way to learn! Flux has a NEWS page listing important changes. (For other packages, see their releses page: MLUtils, MLDatasets, etc.)
To run the old examples, Flux v0.11 can be installed and run on Julia 1.6, the LTS version.
Flux v0.12 works on Julia 1.8.
Flux v0.14 is the latest right now, this and v0.13 are marked with ☀️; models upgraded to use
explicit gradients (v0.13.9+ or v0.14) have a +
.
Vision
- MNIST
- Simple multi-layer perceptron ☀️ v0.13 +
- Simple ConvNet (LeNet) ☀️ v0.14
- Variational Auto-Encoder ☀️ v0.13 +
- Deep Convolutional Generative Adversarial Networks ☀️ v0.13 +
- Conditional Deep Convolutional Generative Adversarial Networks ☀️ v0.13
- Score-Based Generative Modeling (Diffusion Model) ☀️ v0.13
- Spatial Transformer ☀️ v0.13 +
- CIFAR10
- VGG 16/19 ☀️ v0.13 +
- ConvMixer "Patches are all you need?" ☀️ v0.13
Text
- CharRNN ☀️ v0.13 +
- NanoGPT ☀️ v0.14
- Character-level language detection ☀️ v0.13 +
- Seq2Seq phoneme detection on CMUDict ⛅️ v0.11
- Recursive net on IMDB sentiment treebank ⛅️ v0.11
Other & contributed models
- Logistic Regression Iris ☀️ v0.13 +
- Autoregressive Model ☀️ v0.13 +
- BitString Parity Challenge ⛅️ v0.11
- MLP on housing data (low level API) ⛅️ v0.11
- FizzBuzz ☀️ v0.13 +
- Meta-Learning ❄️ v0.7
- Speech recognition ❄️ v0.6
Tutorials
- A 60 Minute Blitz ⛅️ v0.11
- DataLoader example with image data ⛅️ v0.11
- Transfer Learning ☀️ v0.13 +
MLJFlux is a bridge to MLJ.jl, a package for mostly non-neural-network machine learning. They have some examples of interest, which like the model zoo's examples, each include a local Project & Manifest file: