Aurora-M

This is code to finetune and run Aurora-M, an open source Starcoderplus based model trained on 400B additional tokens of multilingual and multidomain data, and adapted for multimodal understanding using the BakLLaVA/LLaVA 1.5 code base. The 400B additional tokens were trained with BigCode's Megatron fork. This model is intended for mixture of experts (MoE) adapation using the M*DEL MoE adapatation. See our M*DEL project page for more details.

Compute provided by the LUMI Supercomputer center and JUWELS Supercomptuer center. Thank you!

Also check out our BakLLaVA project, which is a cooperation between the AI Open source organizations: LAION, Ontocord, Skunkworks OSS AI group and AI Alignment Lab.

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
docs		docs
images		images
llava		llava
playground/data		playground/data
scripts		scripts
Aurora_over_bakllava.png		Aurora_over_bakllava.png
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup_finetune.sh		setup_finetune.sh
setup_pretrain.sh		setup_pretrain.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aurora-M

About

Releases

Packages

Languages

License

huu4ontocord/aurora-m

Folders and files

Latest commit

History

Repository files navigation

Aurora-M

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages