Lists (9)
Sort Name ascending (A-Z)
01_Successful CV PhD Journey
02_PyTorch Coding
03_Spatial Temporal Forecasting
04_Time Series Forecasting
05_Diffusion Model
06_Awesome Backbone
07_Awesome Loss
08_LLM and LLM Agent
Stars
Give us minutes, we give back a faster Mamba. The official implementation of "Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-training".
SEED-Voken: A Series of Powerful Visual Tokenizers
The paper collections for the autoregressive models in vision.
The official GitHub page for the survey paper "A Survey of Large Language Models".
A suite of image and video neural tokenizers
Tips for Writing a Research Paper using LaTeX
π A curated list of MIT faculty that tackle climate change with machine learning for applying students, undergraduates, or others
[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 π and reasoning techniques.
π A Collection of Awesome Large Weather Models (LWMs) | AI for Earth (AI4Earth) | AI for Science (AI4Science)
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery (EMNLP'24)
Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"
Implementation of Alphafold 3 from Google Deepmind in Pytorch
A concise but complete full-attention transformer with a set of promising experimental features from various papers
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
code for learning trajectory dependencies for human motion prediction
Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)
This repo contains the code for 1D tokenizer and generator
Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs
[NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model