Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

radarFudan Follow

Overview Repositories 56 Projects 0 Packages 0 Stars 1.3k

More

Overview
Repositories
Projects
Packages
Stars

radarFudan

Follow

Shida Wang radarFudan

Follow

Machine Learning and Math

76 followers · 63 following

NUS
Singapore
13:44 - 8h ahead
https://radarfudan.github.io
https://orcid.org/0009-0001-1457-2419
@SanderWangSD

Achievements

Achievements

Highlights

Developer Program Member
Pro

Block or report radarFudan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Overview Repositories 56 Projects 0 Packages 0 Stars 1.3k

More

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All Jupyter Notebook HTML Python Cuda C++ Dockerfile Shell Assembly MATLAB

Sort Last updated

Select order

Last updated Name Stars

Curse-of-memory Public

Curse-of-memory phenomenon of RNNs in sequence modelling

rnn ssm long-term-memory

Jupyter Notebook 19 1 Updated Dec 23, 2024
radarFudan.github.io Public

HTML 3 Updated Dec 15, 2024
mamba-minimal-jax Public

Python 29 Updated Nov 22, 2024
flash-linear-attention Public
Forked from sustcsonglin/flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python MIT License Updated Nov 6, 2024
Awesome-state-space-models Public

Collection of papers on state-space models

562 20 Updated Nov 3, 2024
mamba Public
Forked from state-spaces/mamba

Python 15 Apache License 2.0 Updated Oct 26, 2024
snippets Public

Jupyter Notebook Updated Sep 19, 2024
pythia Public
Forked from EleutherAI/pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 1 Apache License 2.0 Updated Jul 12, 2024
llm.c Public
Forked from karpathy/llm.c

LLM training in simple, raw C/CUDA

Cuda MIT License Updated Jul 12, 2024
transformers Public
Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 1 Apache License 2.0 Updated Jul 10, 2024
mamba2-minimal Public
Forked from tommyip/mamba2-minimal

Minimal Mamba-2 implementation in PyTorch

Python Apache License 2.0 Updated Jun 17, 2024
SSM_examples Public

Jupyter Notebook 1 1 Updated Apr 1, 2024
s4 Public
Forked from state-spaces/s4

Structured state space sequence models

Jupyter Notebook Apache License 2.0 Updated Mar 24, 2024
google-research Public
Forked from google-research/google-research

Google Research

Jupyter Notebook 1 Apache License 2.0 Updated Mar 11, 2024
flash-attention Public
Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 1 BSD 3-Clause "New" or "Revised" License Updated Mar 11, 2024
t5-pegasus-pytorch Public
Forked from renmada/t5-pegasus-pytorch

Python 1 Updated Mar 11, 2024
attention_with_linear_biases Public
Forked from ofirpress/attention_with_linear_biases

Code for the ALiBi method for transformer language models (ICLR 2022)

Python 1 MIT License Updated Mar 11, 2024
EffHDC Public
Forked from zhangluyan9/EffHDC

Python 1 Updated Mar 11, 2024
in-context-operator-networks Public
Forked from LiuYangMage/in-context-operator-networks

ICON for in-context operator learning

Python MIT License Updated Mar 11, 2024
gateloop-transformer Public
Forked from lucidrains/gateloop-transformer

Implementation of GateLoop Transformer in Pytorch and Jax

Python MIT License Updated Mar 11, 2024
causal-conv1d Public
Forked from Dao-AILab/causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

Cuda 1 BSD 3-Clause "New" or "Revised" License Updated Mar 11, 2024
RWKV-CUDA Public
Forked from BlinkDL/RWKV-CUDA

The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )

Cuda 1 Updated Mar 7, 2024
profiling-cuda-in-torch Public
Forked from gpu-mode/profiling-cuda-in-torch

Python 3 Updated Mar 7, 2024
triton Public
Forked from triton-lang/triton

Development repository for the Triton language and compiler

Python 1 MIT License Updated Mar 3, 2024
annotated-mamba Public
Forked from srush/annotated-mamba

Annotated version of the Mamba paper

Jupyter Notebook 2 MIT License Updated Feb 28, 2024
lightning-hydra-template Public template
Forked from ashleve/lightning-hydra-template

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

Python 1 Updated Feb 27, 2024
flash-fft-conv Public
Forked from HazyResearch/flash-fft-conv

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

C++ 2 Apache License 2.0 Updated Feb 25, 2024
TinyLlama Public
Forked from jzhang38/TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 1 Apache License 2.0 Updated Feb 25, 2024
LongMamba Public
Forked from jzhang38/LongMamba

Some preliminary explorations of Mamba's context scaling.

Python Updated Feb 8, 2024
S5 Public
Forked from lindermanlab/S5

Python MIT License Updated Feb 3, 2024

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.