Skip to content

Popular repositories Loading

  1. Tune-A-Video Tune-A-Video Public

    [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

    Python 4.3k 386

  2. Awesome-Video-Diffusion Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

    3.5k 205

  3. Show-1 Show-1 Public

    [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

    Python 1.1k 62

  4. Show-o Show-o Public

    Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    Python 1k 44

  5. MotionDirector MotionDirector Public

    [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

    Python 853 54

  6. Image2Paragraph Image2Paragraph Public

    [A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

    Python 796 54

Repositories

Showing 10 of 74 repositories
  • ShowUI Public

    Repository for ShowUI: One Vision-Language-Action Model for GUI Visual Agent

    showlab/ShowUI’s past year of commit activity
    Python 241 MIT 9 0 0 Updated Nov 29, 2024
  • Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

    showlab/Awesome-Video-Diffusion’s past year of commit activity
    3,530 205 4 1 Updated Nov 29, 2024
  • FQGAN Public

    FQGAN: Factorized Visual Tokenization and Generation

    showlab/FQGAN’s past year of commit activity
    30 0 0 0 Updated Nov 28, 2024
  • Awesome-MLLM-Hallucination Public

    đź“– A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

    showlab/Awesome-MLLM-Hallucination’s past year of commit activity
    472 14 2 0 Updated Nov 28, 2024
  • ROICtrl Public

    Code for ROICtrl: Boosting Instance Control for Visual Generation

    showlab/ROICtrl’s past year of commit activity
    76 0 0 0 Updated Nov 28, 2024
  • Awesome-GUI-Agent Public

    đź’» A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

    showlab/Awesome-GUI-Agent’s past year of commit activity
    296 12 0 0 Updated Nov 27, 2024
  • VideoLISA Public

    [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos

    showlab/VideoLISA’s past year of commit activity
    Python 68 Apache-2.0 2 3 0 Updated Nov 27, 2024
  • computer_use_ootb Public

    An out-of-the-box (OOTB) version of Anthropic Claude Computer Use for Windows and macOS

    showlab/computer_use_ootb’s past year of commit activity
    Python 755 MIT 73 6 4 Updated Nov 26, 2024
  • showlab/MovieBench’s past year of commit activity
    20 0 0 0 Updated Nov 26, 2024
  • Awesome-Unified-Multimodal-Models Public

    đź“– This is a repository for organizing papers, codes and other resources related to unified multimodal models.

    showlab/Awesome-Unified-Multimodal-Models’s past year of commit activity
    225 9 0 0 Updated Nov 23, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.