Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does VISORGPT supports to generate multiple instances with different sizes in an image? #4

Open
zhangh0920 opened this issue Oct 30, 2023 · 1 comment

Comments

@zhangh0920
Copy link

I want to know if I want to generate objects with different sizes, such as a large building and lots of small windows in an image, can VISORGPT do it?

@Sierkinhane
Copy link
Collaborator

Sierkinhane commented Oct 30, 2023

Thank you for your interest. VisorGPT can generate objects of different sizes, and the flag (small, medium, large) indicates the average area of all instances in one sample. Since the training data involves a limited set of annotated classes (not including all open-world objects), the current model may not have the ability to handle some open-world situations (some novel classes). This work primarily validates that visual priors can be learned through generative pre-training. We are actively working on enhancing VisorGPT to make it capable of handling open-world scenarios.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants