Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RGB image or RGB-D for inference ? #29

Open
Fizmath opened this issue Feb 16, 2021 · 2 comments
Open

RGB image or RGB-D for inference ? #29

Fizmath opened this issue Feb 16, 2021 · 2 comments

Comments

@Fizmath
Copy link

Fizmath commented Feb 16, 2021

Hello

Can we use simple RGB images taken by mobile phones in prediction ?

Thanks

@alando46
Copy link

You can but you will need to get the intrinsic matrix from your phone's camera system. You also need to finetune a 2d detector to generate 2d bounding boxes specific to the classes recognized by Total3DUnderstanding.

@Fizmath
Copy link
Author

Fizmath commented Mar 16, 2021

Thank you very much for the response.

I get camera intrinsic parameters Focal Length = 3.46 mm , Sensor Size=4.66*3.51 , Pixel Array Size=4160*3120 , Orintation = 90 degree in android by installing a device info app. So, from these we can construct the intrinsic matrix cam_K.txt, right ?

Now, I wonder how we build camera pose (extrinsic matrix) from this intrinsic matrix ? I searched the web but no straight forward answer. How did you do that in your work ?

Another question : after getting a result from your algorithm, what are the possible solutions for stitching room layouts from overlapping photos to build a complete room perimeter ? Is it possible ?

Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants