Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

nordland subset #16

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open

nordland subset #16

wants to merge 1 commit into from

Conversation

xjh19971
Copy link

@xjh19971 xjh19971 commented Jul 5, 2024

Following [1-2], this script provides a subset of nordland dataset, which contains 2760 query images instead of 27k query images. Meanwhile, the subset requires THRESHOLD_FRAMES=1 instead of 10.

[1] Zaffar, M., Garg, S., Milford, M. et al. VPR-Bench: An Open-Source Visual Place Recognition Evaluation Framework with Quantifiable Viewpoint and Appearance Change. Int J Comput Vis 129, 2136–2174 (2021). https://doi.org/10.1007/s11263-021-01469-5
[2] Ali-bey, A., Chaib-draa, B., & Giguère, P. (2024). BoQ: A Place is Worth a Bag of Learnable Queries. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 17794-17803).

@xjh19971 xjh19971 closed this Jul 5, 2024
@xjh19971 xjh19971 reopened this Jul 5, 2024
@gmberton
Copy link
Owner

Hi, sorry for the late response. Did you try to test some VPR model on the dataset created by this code? Shouldn't the DISTANCE_BETWEEN_FRAMES be different between database and queries?

@xjh19971
Copy link
Author

xjh19971 commented Sep 28, 2024

No, I don't modify the distance_between_frames. Instead, I adjust the sampling frequency between the database and query sets: database images are sampled for every frame, while query images are sampled every 10 frames. Feel free to change this if you find a better way.

To ensure correctness, I compare the generated dataset with the one provided in the SALAD repository (https://github.com/serizba/salad?tab=readme-ov-file). Specifically, I cross-check the first 10 frames with the reference dataset available here: https://surfdrive.surf.nl/files/index.php/s/sbZRXzYe3l0v67W. My comparison shows that the first 10 frames in my dataset match those from this repo.

One additional observation I made is that the two datasets appear to use different image upsampling/downsampling methods. Initially, I wrote a script to compare the datasets pixel-by-pixel, but I noticed slight differences in pixel values. As a result, I had to verify the content manually.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants