Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Initial jump in seed finding #10

Open
tsybulkin opened this issue Aug 7, 2015 · 0 comments
Open

Initial jump in seed finding #10

tsybulkin opened this issue Aug 7, 2015 · 0 comments

Comments

@tsybulkin
Copy link
Contributor

We may maintain an extra data structure - a hash table that will hold all possible short sequences as keys and two pointers to FM-index. If a particular sequence (a key) cannot be found in fm-index we may store 'not-found' instead of two pointers.
This may speed up the process of seed finding at least 2x.
This extra data structure can be created during the runtime. The process looks like we cash the most common sequences in a special table and use them instead of computing them again and again.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant