Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

question about QUAL INFO #166

Open
kkamii opened this issue Sep 30, 2021 · 2 comments
Open

question about QUAL INFO #166

kkamii opened this issue Sep 30, 2021 · 2 comments

Comments

@kkamii
Copy link

kkamii commented Sep 30, 2021

Hi,

I want to use VarDict (version 1.6) to detect variants and decide the best quality option for my data.

To do that, Firstly I used vardict with default options.
And then, I adjusted -q option to 0 and compared two output.

One of the the newly detected mutation in the second output has QUAL INFO = 23.1, which exceeds the default base quality setting (22.5).

I don't understand why this mutation not detected in default option.

I tried more test ( with -q 5, 10, 15, 20 ) and the base quality of mutation is
-q option 0 -> QUAL = 23.1
-q option 5 -> QUAL = 22.4
-q option 10 -> QUAL = 21.5
-q option 15 -> QUAL = 21.5
-q option 20 -> QUAL = 21.5

why the QUAL info is continuely changed and how is the QUAL INFO calculated ?

plz answer me.

thank you very much :)

@kkamii
Copy link
Author

kkamii commented Oct 7, 2021

plus, I want to know how I can adjust -q option without QUAL INFO change.

@PolinaBevad
Copy link
Contributor

Hello @kkamii,

Sorry for the late reply, I'm quite busy at the moment with another project!

So the -q option really operates in two steps: when we iterate over reads and collect information about the variant, bases that are less than this threshold will not be considered - we skip them. And the second step: when all the information is collected, we filter out variants where mean quality was less than threshold. As you can see the first step can affect the variant base quality if you change threshold as some of the bases will not be included in final calculation for mean value.

It is good to not decrease threshold too hard because otherwise you can get some extra Complex/MNV variants, so keep it around default threshold (I recommend not less than 20 for Illumina).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants