You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I ran a whole bunch of samples on an older version of EHdn (0.6.2), and have since downloaded and been running the outlier analyses with the current version (0.9.0). However, I didn't immediately notice that the profile step that creates the individual jsons makes files that are actually pretty different than the older version's. I still have results, but I'm worried the older json data may have a bug or a huge reason for differences in the files. Many of the bams I've used are archived, and so it isn't trivial for me to just reprocess them individually. Any explanation between the releases would be helpful. Thanks!
Example differences run on the same bam, with the same settings and reference:
Older version -
"AAAAAAAAAAAAAAAAAAT": {
"AnchoredIrrCount": 6,
"IrrPairCount": 0,
"RegionsWithIrrAnchors": {
"12:47693944-47693945": 1,
"13:36128685-36128686": 1,
"2:200884347-200884348": 1,
"4:85302549-85302550": 1,
"5:19171777-19171778": 1,
"9:74481855-74481856": 1
},
"RepeatUnit": “AAAAAAAAAAAAAAAAAAT"
The algorithm for detecting in-repeat reads has changed between versions 0.6.2 and 0.9.0. So, it would be best to limit the analysis to STR profiles generated by the same version of EHdn. If you run the outlier analysis on a mixed dataset where most profiles were generated by, say, the older version of EHdn, the program may produce incorrect results.
I hope this answer is helpful. Please let me know if you have any other questions.
I ran a whole bunch of samples on an older version of EHdn (0.6.2), and have since downloaded and been running the outlier analyses with the current version (0.9.0). However, I didn't immediately notice that the profile step that creates the individual jsons makes files that are actually pretty different than the older version's. I still have results, but I'm worried the older json data may have a bug or a huge reason for differences in the files. Many of the bams I've used are archived, and so it isn't trivial for me to just reprocess them individually. Any explanation between the releases would be helpful. Thanks!
Example differences run on the same bam, with the same settings and reference:
Older version -
"AAAAAAAAAAAAAAAAAAT": {
"AnchoredIrrCount": 6,
"IrrPairCount": 0,
"RegionsWithIrrAnchors": {
"12:47693944-47693945": 1,
"13:36128685-36128686": 1,
"2:200884347-200884348": 1,
"4:85302549-85302550": 1,
"5:19171777-19171778": 1,
"9:74481855-74481856": 1
},
"RepeatUnit": “AAAAAAAAAAAAAAAAAAT"
Newest version-
"AAAAAAAAAAAAAAAAAAT": {
"AnchoredIrrCount": 1,
"IrrPairCount": 0,
"RegionsWithIrrAnchors": {
"12:47693944-47693945": 1
},
"RegionsWithIrrs": {
"2:97886466-97886467": 1
},
"RepeatUnit": "AAAAAAAAAAAAAAAAAAT"
The text was updated successfully, but these errors were encountered: