Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The limitation of the Lander-Green approach to multipoint linkage (<30 "bits") #3

Open
changebio opened this issue Jan 31, 2022 · 3 comments

Comments

@changebio
Copy link
Contributor

I tried to test cstatgen by running some big pedigree. and I got the following errors.
Screen Shot 2022-01-31 at 12 28 06 PM

The errors are from merlin/MerlinFamily.h. So I searched merlin related material. I found one pdf, which mentioned "Uses the Lander-Green approach to multipoint linkage, so not suitable for large pedigrees (>30 “bits”)" (https://genepi.qimr.edu.au/staff/davidD/Course/Slides/merlin.pdf).

@gaow
Copy link
Member

gaow commented Jan 31, 2022

@changebio indeed Lander Green is good for multiple markers but not huge pedigrees .. This family has 8 founders and 20 descendants? Yes it is a large family indeed. But this is different issue from #2 right?

@changebio
Copy link
Contributor Author

I closed the #2 issue. and ask your last comment(bit 24 is hard coded. I wonder what bit you should set it to ... perhaps large enough to make it work?) in here.
maybe 30 Bits is a good option. I tried to set the maxBits to 36 with 64G memory. But It still failed to phase haplotypes for these big families, which probably need 100G memory. The amount of computation and time required increases exponentially.
image

@gaow
Copy link
Member

gaow commented Feb 2, 2022

I see. We are stuck with lander-green. But this is a multi-marker, multi-sample issue ... initially I pick lander-green becauase i thought we are more in trouble of a multi-marker situation (a gene has many markers). I did not think too much of the big family situation. ... @changebio you might be correct after all that we may need some additional trimming ... I wonder if I should talk to Jurg again. Or seriously study the Pseudomarker program. What do you think?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants