Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add summary statistics automatically for VCF Files. #8

Open
laceysanderson opened this issue May 15, 2017 · 1 comment
Open

Add summary statistics automatically for VCF Files. #8

laceysanderson opened this issue May 15, 2017 · 1 comment

Comments

@laceysanderson
Copy link
Member

We have a script developed by @Jiu9Shen that calculates a variety of statistics on a VCF File: https://gist.github.com/Jiu9Shen/1709484e7bf9564a27de6f2c221314b5. We manually paste the results of this script into the description of a given VCF file to give guidance to the researcher on how they many want to filter it.
screen shot 2017-05-15 at 11 01 29 am

Since this is such critical information and can be useful for any VCF file, we should add automatic calculation of these stats and generation of the table to this module. This would streamline the process, ensure these statistics are available for every file and provide this functionality to other Tripal sites using this module.

@laceysanderson
Copy link
Member Author

The script for calculating these statistics can take some time depending on the size of the VCF file. As such it would be best run in a Tripal Job that is submitted when the VCF File is added or updated through the administrative interface. The results could then be stored in a generic VCF files metadata table as suggested for the ABH format in #2. The table on the VCF Filter form would then be built by querying this table.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant