Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add format data to vcf object #28

Open
wants to merge 19 commits into
base: better-parsing-of-header
Choose a base branch
from

Conversation

MrCurtis
Copy link
Contributor

@MrCurtis MrCurtis commented Aug 31, 2023

This adds a format attribute to the returned VCF object. The attribute allows access to all the metadata entries which satisfy part 1.4.4 of the specification.

I imagine this isn't the best way to do things, either from an API or an implementation point of view, but hopefully it still moves things in the right direction.

@MrCurtis MrCurtis changed the base branch from main to better-parsing-of-header August 31, 2023 21:58
@MrCurtis MrCurtis marked this pull request as ready for review September 5, 2023 13:13
Better parsing of header



This ensures that the text from nested headers is not split on commas that are enclosed in quotes.

In particular, it ensures that this line

##INFO=<ID=DB,Number=0,Type=Flag,Description="dbSNP membership, build 129">

from the example in section 1.1 of the specs can be parsed.

Note that section 1.2 lists the comma as one of the special characters which should always be 'represented with the
capitalized percent encoding' when they are not used for their specific meaning. However, I assume this refers only to cases where they are not enclosed within quotes.
@MrCurtis MrCurtis force-pushed the add-format-data-to-vcf-object branch from c18ad5c to 09b4e2d Compare September 5, 2023 13:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant