Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Should bivalirudin and bivalirudin trifluoroacetate be cliqued together? #346

Open
gaurav opened this issue Sep 11, 2024 · 0 comments
Open

Comments

@gaurav
Copy link
Collaborator

gaurav commented Sep 11, 2024

We currently consider them to be two separate cliques, both of which have the preferred label "bivalirudin" (https://name-resolution-sri.renci.org/lookup?string=bivalirudin&autocomplete=false&highlighting=false&offset=0&limit=10)

Looking at these cliques (https://nodenormalization-sri.renci.org/1.5/get_normalized_nodes?curie=PUBCHEM.COMPOUND%3A19797045&curie=CHEBI%3A59173&curie=PUBCHEM.COMPOUND%3A78357798&conflate=true&drug_chemical_conflate=false&description=false&individual_types=false), we notice:

  1. The CHEBI:59173 "Bivalirudin" clique includes PUBCHEM.COMPOUND:16129704 "Bivalirudin" and most of its synonyms are "bivalirudin", although at least one identifier (MESH:C074619) has "" as a synonym.
  2. The PUBCHEM.COMPOUND:19797045 "Bivalirudin Trifluoacetate" clique ends up with a preferred label of "Bivalirudin" because it includes HMDB:HMDB0249283, whose label we prefer. However, this HMDB ID is no longer available on the website -- we're presumably getting this from PubChem.
  3. PUBCHEM.COMPOUND:78357798 "Bivalirudin (Trifluoroacetate)" shows up as a molecular mixture with a difference InChiKey from the other two cliques.
  4. These are not currently conflated if drug_chemical conflation is turned on.
  5. As far as I can tell, PUBCHEM.COMPOUND:78357798 and PUBCHEM.COMPOUND:19797045 are being split before the partials are generated.

Possible solutions:

  1. We could combine all three cliques into a single conflated clique. This is probably the simplest, fastest solution.
  2. We could try to split out a "Bivalirudin" clique and a "Bivalirudin Trifluoacetate" clique, but that would require some investigation as to why they aren't currently being combined (presumably because of those different InChiKeys).
  3. ???
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant