Mapping multimer protein components #46

ijpulidos · 2024-08-23T17:39:28Z

Describe the bug
When mapping gufe protein components that were built using multimeric PDBs, I'm observing that the map is only done to a part of the multimer, apparently only one of the monomers is mapped. I would expect kartograf to be able to map the components correctly, or complain if it doesn't.

To Reproduce

from kartograf import KartografAtomMapper
from gufe import ProteinComponent

# Create components from PDB Files
protein_comp = ProteinComponent.from_pdb_file("input.pdb")
mutated_comp = ProteinComponent.from_pdb_file("mutated.pdb")

mapper = KartografAtomMapper(atom_map_hydrogens=True)
mapping = next(mapper.suggest_mappings(protein_comp, mutated_comp))
print(len(mapping.componentA_to_componentB))

It seems to map only the chain "B" for some reason.

Expected behavior
I expect the length of the mapping to be the number of atoms of the protein components minus the mutated ones, which should be just a few of them.

Screenshots

Additional context
This would enable handling protein mutations in a more streamlined way. As far as I can tell, the way to do it right now would be to separate each monomer (each chain in the PDBs) to its own component and then mapping those independently, but that can be cumbersome for users.

PDB files to test in the following zip archive:
Archive.zip

The text was updated successfully, but these errors were encountered:

IAlibay · 2024-08-27T15:42:52Z

From today's call: a fix here would be a check for a ProteinComponent that checks for chain breaks and how to fix it.

RiesBen · 2024-08-28T23:41:29Z

@ijpulidos
I marked in the PR the code bits, where I think the new features need to be implemented to :)
let me know what you think? :)
P.s.: I implemented an initial suggestion for splitting the protein chains into components, can you test that one?

jameseastwood · 2024-10-28T16:01:10Z

Irfan's comments should be addressed, but this PR is not blocking any of Ivan's current work.

RiesBen mentioned this issue Aug 28, 2024

Mapping multi chain components #47

Merged

RiesBen linked a pull request Aug 28, 2024 that will close this issue

Mapping multi chain components #47

Merged

jameseastwood assigned RiesBen and ijpulidos Oct 21, 2024

RiesBen assigned jthorton and hannahbaumann Oct 22, 2024

IAlibay added the priority:high label Oct 28, 2024

jameseastwood unassigned RiesBen Oct 30, 2024

jthorton closed this as completed in #47 Nov 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mapping multimer protein components #46

Mapping multimer protein components #46

ijpulidos commented Aug 23, 2024 •

edited

Loading

IAlibay commented Aug 27, 2024

RiesBen commented Aug 28, 2024 •

edited

Loading

jameseastwood commented Oct 28, 2024

Mapping multimer protein components #46

Mapping multimer protein components #46

Comments

ijpulidos commented Aug 23, 2024 • edited Loading

IAlibay commented Aug 27, 2024

RiesBen commented Aug 28, 2024 • edited Loading

jameseastwood commented Oct 28, 2024

ijpulidos commented Aug 23, 2024 •

edited

Loading

RiesBen commented Aug 28, 2024 •

edited

Loading