P2M: Map protein identifiers to metabolites

Installation

Create and activate conda environment:

conda create -n p2m -c rdkit -c openbabel python rdkit openbabel pandas
conda activate p2m

Clone and setup p2m:

pip install git+https://github.com/pnnl/p2m

Run

p2m --input IDS_PATH --type IDS_TYPE [OPTIONS]

For instance, in the included example, the following command was used to query UniProt identifiers with R-group completion and SMILES cleaning:

p2m --input example_run/ecoli_k12_mg1655_uniprotkb.txt --type uniprot -rc

See --help for more details:

$ p2m --help
usage: p2m [--help] --input IDS_PATH --type IDS_TYPE [--output OUTPUT_PATH]
           [--complete_rgroups] [--clean_smiles] [--sleep SLEEP]

---
P2M
---

Identify metabolites (substrates and products) of proteins from UniProt identifiers or EC numbers. Requires internet access for external database queries. 

To set up local databases for your first run, be sure to run the 'download' command.

options:
  --help, -h            Show help documentation.
  --input IDS_PATH, -i IDS_PATH
                        Path to UniProt or EC identifiers; file must be a list of identifiers separated by newlines.
  --type IDS_TYPE, -t IDS_TYPE
                        Type of identifiers. One of: UniProt, EC
  --output OUTPUT_PATH, -o OUTPUT_PATH
                        Path to desired output folder.
  --complete_rgroups, -r
                        Externally query the ChEBI database for substructure searches of compounds with R-groups. Default False.
  --clean_smiles, -c    Pass SMILES through a set of standardizations. Default False.
  --sleep SLEEP, -s SLEEP
                        Sleep time in seconds between query calls.

Additional Resources

Example: Includes input, output, and command line logs for an example P2M run.
Substructure expansion: Partial SMILES/SMARTS can be expanded using included utilities in P2M. See the linked notebook for details.

Authors

Bryan Killinger (@brykpnl)
Christine Chang (@christinehc)
Ryan Renslow
Sean Colby (@smcolby)

License

P2M is licensed under the BSD 3-Clause License; see the LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
resources		resources
src/p2m		src/p2m
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

P2M: Map protein identifiers to metabolites

Installation

Run

Additional Resources

Authors

License

About

Releases

Packages

Contributors 3

Languages

License

pnnl/p2m

Folders and files

Latest commit

History

Repository files navigation

P2M: Map protein identifiers to metabolites

Installation

Run

Additional Resources

Authors

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages