BioMart request declined #248

zhenzuo2 · 2024-12-04T18:17:19Z

Description of feature

Hi,

Thank you for developing epitopeprediction! I got error when running at line 1133

transcriptProteinTable = ma.get_protein_ids_from_transcripts(transcripts, type=EIdentifierTypes.ENSEMBL).

If the input variant list is too long and then BioMart will decline my request (due to too many times). Is there a way I can run it locally? Thank you!

Best,

Zhen

The text was updated successfully, but these errors were encountered:

jonasscheid · 2024-12-04T18:31:10Z

Hi! Thanks for using the Pipeline.

Currently, parsing of a local biomart version is not possible. We rely on querying biomart unfortunately

jonasscheid · 2024-12-04T18:35:33Z

Solutions to use a local biomart version might be a good addition to the pipeline, e.g. implementing pyensemble in the variant prediction part

zhenzuo2 · 2024-12-04T19:05:54Z

Thank you so much for your prompt response!

christopher-mohr · 2024-12-06T08:22:03Z

Description of feature

Hi,

Thank you for developing epitopeprediction! I got error when running at line 1133

transcriptProteinTable = ma.get_protein_ids_from_transcripts(transcripts, type=EIdentifierTypes.ENSEMBL).

If the input variant list is too long and then BioMart will decline my request (due to too many times). Is there a way I can run it locally? Thank you!

Best,

Zhen

Hi @zhenzuo2,

Did you try using the "splitting functionality" that is implemented in the pipeline? You can get an overview of the parameters that can be used here under "Run optimisation": https://nf-co.re/epitopeprediction/2.3.1/parameters/

Not sure if it would help in your case but it's worth a try.

Best,
Chris

zhenzuo2 · 2024-12-10T15:49:25Z

Description of feature

Hi,
Thank you for developing epitopeprediction! I got error when running at line 1133
transcriptProteinTable = ma.get_protein_ids_from_transcripts(transcripts, type=EIdentifierTypes.ENSEMBL).
If the input variant list is too long and then BioMart will decline my request (due to too many times). Is there a way I can run it locally? Thank you!
Best,
Zhen

Hi @zhenzuo2,

Did you try using the "splitting functionality" that is implemented in the pipeline? You can get an overview of the parameters that can be used here under "Run optimisation": https://nf-co.re/epitopeprediction/2.3.1/parameters/

Not sure if it would help in your case but it's worth a try.

Best, Chris

Thank you for sharing this, Chris. I haven’t had a chance to try it yet. The issue is that for security reason, the computing servers we use are not allowed to connect to the internet. My current solution is to download dataframe from Biomart and use as an input file to that function.

def get_protein_ids_from_transcripts_offline(transcripts, data_path = "mart_export.txt"):
    df = pd.read_csv(data_path)
    result = df.loc[df["Transcript stable ID version"].isin(transcripts),["Protein stable ID","RefSeq peptide ID","UniProtKB/Swiss-Prot ID", "Transcript stable ID version"]]
    result.columns = ["ensembl_id", "refseq_id",
                        "uniprot_id", "transcript_id"]
    print("Offline Now!") 
    return result

Using similar ways I changed a few other functions, such as generate_transcripts_from_variants() and generate_peptides_from_variants(). It works now. I will try "Run optimisation" you mentioned.

zhenzuo2 added the enhancement New feature or request label Dec 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BioMart request declined #248

BioMart request declined #248

zhenzuo2 commented Dec 4, 2024

jonasscheid commented Dec 4, 2024

jonasscheid commented Dec 4, 2024

zhenzuo2 commented Dec 4, 2024

christopher-mohr commented Dec 6, 2024

Description of feature

zhenzuo2 commented Dec 10, 2024

Description of feature

Description of feature

BioMart request declined #248

BioMart request declined #248

Comments

zhenzuo2 commented Dec 4, 2024

Description of feature

jonasscheid commented Dec 4, 2024

jonasscheid commented Dec 4, 2024

zhenzuo2 commented Dec 4, 2024

christopher-mohr commented Dec 6, 2024

Description of feature

zhenzuo2 commented Dec 10, 2024

Description of feature

Description of feature