Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EpiNano output issue #148

Open
TTT16 opened this issue Jun 10, 2024 · 11 comments
Open

EpiNano output issue #148

TTT16 opened this issue Jun 10, 2024 · 11 comments

Comments

@TTT16
Copy link

TTT16 commented Jun 10, 2024

We are using the scripts which you published on this paper and repeat the m6A calling using EpiNano, with the datasets published on Nature Communication (2019) including Curlcakes and yeast datasets.

EpiNano: Detection of m6A RNA Modifications Using Oxford Nanopore Direct RNA Sequencing

· First Online: 04 June 2021

We are unable to get 2 output files: sample.per.site.var.csv and sample.per.site.5mer.csv

I think the Epinano_Variants.py was running since it could produce the analysis time, however, we got NO output files.

For yeast dataset:
Epinano_Variants.py \

-c 36
-r $ref
-b $folder_name.sorted.bam
analysis took 52.780353307724 seconds

For curlcakes dataset:
Epinano_Variants.py -c 36 -r $ref -b $folder_name.sorted.bam
analysis took 5.046196460723877 seconds

Could you please help with this issue?
Thank you,
Trinh

@enovoa
Copy link
Collaborator

enovoa commented Jun 10, 2024

Hi can you please try with the demo data? https://github.com/novoalab/EpiNano/tree/master/test_data/make_predictions Thank you

@SabeenRaza75
Copy link

SabeenRaza75 commented Jun 10, 2024

I've tried to use the demo data from (https://github.com/novoalab/EpiNano/tree/master/test_data/make_predictions) and the commands from run.sh which requires Epinano_Variants.py. I've downloaded and installed version 1.2.4 but it still gives me an error:

Commads:
python /apps/epinano/1.2.4/Epinano_Variants.py
-R /EpiNano_input-data/ref/ref.fa
-b /EpiNano_input-data/wt_data/wt.bam
-n 6
-T t
-s java -jar /sam2tsv/2.0/share/jvarkit-sam2tsv-1.0-0/sam2tsv.jar
##############
#It gave an error:
##################
usage: Epinano_Variants.py [-h] -b BAM -r REFERENCE [-c CPUS]
Epinano_Variants.py: error: the following arguments are required: -r/--reference

###############
#I corrected the -R to -r and re-ran the commands as mentioned in run.sh
################################################
-r /EpiNano_input-data/ref/ref.fa
-b /EpiNano_input-data/wt_data/wt.bam
-n 6
-T t
-s java -jar /sam2tsv/2.0/share/jvarkit-sam2tsv-1.0-0/sam2tsv.jar

################
#It says unrecognized arguments in the Epinano_Variants.py
#################
usage: Epinano_Variants.py [-h] -b BAM -r REFERENCE [-c CPUS]
Epinano_Variants.py: error: unrecognized arguments: -n 6 -T t -s java -jar /apps/sam2tsv/2.0/share/jvarkit-sam2tsv-1.0-0/sam2tsv.jar

@enovoa
Copy link
Collaborator

enovoa commented Jun 10, 2024

Please try removing the argument: "-s java -jar /apps/sam2tsv/2.0/share/jvarkit-sam2tsv-1.0-0/sam2tsv.jar" I believe that this is a mistake from a previous version

@SabeenRaza75
Copy link

yes but the version 1.2.4 doesnt allow other flags such as -n, -T either - those are from the previous version too
the only flags version 1.2.4 allows is: -c &-r & -b
everything else it doesn't allow.. but when I run Epinano_Variants.py with just these three:
Epinano_Variants.py
-c 36
-r $ref
-b $folder_name.sorted.bam

It doesn't give me anything - also do I use sample.sorted.bam file or JUST sample.bam ?
analysis took 52.780353307724 seconds

@enovoa
Copy link
Collaborator

enovoa commented Jun 11, 2024

Hi, have you tried with the DEMO data with the right command line parameters as described in the README?

@enovoa
Copy link
Collaborator

enovoa commented Jun 11, 2024

you seem to be using parameters required for older versions of EpiNano

@SabeenRaza75
Copy link

Hi, have you tried with the DEMO data with the right command line parameters as described in the README?

I'll try that today.

@SabeenRaza75
Copy link

Hi, have you tried with the DEMO data with the right command line parameters as described in the README?

I'm trying to run Epinano_Predict.py using the demo data as follows:
python $EPINANO_HOME/Epinano_Predict.py
--model $EPINANO_HOME/models/rrach.q3.mis3.del3.linear.dump
--predict ko.per.site.csv
--columns 7,9,11
--out_prefix ko_mod_prediction


This initially gave me an error saying sklearn is not installed so I installed it (version 0.24.2) in the $EPINANO_HOME//epinano1.2_venv/lib/python3.6/site-packages/ directory (I'm running Python version 3.6.8)
Now when I run the above command it says :

Traceback (most recent call last):
File "/epinano/1.2.4/Epinano_Predict.py", line 139, in
loaded_model = pickle.load (open (m,'rb'))
ModuleNotFoundError: No module named 'sklearn.svm.classes'


It seems that the model was pickled using a different version of scikit-learn ?

Please advise - also am I using the correct trained model ?

@enovoa
Copy link
Collaborator

enovoa commented Jun 14, 2024

Hi @SabeenRaza75 sorry it seems that you have problems with the versions installed and/or locations in which you have installed the dependencies, but I am unable to figure out why from the information given above. As a solution, I would suggest you to switch to using MasterOfPores, a Nextflow workflow that has EpiNano (and many other tools!) embedded in it. It does not require you to install any of the softwares nor their dependencies, and is also a great solution to ensure reproduciblity, traceability and monitoring of your computational resources used. Let me know if this solution worked for you. Thanks!

@Huanle
Copy link
Collaborator

Huanle commented Jun 19, 2024

Sorry for this late chime in.
@TTT16 , you might need scikit-learn 0.20.2 as indicated here.

@Huanle
Copy link
Collaborator

Huanle commented Jun 19, 2024

yes but the version 1.2.4 doesnt allow other flags such as -n, -T either - those are from the previous version too the only flags version 1.2.4 allows is: -c &-r & -b everything else it doesn't allow.. but when I run Epinano_Variants.py with just these three: Epinano_Variants.py -c 36 -r $ref -b $folder_name.sorted.bam

It doesn't give me anything - also do I use sample.sorted.bam file or JUST sample.bam ? analysis took 52.780353307724 seconds

The latest Epinano_Variants.py does not rely on the 3rd party sam2tsv.jar program required by its predecessors. Please check its help message.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants