Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TreeShrink error #50

Open
sjfleck opened this issue Apr 26, 2020 · 8 comments
Open

TreeShrink error #50

sjfleck opened this issue Apr 26, 2020 · 8 comments

Comments

@sjfleck
Copy link

sjfleck commented Apr 26, 2020

I ran 3 PASTA commands, only 2 of which were successful:

  1. PASTA with default parameters (SUCCESS)
    run_pasta.py -d PROTEIN -i protein_sequences.txt -o pasta_job

  2. PASTA with RaxML (SUCCESS)
    run_pasta.py -d PROTEIN --raxml-search-after -i protein_sequences.txt -o pasta_job

  3. PASTA with RaxML & treeshrink (FAILURE)
    run_pasta.py -d PROTEIN --raxml-search-after --treeshrink-filter -i protein_sequences.txt -o pasta_job

I'm not sure why it failed exactly. these seem to be the relevant lines, but I'll include the entire failure message in my command line:

python: can't open file '/Users/albertlab/pasta-code/pasta/bin/tree_shrink/treeshrink.py': [Errno 2] No such file or directory
cp: input_treeshrink/input_shrunk_0.05.tre: No such file or directory
python: can't open file '/Users/albertlab/pasta-code/pasta/bin/utils/sample_from_subtree.py': [Errno 2] No such file or directory

Any help would be appreciated. Thank you!

NMT_RaxML_treeshrink_filter_error.txt

@smirarab
Copy link
Owner

smirarab commented Apr 27, 2020 via email

@sjfleck
Copy link
Author

sjfleck commented Apr 28, 2020

I believe that I should have the latest version. I followed the installation steps in the readme and on the github page. I used:

git clone https://github.com/smirarab/pasta.git

Would this have not downloaded the newest version? I also checked and I do not have a treeshrink directory in /Users/albertlab/pasta-code/pasta/bin/. I do have a treeshrink executable.

This executable is strange because it's also looking for /tree_shrink/treeshrink.py in its code and when I run it in terminal, I get this:

python: can't open file '/Users/albertlab/pasta-code/pasta/bin/tree_shrink/treeshrink.py': [Errno 2] No such file or directory
cp: input_treeshrink/input_shrunk_0.05.tre: No such file or directory
python: can't open file '/Users/albertlab/pasta-code/pasta/bin/utils/sample_from_subtree.py': [Errno 2] No such file or directory

Maybe I'll try downloading PASTA manually instead of using the git clone command.

@smirarab
Copy link
Owner

smirarab commented May 4, 2020 via email

@sjfleck
Copy link
Author

sjfleck commented May 24, 2020

Hi,
I'm just checking in about this error. If it helps I can share my exact installation steps:

  1. I made sure that I had DendroPy (your link is broken, so I followed the installation instructions on https://github.com/jeetsukumaran/DendroPy)
  2. mkdir ~/pasta-code
  3. cd ~/pasta-code
  4. git clone https://github.com/smirarab/pasta.git
  5. git clone https://github.com/smirarab/sate-tools-mac.git
  6. cd sate-tools-mac
  7. git clone https://github.com/koditaraszka/pasta-databases
  8. cd ..
  9. sudo python setup.py develop
  10. added "export CONTRALIGN_DIR=/Users/albertlab/pasta-code/sate-tools-mac" to my ~/.bashrc file

I also can't find "bin/tree_shrink/treeshrink.py" within the files on the code tab on this page. What you have on the code tab matches pretty well with what is on my computer locally. We both seem to have a "treeshrink" in our bin directory, but not a directory named "tree_shrink" or the "treeshrink.py" file. Is there a step that I'm skipping that generates that directory and .py file?

I also want to mention that PASTA on default settings works, I've been able to make a number of protein trees without any errors. I also, used the test files successfully, except in two cases, both of which are in step 6 of the pasta-tutorial.md in the "pasta-doc" directory:

run_seqtools.py -infile pastajob_temp_iteration_0_seq_alignment.txt -outfile iter0.phylip -informat COMPACT3 -outformat PHYLIP -masksites 5 -rename pastajob_temp_name_translation.txt

  • I get an error here saying that the infile isn't COMPACT3

run_seqtools.py -infile pastajob.marker001.small.aln -informat FASTA -outfile pata-masked-20.fasta -outformat FASTA -maskmin 20

  • i get an error here saying that -maskmin isn't in the usage of run_seqtools.py

I'm not sure why I'm having these issues, but any help would be appreciated

@smirarab
Copy link
Owner

smirarab commented May 28, 2020 via email

@RobBarringer
Copy link

Hi Siavash and sjfleck,

I'm getting the same error, and I might have spotted something but I'm not too sure. Below is the error returned by my system that seems identical to sjflecks:

treeshrink_error

And when I look in the pasta/bin directory, I find that treeshrink is a standalone file and isn't in it's own directory, which the error file seems to be searching for (in /bin/tree_shrink/treeshrink.py instead of /bin/treeshrink):

treeshrink_location

Not sure if it's just a simple problem of the program searching for treeshrink in the wrong directory, but I thought I'd highlight it.

Seems to work perfectly when treeshrink isn't used though!

Thanks a lot,

Rob Barringer

@smirarab
Copy link
Owner

smirarab commented Jun 3, 2020 via email

@RobBarringer
Copy link

Hi All, It seems we had intended to remove TreeShrink from within PASTA but we had failed to remove the option. Let me investigate further and either add it back or remove it completely. In the meantime, you can use TreeShrink on the output of PASTA: https://github.com/uym2/TreeShrink Please also let me know if you have a specific use case that would need the integration of TreeShrink within PASTA (as opposed to stand-alone). Thanks Siavash

Hi Siavash,

OK great, thanks for clarifying!

Cheers,

Rob

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants