Add table with number of clusters to Ewing clustering report #938

allyhawkins · 2024-12-12T21:08:21Z

Purpose/implementation Section

Please link to the GitHub issue that this pull request addresses.

Closes #935

What is the goal of this pull request?

Here I'm adding a table that summarizes the number of clusters for each set of parameters used for clustering in 01-clustering-metrics.Rmd.

Briefly describe the general approach you took to achieve this goal.

Before reporting any stats there should now be a print out of the number of clusters for all parameters used. I also added a sentence stating that if there was only one cluster there would be no distributions shown.
The tables aren't super fancy or anything but I think they do the job and are readable.

If known, do you anticipate filing additional pull requests to complete this analysis module?

Yes

Results

What is the name of your results bucket on S3?

s3://researcher-211125375652-us-east-2/cell-type-ewings/results/clustering

What types of results does your code produce (e.g., table, figure)?

Rendered reports for each sample. Note that I will re-run the workflow and update the bucket once these changes are approved.

What is your summary of the results?

No changes to the results with this change other than the addition of the table. The conclusions are the same.

Provide directions for reviewers

What are the software and computational requirements needed to be able to run the code in this PR?

I rendered the example report locally, but running the full workflow takes ~ 24 hours on a laptop.

Are there particularly areas you'd like reviewers to have a close look at?

Here's an example of a rendered report:
01-clustering-metrics.html.zip

Author checklists

Check all those that apply.
Note that you may find it easier to check off these items after the pull request is actually filed.

Analysis module and review

This analysis module uses the analysis template and has the expected directory structure.
The analysis module README.md has been updated to reflect code changes in this pull request.
The analytical code is documented and contains comments.
Any results and/or plots this code produces have been added to your S3 bucket for review.

Reproducibility checklist

Code in this pull request has been added to the GitHub Action workflow that runs this module.
The dependencies required to run the code in this pull request have been added to the analysis module Dockerfile.
If applicable, the dependencies required to run the code in this pull request have been added to the analysis module conda environment.yml file.
If applicable, R package dependencies required to run the code in this pull request have been added to the analysis module renv.lock file.

sjspielman

Woohoo!

allyhawkins · 2024-12-17T21:06:10Z

The updated clustering reports have been added to S3. This already passed CI today and then new changes were added to main unrelated to this module so I'm cancelling the latest run of the workflow.

allyhawkins added 2 commits December 12, 2024 14:46

fix join error with cluster column

9dd9475

add table with number of clusters

5e7f4cc

allyhawkins requested a review from jaclyn-taroni as a code owner December 12, 2024 21:08

allyhawkins requested review from sjspielman and removed request for jaclyn-taroni December 12, 2024 21:08

sjspielman approved these changes Dec 13, 2024

View reviewed changes

allyhawkins added 2 commits December 17, 2024 10:00

Merge branch 'main' into allyhawkins/cluster-numbers-table

c6978d6

Merge branch 'main' into allyhawkins/cluster-numbers-table

49f3b74

allyhawkins merged commit e1daebb into AlexsLemonade:main Dec 17, 2024
1 of 3 checks passed

allyhawkins deleted the allyhawkins/cluster-numbers-table branch December 17, 2024 21:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add table with number of clusters to Ewing clustering report #938

Add table with number of clusters to Ewing clustering report #938

allyhawkins commented Dec 12, 2024

sjspielman left a comment

allyhawkins commented Dec 17, 2024

Add table with number of clusters to Ewing clustering report #938

Add table with number of clusters to Ewing clustering report #938

Conversation

allyhawkins commented Dec 12, 2024

Purpose/implementation Section

Please link to the GitHub issue that this pull request addresses.

What is the goal of this pull request?

Briefly describe the general approach you took to achieve this goal.

If known, do you anticipate filing additional pull requests to complete this analysis module?

Results

What is the name of your results bucket on S3?

What types of results does your code produce (e.g., table, figure)?

What is your summary of the results?

Provide directions for reviewers

What are the software and computational requirements needed to be able to run the code in this PR?

Are there particularly areas you'd like reviewers to have a close look at?

Author checklists

Analysis module and review

Reproducibility checklist

sjspielman left a comment

Choose a reason for hiding this comment

allyhawkins commented Dec 17, 2024