Load grid from URL #256

mogres · 2024-05-29T23:42:50Z

Problem

Closes #253

Solution

Updated grid file loading to use URLs
Moved cache to local repo clone
Added functionality to clean cache

Type of change

New feature (non-breaking change which adds functionality)

Steps to Verify:

Run Packing

pack -r cellpack/tests/recipes/v2/test_url_load.json -c cellpack/tests/packing-configs/test_url_load_config.json

Details

The packing should download and save a file test_s3_mesh_test_s3_mesh_config_1.0.0_grid.dat to .cache/grids/ folder at the top level of the repository

Clean Cache

Since some grid files can be large, we want to clean the cache periodically

Option 1

Set the clean_grid_cache option to true in the config file. This will delete the downloaded grid file after the packing is completed.

Option 2 (Destructive)

Run python cellpack/bin/clean.py.
WARNING: This will remove all the files in the cache!

Add download_file_from_s3 and parse_s3_uri functions Refactor download_file function to handle S3 URLs

github-actions · 2024-05-29T23:46:31Z

Packing analysis report

Analysis for packing results located at cellpack/tests/outputs/test_spheres/spheresSST

Ingredient name	Encapsulating radius	Average number packed
ext_A	25	236.0

Packing image

Distance analysis

Expected minimum distance: 50.00
Actual minimum distance: 50.01

Ingredient key	Pairwise distance distribution
ext_A

…d_mesh_from_s3

cellpack/autopack/__init__.py

meganrm

looks good, just some non blocking comments

…d_mesh_from_s3

rugeli · 2024-06-12T00:33:31Z

cellpack/autopack/__init__.py

@@ -261,8 +250,49 @@ def updateReplacePath(newPaths):
        REPLACE_PATH[w[0]] = w[1]


+def download_file_from_s3(s3_uri, local_file_path):


it seems like this function does the same thing as def download_file() and isn't being used. we can keep one of them and remove the other

Good point!

rugeli · 2024-06-12T00:50:46Z

cellpack/autopack/__init__.py

 def download_file(url, local_file_path, reporthook):
-    if url_exists(url):
+    if is_s3_url(url):
+        # download from s3


These steps you've commented out are indeed the correct way to initiate the s3 client, we have functionality in AWSHandler that handles the initiation of clients and manages multiple existing clients. I'd suggest moving either download_file or download_file_from_s3 to AWSHandler to keep aws related util functions more organized and avoid potential client conflicts in the future. What do you think?

Makes sense! It looks like you have moved these files in your branch already? In that case, I will leave these S3 related functions here so the refactoring in your branch doesn't have merge conflicts.

rugeli

The cache cleaning and grid file loading parts look great! For the S3 part, I created a branch to dig around and moved download_file_from_s3 to AWSHandler, and it works the same as this branch does. Here is the comparison link for you to take a look at. We can also chat about tomorrow if you like:)

Nit: the file name has some repeated parts in it: saved to out/test/test_s3_mesh/spheresSST/results_test_s3_mesh_test_s3_mesh_config_1.0.0_seed_0.simularium. We might want to reconstruct it.

mogres

Thanks for taking a look @rugeli!
I suggest we merge this branch with the s3 related functions in __init__.py and then move them to AWSHandler in a later PR from the branch you linked to earlier. Would that work? The refactoring from the other branch looks good to me!

Nit: the file name has some repeated parts in it: saved to out/test/test_s3_mesh/spheresSST/results_test_s3_mesh_test_s3_mesh_config_1.0.0_seed_0.simularium. We might want to reconstruct it.

This is because my recipe is called test_s3_mesh and the config is called test_s3_mesh_config. I'll rename these for clarity.

mogres · 2024-06-12T16:40:05Z

cellpack/autopack/__init__.py

@@ -261,8 +250,49 @@ def updateReplacePath(newPaths):
        REPLACE_PATH[w[0]] = w[1]


+def download_file_from_s3(s3_uri, local_file_path):


Good point!

mogres · 2024-06-12T16:45:54Z

cellpack/autopack/__init__.py

 def download_file(url, local_file_path, reporthook):
-    if url_exists(url):
+    if is_s3_url(url):
+        # download from s3


Makes sense! It looks like you have moved these files in your branch already? In that case, I will leave these S3 related functions here so the refactoring in your branch doesn't have merge conflicts.

mogres added 12 commits May 14, 2024 15:09

Update AWSHandler.py and __init__.py

2254ba6

Add download_file_from_s3 and parse_s3_uri functions Refactor download_file function to handle S3 URLs

Add S3 file download functionality

7abaaf1

Add grid cache directory

36d0ee1

simplify grid loading logic and allow loading from URL

9af4a19

add test recipe and config for URL loading

5071e06

Update cache directory to be created in local repo

9e7ec13

Add kwargs parameter to pack_grid method and clean_grid_cache option

70bef4b

Update Environment.py with grid cache cleaning functionality

03a1725

Add clean_grid_cache option to default_values

e085763

Add clean.py script to clean local cache directory

2e57241

Add clean_grid_cache option to test_url_load_config.json

a5b39d8

Update clean_grid_cache flag to false

c299197

mogres requested review from rugeli, meganrm and ascibisz May 29, 2024 23:42

Linting: remove unused imports

705a811

mogres changed the title ~~Load mesh from URL~~ Load grid from URL Jun 3, 2024

mogres added 2 commits June 3, 2024 13:07

Merge branch 'main' of github.com:mesoscope/cellpack into feature/loa…

df28667

…d_mesh_from_s3

add back sys import

f5677c3

meganrm reviewed Jun 11, 2024

View reviewed changes

cellpack/autopack/__init__.py Show resolved Hide resolved

meganrm reviewed Jun 11, 2024

View reviewed changes

cellpack/autopack/__init__.py Show resolved Hide resolved

meganrm reviewed Jun 11, 2024

View reviewed changes

cellpack/autopack/__init__.py Outdated Show resolved Hide resolved

meganrm approved these changes Jun 11, 2024

View reviewed changes

mogres added 3 commits June 11, 2024 14:03

Sort imports

db7d747

remove unused function

33c439a

Merge branch 'main' of github.com:mesoscope/cellpack into feature/loa…

867535e

…d_mesh_from_s3

rugeli reviewed Jun 12, 2024

View reviewed changes

mogres commented Jun 12, 2024

View reviewed changes

Update recipe and config

1156f61

rugeli approved these changes Jun 12, 2024

View reviewed changes

rugeli mentioned this pull request Jun 12, 2024

Refactor: load from s3 #267

Merged

mogres merged commit 6feaa9a into main Jun 13, 2024
7 checks passed

mogres deleted the feature/load_mesh_from_s3 branch June 13, 2024 16:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load grid from URL #256

Load grid from URL #256

mogres commented May 29, 2024 •

edited

Loading

github-actions bot commented May 29, 2024 •

edited

Loading

meganrm left a comment

rugeli Jun 12, 2024

mogres Jun 12, 2024

rugeli Jun 12, 2024

mogres Jun 12, 2024

rugeli left a comment •

edited

Loading

mogres left a comment •

edited

Loading

mogres Jun 12, 2024

mogres Jun 12, 2024

		@@ -261,8 +250,49 @@ def updateReplacePath(newPaths):
		REPLACE_PATH[w[0]] = w[1]


		def download_file_from_s3(s3_uri, local_file_path):

Load grid from URL #256

Load grid from URL #256

Conversation

mogres commented May 29, 2024 • edited Loading

Problem

Solution

Type of change

Steps to Verify:

Run Packing

Details

Clean Cache

Option 1

Option 2 (Destructive)

github-actions bot commented May 29, 2024 • edited Loading

Packing analysis report

Analysis for packing results located at cellpack/tests/outputs/test_spheres/spheresSST

Packing image

Distance analysis

meganrm left a comment

Choose a reason for hiding this comment

rugeli Jun 12, 2024

Choose a reason for hiding this comment

mogres Jun 12, 2024

Choose a reason for hiding this comment

rugeli Jun 12, 2024

Choose a reason for hiding this comment

mogres Jun 12, 2024

Choose a reason for hiding this comment

rugeli left a comment • edited Loading

Choose a reason for hiding this comment

mogres left a comment • edited Loading

Choose a reason for hiding this comment

mogres Jun 12, 2024

Choose a reason for hiding this comment

mogres Jun 12, 2024

Choose a reason for hiding this comment

mogres commented May 29, 2024 •

edited

Loading

github-actions bot commented May 29, 2024 •

edited

Loading

rugeli left a comment •

edited

Loading

mogres left a comment •

edited

Loading