You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm experiencing an issue with Snakemake where workflows fail when there is no internet connectivity, even though the required files are already cached locally. The storage function with keep_local=True is supposed to cache files locally, but Snakemake still tries to verify the presence of these files online before deciding to use the cached version. This behavior is problematic in environments with intermittent internet access or when working offline.
A feature or an argument enhancement in the storage function would be helpful that allows Snakemake to automatically use cached files if they are present, without attempting to check the remote file source. This would be particularly useful for ensuring that workflows are robust to network outages and do not require internet access if the necessary data is already cached.
For instance, an argument like use_cache_if_available=True could be added to the storage function, which would make Snakemake check the local cache first and proceed if the file is available, only falling back to a remote check if it is not.
Possible workaround
As a workaround, I have considered manually writing custom logic to handle file checks and determine whether to use a local copy or download a new one. However, this approach requires additional boilerplate code and deviates from Snakemake’s streamlined workflow management.
The use of local caches without remote checks would greatly improve the efficiency and reliability of data-driven workflows, especially in computational environments with limited or unreliable internet access.
The text was updated successfully, but these errors were encountered:
Context
I'm experiencing an issue with Snakemake where workflows fail when there is no internet connectivity, even though the required files are already cached locally. The storage function with keep_local=True is supposed to cache files locally, but Snakemake still tries to verify the presence of these files online before deciding to use the cached version. This behavior is problematic in environments with intermittent internet access or when working offline.
A feature or an argument enhancement in the storage function would be helpful that allows Snakemake to automatically use cached files if they are present, without attempting to check the remote file source. This would be particularly useful for ensuring that workflows are robust to network outages and do not require internet access if the necessary data is already cached.
For instance, an argument like use_cache_if_available=True could be added to the storage function, which would make Snakemake check the local cache first and proceed if the file is available, only falling back to a remote check if it is not.
Possible workaround
As a workaround, I have considered manually writing custom logic to handle file checks and determine whether to use a local copy or download a new one. However, this approach requires additional boilerplate code and deviates from Snakemake’s streamlined workflow management.
The use of local caches without remote checks would greatly improve the efficiency and reliability of data-driven workflows, especially in computational environments with limited or unreliable internet access.
The text was updated successfully, but these errors were encountered: