Long-Running Tasks on SaladCloud

This repository provides resources for molecular dynamics simulations and other long-running tasks (such as model fine-tuning and hyperparameter tuning) on SaladCloud. It includes blogs, reference designs, benchmarking code, demonstration applications, and test reports.

If you are new to SaladCloud, we recommend starting with the SCE Architectural Overview and the Docker Run on SaladCloud. The tutorial - Build High-Performance Applications shares best practices along with proven insights from customers who have successfully built large-scale AI inference applications and run molecular dynamics simulations, using tens to thousands of Salad GPU nodes.

Solution Options

GROMACS Benchmark and Code

OpenMM Benchmark and Code

Transcription Benchmark, Guide and Code for 1 Million Hours of YouTube Videos

Long-Running Tasks - Demo App 1

Use Kelpie as the job queue along with its built-in data management.

Long-Running Tasks - Demo App 3

Use Kelpie solely as a job queue, while implementing custom data management (Cloudflare R2 + rclone).

Demo App 3 outperforms Demo App 2 v2 in several key areas:

Simplified Architecture: It significantly reduces application complexity by eliminating the need for job and leasing management, resulting in a 30% reduction (600 to 400 lines in Python) in the demo app.
Enhanced Task Duration: It resolves the limitation of AWS SQS's maximum 12-hour job execution at a time, enabling seamless support for longer-running tasks on SaladCloud.

Long-Running Tasks - Demo App 2 (v2) (deprecated)

Use AWS SQS as a job queue, while implementing custom data management (Cloudflare R2 + boto3).

High-Performance Inference Server

This implementation utilizes separate threads for I/O operations (including health checks) and AI inference, enabling efficient handling of concurrent requests with batched inference processing. It can be used for image generation, transcription, and non-streaming LLM tasks.

High-Performance Storage

Benchmarks and best practices for designing a high-performance and cost-effective storage solution for applications on SaladCloud.

High-Performance Applications

Summarize the common challenges while migrating workloads from Hyperscalers to SaladCloud, and best practices for successful application deployments.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.vscode		.vscode
SCE_Architectural_Overview		SCE_Architectural_Overview
demo-app1		demo-app1
demo-app2v2		demo-app2v2
demo-app3		demo-app3
gromacs-benchmark		gromacs-benchmark
high-performance-applications		high-performance-applications
high-performance-storage		high-performance-storage
inference-server		inference-server
openmm-benchmark		openmm-benchmark
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
solution_options.png		solution_options.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Long-Running Tasks on SaladCloud

Solution Options

GROMACS Benchmark and Code

OpenMM Benchmark and Code

Transcription Benchmark, Guide and Code for 1 Million Hours of YouTube Videos

Long-Running Tasks - Demo App 1

Long-Running Tasks - Demo App 3

Long-Running Tasks - Demo App 2 (v2) (deprecated)

High-Performance Inference Server

High-Performance Storage

High-Performance Applications

About

Releases

Packages

Languages

SaladTechnologies/mds

Folders and files

Latest commit

History

Repository files navigation

Long-Running Tasks on SaladCloud

Solution Options

GROMACS Benchmark and Code

OpenMM Benchmark and Code

Transcription Benchmark, Guide and Code for 1 Million Hours of YouTube Videos

Long-Running Tasks - Demo App 1

Long-Running Tasks - Demo App 3

Long-Running Tasks - Demo App 2 (v2) (deprecated)

High-Performance Inference Server

High-Performance Storage

High-Performance Applications

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages