Artifacts using Vertex ML Pipelines

This repo contains code for an example ML Pipeline (Kubeflow) on Vertex AI. Instead of using the default Artifact URI as provided by Vertex AI, the file location is set manually.

Setting up environment

To set up the environment, create a new venv and install the requirements.

python3 -m venv pipeline-env
source pipeline-env/bin/activate
pip install -r requirements.txt

If you later you encounter issues with protobuf, try uninstalling and installing it manually:

pip uninstall protobuf
pip install protobuf

Running pipelines

There are two pipelines; one where data preprocessing is done in the pipeline and one where a preprocessed dataset is loaded from an existing Artifact: train_only_pipeline.

To run the pipeline, first you need to specify some variables, in pipeline file you're running.

project_id
Google Cloud Project name the pipeline will run on
pipeline_root_path
Root location to store pipeline files to; must be a path to a folder on Google Cloud Storage
data_file_location (only for train_only_pipeline)
URI of preprocessed Dataset Artifact; must be valid file on Google Cloud Storage

To run the pipeline run one of these.

python pipeline.py
python train_only_pipeline.py

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
pipeline.py		pipeline.py
pipeline_components.py		pipeline_components.py
requirements.txt		requirements.txt
train_only_pipeline.py		train_only_pipeline.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Artifacts using Vertex ML Pipelines

Setting up environment

Running pipelines

About

Releases

Packages

Languages

MennoHerbrink/artifacts-on-vertexai

Folders and files

Latest commit

History

Repository files navigation

Artifacts using Vertex ML Pipelines

Setting up environment

Running pipelines

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages