Add support for new types which store hyperparameter tuning related metadata #138

bharathappali · 2024-06-13T07:51:39Z

Is your feature request related to a problem? Please describe.
Tracking and managing hyperparameter tuning experiments for machine learning models can be challenging without detailed information storage capabilities. The current Model Registry (MR) does not support storing comprehensive metadata related to hyperparameter configurations, trials, and experiments. This functionality is critical for optimizing model performance through Hyperparameter Optimization (HPO). The absence of these features makes it more difficult to run and manage HPO experiments efficiently and to maintain organized records of various trials and their configurations.

Describe the solution you'd like

Credits: The solution was explored and developed by Kusuma Chalasani (@kusumachalasani) & Nicholas Caughey (@ncau)

Introduction

The current Model Registry (MR) is designed to save and manage model metadata efficiently, allowing users to track and maintain the lifecycle of their machine learning models. This service is essential for ensuring that model information is easily accessible and utilizable.

However, users often require more detailed information about a model, such as the hyperparameters and values used during its creation or training. This is especially critical for those performing hyperparameter tuning, where multiple trials with varying hyperparameter configurations are conducted to optimize model performance.

Goal

To address this need, we propose enhancing the Model Registry to support the storage of detailed experiment, trial, and Hyperparameter Optimization (HPO) configuration information. Specifically, we aim to introduce new data types: HPO Experiment, HPO Trial, and HPO Config into the Model Registry.

Run an HPO experiment

Using a Jupyter notebook where the user can keep their laptop/ server ON for the whole period of the experiment which might take days to weeks based on the trials. This can be two ways - running the HPO experiment on a notebook server or on the Rayclusters.
Using Data Science Pipelines which is automated.

Hyper Parameter Optimization (HPO) Model Registry Integration

Overview

In order to integrate Hyper Parameter Optimization (HPO) functionality into OpenShift AI, we need to include additional types in the Model Registry (MR) to save the metadata of each HPO experiment. This work is detailed in the ADR: ODH-ADR-0011-hpo-raytune. We have minimized the number of new types and unnecessary additions by reusing existing ML Metadata (MLMD) and Kubeflow (KF) types.

New Types for HPO Integration

kf.HPOConfig (Artifact)

Captures the hyperparameter configuration for all trials of HPO experiments. Links one-to-one with trials.

Field	Type	Description
hpoconfig_id	string	URI of the artifact
name	string	Name of the artifact
state	ArtifactState	State of the artifact
description	string	Description of the configuration
configuration	string[map]	Metadata of the configuration
trial_id	string	ID of the trial the config is linked to
trial_name	string	Name of the trial the config is linked to

kf.HPOTrial (Context)

Provides information about individual trials conducted within an experiment, aiding in experiment management.

Field	Type	Description
trial_id	string	URI of the context
context_state	ContextState	State of the context
trial_name	string	Name of the model associated with this version

kf.HPOExperiment (Context)

The parent context of the trials, having a one-to-many relationship with them. Serves as a parent context for trials, facilitating the organization and comparison of experiments.

Field	Type	Description
experiment_name	string	Name of the experiment associated with this version
description	string	Description of the trial, optional field

rareddy · 2024-07-04T13:16:58Z

I am supportive of this feature. Have already started a POC as to how this extends the MR? I also suggest you come to MR community meeting to advance this.

ncau · 2024-07-04T14:39:27Z

I have yeah, quite a bit through it. Happy to join the community meeting to further this

github-actions · 2024-10-03T04:08:46Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

rareddy · 2024-10-03T17:45:21Z

@bharathappali any update you can provide on the collaboration of this feature?

ncau · 2024-10-03T22:25:01Z

Hi @rareddy sorry got bogged down with some other bits and pieces. I am working on this, I have a working version with the new types that I can share very soon.

rareddy · 2024-10-04T02:15:00Z

Hi @ncau that's super great news! would love to hear about it in one of the community meetings.

tarilabs · 2024-10-05T10:03:16Z

/assign @ncau

github-actions · 2025-01-04T04:07:54Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

github-actions bot added the lifecycle/stale label Oct 3, 2024

rareddy removed the lifecycle/stale label Oct 3, 2024

google-oss-prow bot assigned ncau Oct 5, 2024

github-actions bot added the lifecycle/stale label Jan 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for new types which store hyperparameter tuning related metadata #138

Add support for new types which store hyperparameter tuning related metadata #138

bharathappali commented Jun 13, 2024 •

edited

Loading

rareddy commented Jul 4, 2024

ncau commented Jul 4, 2024

github-actions bot commented Oct 3, 2024

rareddy commented Oct 3, 2024

ncau commented Oct 3, 2024 •

edited

Loading

rareddy commented Oct 4, 2024

tarilabs commented Oct 5, 2024

github-actions bot commented Jan 4, 2025

Add support for new types which store hyperparameter tuning related metadata #138

Add support for new types which store hyperparameter tuning related metadata #138

Comments

bharathappali commented Jun 13, 2024 • edited Loading

Introduction

Goal

Run an HPO experiment

Hyper Parameter Optimization (HPO) Model Registry Integration

Overview

New Types for HPO Integration

kf.HPOConfig (Artifact)

kf.HPOTrial (Context)

kf.HPOExperiment (Context)

rareddy commented Jul 4, 2024

ncau commented Jul 4, 2024

github-actions bot commented Oct 3, 2024

rareddy commented Oct 3, 2024

ncau commented Oct 3, 2024 • edited Loading

rareddy commented Oct 4, 2024

tarilabs commented Oct 5, 2024

github-actions bot commented Jan 4, 2025

bharathappali commented Jun 13, 2024 •

edited

Loading

ncau commented Oct 3, 2024 •

edited

Loading