feat: add model source properties to store metadata about origin of a model artifact, fixes RHOAIENG-19885 #838

dhirajsb · 2025-02-27T03:15:29Z

Description

Add the following model source properties to store metadata about origin of a model artifact
ModelSourceKind
ModelSourceClass
ModelSourceGroup
ModelSourceId
ModelSourceName

Note that these properties are used to reference an external source where a model artifact was created.
A fuller lineage model for tracking ML Experiments and Runs will be added in the future as model registry API resource types.
These properties also support referencing arbitrary model artifact sources, without constraining it to ML models only.

Fixes RHOAIENG-19885

How Has This Been Tested?

Added new model source properties to existing unit tests for model artifacts in backend service as well as Python client API and tests.

Merge criteria:

All the commits have been signed-off (To pass the DCO check)

The commits have meaningful messages; the author will squash them after approval or in case of manual merges will ask to merge with squash.
Testing instructions have been added in the PR body (for PRs involving changes that are not immediately obvious).
The developer has manually tested the changes and verified that the changes work.
Code changes follow the kubeflow contribution guidelines.

If you have UI changes

The developer has added tests or explained why testing cannot be added.
Included any necessary screenshots or gifs if it was a UI change.
Verify that UI/UX changes conform the UX guidelines for Kubeflow.

… model artifact, fixes RHOAIENG-19885 Signed-off-by: Dhiraj Bokde <[email protected]>

google-oss-prow · 2025-02-27T03:15:33Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign tarilabs for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

tarilabs

thank you @dhirajsb , overall is a good information to add to the logical model.

do we really need to denormalize between (Kind, Class) and (Id, Name)? unless I'm missing anythin in the example in the case of a PipelineRun one field would have been enough and it's up to the producer+consumer (as you rightly say) to manage this field to their need, that is if they want to make it even more explicit they could use the "FQDN" of the resource
why on ModelArtifact and not on ModelVersion as discussed? A ML model is an ensemble of assets with one "trainer" source.

dhirajsb · 2025-02-28T04:35:01Z

do we really need to denormalize between (Kind, Class) and (Id, Name)? unless I'm missing anythin in the example in the case of a PipelineRun one field would have been enough and it's up to the producer+consumer (as you rightly say) to manage this field to their need, that is if they want to make it even more explicit they could use the "FQDN" of the resource

The idea is to allow clients to search by increasing smaller groups of model sources. So, we can support search by kind, class, group, etc.

why on ModelArtifact and not on ModelVersion as discussed? A ML model is an ensemble of assets with one "trainer" source.

A version could be a collection of multiple model artifacts all coming from separate sources, huggingface, pipelines, etc. to create a composite model like an agentic system.

feat: add model source properties to store metadata about origin of a…

25cb51d

… model artifact, fixes RHOAIENG-19885 Signed-off-by: Dhiraj Bokde <[email protected]>

google-oss-prow bot requested review from andreyvelich and tarilabs February 27, 2025 03:15

github-actions bot added Area/MR Python client Area/Go REST server Area/Documentation labels Feb 27, 2025

google-oss-prow bot added the size/XXL label Feb 27, 2025

tarilabs reviewed Feb 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add model source properties to store metadata about origin of a model artifact, fixes RHOAIENG-19885 #838

feat: add model source properties to store metadata about origin of a model artifact, fixes RHOAIENG-19885 #838

dhirajsb commented Feb 27, 2025 •

edited

Loading

google-oss-prow bot commented Feb 27, 2025

tarilabs left a comment

dhirajsb commented Feb 28, 2025 •

edited

Loading

feat: add model source properties to store metadata about origin of a model artifact, fixes RHOAIENG-19885 #838

Are you sure you want to change the base?

feat: add model source properties to store metadata about origin of a model artifact, fixes RHOAIENG-19885 #838

Conversation

dhirajsb commented Feb 27, 2025 • edited Loading

Description

How Has This Been Tested?

Merge criteria:

google-oss-prow bot commented Feb 27, 2025

tarilabs left a comment

Choose a reason for hiding this comment

dhirajsb commented Feb 28, 2025 • edited Loading

dhirajsb commented Feb 27, 2025 •

edited

Loading

dhirajsb commented Feb 28, 2025 •

edited

Loading