feat(py): Add support for storing models in S3 #765

syntaxsdev · 2025-02-05T21:57:45Z

Within the Python client, users will be able to directly store models to an S3 compatible object storage.
AWS S3 connections will automatically be consumed.

Description

The bulk of the changes were done in clients/python/src/_client.py
With E2E tests that were added to test functionality by creating a Minio instance

This implementation is within the ModelRegistry class

Example usage:

mr = ModelRegistry(...)
mr.save_to_s3(
  path="models",
  bucket_name="default",
  endpoint_url="xxx",
  access_key_id="xxx",
  secret_access_key="xxx"
)

You can path both files and paths to the path parameter.
If a path is nested (ex. data/models) then an s3_prefix variable will be required to allow for proper folder naming.
Environment variables are picked up, but variables supplied via direct params are preferred over environment variables that are mounted.

How Has This Been Tested?

A local and remote version of Minio S3

Changes to the Makefile target were made to add a local Minio instance for the same Kind cluster than the Makefile target deploy-latest-mr uses.
An .env file is generated so that vars can be used for the added tests
Three new tests were added that single for storing a singular model to S3 and for storing a path of models, recursively

Tests are passing when completely rebuilding and testing the new e2e workflow

To test, run make test-e2e as per usual.

Merge criteria:

All the commits have been signed-off (To pass the DCO check)

The commits have meaningful messages; the author will squash them after approval or in case of manual merges will ask to merge with squash.
Testing instructions have been added in the PR body (for PRs involving changes that are not immediately obvious).
The developer has manually tested the changes and verified that the changes work.
Code changes follow the kubeflow contribution guidelines.

If you have UI changes

The developer has added tests or explained why testing cannot be added.
Included any necessary screenshots or gifs if it was a UI change.
Verify that UI/UX changes conform the UX guidelines for Kubeflow.

Signed-off-by: syntaxsdev <[email protected]>

clients/python/src/model_registry/_client.py

tarilabs

thank you @syntaxsdev for this!

some initial comments below.
which type of test can we consider to make sure the functionality is covered?

I'm thinking we could have some dedicated e2e test by extending the current opt-in pytest mechanism and deploy minio in that "scenario" of e2e testing. Do you have some additional ideas?

clients/python/src/model_registry/_client.py

…sting with minio locally Signed-off-by: syntaxsdev <[email protected]>

clients/python/tests/conftest.py

clients/python/tests/test_client.py

…tifacts in paths Signed-off-by: syntaxsdev <[email protected]>

tarilabs

I would like to make sure the end results works as expected for KServe (not necessarily as a test in the PR itself, making the check manually can also do)

otherwise
/lgtm
already thanks @syntaxsdev

clients/python/src/model_registry/_client.py

clients/python/tests/conftest.py

tarilabs · 2025-02-17T08:16:15Z

( I also believe this PR need rebasing to account for more recent openapi and generated changes )

Signed-off-by: syntaxsdev <[email protected]>

…bucket, additional test coverage Signed-off-by: syntaxsdev <[email protected]>

Signed-off-by: syntaxsdev <[email protected]>

syntaxsdev · 2025-02-18T21:47:22Z

Confirmed functionality works similarly by going through other kubeflow saving model demos

Signed-off-by: syntaxsdev <[email protected]>

Signed-off-by: Sidney Glinton <[email protected]>

Signed-off-by: Matteo Mortari <[email protected]>

rareddy · 2025-02-25T13:19:10Z

scripts/deploy_minio_on_kind.sh

+#!/usr/bin/env bash
+
+set -e


@tarilabs @syntaxsdev one question I have is that I was assuming we are using pytest libraries for any cluster management too, is that a wrong assumption?

no we are not "orchestrating cluster setup from pytest" if that was the question, as the request was to stick to makefile and scripting; the downside is netting on a basic common ground, but I see the advantage as reuse from Go or else in the future if needed.

Correct for reuse and manageability, this was the topic in shift left. @lugi0 might have some comments on this. I am not asking to change this PR but maybe we can capture a follow-up if that is the overall direction we need to be writing the tests with, if not we can ignore.

I'm favourable of revising the setup, as you know I prsonally prefer to reduce Makefile and scripts to the minimum as possible, but we need indeed to account for multiple factors.

rareddy · 2025-02-25T14:10:53Z

@lucferbux can you look into the issues with the UI tests here?

lucferbux · 2025-02-25T14:12:53Z

Agh, this might happen due to latest config PR cc @christianvogt I'm gonna take a look

christianvogt · 2025-02-25T14:40:19Z

A fix for the tests has been posted here #832

dhirajsb

API looks good to me, we should verify Makefile changes on all platforms.

clients/python/src/model_registry/_client.py

clients/python/Makefile

Signed-off-by: Matteo Mortari <[email protected]>

tarilabs

thank you @syntaxsdev , and all for this PR, this is a good foundation step to make life easier to the DS + MLOps Eng to orchestrate the Store-then-Register flow!

/lgtm
/approve

google-oss-prow · 2025-02-27T07:12:12Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: tarilabs

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [tarilabs]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

feat(py): add support storing models in S3 - draft

48b1043

Signed-off-by: syntaxsdev <[email protected]>

google-oss-prow bot added the do-not-merge/work-in-progress label Feb 5, 2025

google-oss-prow bot requested review from andreyvelich, tarilabs and zijianjoy February 5, 2025 21:57

github-actions bot added the Area/MR Python client label Feb 5, 2025

google-oss-prow bot added the size/L label Feb 5, 2025

syntaxsdev commented Feb 5, 2025

View reviewed changes

clients/python/src/model_registry/_client.py Outdated Show resolved Hide resolved

tarilabs reviewed Feb 5, 2025

View reviewed changes

clients/python/src/model_registry/_client.py Outdated Show resolved Hide resolved

clients/python/src/model_registry/_client.py Outdated Show resolved Hide resolved

clients/python/src/model_registry/_client.py Outdated Show resolved Hide resolved

feat: added init tests for storing to s3, created e2e pipeline for te…

afdc8cf

…sting with minio locally Signed-off-by: syntaxsdev <[email protected]>

syntaxsdev force-pushed the feat/store-on-s3-py branch from 176b6a8 to afdc8cf Compare February 7, 2025 19:56

syntaxsdev commented Feb 7, 2025

View reviewed changes

clients/python/tests/conftest.py Show resolved Hide resolved

syntaxsdev commented Feb 7, 2025

View reviewed changes

clients/python/tests/test_client.py Outdated Show resolved Hide resolved

Merge branch 'kubeflow:main' into feat/store-on-s3-py

a1556c8

tarilabs mentioned this pull request Feb 14, 2025

Export Models to Kubeflow Model Registry kubeflow/trainer#2438

Open

feat: updated e2e tests, added logic for recursively saving models/ar…

b7f28fb

…tifacts in paths Signed-off-by: syntaxsdev <[email protected]>

google-oss-prow bot added size/XL and removed size/L labels Feb 17, 2025

tarilabs reviewed Feb 17, 2025

View reviewed changes

clients/python/src/model_registry/_client.py Outdated Show resolved Hide resolved

clients/python/tests/conftest.py Show resolved Hide resolved

google-oss-prow bot assigned tarilabs Feb 17, 2025

google-oss-prow bot added the lgtm label Feb 17, 2025

syntaxsdev added 2 commits February 17, 2025 10:03

Merge branch 'fork-main' into feat/store-on-s3-py

e4c1404

fix: s3 uri for nested artifacts, added tests case around usecase

37fc994

Signed-off-by: syntaxsdev <[email protected]>

google-oss-prow bot removed the lgtm label Feb 17, 2025

syntaxsdev added 2 commits February 17, 2025 12:42

feat: added additional s3_prefix param so models dont save to root …

89eb913

…bucket, additional test coverage Signed-off-by: syntaxsdev <[email protected]>

fix: Makefile env var fix for monkeypatching

a889bcc

Signed-off-by: syntaxsdev <[email protected]>

Merge branch 'kubeflow:main' into feat/store-on-s3-py

677688c

syntaxsdev requested a review from tarilabs February 18, 2025 21:50

fix: workaround for env vars

624963f

Signed-off-by: syntaxsdev <[email protected]>

syntaxsdev force-pushed the feat/store-on-s3-py branch from c1c209d to 624963f Compare February 20, 2025 23:45

fix: pytest labels

2aeb44a

Signed-off-by: syntaxsdev <[email protected]>

tarilabs mentioned this pull request Feb 24, 2025

Add save_to_oci_registry python client method #800

Merged

7 tasks

syntaxsdev and others added 3 commits February 24, 2025 15:50

Merge branch 'main' into feat/store-on-s3-py

25d1d17

Signed-off-by: Sidney Glinton <[email protected]>

Merge branch 'main' into feat/store-on-s3-py

d9fc4f1

Signed-off-by: Matteo Mortari <[email protected]>

chore: rerun poetry lock --no-update

763cbde

Signed-off-by: Matteo Mortari <[email protected]>

rareddy reviewed Feb 25, 2025

View reviewed changes

rareddy mentioned this pull request Feb 25, 2025

update frontend to support module federation #798

Merged

7 tasks

dhirajsb reviewed Feb 26, 2025

View reviewed changes

clients/python/src/model_registry/_client.py Outdated Show resolved Hide resolved

clients/python/Makefile Show resolved Hide resolved

tarilabs added 3 commits February 26, 2025 10:21

Merge remote-tracking branch 'upstream/main' into feat/store-on-s3-py

d06e04a

Signed-off-by: Matteo Mortari <[email protected]>

make: add target likely lost on rebase

c899e31

Signed-off-by: Matteo Mortari <[email protected]>

test: add boto3 to nox

dcdf02e

Signed-off-by: Matteo Mortari <[email protected]>

github-actions bot added the Area/GitHub label Feb 26, 2025

tarilabs force-pushed the feat/store-on-s3-py branch 2 times, most recently from e2313b8 to c27af31 Compare February 26, 2025 12:30

test: add wiring of Minio to Nox

6d0aeab

Signed-off-by: Matteo Mortari <[email protected]>

tarilabs force-pushed the feat/store-on-s3-py branch from c27af31 to 6d0aeab Compare February 26, 2025 12:41

implement code review feedback

9883236

Signed-off-by: Matteo Mortari <[email protected]>

tarilabs force-pushed the feat/store-on-s3-py branch from f1c1d45 to 9883236 Compare February 26, 2025 19:18

test: remove case no longer needed

9a1d043

Signed-off-by: Matteo Mortari <[email protected]>

tarilabs approved these changes Feb 27, 2025

View reviewed changes

google-oss-prow bot added the lgtm label Feb 27, 2025

google-oss-prow bot added the approved label Feb 27, 2025

google-oss-prow bot merged commit 90ba800 into kubeflow:main Feb 27, 2025
17 checks passed

tarilabs mentioned this pull request Mar 3, 2025

py: add basic service URL resolver #421

Closed

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(py): Add support for storing models in S3 #765

feat(py): Add support for storing models in S3 #765

syntaxsdev commented Feb 5, 2025 •

edited

Loading

tarilabs left a comment

tarilabs left a comment

tarilabs commented Feb 17, 2025

syntaxsdev commented Feb 18, 2025

rareddy Feb 25, 2025

tarilabs Feb 25, 2025

rareddy Feb 25, 2025

tarilabs Feb 25, 2025

rareddy commented Feb 25, 2025

lucferbux commented Feb 25, 2025

christianvogt commented Feb 25, 2025

dhirajsb left a comment

tarilabs left a comment

google-oss-prow bot commented Feb 27, 2025

		#!/usr/bin/env bash

		set -e

feat(py): Add support for storing models in S3 #765

feat(py): Add support for storing models in S3 #765

Conversation

syntaxsdev commented Feb 5, 2025 • edited Loading

Description

How Has This Been Tested?

Merge criteria:

tarilabs left a comment

Choose a reason for hiding this comment

tarilabs left a comment

Choose a reason for hiding this comment

tarilabs commented Feb 17, 2025

syntaxsdev commented Feb 18, 2025

rareddy Feb 25, 2025

Choose a reason for hiding this comment

tarilabs Feb 25, 2025

Choose a reason for hiding this comment

rareddy Feb 25, 2025

Choose a reason for hiding this comment

tarilabs Feb 25, 2025

Choose a reason for hiding this comment

rareddy commented Feb 25, 2025

lucferbux commented Feb 25, 2025

christianvogt commented Feb 25, 2025

dhirajsb left a comment

Choose a reason for hiding this comment

tarilabs left a comment

Choose a reason for hiding this comment

google-oss-prow bot commented Feb 27, 2025

syntaxsdev commented Feb 5, 2025 •

edited

Loading