-
Notifications
You must be signed in to change notification settings - Fork 35
Use v1 DRA APIs #732
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Use v1 DRA APIs #732
Conversation
|
/cc @xieshenzh |
I think this should be updated as well: k8s-nim-operator/internal/webhook/apps/v1alpha1/nimservice_webhook_validation_helper.go Lines 513 to 518 in f628f6c
By the way, I believe the annotation should be changed to |
|
These test cases should be updated, since the default mode is supposed to be read from a configmap when the annotation is absent: k8s-nim-operator/internal/webhook/apps/v1alpha1/nimservice_webhook_validation_helper_test.go Lines 1002 to 1080 in f628f6c
|
|
These examples should be updated as well, since |
|
Overall looks good other than KServe comments. We need to update the sample here too: https://github.com/NVIDIA/k8s-nim-operator/blob/main/config/samples/nim/serving/advanced/dra/manual/llm.yaml |
@xieshenzh Do you mean that we need to setup a configmap with default deployment mode as KNative in these tests? What's the expected behavior if the configmap is empty and we don't set any annotation in the ISvc? this testcase indicates the configmap being empty is valid. |
Signed-off-by: Varun Ramachandra Sekar <[email protected]>
Signed-off-by: Varun Ramachandra Sekar <[email protected]>
Signed-off-by: Varun Ramachandra Sekar <[email protected]>
As far as I know, serverless deployment is no longer supported on RHOAI 3.x. So, the kserve community is changing the default deployment mode from serverless to standard/rawdeployment. With the current kserve code, if the configmap is empty, the deploymentMode will be empty. I think the expected behavior if the configmap is empty is to use the standard mode. Otherwise, it is also acceptable to return an error and avoid creating an inferenceservice, if the configmap is empty. |
I see here that the predictor reconciler defaults to knative even though we're expected to exit early in case of missing knative CRDs
isvcutils.GetDeploymentMode seems to provide the default deployment. Isn't this behavior sufficient? Or do you think we should set this as an annotation explicitly from the NIMService? |
For RHOAI, it is sufficient. The configmap is not supposed to be empty, if the kserve is installed with RHOAI. But I am not sure if the configmap could be empty, when installing the community version of kserve. |
|
@varunrsekar What will be upgrade story when user who are on < v1.34 want to just upgrade the NIM Operator? |
DRA is GA in k8s v1.34! This change bumps up the consumed DRA APIs from v1beta2 -> v1 api version.
Changes:
Note:
Testing: