1 | initial version |
Sure, here are a few troubleshooting steps you can take:
Check if the kserve deployment is running by running the following command in a new terminal window:
kubectl get pods
If the deployment is running, you should see an output similar to this:
NAME READY STATUS RESTARTS AGE
kserve-models-serving-<random-string>-<random-string> 2/2 Running 0 5m
Check if the kserve deployment has the correct port configured. The default port for kserve is 8080. You can check this by running the following command:
kubectl get services
Look for the service associated with your kserve deployment. The name will be similar to kserve-models-serving
. Check the value in the PORT(S)
column. It should be 8080/TCP
.
Ensure that the endpoint you are trying to access is correct. In the tutorial, you should be able to access the endpoint using curl
like this:
curl -X POST http://<kserve-service-url>/v1/models/sklearn-iris/:predict -d '{"data": {"ndarray": [[5.2, 4.1, 1.5, 0.1]]}}'
Make sure you replace <kserve-service-url>
with the actual URL of your kserve deployment.
Check the logs of your kserve deployment to see if there are any errors. You can use the following command to view the logs:
kubectl logs <kserve-pod-name>
Replace <kserve-pod-name>
with the name of the kserve pod that you retrieved using the kubectl get pods
command.
Hopefully, one of these steps will help you fix the 404 error you are encountering. Let me know if you need further assistance!