781 questions
- Bountied 0
- Unanswered
- Frequent
- Score
- Trending
- Week
- Month
- Unanswered (my tags)
0
votes
1
answer
109
views
How to get all non-registered models in MLflow?
I am using MLflow (3.7.0) with PyTorch (2.5.1 + CUDA).
How can I get model(NO REGISTERED) names from MLflow?
I have a function that saves a model whenever it improves compared to the previous epoch. I ...
2
votes
0
answers
64
views
How log model in mlflow using Spark Connect
I have the following setup:
Kubernetes cluster with Spark Connect 4.0.1 and
MLflow tracking server 3.5.0
MLFlow tracking server should serve all artifacts and is configured this way:
--backend-store-...
0
votes
0
answers
16
views
How to replace MlFlow model
I have an MLflow model deployed in SageMaker, but when I try to update it I get this error:
MlflowException: You are attempting to deploy an application with name: titanic-xgboost-endpoint in 'create' ...
Best practices
0
votes
0
replies
41
views
AWS sagemaker Model Registry Vs. MLflow
In the AWS environment, which of Sagemaker Registry or MLFlow do you suggest?
If we want to use grid search to experiment with different configurations and log all experiments to decide which model to ...
3
votes
0
answers
145
views
MLflow doesn’t log or show model artifacts after training run
I’m working on a machine learning project using MLflow for experiment tracking (on macOS, Python 3.12, scikit-learn, and DagsHub as the tracking server). The experiment runs successfully — I see the ...
0
votes
0
answers
44
views
How to register a saved and logged model in mlflow?
I've been running numerous experiments tuning hyperparameters for anomaly detection models. I've logged parameters, metrics, and artifacts, and at the end of the experiment, I've also logged models ...
0
votes
0
answers
29
views
how to link the mlflow model to the run when the model is used as input?
When a model is logged from a training run in mlflow, the model is shown in the UI, under the logged models tab. This should also work when doing an evaluation run, so when the model is used as input ...
0
votes
0
answers
85
views
How to load a model while ignoring unbuilt head layers? (`expected 2 variables, received 0`)
I’m loading a costume ViT backbone saved via MLflow’s TensorFlow flavor (Keras 3). The artifact includes backbone parts I want (patch_embed, encoder) a couple of layers in the encoder were saved in a ...
2
votes
0
answers
75
views
mlflow unable to generate input output schema when using log_model()
I am using the latest mlflow 3.3.2 with a simple PyTorch implementation of SRCNN. I am able to log the parameters as well as the metrics for the training process as well as generate custom ...
0
votes
0
answers
146
views
Why does mlflow.langchain.autolog() not capture traces when using a custom LangGraph StateGraph?
I’m trying to integrate MLflow autologging with a LangGraph-based agent in my project.
According to the MLflow docs https://mlflow.org/docs/latest/genai/tracing/integrations/listing/langgraph/?...
0
votes
0
answers
241
views
MLFlow hangs on pytorch log_model for large model file
When trying to upload a VLM model artifact to a remote tracking uri, the MLFlow run hangs and fails after a few hours. I am using Mlflow 3.1.1.
The model file is aroung 5GB, we've set the server ...
0
votes
0
answers
167
views
mlflow cannot fetch model registered on GitLab model registry
I’m trying to download artifacts and a model that has been saved using MLflow on GitLab model registry.
That's the working part of my code, where I set up the client, create model and version and ...
1
vote
1
answer
86
views
Missing variable inputs to LLM metric created by make_genai_metric_from_prompt()
I've defined a custom LLM metric with a couple of f-string formatted variables using the make_genai_metric_from_prompt() function, and when I pass this custom metric to mlflow.evaluate() with a pandas ...
0
votes
0
answers
79
views
Log Metrics in Azure using MLflow
I have a component registered within MLStudio that contains code to run a Promptflow pipeline. I am executing the flow using an AzureML pipeline, following the documentation and example notebook. The ...
0
votes
1
answer
217
views
filedescriptor out of range in select()
I am running a training job using MLflow and an MLflow recipe. In the recipe.train step, the training starts an experiment and runs for 350 epochs. After the 350 epochs are completed and I try to log ...