FAQ

What does deprecated mean?

A deprecated model status indicates the model is no longer actively maintained, but remains fully functional for inference and fine-tuning (if applicable), whilst it will no longer receive updates, or regular maintenance.

Deprecation typically occurs due to low usage metrics, customers can always continue using deprecated models, but we recommend migrating to actively maintained alternatives e.g. Meta Llama 3.1 may be deprecated, but Meta Llama 3.3 is available.

If we stop seeing usage from deprecated models for multiple months, we then remove them. For reference, here is the list of models we have removed from the catalog so far:

Model Name	Removal Date
Meta-Llama-3-70b	2025-10-05
google/gemma-7b (train only)	2025-10-05
mistralai/Mixtral-8x7B-v0.1 (train only)	2025-10-05
mistralai/Mistral-7B-v0.1 (train only)	2025-10-05
HuggingFaceH4/zephyr-7b-beta	2025-10-05
meta-llama/Meta-Llama-3.1-405B-Instruct	2025-10-05
meta-llama/meta-llama-3-70b-instruct	2025-10-05
google/paligemma2-3b-mix-448	2025-10-05
google/paligemma2-10b-mix-448	2025-10-05
google/paligemma2-28b-mix-448	2025-10-05

Why do I need to provide a HF Token?

Providing a Hugging Face token is the best practice to make sure your calls to the Hugging Face Hub are authenticated. Also, it lets you benefit from higher rate limits.

How do I migrate to the new container system?

From 1st February 2026, Dell Enterprise Hub will only offer access to containers that do not contain model weights inside, in order to reduce the image size and better decouple model and environment lifecycle. This is the foundation to offer a better security and compliance service with Dell Enterprise Hub.

New Models

All new model images added from now on (since October 2025) to the Dell Enterprise Hub (i.e. those labeled as New in the catalog, such as ibm-granite/granite-4.0-h-micro or openai/gpt-oss-20b), will not contain pre-downloaded model weights in any of the offered Docker image tags.

Previous Models

As for all previous model images (i.e. those not labeled as New), you can expect no breaking changes when using latest image tag, since those still contain model weights. However, as mentioned before, latest versions of these images will be replaced from next February onwards, so we encourage users to transition to this new container system.

To do so, images without weights are already available under specific version tags (e.g: registry.dell.huggingface.co/enterprise-dell-inference-google-gemma-3-12b-it:tgi-3.3.6).

Migration Steps

When using these images, model weights will be downloaded on container runtime, so make sure to include both HF_TOKEN and MODEL_ID when running your Docker container to both bypass the gating (if applicable) and ensure you don't get rate-limited when pulling the weights from the Hugging Face Hub.

What is a gated model?

To give more control over how models are used, the Hub allows model authors to enable access requests for their models. A model with access requests enabled is called a gated model.

As a user, if you want to use a gated model, you will need to request access to it. This means that you must be logged in to a Hugging Face user account. Requesting access can only be done from your browser. Go to the model on the Hub and request access to the model provider.