1. Ensure you have the necessary hardware and software, as outlined on the Prerequisites page.

  2. Follow the Installation guide to download the Cosmos-Predict2 repo and set up the environment.

  3. Generate a Hugging Face access token. Set the access token permission to ‘Read’ (the default permission is ‘Fine-grained’).

  4. Log in to Hugging Face with the access token:

    huggingface-cli login
    
  5. Review and accept the Llama-Guard-3-8B terms.

  6. Download the model weights for Cosmos-Predict2-2B-Text2Image and Cosmos-Predict2-14B-Text2Image from Hugging Face:

    python -m scripts.download_checkpoints --model_types text2image --model_sizes 2B 14B
    

    Tip

    Change the --model_sizes parameter as needed if you only need one of the 2B/14B models.