Between 1-8 GPUs with 27-80 GB, depending on the desired training setup (with default bfloat16 data type). See this FAQ on our project website for details. First, set up a conda environment (see ...