πŸ“„ Llama 3 1 405B Instruct Fp8ΒΆ

engines.rits.llama_3_1_405b_instruct_fp8

RITSInferenceEngine(
    model_name="meta-llama/llama-3-1-405b-instruct-fp8",
    max_tokens=2048,
    seed=42,
)
[source]

Read more about catalog usage here.