π Llama 3 1 405B Instruct Fp8ΒΆ
engines.rits.llama_3_1_405b_instruct_fp8
RITSInferenceEngine(
model_name="meta-llama/llama-3-1-405b-instruct-fp8",
max_tokens=2048,
seed=42,
)
[source]Read more about catalog usage here.
engines.rits.llama_3_1_405b_instruct_fp8
RITSInferenceEngine(
model_name="meta-llama/llama-3-1-405b-instruct-fp8",
max_tokens=2048,
seed=42,
)
[source]Read more about catalog usage here.