DeepSeek-R1-Distill-Llama-70B

Reasoning LLM

The DeepSeek-R1-Distill-Llama-70B model is a model trained via large-scale reinforcement learning. It was released by DeepSeek on January 20, 2025, and it is a distilled version of the Llama 3.3 70B model. The knowledge cutoff date for this model is July 1, 2024.

About DeepSeek-R1-Distill-Llama-70B model

Published on huggingface

20/01/2025


Input price

0.67 /Mtoken(input)

Output price

0.67 /Mtoken(output)


Supported Features
Function callingReasoningStreaming
Output Formats
raw_textjson_objectjson_schema
Context Sizes
131k
Parameters
70B

Try out the model by playing with it.