deepseek r1 quantized model

deepseek v3 benchmark