INFO:
1.58bit量化671B的DeepSeekR1模型,在CPU上缓慢推理或者2x H100 80GB