empirischtech/DeepSeek-R1-Distill-Qwen-32B-gptq-4bit

https://huggingface.co/empirischtech/DeepSeek-R1-Distill-Qwen-32B-gptq-4bit
Activeby empirischtech34715updated 1 month ago

A domain-optimized reasoning model built on DeepSeek-R1-Distill-Qwen-32B, refined through a multi-stage pipeline of GPTQ quantization-aware training and QLoRA fine-tuning. Achieves 84% on MedQA — within 4 points of GPT-4o — in a ~20GB package that fits on a single L40/L40s GPU.

Sourced from

  • HuggingFaceempirischtech/DeepSeek-R1-Distill-Qwen-32B-gptq-4bit

Related resources

Active454 months ago
Python

The MediPhi Model Collection comprises 7 small language models of 3.8B parameters from the base model Phi-3.5-mini-instruct specialized in the medical and clinical domains. The collection is designed in a modular fashion. Five MediPhi experts are fine-tuned on various medical corpora (i.e.

Active2K5 months ago
Python

In search enginers, rerankers are crucial for improving the accuracy of your retrieval system.

Active22.9K3 months ago
Python

This model had been created as part of joint research of HUMADEX research group (https://www.linkedin.com/company/101563689/) and has received funding by the European Union Horizon Europe Research and Innovation Program project SMILE (grant number 101080923) and Marie Skłodowska-Curie Actions…

Idle3361 year ago

Original code at https://github.com/Edoar-do/HuBERT-ECG

Active2.2K6 days ago
Python