empirischtech/DeepSeek-R1-Distill-Qwen-32B-gptq-4bit
https://huggingface.co/empirischtech/DeepSeek-R1-Distill-Qwen-32B-gptq-4bitA domain-optimized reasoning model built on DeepSeek-R1-Distill-Qwen-32B, refined through a multi-stage pipeline of GPTQ quantization-aware training and QLoRA fine-tuning. Achieves 84% on MedQA — within 4 points of GPT-4o — in a ~20GB package that fits on a single L40/L40s GPU.
Sourced from
- HuggingFace — empirischtech/DeepSeek-R1-Distill-Qwen-32B-gptq-4bit
Related resources
microsoft/MediPhi-Instruct
by microsoftThe MediPhi Model Collection comprises 7 small language models of 3.8B parameters from the base model Phi-3.5-mini-instruct specialized in the medical and clinical domains. The collection is designed in a modular fashion. Five MediPhi experts are fine-tuned on various medical corpora (i.e.
zeroentropy/zerank-1-small-reranker
by zeroentropyIn search enginers, rerankers are crucial for improving the accuracy of your retrieval system.
This model had been created as part of joint research of HUMADEX research group (https://www.linkedin.com/company/101563689/) and has received funding by the European Union Horizon Europe Research and Innovation Program project SMILE (grant number 101080923) and Marie Skłodowska-Curie Actions…
Original code at https://github.com/Edoar-do/HuBERT-ECG