Vividh-ASR: Diagnosing and Fixing Studio-Bias in Whisper for Indic Languages adalat-ai • about 13 hours ago • 11
Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step FINAL-Bench • about 15 hours ago • 11
"OncoAgent: A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support" lablab-ai-amd-developer-hackathon • 6 days ago • 9
CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models lablab-ai-amd-developer-hackathon • 7 days ago • 7
Hugging Face on JFrog Artifactory: An Enterprise Guide (and What Changes in June 2026) jeffboudier • 7 days ago • 4
How to Comply with SOC 2 and ISO 27001 with Hugging Face: A Practical Guide to AI Model Supply Chain Governance jeffboudier • about 22 hours ago • 4
A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond karina-zadorozhny • Jan 19 • 18
QVAC MedPsy: State-of-the-Art Medical and Healthcare Language Models for Edge Devices qvac • 8 days ago • 15
🧠 I trained my own French LLM from scratch — alone, with a 1080 Ti, and the power went out ⚡🇫🇷 RDTvlokip • 10 days ago • 6
makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch AviSoori1x • May 7, 2024 • 121
Vividh-ASR: Diagnosing and Fixing Studio-Bias in Whisper for Indic Languages adalat-ai • about 13 hours ago • 11
Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step FINAL-Bench • about 15 hours ago • 11
"OncoAgent: A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support" lablab-ai-amd-developer-hackathon • 6 days ago • 9
CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models lablab-ai-amd-developer-hackathon • 7 days ago • 7
Hugging Face on JFrog Artifactory: An Enterprise Guide (and What Changes in June 2026) jeffboudier • 7 days ago • 4
How to Comply with SOC 2 and ISO 27001 with Hugging Face: A Practical Guide to AI Model Supply Chain Governance jeffboudier • about 22 hours ago • 4
A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond karina-zadorozhny • Jan 19 • 18
QVAC MedPsy: State-of-the-Art Medical and Healthcare Language Models for Edge Devices qvac • 8 days ago • 15
🧠 I trained my own French LLM from scratch — alone, with a 1080 Ti, and the power went out ⚡🇫🇷 RDTvlokip • 10 days ago • 6
makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch AviSoori1x • May 7, 2024 • 121