Community Blog & Articles

Community Articles

Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step

KV Caching Explained: Optimizing Transformer Inference Efficiency

"OncoAgent: A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support"

lablab-ai-amd-developer-hackathon

•

6 days ago

• 9

CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models

lablab-ai-amd-developer-hackathon

•

7 days ago

• 7

NEO-unify: Building Native Multimodal Unified Models End to End

Introducing the agentic robotics appstore for 10,000 Reachy Minis

Hugging Face on JFrog Artifactory: An Enterprise Guide (and What Changes in June 2026)

How to Comply with SOC 2 and ISO 27001 with Hugging Face: A Practical Guide to AI Model Supply Chain Governance

From GRPO to DAPO and GSPO: What, Why, and How

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

QVAC MedPsy: State-of-the-Art Medical and Healthcare Language Models for Edge Devices

Pallas for people who know JAX but not kernels yet

🧠 I trained my own French LLM from scratch — alone, with a 1080 Ti, and the power went out ⚡🇫🇷

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

May 14, 2026

transformerspytorchoptimization

Unlocking asynchronicity in continuous batching

May 14, 2026

Building Blocks for Foundation Model Training and Inference on AWS

May 11, 2026

EMO: Pretraining mixture of experts for emergent modularity

May 8, 2026

vLLM V0 to V1: Correctness Before Corrections in RL

May 6, 2026

audiospeechleaderboard

Adding Benchmaxxer Repellant to the Open ASR Leaderboard

May 6, 2026

Granite 4.1 LLMs: How They’re Built

April 29, 2026

llmsinference-providersdeepinfra

DeepInfra on Hugging Face Inference Providers 🔥

April 29, 2026

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

April 28, 2026

openaiprivacy-filterweb-apps

How to build scalable web apps with OpenAI's Privacy Filter

April 27, 2026

llmmoelong-context

DeepSeek-V4: a million-token context that agents can actually use

April 24, 2026

guidetransformers.jsjavascript

How to Use Transformers.js in a Chrome Extension

April 23, 2026

QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard

April 21, 2026

cybersecurityopen-sourcecommunity

AI and the Future of Cybersecurity: Why Openness Matters

April 21, 2026

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law

Vividh-ASR: Diagnosing and Fixing Studio-Bias in Whisper for Indic Languages

Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step

KV Caching Explained: Optimizing Transformer Inference Efficiency

"OncoAgent: A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support"

lablab-ai-amd-developer-hackathon

•

6 days ago

• 9

CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models

lablab-ai-amd-developer-hackathon

•

7 days ago

• 7

NEO-unify: Building Native Multimodal Unified Models End to End

Introducing the agentic robotics appstore for 10,000 Reachy Minis

Hugging Face on JFrog Artifactory: An Enterprise Guide (and What Changes in June 2026)

How to Comply with SOC 2 and ISO 27001 with Hugging Face: A Practical Guide to AI Model Supply Chain Governance

From GRPO to DAPO and GSPO: What, Why, and How

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

QVAC MedPsy: State-of-the-Art Medical and Healthcare Language Models for Edge Devices

Pallas for people who know JAX but not kernels yet

🧠 I trained my own French LLM from scratch — alone, with a 1080 Ti, and the power went out ⚡🇫🇷

Safety Evals Should Project Test-Time Compute

You do the work. Big Tech takes the model.

Self Evolving is the Endgame or final destiny

Uncensor any LLM with abliteration

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch