𧬠OncoAgent v1.0 β 27B (Tier 2)
Advanced Reasoning Model for Complex Oncology Cases
AMD Developer Hackathon 2026 Β· Deployed on AMD Instinctβ’ MI300X Β· ROCm 7.2
Model Description
OncoAgent v1.0 27B is the Tier 2 (advanced reasoning) model in the OncoAgent multi-agent oncology triage system. It leverages the full capacity of Qwen/Qwen3.6-27B with a specialized clinical oncology system prompt and RAG-grounded inference.
This model is activated for complex cases that require deeper reasoning:
- Multi-line therapy planning (Stage III/IV cancers)
- Rare tumor types with limited guideline coverage
- Cases requiring cross-guideline synthesis (NCCN + ESMO)
- Differential diagnosis with conflicting biomarkers
Architecture Role
In the OncoAgent dual-tier architecture, the 27B model is the "deep thinker":
Clinical Case β Router Agent
β
βββ Simple/Common β [Tier 1: 9B LoRA] β Fast Triage
β
βββ Complex/Rare β [Tier 2: 27B] β Deep Analysis
β
β
Specialist Agent
β
β
Critic (Reflexion Loop)
β
β
Validated Recommendation
Routing Criteria (Tier 1 β Tier 2 Escalation)
| Trigger | Example |
|---|---|
| Stage III/IV disease | Metastatic breast cancer |
| Rare tumor types | Merkel cell carcinoma |
| Multi-drug regimens | Combination immunotherapy |
| Conflicting data | HER2-low with BRCA mutation |
| Low RAG confidence | Cross-encoder score < 0.70 |
Configuration
This model uses the base Qwen3.6-27B with OncoAgent's specialized system prompt and Corrective RAG pipeline. The configuration includes:
| Parameter | Value |
|---|---|
| Base Model | Qwen/Qwen3.6-27B |
| Precision | BF16 (native MI300X Matrix Cores) |
| Context Window | 32,768 tokens |
| Serving Engine | vLLM with PagedAttention |
| GPU Memory | ~55% of MI300X 192GB HBM3 |
| Tensor Parallelism | 1 (single MI300X) |
System Prompt
You are OncoAgent-Specialist, a board-certified oncologist AI assistant.
You provide evidence-based treatment recommendations grounded EXCLUSIVELY
in the retrieved clinical guidelines (NCCN/ESMO).
RULES:
1. NEVER invent treatments. If the evidence is not in the provided context,
state: "InformaciΓ³n no concluyente en las guΓas provistas."
2. Always cite the guideline source (NCCN/ESMO) and evidence category.
3. Structure your response with: Clinical Summary, Diagnostic Findings,
Treatment Recommendation, and Evidence Level.
4. Consider comorbidities, contraindications, and patient-specific factors.
5. For Stage IV cases, include discussion of clinical trial eligibility.
vLLM Deployment (AMD MI300X)
# Serve Tier 2 on MI300X
python -m vllm.entrypoints.openai.api_server \
--model Qwen/Qwen3.6-27B \
--dtype bfloat16 \
--tensor-parallel-size 1 \
--gpu-memory-utilization 0.55 \
--max-model-len 32768 \
--port 8001
Dual-Model Deployment
# Run both tiers simultaneously on MI300X (192GB HBM3)
# Tier 1 (9B): ~45% GPU memory β Port 8000
# Tier 2 (27B): ~55% GPU memory β Port 8001
bash deploy/start_vllm.sh both
Safety Features
OncoAgent v1.0 27B operates within a multi-layered safety framework:
- Anti-Hallucination Policy β Model is constrained to RAG-retrieved context only
- Reflexion Critic Loop β Output is validated by a dedicated Critic agent
- Diagnostic Rigor Check β Treatment recommendations require confirmed pathology
- PHI Sanitization β Zero patient health information in logs
- HITL Gate β Stage IV cases can trigger human-in-the-loop review
Links
- π Demo: HF Space
- π GitHub: maximolopezchenlo-lab/OncoAgent
- π Tier 1 Model: OncoAgent-v1.0-9B
- π Dataset: OncoAgent-Clinical-266K
Citation
@misc{oncoagent2026,
title={OncoAgent: Multi-Agent Oncology Triage System},
author={Lopez Chenlo, Maximo},
year={2026},
howpublished={AMD Developer Hackathon 2026},
url={https://github.com/maximolopezchenlo-lab/OncoAgent}
}
License
Apache 2.0 β This model configuration is for research and educational purposes only. Not intended for direct clinical use without professional medical oversight.
Model tree for lablab-ai-amd-developer-hackathon/OncoAgent-v1.0-27B
Base model
Qwen/Qwen3.6-27B