🇦🇺 AusCyberBench Evaluation Dashboard
Australia's First LLM Cybersecurity Benchmark • 13,449 Tasks • 25 Open Models
Evaluate proven open language models on Australian cybersecurity knowledge including
Essential Eight, ISM Controls, Privacy Act, SOCI Act, and ACSC Threat Intelligence.
✅ Recommended models have been tested: Qwen2.5-3B (55.6%), DeepSeek (55%), TinyLlama (33%)