Best Results per System
Filters all results by System, Workload, and LLM, then shows the fastest tested setup per Quant.
For more context, see Systems and All Results.
Each bar shows the total time it took to process a prompt of the selected workload length and to generate 500 tokens. Shorter bars are better/faster. Click on a bar to open its detail page.
No results found for the selected filters.