Arena Score

A single score (0-100) that summarizes the quality of a backtest — so you don't have to interpret every individual metric.

Example: how the Arena Score appears in your backtest results.

Laden...

Why 4 dimensions?

A single metric is always misleading. 40% CAGR sounds great — not if the strategy only made 5 trades and Buy & Hold would have done 60%. Arena Score deliberately weighs Return, Risk, Consistency, and statistical foundation.

Sample Size as its own dimensionthat's the unique contribution over other tools. A 5-trade backtest can never score more than 25 points, no matter how good CAGR and Win Rate look. This automatically encourages statistically sound tests.

The 4 dimensions

📈

Return (0-30)

How much your strategy outperforms the average Buy & Hold investor (not just the day-1 buyer).

max 30
ConditionPointsMeaning
Outperformance < 0%0Schlechter als Ø B&H
Outperformance < 5%5–10Leicht besser
Outperformance < 15%10–20Deutlich besser
Outperformance < 30%20–27Stark besser
Outperformance ≥ 30%30Außergewöhnlich
⚖️

Efficiency (0-25)

Risk-adjusted return via MAR Ratio: CAGR ÷ |Max Drawdown|. Higher = less pain per percent of gain.

max 25
ConditionPointsMeaning
MAR < 0.30Sehr volatil
MAR < 0.55Schwach
MAR < 1.010Akzeptabel
MAR < 1.517Gut
MAR < 2.021Sehr gut
MAR ≥ 2.025Exzellent
🎯

Consistency (0-25)

Win Rate (0-10 pts) + Profit Factor (0-15 pts). Rewards strategies where winners reliably dominate losers.

max 25
ConditionPointsMeaning
PF < 1.0PF: 0Verliert Geld
PF < 1.3PF: 3Knapp profitabel
PF < 1.5PF: 7Solide
PF < 2.0PF: 11Stark
PF ≥ 2.0PF: 15Exzellent — €2 Gewinn je €1 Verlust
WR < 40%WR: 0Schwache Trefferquote
WR 50–55%WR: 8Über Zufall
WR ≥ 55%WR: 10Hohe Trefferquote
📊

Sample Size (0-20)

Statistical significance — two dimensions combined: absolute trade count (0-10) + test duration (0-10). Both must be solid to reach max. Hard floor: <5 trades or <1 year = 0. This accounts for fact that weekly/monthly strategies naturally generate fewer trades but each trade spans more market time.

max 20
ConditionPointsMeaning
<5 Trades ODER <1 Jahr0Hard Floor — Score automatisch F
Trades 5-9+2Minimal (Trade-Count)
Trades 10-19+5Basic (Trade-Count)
Trades 20-29+7Solide (Trade-Count)
Trades 30-49+9Stark (Trade-Count)
Trades ≥50+10Robust (Trade-Count max)
Jahre 1-2+2Kurz (Zeit)
Jahre 2-3+5Mittelfristig (Zeit)
Jahre 3-4+7Mehrjährig (Zeit)
Jahre 4-6+9Langfristig (Zeit)
Jahre ≥6+10Multi-Cycle (Zeit max)

The grades

S

Excellent

85-100

A

Very Good

70-84

B

Good

55-69

C

Mixed

40-54

D

Weak

25-39

F

Not Recommended

0-24

Where do you see the Score?

Important

Arena Score only evaluates the past backtest. A high score does not guarantee future performance — but a low score is a reliable warning signal. Use it as a first filter, not as a decision engine.