Parameters of calculation and conception of graphic
Number of events, network length, thresholds, and road user
1️⃣ Number of events. We indicate 3,402 as the number of road fatalities in France in 2023 (ONISR), but we could also include the number of severe, minor injuries, or uninjured cases.
2️⃣ Road network length. Here, we can specify 1 or 2,300,000 km of roads, depending on the official count or that of OpenStreetMap. We could also focus on the entire network or a specific part (department, region).
3️⃣ Thresholds. Next, we specify the upper quantiles in red for high-risk roads and lower quantiles in green for safe roads. We use 90 and 10%, but 99 and 1% are also possible.
4️⃣ Road user. Since the accident probability varies by user type, we can also change the user to among car driver, motorcyclist, cyclist, and pedestrian.
Then we derive the number of road segments (making ~100 m each), the average score (log-normalized), and the ratios to the baseline for high and low-risk thresholds.
➡️ The x-axis shows the range (min-max) of scores obtained across the French road network, composed of 23 million segments. Since the probabilities are small, the scores are logarithmically scaled.
⬆️ The y-axis represents the cumulative distribution (CDF), ranging from 0% to 100%. The CDF follows a sigmoid (normal) shape.
☰ Horizontal lines indicate the thresholds. The area under the curve is shaded green for the lower decile (low risk) and red for the upper decile (high risk).
━ The thick curve represents the theoretical distribution of the risk score, while the background curve shows the actual empirical distribution across the French network.
📌 Note that the empirical distribution is divided into 1,000 steps, with each step representing therefore 0.1% of the values.