How we source & present this data
Travel-safety numbers are easy to find and easy to get wrong. A rate quoted without its definition, its year, or its exact geography can mislead more than it informs. This project is built around a single rule: a figure is only as trustworthy as its documentation trail. This page explains what that means in practice — including the things we choose not to show you.
The four questions behind every number
Before any figure appears on a city page, it must be able to answer four questions. If it can’t, we don’t present it as a confident value.
- How was it counted? The definition and collection method used.
- By whom? The named source, and where it sits in our hierarchy.
- When? The reference year — the year the data describes, not when it was published.
- For what exact place? The precise geographic unit — city, municipality, metro area, state, or country.
Every figure on the site is drillable: expand “What’s behind this number” on any cell to see all of this, for each contributing source.
Where our numbers come from
We work top-down through a source hierarchy and only fall to a lower tier when a higher one doesn’t cover the place or metric.
- Primary government statistical agencies — the bodies legally responsible for counting (e.g. national statistics offices, city police departments). Least filtered, most traceable.
- Intergovernmental bodies — UNODC, PAHO, UN Women. They standardize across countries but add a layer of processing.
- Academic & peer-reviewed — analytically rigorous, but often lags two to three years.
- NGOs with transparent methodology — used only when they publish their raw data and methods.
- Aggregators — used for discovery and cross-checking only; we always trace back to the cited primary source before storing a figure.
What we exclude: figures sourced from travel blogs, tourism boards, real-estate sites, security-company marketing, or news articles that don’t cite a primary source. These have incentives to skew, or no documented method at all.
Every figure carries its provenance
For each source behind a figure we store and display:
- Source name & tier
- Link to the source (and an archived copy)
- Access date
- Reference year
- Exact geographic unit & place name
- The definition used
- Collection method
- The dataset version we pulled
- The verbatim text the number came from
We store both the raw count and the rate per 100,000 wherever a source provides them, so a population-normalized figure can always be checked against the underlying count.
Some sources publish only a count (e.g. a city’s police records), which isn’t comparable between places of different sizes. In those cases we compute the rate ourselves — count divided by an official population for the exact same area — and label it as a computed figure. The drill-down shows both sources (the count and the population) and the math, and we only do this when the population is for the same geographic unit; we never divide by a different area’s population to manufacture a rate.
When sources disagree, we show the disagreement
Crime is measured differently by different institutions — a criminal-justice source and a public-health source can count different things. When two credible sources give different values for the same figure, we do not silently pick a winner. We show the range and flag it as a conflict, because the disagreement is often more informative than a single tidy number. You can open the drill-down to see each source’s value side by side.
Comparing cities fairly
Most cross-city comparisons fail in predictable ways. We build a guard against each:
- Geographic-unit mismatch
- A “city” figure and a “municipality” or “metro-area” figure aren’t the same thing. We store the exact unit with every figure and surface it in the drill-down. When several figures exist for a place we show the one closest to the city itself; many official sources only report a wider area (the surrounding municipality, metro, or whole country), so when that’s all we have we still show it but badge it as covering a larger area — a figure to read with that in mind, especially when comparing against a true city-level number.
- Reference-year mismatch
- Comparing a 2024 figure to a 2020 one is a false comparison. When two compared cities’ figures are more than one year apart, the comparison row is visibly flagged.
- Definitional mismatch
- “Femicide,” “sexual assault,” and even “homicide” are defined differently across legal systems. We record the definition each source used and align to the UNODC ICCS standard as a reference point.
- Reporting-rate differences
- Police-recorded crime and victimization surveys measure different populations of incidents. A low reported rate in a place with weak institutions may mean low reporting, not low crime. We tag each figure’s collection method so these are never treated as interchangeable.
What we deliberately don’t show
- Unsourced numbers. If a figure can’t be traced to a retrievable source and the exact passage it came from, it doesn’t appear as a value.
- “No reliable data” is shown as itself — not as a zero, a blank, or a guess. Missing data is information; many cities genuinely lack vetted figures for some categories, and we say so plainly.
- False precision. We never present a number as more exact than its source supports. Where methodology is weak or definitions differ, we show a caveat alongside the figure rather than a clean-looking but misleading value.
- Resolved conflicts. We don’t average away or hide disagreement between sources.
How current the data is
Different metrics go stale at different rates, so we refresh them on different schedules. Travel advisories can change quickly and are checked frequently. Homicide, robbery, and femicide statistics are released roughly annually, so they’re refreshed on a slower cycle. Government datasets are sometimes revised after the fact, so we record exactly which release we pulled, and every city page shows when its data was last generated.
The badges you’ll see
- conflict Sources disagree on this figure — the range is shown, and each source is listed in the drill-down.
- year mismatch The figures being compared come from reference years more than one year apart.
- wider area The figure shown covers a larger area than the city itself (e.g. its municipality, metro, or country) because no city-level source was available — the exact unit is in the drill-down.
- computed rate We calculated this per-100,000 rate ourselves from a count and an official population for the same area; the drill-down shows both sources and the math.
- Tier 1 Where a source sits in the hierarchy above (1 = primary government agency).
- fixture Sample/illustrative data used while the project is in early development — not a verified figure.
- No reliable data — no vetted source exists for this figure yet; deliberately distinct from a value of zero.
- Coming soon An entire metric category we haven’t built the data pipeline for yet — distinct from “no reliable data,” which means we looked and found none for that city. These flip on automatically as their sources come online.
In short: we’d rather show you a documented range, a caveat, or an honest “we don’t know yet” than a confident number you can’t check.