Files
covid19-age-stratified-ifr/README.md
T
2020-06-10 09:45:08 -07:00

75 lines
3.6 KiB
Markdown
Raw Blame History

This file contains invisible Unicode characters
This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
# Calculating the age-stratified infection fatality ratio (IFR) of COVID-19
*Updated: 10 June 2020*
Author: Marc Bevand
The largest serological prevalence survey of COVID-19 was conducted by Spain
during the second round of a study that analyzed 63 564 samples between 18 May
2020 and 01 June 2020. We used its [provisional results][sero] published on 03
June to calculate the overall and age-stratified IFR of COVID-19 with the
Python script `calc_ifr.py`:
```
$ ./calc_ifr.py
Ages 0 to 9: 115013 infected, 4 deaths, 0.003% IFR
Ages 10 to 19: 177929 infected, 7 deaths, 0.004% IFR
Ages 20 to 29: 212099 infected, 32 deaths, 0.015% IFR
Ages 30 to 39: 281290 infected, 86 deaths, 0.030% IFR
Ages 40 to 49: 447942 infected, 287 deaths, 0.064% IFR
Ages 50 to 59: 410213 infected, 874 deaths, 0.213% IFR
Ages 60 to 69: 334709 infected, 2404 deaths, 0.718% IFR
Ages 70 to 79: 270572 infected, 6451 deaths, 2.384% IFR
Ages 80 to 89: 131703 infected, 11150 deaths, 8.466% IFR
Ages 90 to 199: 46631 infected, 5827 deaths, 12.497% IFR
Ages 0 to 199: 2428102 infected, 27121 deaths, 1.117% IFR
```
The average IFR for Spain is **1.117%**. However the true IFR may be higher due
to right-censoring and under-reporting of deaths.
The Spanish serological study remains the largest published study available to
this day. The age-stratified IFR was calculated from three sources:
1. Detailed *prevalence data for age brackets*, from the [serosurvey][sero] (table 1)
1. *Total deaths* and *deaths per age bracket* from the [Ministry of Health's daily report for 29 May][daily] (table 2 and table 3)
1. *Population pyramid* for Spain, from [worldpopulationreview.com][wpop]
In order to minimize right-censoring, the parameters *total deaths* and *deaths
per age bracket* should be obtained from a point in time as close as possible
to when the serosurvey was conducted (18 May to 01 June.) We found only two
Ministry of Health reports in this time period that document deaths per age
bracket: [18 May][dailyalt], [29 May][daily]. However the Ministry of Health
has made significant corrections to deaths statistics on 25 May by subtracting
approximately 2 000 deaths. Therefore we trusted the statistics from 29 May over
those of 18 May.
Important detail to note: there were 27 121 total deaths, however age information
was only available for 20 585 deaths, and was missing for 6 536 deaths.
We assume that these 6 536 deaths were distributed proportionally—not equally—among age
brackets, which seems to be a reasonable assumption.
# Applying the age-stratified IFR to other countries
The script `calc_ifr.py` is also able to apply the age-stratified IFR to
another population pyramid, thus calculating the expected average IFR for other
countries.
In the second half of the script, edit `pyramid_target` with the demographics data.
As an example, we supply pyramid data for the United States and calculate an IFR of **0.658%**:
```
$ ./calc_ifr.py
[...]
IFR on target country assuming disease prevalence equal among ages: 0.658%
```
However IFR is highly dependent on factors other than age: availability
of healthcare, population health, etc, so this estimate should be interpreted
with caution.
[sero]: https://portalcne.isciii.es/enecovid19/ene_covid19_inf_pre2.pdf
[daily]: https://www.mscbs.gob.es/profesionales/saludPublica/ccayes/alertasActual/nCov-China/documentos/Actualizacion_120_COVID-19.pdf
[dailyalt]: https://www.mscbs.gob.es/profesionales/saludPublica/ccayes/alertasActual/nCov-China/documentos/Actualizacion_109_COVID-19.pdf
[wpop]: https://worldpopulationreview.com/countries/spain-population/