Data-Based Analysis, Modelling and Forecasting of the COVID-19 outbreak
Cleo Anastassopoulou, Lucia Russo, Athanasios Tsakris, Constantinos Siettos
doi: https://doi.org/10.1101/2020.02.11.20022186
Abstract
Since the first suspected case of coronavirus disease-2019 (COVID-19) on December 1st, 2019, in Wuhan, Hubei Province, China, a total of 40,235 confirmed cases and 909 deaths have been reported in China up to February 10, 2020, evoking fear locally and internationally. Here, based on the publicly available epidemiological data for Hubei, China from January 11 to February 10, 2020, we provide estimates of the main epidemiological parameters. In particular, we provide estimation of the the case fatality and case recovery ratios, along with their 90% confidence intervals and on the basis of a Susceptible-Infected-Recovered-Dead (SIRD) model we provide estimations of the basic reproduction number (R0), the infection rate. By calibrating the parameters of the SIRD model to the reported data, we also attempt to forecast the evolution of the of the outbreak at the epicenter three weeks ahead, i.e. until February 29. As the number of infected individuals, especially of those with asymptomatic or mild courses, is suspected to be much higher than the official numbers, which can be considered only as a subset of the actual numbers of infected and recovered cases in the population, we have repeated the calculations under a second scenario that considers twenty times the number of confirmed infected cases and forty times the number of recovered, leaving the number of deaths unchanged. Based on the reported data, the expected value of R0 as computed considering the period from the 11th of January until the 18th of January, using the official counts of confirmed cases was found to be ~4.6, while the one computed under the second scenario was found to be ~3.2. Thus, based on the SIRD simulations, the estimated average value of R0 was found to be ~2.6. Our forecasting flashes a note of caution for the presently unfolding outbreak in China. Based on the official counts for confirmed cases, the simulations suggest that the cumulative number of infected could reach 140,000 (with an upper bound of 700,000) by February 29. Regarding the number of deaths, simulations forecast that on the basis of the up to the 10th of February data and estimations of the actual numbers of infected and recovered in the population, the death toll might exceed 3,200 by February 29. However, our analysis further reveals a significant decline of the case fatality ratio to which various factors may have contributed, such as the severe control measures taken in Hubei, China (e.g. quarantine and hospitalization of infected individuals), but mainly because of the fact that if the actual cumulative numbers of infected and recovered cases in the population are significantly higher (here of the order of twenty times) for the infected and ten times higher for the recovered than reported, thus resulting in a much lower case fatality ratio significantly below 1%.
*注,本文为预印本论文手稿,是未经同行评审的初步报告,其观点仅供科研同行交流,并不是结论性内容,请使用者谨慎使用.