This document details the data engineering, cleaning, and merging operations performed to construct the computational indicators for disaster resilience, primarily focusing on the Disaster Impact Index (DII) and the Resilience Recovery Score (RRS). These operations prepared the final analytical datasets used in the visualisations.
The operations for DII were executed in DII/DataCleaningDII.ipynb to standardise natural disaster records and integrate them with macroeconomic and demographic indicators.
Year, Start Month/Day, and End Month/Day into unified startDate and endDate columns.coordinates column by combining Latitude and Longitude.affected_population, derived by calculating the percentage of Total Deaths relative to the total national population.DII_v5.csv) contained fully integrated event, economic, and demographic data.The operations for RRS were executed in RRS/DataCleaningRRS.ipynb to model the post-disaster GDP rebound dynamics and integrate them with institutional and human development scores.
GovIndex by adding 2.5 and scaling by 20. Converted standard errors into percentage values (ErrorPercent).HDI_percent).pycountry library.recovery_years, representing the number of years required for post-disaster GDP growth to return to or exceed the pre-disaster baseline. Non-recovering states were capped at the current year differential.GDP_GrowthPre (3-year mean before disaster) and GDP_GrowthPost (3-year mean after disaster).The final Resilience Recovery Score (RSS) was computed using the formal project equation:
RSS = ((GDP_GrowthPost - GDP_GrowthPre) / recovery_years) + ((HDI + GovIndex_ErrorPercent) / 2)
Missing values across mandatory macroeconomic and institutional features were dropped, resulting in RSS_v9.csv.
To produce the final master dataset for interactive visual analysis, DII_v5.csv and RSS_v9.csv were row-synchronised:
ID) to account for multiple disasters per country per year.Year, Country, and ID, culminating in RSS_v11.csv (7,038 valid disaster-year observations).Explore the results of these data operations in our interactive visualisations: