Google Introduces Groundsource: Extracting Disaster Data from News Reports with Gemini
Key Takeaways
- ▸Groundsource uses Gemini to automatically extract flood data from news reports, creating a dataset 260x larger than existing humanitarian databases like GDACS
- ▸The open-access flash floods dataset covers 150+ countries with 2.6 million records, providing crucial training data for improved forecasting models
- ▸The methodology addresses the "data desert" problem where traditional satellite and sensor networks fail to capture localized, quick-moving disasters
Summary
Google Research has unveiled Groundsource, a new scalable methodology that leverages Gemini to transform unstructured global news reports into structured, actionable historical data on natural disasters. The first application focuses on urban flash floods, resulting in an open-access dataset comprising 2.6 million historical flood events spanning more than 150 countries from 2000 to present. This addresses a critical "data desert" in flood forecasting, where traditional monitoring systems like satellites and sensor networks struggle with cloud interference, revisit times, and the tendency to only capture large, long-lasting events. The same Groundsource methodology could potentially be applied to build historical datasets for other natural hazards, accelerating global crisis resilience and disaster preparedness efforts worldwide.
- The same AI-powered extraction framework can be adapted to build historical datasets for other natural hazards like earthquakes, hurricanes, and droughts
Editorial Opinion
Groundsource represents a pragmatic and potentially transformative application of large language models—leveraging Gemini to unlock value from existing but scattered news data rather than relying solely on expensive infrastructure. By democratizing access to historical disaster data, this approach could meaningfully improve forecasting for underserved regions and help communities prepare for increasingly severe climate-related hazards. The open-source methodology's extensibility to other disaster types hints at Google's ambition to reshape how we gather and analyze critical societal data.



