Downsampling Geotiff using summation - Gdal/Numpy

Question 1

I am trying to downsample a 1 square km raster dataset to a much larger (.5 degree x .66 degree) dataset by summing all of the pixel values within this large grid cell.

gdal_warp does not contain a summation resampling method so I'm wondering if anyone has figured this out before.

Question 2

Depending on the version of GDAL, there are a few different resample options available; see gdalwarp.

GDAL 1.10 or later using `-r average`

average resampling, computes the weighted average of all non-NODATA contributing pixels

This isn't tested, but should look something like:

gdalwarp -t_srs EPSG:4326 -tr 0.5 0.66 -r average fine_one_sq_km.tif coarse_average.tif

Then to get the sum, multiply the average by the number of pixels of the fine resolution raster in one pixel of the coarse resolution raster, which hopefully is constant (you could assume it is).

GDAL 3.1 or later using `-r sum`

compute the weighted sum of all non-NODATA contributing pixels

This should look like this:

gdalwarp -t_srs EPSG:4326 -tr 0.5 0.66 -r sum fine_one_sq_km.tif coarse_sum.tif

Otherwise, scipy.ndimage.measurements.sum_labels (or sum for older versions) can be used to aggregate multidimensional sums. But this may rely on perfect matchings between grids.

Question 3

That would work but I would have to create a labeling matrix the same size as my larger resolution grid. I can do that but I was hoping there was something in numpy/scipy/gdal that looked like: gdalwarp ... -r sum ...

Question 4

Multiplying the average by the number of 1 square kilometers cells could work for this (accuracy isn't my top priority). Thanks!

Question 5

Yes: avg = sum / count, so sum = avg * count.

Question 6

...I guess I over thought the problem...

Question 7

Note: careful with rasters with null values. The average is calculated without considering nulls, so then to multiply that value by the number of input pixels contributing to a larger output pixel would give you an incorrect sum of the input. Setting regions of no data to 0 in the source dataset is probably a sensible idea in most cases where summation is meaningful, so that it influences the average value appropriately. Having a sum resampling method would actually be an excellent contribution to gdalwarp, since it would consider no data appropriately and automatically.

Question 8

Apparently, gdalwarp got a new sum method in GDAL release 3.1.0, see release notes. Adapting from the above solution:

gdalwarp -t_srs EPSG:4326 -tr 0.5 0.66 -r sum fine_one_sq_km.tif coarse_sum.tif

Mike T Mike T 42.7k10 gold badges131 silver badges194 bronze badges · Accepted Answer · 2015-06-30 03:00:43Z

Depending on the version of GDAL, there are a few different resample options available; see gdalwarp.

GDAL 1.10 or later using `-r average`

average resampling, computes the weighted average of all non-NODATA contributing pixels

This isn't tested, but should look something like:

gdalwarp -t_srs EPSG:4326 -tr 0.5 0.66 -r average fine_one_sq_km.tif coarse_average.tif

Then to get the sum, multiply the average by the number of pixels of the fine resolution raster in one pixel of the coarse resolution raster, which hopefully is constant (you could assume it is).

GDAL 3.1 or later using `-r sum`

compute the weighted sum of all non-NODATA contributing pixels

This should look like this:

gdalwarp -t_srs EPSG:4326 -tr 0.5 0.66 -r sum fine_one_sq_km.tif coarse_sum.tif

Otherwise, scipy.ndimage.measurements.sum_labels (or sum for older versions) can be used to aggregate multidimensional sums. But this may rely on perfect matchings between grids.

That would work but I would have to create a labeling matrix the same size as my larger resolution grid. I can do that but I was hoping there was something in numpy/scipy/gdal that looked like: gdalwarp ... -r sum ...
Multiplying the average by the number of 1 square kilometers cells could work for this (accuracy isn't my top priority). Thanks!
Note: careful with rasters with null values. The average is calculated without considering nulls, so then to multiply that value by the number of input pixels contributing to a larger output pixel would give you an incorrect sum of the input. Setting regions of no data to 0 in the source dataset is probably a sensible idea in most cases where summation is meaningful, so that it influences the average value appropriately. Having a sum resampling method would actually be an excellent contribution to gdalwarp, since it would consider no data appropriately and automatically.

Stack Exchange Network

Downsampling Geotiff using summation - Gdal/Numpy

2 Answers 2

GDAL 1.10 or later using `-r average`

GDAL 3.1 or later using `-r sum`

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

Downsampling Geotiff using summation - Gdal/Numpy

2 Answers 2

GDAL 1.10 or later using -r average

GDAL 3.1 or later using -r sum

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related

Hot Network Questions

GDAL 1.10 or later using `-r average`

GDAL 3.1 or later using `-r sum`