Opening Raster Stack without 'No Data Values' in GDAL with Python

Question 1

When I open a raster stack with GDAL and call it as a numpy array, lines with 'no data values' also appear. Since I do not want to include these 'no data values' (mine is 128) in the calculations I will make, I am looking for a way to prevent.

Is there a way to prevent 'no data values' from getting into numpy arrays when opening the raster stack? Or what would you recommend?

My codes are here:

outvrt = ('result/raster_stack_vrt.tif')
outtif = ('result/raster_stack.tif')
tifs = glob.glob('data/*.tif')
outds = gdal.BuildVRT(outvrt, tifs, separate = True)
outds = gdal.Translate(outtif, outds)

Question 2

> import rasterio 
> import numpy as np

You can create a mask with numpy:

Open the raster like a numpy array then run this code and plot the raster.

Blockquote

raster = rasterio.open(inputpath_raster)
raster = raster.read(1)
value = 0
raster = raster.astype('float32') # You can change the format
raster_copy = copy.copy(raster)
raster_copy[raster == value] = np.nan # Value equal 'nan value'
raster_copy[raster > value] = 1
raster_nan = raster_copy * raster

The process is something like this, I think you have to repeat the process por each band in your stack

enter image description here

Question 3

No data value is 128

Question 4

The most efficient way to do this is with numpy and masked arrays:

import numpy as np
import rasterio as rio
src = rio.open(file.tif)
arr = src.read(1)
masked_arr = np.where(arr==128, np.nan, arr)

Then you can treat the array as you usually would when dealing with nan values. You can mask the array if you like:

https://numpy.org/devdocs/reference/maskedarray.generic.html

But if your calculations are simple you could use the numpy built in nan calculations:

numpy.nanmean(masked_arr)

Helios Helios 4472 silver badges9 bronze badges · Answer 1 · 2022-03-19 23:12:55Z

> import rasterio 
> import numpy as np

You can create a mask with numpy:

Open the raster like a numpy array then run this code and plot the raster.

Blockquote

raster = rasterio.open(inputpath_raster)
raster = raster.read(1)
value = 0
raster = raster.astype('float32') # You can change the format
raster_copy = copy.copy(raster)
raster_copy[raster == value] = np.nan # Value equal 'nan value'
raster_copy[raster > value] = 1
raster_nan = raster_copy * raster

The process is something like this, I think you have to repeat the process por each band in your stack

enter image description here

No data value is 128

GeoMonkey
– GeoMonkey

2023年06月04日 11:12:20 +00:00
Commented Jun 4, 2023 at 11:12

GeoMonkey GeoMonkey 1,55913 silver badges29 bronze badges · Answer 2 · 2023-06-04 11:10:17Z

The most efficient way to do this is with numpy and masked arrays:

import numpy as np
import rasterio as rio
src = rio.open(file.tif)
arr = src.read(1)
masked_arr = np.where(arr==128, np.nan, arr)

Then you can treat the array as you usually would when dealing with nan values. You can mask the array if you like:

https://numpy.org/devdocs/reference/maskedarray.generic.html

But if your calculations are simple you could use the numpy built in nan calculations:

numpy.nanmean(masked_arr)

Stack Exchange Network

Opening Raster Stack without 'No Data Values' in GDAL with Python

2 Answers 2

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Linked

Hot Network Questions

Opening Raster Stack without 'No Data Values' in GDAL with Python

2 Answers 2

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Linked

Related

Hot Network Questions