I have a zarr store of weather data with 1 hr time interval for the year 2022. So 8760 chunks. But there are data only for random days. How do i check which are the hours in 0 to 8760, the data is available? Also the store is defined with "fill_value": "NaN",
I am iterating over each hour and checking for all nan as below (using xarray
) to identify if there is data or not. But its a very time consuming process.
hours = 8760
for hour in range(hours):
if not np.isnan(np.array(xarrds['temperature'][hour])).all():
print(f"data available in hour: {i}")
is there a better way to check the data availablity?
source https://stackoverflow.com/questions/75521500/better-way-to-identify-chunks-where-data-is-available-in-zarr
Comments
Post a Comment