SciPy 2024 - xCDAT (Xarray Climate Data Analysis Tools)

SciPy 2024 - xCDAT (Xarray Climate Data Analysis Tools)#

“A Python Package for Simple and Robust Analysis of Climate Data”

Presented by Tom Vo

This work is performed under the auspices of the U. S. DOE by Lawrence Livermore National Laboratory under contract No. DE-AC52-07NA27344.

Notebook Setup#

Create an Anaconda environment for this notebook using the command below:

conda create -n xcdat_scipy_2024 -c conda-forge xcdat=0.7.0 nco matplotlib ipython ipykernel cartopy nc-time-axis gsw-xarray jupyter jupyter_contrib_nbextensions rise wget

conda activate xcdat_scipy_2024

Then run:

jupyter contrib nbextension install --user

jupyter nbextension enable splitcell/splitcell

To open Jupyter Notebook GUI:

jupyter notebook

To print notebook as PDF:

jupyter nbconvert docs/demos/24-07-11-scipy-2024/scipy-2024.ipynb --to html
# Then "Print HTML to PDF" in your browser

This presentation is available on xCDAT’s Read The Docs Page#

Presentation QR Code

https://xcdat.readthedocs.io/en/latest/demos/24-07-11-scipy-2024/scipy-2024.html #

Before we start…#

A little about me#

Software Engineer at Lawrence Livermore National Laboratory (LLNL)
Energy Exascale Earth System Model (E3SM) and Simplifying ESM Analysis Through Standards (SEATS)
Lead developer of xCDAT
Contributor and DevOps engineer for various E3SM tools

Tom portrait

A big shoutout to the xCDAT core development team including#

Stephen Po-Chedley
Jason Boutte
Jill Zhang
Jiwoo Lee

With thanks to Peter Gleckler, Paul Durack, Karl Taylor, and Chris Golaz.

An Overview of this Talk#

Objective: Learn about the grounds-up development of an open-source Python package targeted at a specific scientific domain

Driving force behind xCDAT
Scope and mission of xCDAT
Design philosophy and key features
Technical demo of an end-to-end analysis workflow
Parallelism with Dask
How to get involved

The Driving Force Behind xCDAT#

Analysis of climate data frequently requires a number of core operations. For example:
- Reading and writing netCDF files
- Regridding
- Spatial and temporal averaging
Highly performant operations to handle the growing volume of climate data
- Larger pool of data products
- Increasing spatiotemporal resolution of model and observational data

CDAT, the predecessor to xCDAT#

CDAT (Community Data Analysis Tools) library provided open-source climate data analysis and visualization packages for over 20 years

The present-day challenge: CDAT is end-of-life as of December 2023#

A big issue for users and packages that depend on CDAT
A driving need for new analysis software

xCDAT addresses this need by combining the power of Xarray with geospatial analysis features inspired by CDAT.

What are the general goals of xCDAT?#

Offer similar core capabilities to CDAT
- e.g., geospatial averaging, temporal averaging, regridding
Use modern technologies in the library’s stack
- Capable of handling large datasets (e.g., parallelism, lazy operations)
Maintainable, extensible, and easy-to-use
- Python Enhancement Proposals (PEPs)
- Reproducible science
Foster open-source community
- Software sustainability
- Serve the needs of the climate community in the long-term
- Community engagement efforts (e.g., Pangeo, ESGF)

“N-D labeled arrays and datasets in Python”

Why is Xarray the core technology of xCDAT?

Mature widely adopted
Fiscal funding from NumFocus
Introduces labels in the form of dimensions, coordinates, and attributes on top of raw NumPy-like arrays
Intuitive, more concise, and less error-prone user experience

Key features of Xarray#

File I/O, indexing and selecting, interpolating, grouping, aggregating, parallelism (Dask), plotting (matplotlib wrapper)
Supports various file formats netCDF, Iris, OPeNDAP, Zarr, and more
Interoperability with scientific Python ecosystem such as NumPy, Dask, Pandas, and Matplotlib

Xarray Climate Data Analysis Tools

Team of climate scientists and software engineers from:

Focused on routine climate research analysis operations
Leverages other powerful Xarray-based packages such as xESMF, xgcm, and cf-xarray

CF xarray logo

xCDAT features for simple, robust, and less error-prone analysis code

Extension of xr.open_dataset() and xr.open_mfdataset() with post-processing options
Robust interpretation of Climate and Forecast (CF) Compliant metadata using cf-xarray
Generate missing bounds, center time coords, convert lon axis orientation
Geospatial weighted averaging
Temporal averaging, climatologies, departures
Horizontal regridding (extension of xesmf and Python port of regrid2)
Vertical regridding (extension of xgcm)

Spatial average chart Temporal average chart Vertical regridding chart

The Software Design Philosophy of xCDAT

Encourage software sustainability and reproducible science
Well-documented and configurable features allow scientists to rapidly develop robust, reusable, less-error prone, more maintainable code
Contribute to Pangeo’s effort of fostering an ecosystem of mutually compatible geoscience Python packages

xCDAT simplifies Xarray code for specific analysis operations#

A comparison of code to calculate global-mean, monthly anomalies.#

HTTP download link: https://esgf-data04.diasjp.net/thredds/fileServer/esg_dataroot/CMIP6/CMIP/E3SM-Project/E3SM-2-0/historical/r1i1p1f1/Amon/tas/gr/v20220830/tas_Amon_E3SM-2-0_historical_r1i1p1f1_gr_200001-201412.nc

[1]:

import matplotlib.pyplot as plt
from pathlib import Path

dpath = Path(
    "docs/demos/24-07-11-scipy-2024/tas_Amon_E3SM-2-0_historical_r1i1p1f1_gr_200001-201412.nc"
)
if not dpath.is_file():
    dpath = "http://esgf-data04.diasjp.net/thredds/dodsC/esg_dataroot/CMIP6/CMIP/E3SM-Project/E3SM-2-0/historical/r1i1p1f1/Amon/tas/gr/v20220830/tas_Amon_E3SM-2-0_historical_r1i1p1f1_gr_200001-201412.nc"

Code Comparison (Xarray vs. xCDAT)#

xCDAT uses less code, is more flexible, and is easier to read/write.

[2]:

import numpy as np
import xarray as xr

# 1. Open the dataset.
ds = xr.open_dataset(dpath)

# 2. Calculate monthly departures.
tas_mon = ds.tas.groupby("time.month")
tas_mon_clim = tas_mon.mean(dim="time")
tas_anom = tas_mon - tas_mon_clim

# 3. Compute global average.
coslat = np.cos(np.deg2rad(ds.lat))
tas_anom_wgt = tas_anom.weighted(coslat)
tas_anom_global = tas_anom_wgt.mean(dim="lat").mean(dim="lon")

# 4. Calculate annual averages.
# ncar.github.io/esds/posts/2021/yearly-averages-xarray/
mon_len = tas_anom_global.time.dt.days_in_month
mon_len_by_year = mon_len.groupby("time.year")
wgts = mon_len_by_year / mon_len_by_year.sum()

temp_sum = tas_anom_global * wgts
temp_sum = temp_sum.resample(time="YS").sum(dim="time")
denom_sum = (wgts).resample(time="YS").sum(dim="time")

tas_anom_global_ann = temp_sum / denom_sum

[3]:

import xcdat as xc

# 1. Open the dataset.
ds = xc.open_dataset(dpath)

# 2. Calculate monthly departures.
ds_anom = ds.temporal.departures("tas", freq="month")

# 3. Compute global average.
ds_anom_glb = ds_anom.spatial.average("tas")

# 4. Calculate annual averages
ds_anom_glb_ann = ds_anom_glb.temporal.group_average("tas", freq="year")

Plot the outputs#

[4]:

fig, (ax1, ax2) = plt.subplots(nrows=1, ncols=2)
fig.set_size_inches(12, 6)
fig.suptitle("Global Mean Monthly Anomalies", fontsize=16)

# Add the subplots
tas_anom_global_ann.plot(ax=ax1)
ds_anom_glb_ann.tas.plot(ax=ax2)

# Configure subplots
ax1.set_ylabel("Near-Surface Temperature Anomalies [K]")
ax2.set_ylabel("Near-Surface Temperature Anomalies [K]")
ax1.title.set_text("Xarray")
ax2.title.set_text("xCDAT")

../../_images/demos_24-07-11-scipy-2024_scipy-2024_24_0.png

Getting Started with xCDAT#

xCDAT is available for installation through Anaconda on the conda-forge channel
- Install command: conda install -c conda-forge xcdat
Check out xCDAT’s Read the Docs, which we strive to keep up-to-date
- https://xcdat.readthedocs.io/en/stable/

xCDAT docs

How to use xCDAT#

xCDAT extends Xarray Dataset objects via “accessor” classes.

accessor api

In the example above, custom spatial functionality is exposed by chaining the spatial accessor attribute to the Dataset object. This chaining enables access to the underlying spatial.average() method.

Accessors classes include:#

spatial – .average(), .get_weights()
temporal – .average(), .group_average(), .climatology(), .depatures()
regridding – horizontal(), vertical()
bounds – .get_bounds(), .add_bounds(), .add_missing_bounds()

xCDAT also provides general utilities as Python functions#

open_dataset(), open_mfdataset()
center_times(), decode_time()
swap_lon_axis()
create_axis()
create_grid()
get_dim_coords()
get_dim_keys()

Visit the API Reference page for a complete list: https://xcdat.readthedocs.io/en/latest/api.html

End-to-End Analysis and Visualization of CMIP Data using xCDAT#

Overview#

Use xcdat to perform computation and analysis on CMIP6 data from the E3SM v2 model.

Sections#

Setup Code
I/O
Horizontal Regridding
Vertical Regridding
Spatial Averaging
Temporal Computations

1. Setup Code#

[5]:

# This style import is necessary to properly render Xarray's HTML output with
# the Jupyer RISE extension.
# GitHub Issue: https://github.com/damianavila/RISE/issues/594
# Source: https://github.com/smartass101/xarray-pydata-prague-2020/blob/main/rise.css

from IPython.core.display import HTML

style = """
<style>
.reveal pre.xr-text-repr-fallback {
    display: none;
}
.reveal ul.xr-sections {
    display: grid
}

.reveal ul ul.xr-var-list {
    display: contents
}
</style>
"""


HTML(style)

[5]:

[6]:

import numpy as np
from xarray.coding.calendar_ops import _datetime_to_decimal_year as dt2decimal
import xcdat as xc
import cartopy.crs as ccrs
import matplotlib.pyplot as plt

2. I/O#

Use xc.open_dataset() to a single netCDF dataset as an xr.Dataset object.
API Documentation: https://xcdat.readthedocs.io/en/stable/generated/xcdat.open_dataset.html

Analyzing monthly tas (near-sea surface air temperature) data from 2000 to 2014.

[7]:

ds = xc.open_dataset(dpath, chunks={"time": "auto"})

ds.tas

[7]:

<xarray.DataArray 'tas' (time: 180, lat: 180, lon: 360)> Size: 47MB
dask.array<open_dataset-tas, shape=(180, 180, 360), dtype=float32, chunksize=(180, 180, 360), chunktype=numpy.ndarray>
Coordinates:
  * lat      (lat) float64 1kB -89.5 -88.5 -87.5 -86.5 ... 86.5 87.5 88.5 89.5
  * lon      (lon) float64 3kB 0.5 1.5 2.5 3.5 4.5 ... 356.5 357.5 358.5 359.5
    height   float64 8B ...
  * time     (time) object 1kB 2000-01-16 12:00:00 ... 2014-12-16 12:00:00
Attributes:
    standard_name:  air_temperature
    long_name:      Near-Surface Air Temperature
    comment:        near-surface (usually, 2 meter) air temperature
    units:          K
    cell_methods:   area: time: mean
    cell_measures:  area: areacella
    history:        2022-08-31T00:41:19Z altered by CMOR: Treated scalar dime...

4. Vertical Regridding#

xCDAT can also regrid in the vertical dimension.

Here we’ll grab some cloud fraction data (cl) and regrid it from model hybrid coordinate to pressure levels.

Documentation: https://xcdat.readthedocs.io/en/stable/examples/regridding-vertical.html#4:-Remap-cloud-fraction-from-model-hybrid-coordinate-to-pressure-levels

[13]:

data_url = "http://esgf-data04.diasjp.net/thredds/dodsC/esg_dataroot/CMIP6/CMIP/E3SM-Project/E3SM-2-0/historical/r1i1p1f1/Amon/cl/gr/v20220830/cl_Amon_E3SM-2-0_historical_r1i1p1f1_gr_200001-201412.nc"

[14]:

ds_3d = xc.open_dataset(data_url, chunks={"time": "auto"})

ds_3d.cl

[14]:

<xarray.DataArray 'cl' (time: 180, lev: 72, lat: 180, lon: 360)> Size: 3GB
dask.array<open_dataset-cl, shape=(180, 72, 180, 360), dtype=float32, chunksize=(7, 72, 180, 360), chunktype=numpy.ndarray>
Coordinates:
  * lev      (lev) float64 576B 0.9985 0.9938 0.9862 ... 0.0001828 0.0001238
  * lat      (lat) float64 1kB -89.5 -88.5 -87.5 -86.5 ... 86.5 87.5 88.5 89.5
  * lon      (lon) float64 3kB 0.5 1.5 2.5 3.5 4.5 ... 356.5 357.5 358.5 359.5
  * time     (time) object 1kB 2000-01-16 12:00:00 ... 2014-12-16 12:00:00
Attributes:
    standard_name:  cloud_area_fraction_in_atmosphere_layer
    long_name:      Percentage Cloud Cover
    comment:        Percentage cloud cover, including both large-scale and co...
    units:          %
    cell_methods:   area: time: mean
    cell_measures:  area: areacella
    history:        2022-08-30T15:06:03Z altered by CMOR: Inverted axis: lev.
    _ChunkSizes:    [  1  36  90 180]

Regrid the cl variable using regridder.vertical() with:

new_pressure_grid as the output grid
"linear" method
pressure_data as the target data.

[15]:

def hybrid_coordinate(p0, a, b, ps, **kwargs):
    return a * p0 + b * ps


pressure_data = hybrid_coordinate(**ds_3d.data_vars)
new_pressure_grid = xc.create_grid(z=xc.create_axis("lev", np.linspace(100000, 1, 13)))

ds_vr = ds_3d.regridder.vertical(
    "cl", new_pressure_grid, method="linear", target_data=pressure_data
)
ds_vr_res = ds_vr.compute()

/opt/miniconda3/envs/xcdat_scipy_2024/lib/python3.11/site-packages/xgcm/grid.py:989: FutureWarning: From version 0.8.0 the Axis computation methods will be removed, in favour of using the Grid computation methods instead. i.e. use `Grid.transform` instead of `Axis.transform`
  warnings.warn(

Finally, we plot the result:#

[16]:

ds_vr_zonal = ds_vr.isel(time=0).spatial.average("cl", axis=["X"]).squeeze()

ds_vr_zonal.cl.plot(cmap=plt.cm.RdBu_r)
plt.gca().invert_yaxis()

../../_images/demos_24-07-11-scipy-2024_scipy-2024_55_0.png

5. Spatial Averaging#

Area-weighted spatial averaging is a common technique to reduce dimensionality in geospatial datasets.
xCDAT can perform this calculation over full domains or regions of interest.

Calculate the spatial average of `tas` and store the results in a Python variable.#

API Documentation: https://xcdat.readthedocs.io/en/stable/generated/xarray.Dataset.spatial.average.html

[17]:

ds_global = ds.spatial.average("tas")

ds_global

[17]:

<xarray.Dataset> Size: 19kB
Dimensions:    (time: 180, bnds: 2, lat: 180, lon: 360)
Coordinates:
  * lat        (lat) float64 1kB -89.5 -88.5 -87.5 -86.5 ... 86.5 87.5 88.5 89.5
  * lon        (lon) float64 3kB 0.5 1.5 2.5 3.5 4.5 ... 356.5 357.5 358.5 359.5
    height     float64 8B ...
  * time       (time) object 1kB 2000-01-16 12:00:00 ... 2014-12-16 12:00:00
Dimensions without coordinates: bnds
Data variables:
    time_bnds  (time, bnds) object 3kB dask.array<chunksize=(180, 2), meta=np.ndarray>
    lat_bnds   (lat, bnds) float64 3kB dask.array<chunksize=(180, 2), meta=np.ndarray>
    lon_bnds   (lon, bnds) float64 6kB dask.array<chunksize=(360, 2), meta=np.ndarray>
    tas        (time) float64 1kB dask.array<chunksize=(180,), meta=np.ndarray>
Attributes: (12/48)
    Conventions:            CF-1.7 CMIP-6.2
    activity_id:            CMIP
    branch_method:          standard
    branch_time_in_child:   0.0
    branch_time_in_parent:  36500.0
    creation_date:          2022-08-31T00:41:19Z
    ...                     ...
    variant_label:          r1i1p1f1
    license:                CMIP6 model data produced by E3SM-Project is lice...
    cmor_version:           3.6.1
    tracking_id:            hdl:21.14100/5e95fb4d-19ba-4037-8000-818fe3955601
    version:                v20220830
    references:             Golaz, J.-C., L. P. Van Roekel, X. Zheng and co-a...

Now let’s plot the results (first 120 timesteps)#

Note that the spatial averager returns a dataset object so we still need to specify tas to plot the dataarray.

[18]:

ds_global.tas.isel(time=slice(0, 120)).plot()
plt.title("Global Average Surface Temperature")
plt.xlabel("Year")
plt.ylabel("Near Surface Air Temperature [K]")

[18]:

Text(0, 0.5, 'Near Surface Air Temperature [K]')

../../_images/demos_24-07-11-scipy-2024_scipy-2024_61_1.png

Calculate the near-surface air temperature (`tas`) in the Niño 3.4 region.#

Users can also specify their own bounds for a region. In this case, we specified keep_weights=True.

Full weight for grid cells entirely in the region
Partial weights for grid cells partially in the region

[19]:

ds_nino34 = ds_xesmf.spatial.average(
    "tas", lat_bounds=(-5, 5), lon_bounds=(190, 240), keep_weights=True
).compute()

Plot the Niño 3.4 region time series#

[20]:

dtime = dt2decimal(ds_nino34.time)  # decimal time

plt.figure(figsize=(10, 2))
plt.subplot(1, 2, 1)
plt.plot(dtime, ds_nino34["tas"].values)
plt.xlabel("Year")
plt.ylabel("Near-Surface Air Temperature [K]")
plt.title("Niño 3.4 time series")

# show the weights
map_proj = ccrs.PlateCarree(central_longitude=180)
ax = plt.subplot(1, 2, 2, projection=map_proj)
plt.pcolor(
    ds_nino34.lon,
    ds_nino34.lat,
    ds_nino34.lat_lon_wts.T,
    transform=ccrs.PlateCarree(),
    cmap=plt.cm.GnBu,
)
ax.set_extent([120, 300, -30, 30], crs=ccrs.PlateCarree())
ax.coastlines()
plt.colorbar(orientation="horizontal")
plt.title("Nino 3.4 Weights")

[20]:

Text(0.5, 1.0, 'Nino 3.4 Weights')

../../_images/demos_24-07-11-scipy-2024_scipy-2024_66_1.png

6. Temporal Computations with xCDAT#

In the examples below, we will performing temporal computations on the xarray.Dataset object using xCDAT.

6.1 Seasonal cycle mean#

In the global mean time series above, there are large seasonal swings in global near-surface air temperature. Here we compute the seasonal mean climatology.

Calculate the seasonal mean climatology for the `tas` variable.#

API Documentation: https://xcdat.readthedocs.io/en/stable/generated/xarray.Dataset.temporal.climatology.html

[21]:

ds_clim = ds.temporal.climatology("tas", freq="season")

Now we plot the season means#

[22]:

map_proj = ccrs.Robinson()
titles = ["DJF", "MAM", "JJA", "SON"]
plt.figure(figsize=(12, 10))
for i in range(4):
    plt.subplot(2, 2, i + 1, projection=map_proj)
    p = ds_clim.tas[i].plot(
        transform=ccrs.PlateCarree(),
        subplot_kws={"projection": map_proj},
        cbar_kwargs={"orientation": "horizontal"},
        cmap=plt.cm.RdBu_r,
        vmin=220,
        vmax=310,
    )
    ax = plt.gca()
    ax.coastlines()
    plt.title(titles[i])

../../_images/demos_24-07-11-scipy-2024_scipy-2024_73_0.png

6.2 Departures#

It can also be useful to show the departures (“anomalies”) from the climatological average.

In climatology, anomalies refer to the difference between the value during a given time interval and the long-term average value for that time interval.

For example, the difference between a January average surface air temperature and the average surface temperature over the last 30 Januaries.

Calculate the monthly mean departures for the `tas` variable.#

In this case, xcdat will operate on the global mean time series we calculated above.

Note that you can set the climatological reference period for historical era departures. We use reference_period=("2000-01-01", "2009-12-31") below.

[23]:

ds_global_anomaly = ds_global.temporal.departures(
    "tas", freq="month", reference_period=("2000-01-01", "2009-12-31")
)

/opt/miniconda3/envs/xcdat_scipy_2024/lib/python3.11/site-packages/xarray/core/indexing.py:1617: PerformanceWarning: Slicing with an out-of-order index is generating 15 times more chunks
  return self.array[key]

API Documentation: https://xcdat.readthedocs.io/en/stable/generated/xarray.Dataset.temporal.departures.html

Now let’s plot the departures from the climatological average.#

[24]:

plt.plot(dtime, ds_global_anomaly.tas.values)
plt.xlabel("Year")
plt.ylabel("Global Mean Near-Surface Air Temperature Anomaly [K]")

[24]:

Text(0, 0.5, 'Global Mean Near-Surface Air Temperature Anomaly [K]')

../../_images/demos_24-07-11-scipy-2024_scipy-2024_79_1.png

6.3 Group averages#

xcdat also allows you to calculate group averages.

For example, annual or seasonal mean from monthly data or monthly mean from daily data

Calculate the annual mean from anomaly time series.#

API Documentation: https://xcdat.readthedocs.io/en/stable/generated/xarray.Dataset.temporal.group_averages.html

[25]:

ds_global_anomaly_annual = ds_global_anomaly.temporal.group_average("tas", freq="year")

Now let’s plot the results.#

[26]:

# plot data
dtime_annual = dt2decimal(ds_global_anomaly_annual.time) + 0.5
plt.plot(dtime, ds_global_anomaly.tas.values, label="Monthly departure", color="gray")
plt.plot(
    dtime_annual,
    ds_global_anomaly_annual.tas.values,
    color="k",
    linestyle="",
    marker="_",
    label="Annual Mean",
)
plt.legend(frameon=False)
plt.xlabel("Year")
plt.ylabel("Global Mean Near-Surface Air Temperature [K]")

[26]:

Text(0, 0.5, 'Global Mean Near-Surface Air Temperature [K]')

../../_images/demos_24-07-11-scipy-2024_scipy-2024_84_1.png

7. General Dataset Utilities#

xCDAT includes various utilities for data manipulation including:

Reorient the longitude axis
Add missing bounds
Centering of time coordinates using time bounds*
Add and get bounds*

not shown

7.1. Reorient the longitude axis#

Longitude can be represented from 0 to 360 E or as 180 W to 180 E.
xcdat allows you to convert between these axes systems using xc.swap_lon_axis.

API Documentation: https://xcdat.readthedocs.io/en/stable/generated/xcdat.swap_lon_axis.html

[27]:

print(f"Original lon axis: ({ds.lon[0].item()}, {ds.lon[-1].item()})")

Original lon axis: (0.5, 359.5)

[28]:

ds_swap = xc.swap_lon_axis(ds, to=(-180, 180))

print(f"Swapped lon axis: ({ds_swap.lon[0].item()}, {ds_swap.lon[-1].item()})")

Swapped lon axis: (-179.5, 179.5)

7.2. Add missing bounds#

Bounds are critical to many ``xcdat`` operations.

For example, they are used in determining the weights in spatial or temporal averages and in regridding operations.
add_missing_bounds() will attempt to produce bounds if they do not exist in the original dataset.

API documentation: https://xcdat.readthedocs.io/en/stable/generated/xarray.Dataset.bounds.add_missing_bounds.html

[30]:

# We are dropping the existing bounds to demonstrate adding bounds.
ds_no_bnds = ds.drop_vars("time_bnds")

try:
    ds_no_bnds.bounds.get_bounds("T")
except KeyError as e:
    print(e)

"No bounds data variables were found for the 'T' axis. Make sure the dataset has bound data vars and their names match the 'bounds' attributes found on their related time coordinate variables. Alternatively, you can add bounds with `ds.bounds.add_missing_bounds()` or `ds.bounds.add_bounds()`."

[31]:

ds_w_bnds = ds_no_bnds.bounds.add_missing_bounds(axes=["T"])

ds_w_bnds.bounds.get_bounds("T")

[31]:

<xarray.DataArray 'time_bnds' (time: 180, bnds: 2)> Size: 3kB
array([[cftime.DatetimeNoLeap(2000, 1, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 2, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 2, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 3, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 3, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 4, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 4, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 5, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 5, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 6, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 6, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 7, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 7, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 8, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 8, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 9, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 9, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 10, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 10, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 11, 1, 0, 0, 0, 0, has_year_zero=True)],
...
        cftime.DatetimeNoLeap(2014, 4, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 4, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 5, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 5, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 6, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 6, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 7, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 7, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 8, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 8, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 9, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 9, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 10, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 10, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 11, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 11, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 12, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 12, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2015, 1, 1, 0, 0, 0, 0, has_year_zero=True)]],
      dtype=object)
Coordinates:
    height   float64 8B ...
  * time     (time) object 1kB 2000-01-16 12:00:00 ... 2014-12-16 12:00:00
Dimensions without coordinates: bnds
Attributes:
    xcdat_bounds:  True

xarray.DataArray

'time_bnds'

time: 180
bnds: 2

2000-01-01 00:00:00 2000-02-01 00:00:00 ... 2015-01-01 00:00:00

array([[cftime.DatetimeNoLeap(2000, 1, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 2, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 2, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 3, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 3, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 4, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 4, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 5, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 5, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 6, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 6, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 7, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 7, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 8, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 8, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 9, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 9, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 10, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2000, 10, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2000, 11, 1, 0, 0, 0, 0, has_year_zero=True)],
...
        cftime.DatetimeNoLeap(2014, 4, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 4, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 5, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 5, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 6, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 6, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 7, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 7, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 8, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 8, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 9, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 9, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 10, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 10, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 11, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 11, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2014, 12, 1, 0, 0, 0, 0, has_year_zero=True)],
       [cftime.DatetimeNoLeap(2014, 12, 1, 0, 0, 0, 0, has_year_zero=True),
        cftime.DatetimeNoLeap(2015, 1, 1, 0, 0, 0, 0, has_year_zero=True)]],
      dtype=object)

Coordinates: (2)

height
()
float64
...
units :
m
axis :
Z
positive :
up
long_name :
height
standard_name :
height
```
[1 values with dtype=float64]
```

time

(time)

object

2000-01-16 12:00:00 ... 2014-12-...

bounds :: time_bnds
axis :: T
long_name :: time
standard_name :: time

array([cftime.DatetimeNoLeap(2000, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2000, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2000, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2000, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2000, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2000, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2000, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2000, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2000, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2000, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2000, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2000, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2001, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2001, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2001, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2001, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2001, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2001, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2001, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2001, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2001, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2001, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2001, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2001, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2002, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2002, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2002, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2002, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2002, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2002, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2002, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2002, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2002, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2002, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2002, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2002, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2003, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2003, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2003, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2003, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2003, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2003, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2003, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2003, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2003, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2003, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2003, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2003, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2004, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2004, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2004, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2004, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2004, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2004, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2004, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2004, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2004, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2004, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2004, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2004, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2005, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2005, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2005, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2005, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2005, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2005, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2005, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2005, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2005, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2005, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2005, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2005, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2006, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2006, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2006, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2006, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2006, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2006, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2006, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2006, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2006, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2006, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2006, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2006, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2007, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2007, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2007, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2007, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2007, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2007, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2007, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2007, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2007, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2007, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2007, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2007, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2008, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2008, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2008, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2008, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2008, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2008, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2008, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2008, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2008, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2008, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2008, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2008, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2009, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2009, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2009, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2009, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2009, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2009, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2009, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2009, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2009, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2009, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2009, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2009, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2010, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2010, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2010, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2010, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2010, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2010, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2010, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2010, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2010, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2010, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2010, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2010, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2011, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2011, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2011, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2011, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2011, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2011, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2011, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2011, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2011, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2011, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2011, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2011, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2012, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2012, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2012, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2012, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2012, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2012, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2012, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2012, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2012, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2012, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2012, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2012, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2013, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2013, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2013, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2013, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2013, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2013, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2013, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2013, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2013, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2013, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2013, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2013, 12, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2014, 1, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2014, 2, 15, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2014, 3, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2014, 4, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2014, 5, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2014, 6, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2014, 7, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2014, 8, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2014, 9, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2014, 10, 16, 12, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2014, 11, 16, 0, 0, 0, 0, has_year_zero=True),
       cftime.DatetimeNoLeap(2014, 12, 16, 12, 0, 0, 0, has_year_zero=True)],
      dtype=object)

Indexes: (1)

time

PandasIndex

PandasIndex(CFTimeIndex([2000-01-16 12:00:00, 2000-02-15 00:00:00, 2000-03-16 12:00:00,
             2000-04-16 00:00:00, 2000-05-16 12:00:00, 2000-06-16 00:00:00,
             2000-07-16 12:00:00, 2000-08-16 12:00:00, 2000-09-16 00:00:00,
             2000-10-16 12:00:00,
             ...
             2014-03-16 12:00:00, 2014-04-16 00:00:00, 2014-05-16 12:00:00,
             2014-06-16 00:00:00, 2014-07-16 12:00:00, 2014-08-16 12:00:00,
             2014-09-16 00:00:00, 2014-10-16 12:00:00, 2014-11-16 00:00:00,
             2014-12-16 12:00:00],
            dtype='object', length=180, calendar='noleap', freq=None))

Attributes: (1)
xcdat_bounds :
True

Parallelism with Dask#

“Nearly all existing xarray methods have been extended to work automatically with Dask arrays for parallelism”

Parallelized xarray methods include indexing, computation, concatenating and grouped operations
xCDAT inherits Xarray’s support for Dask parallelism by operating on Xarray objects.
Dask arrays are loaded into memory only when absolutely required (e.g., generating weights for averaging)

— https://docs.xarray.dev/en/stable/user-guide/dask.html#using-dask-with-xarray

Example of Parallelizing xCDAT with Dask#

Open ~11 GB dataset using xc.open_mfdataset()
Chunked on time dimension by 10 (parallel) and by 1 (serial)
Compare run times of global spatial averaging

This exercise requires data to be downloaded locally using this command: ``bash docs/demos/24-07-11-scipy-2024/tas_day_E3SM-2-0_historical_r1i1p1f1_gr/tas_day_wget_script_2024-6-25_11-31-0.sh -s``

[32]:

import os

import xcdat as xc

FILENAMES = [
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_18500101-18591231.nc",
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_18600101-18691231.nc",
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_18700101-18791231.nc",
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_18800101-18891231.nc",
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_19000101-19091231.nc",
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_19100101-19191231.nc",
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_19300101-19391231.nc",
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_19400101-19491231.nc",
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_19600101-19691231.nc",
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_19700101-19791231.nc",
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_19800101-19891231.nc",
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_19900101-19991231.nc",
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_20000101-20091231.nc",
    "tas_day_E3SM-2-0_historical_r1i1p1f1_gr_20100101-20141231.nc",
]

LOCAL_DIR = "docs/demos/24-07-11-scipy-2024/tas_day_E3SM-2-0_historical_r1i1p1f1_gr"
LOCAL_FILEPATHS = [os.path.join(LOCAL_DIR, file) for file in FILENAMES]

ds_parallel = xc.open_mfdataset(LOCAL_FILEPATHS, chunks={"time": 10})
ds_serial = xc.open_mfdataset(LOCAL_FILEPATHS, chunks={"time": 1})

Now let’s compare the performance of xCDAT `spatial.average()`#

[33]:

%%time
ds_parallel_avg = ds_parallel.spatial.average("tas")
ds_parallel_avg = ds_parallel.compute()

CPU times: user 1min 6s, sys: 9.39 s, total: 1min 16s
Wall time: 1min 59s

[34]:

%%time
ds_serial_avg = ds_serial.spatial.average("tas")
ds_serial_avg = ds_serial.compute()

CPU times: user 1min 31s, sys: 14.1 s, total: 1min 45s
Wall time: 2min 37s

xCDAT’s spatial averager is faster when chunking on the time dimension by 10 and executing in parallel.#

xCDAT also outperforms the older CDAT library by much large margins in some cases.#

For example, for global spatial averaging across various file sizes:

JOSS Perf Metrics

CDAT only runs functions serially and crashes with 22 GB+ datasets.

Source: https://joss.theoj.org/papers/10.21105/joss.06426

Further Dask Guidance#

Visit these pages for more guidance (e.g., when to parallelize):

Parallel computing with Dask (xCDAT): https://xcdat.readthedocs.io/en/latest/examples/parallel-computing-with-dask.html
Parallel computing with Dask (Xarray): https://docs.xarray.dev/en/stable/user-guide/dask.html
Xarray with Dask Arrays: https://examples.dask.org/xarray.html

What are some things in the works with xCDAT?#

Collaborate with UXarray for interoperation of two tools to support end-to-end and more streamlined operation on unstructured (i.e. E3SM native output) datasets.
Continue assisting DOE funded projects including PCMDI Metrics Package, E3SM Diags
Explore other DOE funded projects to integrate xCDAT for analysis capabilities (e.g., ARM Diags and ILAMB)
Explore xarray-datatree package for analyzing ensembles with xCDAT capabilities

Get involved and join the xCDAT community!

QR Code

Code contributions are welcome and appreciated
- GitHub Repository: xCDAT/xcdat
- Contributing Guide: https://xcdat.readthedocs.io/en/latest/contributing.html
Submit and/or address tickets for feature suggestions, bugs, and documentation updates
- GitHub Issues: xCDAT/xcdat#issues
Participate in forum discussions on version releases, architecture, feature suggestions, etc.
GitHub Discussions: xCDAT/xcdat#discussions

Key takeaways

xCDAT is an extension of Xarray for climate data analysis on structured grids
Focused on routine climate research analysis operations
Designed to encourages software sustainability and reproducible science
Parallelizable through Xarray’s support for Dask, which enables efficient processing of large datasets