NetCDF¶

A very efficient way to read, analyze and write NetCDF files is to use the xarray Python library, which can be viewed as a ND counterpart of the pandas.

Reading NetCDF¶

Reading multiple files¶

Often, a variable is stored in multiple NetCDF files (one file per year for instance). The xarray.open_mfdataset allows to open all the files at one and to concatenate them along record dimension (UNLIMITED dimension, which is usually time) and spatial dimensions.

Below, the four ISAS13 files are opened at once and are automatically concanated along the record dimension, hence leading to a dataset with 4 time steps.

data = xr.open_mfdataset("data/*ISAS*nc", combine='by_coords')
data

<xarray.Dataset>
Dimensions:  (time: 4, depth: 152)
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 ... 1.96e+03 1.98e+03 2e+03
  * time     (time) datetime64[ns] 2012-01-15 2012-02-15 2012-03-15 2012-04-15
Data variables:
    TEMP     (time, depth) float32 dask.array<chunksize=(1, 152), meta=np.ndarray>
Attributes:
    script:   extract_prof.py
    history:  Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.n...
    NCO:      "4.5.5"

xarray.Dataset

Dimensions:
- time: 4
- depth: 152

Coordinates: (2)

depth

(depth)

float32

0.0 3.0 5.0 ... 1.98e+03 2e+03

array([   0.,    3.,    5.,   10.,   15.,   20.,   25.,   30.,   35.,   40.,
         45.,   50.,   55.,   60.,   65.,   70.,   75.,   80.,   85.,   90.,
         95.,  100.,  110.,  120.,  130.,  140.,  150.,  160.,  170.,  180.,
        190.,  200.,  210.,  220.,  230.,  240.,  250.,  260.,  270.,  280.,
        290.,  300.,  310.,  320.,  330.,  340.,  350.,  360.,  370.,  380.,
        390.,  400.,  410.,  420.,  430.,  440.,  450.,  460.,  470.,  480.,
        490.,  500.,  510.,  520.,  530.,  540.,  550.,  560.,  570.,  580.,
        590.,  600.,  610.,  620.,  630.,  640.,  650.,  660.,  670.,  680.,
        690.,  700.,  710.,  720.,  730.,  740.,  750.,  760.,  770.,  780.,
        790.,  800.,  820.,  840.,  860.,  880.,  900.,  920.,  940.,  960.,
        980., 1000., 1020., 1040., 1060., 1080., 1100., 1120., 1140., 1160.,
       1180., 1200., 1220., 1240., 1260., 1280., 1300., 1320., 1340., 1360.,
       1380., 1400., 1420., 1440., 1460., 1480., 1500., 1520., 1540., 1560.,
       1580., 1600., 1620., 1640., 1660., 1680., 1700., 1720., 1740., 1760.,
       1780., 1800., 1820., 1840., 1860., 1880., 1900., 1920., 1940., 1960.,
       1980., 2000.], dtype=float32)

time

(time)

datetime64[ns]

2012-01-15 ... 2012-04-15

array(['2012-01-15T00:00:00.000000000', '2012-02-15T00:00:00.000000000',
       '2012-03-15T00:00:00.000000000', '2012-04-15T00:00:00.000000000'],
      dtype='datetime64[ns]')

Data variables: (1)

TEMP

(time, depth)

float32

dask.array<chunksize=(1, 152), meta=np.ndarray>

	Array	Chunk
Bytes	2.38 kiB	608 B
Shape	(4, 152)	(1, 152)
Count	12 Tasks	4 Chunks
Type	float32	numpy.ndarray

Attributes: (3)
script :
extract_prof.py
history :
Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.nc ISAS13_20120115_fld_TEMP.nc
NCO :
"4.5.5"

Furthermore, complex models are often paralellized using the Message Passing Interface (MPI), in which each processor manages a subdomain. If each processor saves output in its sub-region, there will be as many output files as there are processors. xarray allows to reconstruct the global file by concatenating the subregional files according to their coordinates.

Warning! This actually works only if the decomposition into subregions is regular, and if subfiles contain coordinates

data = xr.open_mfdataset("data/GYRE_OOPE*", combine='by_coords', engine='netcdf4')
data['OOPE']

<xarray.DataArray 'OOPE' (time: 10, y: 22, x: 32, community: 3, weight: 100)>
dask.array<concatenate, shape=(10, 22, 32, 3, 100), dtype=float32, chunksize=(10, 11, 16, 3, 100), chunktype=numpy.ndarray>
Coordinates:
  * x        (x) int64 0 1 2 3 4 5 6 7 8 9 10 ... 22 23 24 25 26 27 28 29 30 31
  * y        (y) int64 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21
Dimensions without coordinates: time, community, weight

xarray.DataArray

'OOPE'

time: 10
y: 22
x: 32
community: 3
weight: 100

dask.array<chunksize=(10, 11, 16, 3, 100), meta=np.ndarray>

	Array	Chunk
Bytes	8.06 MiB	2.01 MiB
Shape	(10, 22, 32, 3, 100)	(10, 11, 16, 3, 100)
Count	16 Tasks	4 Chunks
Type	float32	numpy.ndarray

Coordinates: (2)

x

(x)

int64

0 1 2 3 4 5 6 ... 26 27 28 29 30 31

array([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11, 12, 13, 14, 15, 16, 17,
       18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31])

y

(y)

int64

0 1 2 3 4 5 6 ... 16 17 18 19 20 21

array([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11, 12, 13, 14, 15, 16, 17,
       18, 19, 20, 21])

Attributes: (0)

In the 2 previous examples, chunksize variable attribute appeared. This is due to the fact that opening multiple datasets automatically generates dask arrays, which are ready for parallel computing. These are discussed in a specific section

Indexing¶

As in pandas, there is 2 ways to extract part of a dataset. Let’s consider the ISAS dataset, which contains 152 vertical levels unevenly from 0 to 2000m.

data = xr.open_mfdataset('data/*ISAS*', combine='by_coords')
data

<xarray.Dataset>
Dimensions:  (time: 4, depth: 152)
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 ... 1.96e+03 1.98e+03 2e+03
  * time     (time) datetime64[ns] 2012-01-15 2012-02-15 2012-03-15 2012-04-15
Data variables:
    TEMP     (time, depth) float32 dask.array<chunksize=(1, 152), meta=np.ndarray>
Attributes:
    script:   extract_prof.py
    history:  Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.n...
    NCO:      "4.5.5"

xarray.Dataset

Dimensions:
- time: 4
- depth: 152

Coordinates: (2)

depth

(depth)

float32

0.0 3.0 5.0 ... 1.98e+03 2e+03

array([   0.,    3.,    5.,   10.,   15.,   20.,   25.,   30.,   35.,   40.,
         45.,   50.,   55.,   60.,   65.,   70.,   75.,   80.,   85.,   90.,
         95.,  100.,  110.,  120.,  130.,  140.,  150.,  160.,  170.,  180.,
        190.,  200.,  210.,  220.,  230.,  240.,  250.,  260.,  270.,  280.,
        290.,  300.,  310.,  320.,  330.,  340.,  350.,  360.,  370.,  380.,
        390.,  400.,  410.,  420.,  430.,  440.,  450.,  460.,  470.,  480.,
        490.,  500.,  510.,  520.,  530.,  540.,  550.,  560.,  570.,  580.,
        590.,  600.,  610.,  620.,  630.,  640.,  650.,  660.,  670.,  680.,
        690.,  700.,  710.,  720.,  730.,  740.,  750.,  760.,  770.,  780.,
        790.,  800.,  820.,  840.,  860.,  880.,  900.,  920.,  940.,  960.,
        980., 1000., 1020., 1040., 1060., 1080., 1100., 1120., 1140., 1160.,
       1180., 1200., 1220., 1240., 1260., 1280., 1300., 1320., 1340., 1360.,
       1380., 1400., 1420., 1440., 1460., 1480., 1500., 1520., 1540., 1560.,
       1580., 1600., 1620., 1640., 1660., 1680., 1700., 1720., 1740., 1760.,
       1780., 1800., 1820., 1840., 1860., 1880., 1900., 1920., 1940., 1960.,
       1980., 2000.], dtype=float32)

time

(time)

datetime64[ns]

2012-01-15 ... 2012-04-15

array(['2012-01-15T00:00:00.000000000', '2012-02-15T00:00:00.000000000',
       '2012-03-15T00:00:00.000000000', '2012-04-15T00:00:00.000000000'],
      dtype='datetime64[ns]')

Data variables: (1)

TEMP

(time, depth)

float32

dask.array<chunksize=(1, 152), meta=np.ndarray>

	Array	Chunk
Bytes	2.38 kiB	608 B
Shape	(4, 152)	(1, 152)
Count	12 Tasks	4 Chunks
Type	float32	numpy.ndarray

Attributes: (3)
script :
extract_prof.py
history :
Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.nc ISAS13_20120115_fld_TEMP.nc
NCO :
"4.5.5"

Extracting using indexes¶

To extract the ten first level and the first to time steps, the isel method should be used, which can be applied on either DataSet or DataArray.

Note It is the xarray counterpart of the Pandas iloc method

data.isel(depth=range(10), time=0)

<xarray.Dataset>
Dimensions:  (depth: 10)
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 15.0 20.0 25.0 30.0 35.0 40.0
    time     datetime64[ns] 2012-01-15
Data variables:
    TEMP     (depth) float32 dask.array<chunksize=(10,), meta=np.ndarray>
Attributes:
    script:   extract_prof.py
    history:  Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.n...
    NCO:      "4.5.5"

xarray.Dataset

Dimensions:
- depth: 10

Coordinates: (2)

depth

(depth)

float32

0.0 3.0 5.0 10.0 ... 30.0 35.0 40.0

array([ 0.,  3.,  5., 10., 15., 20., 25., 30., 35., 40.], dtype=float32)

time

()

datetime64[ns]

2012-01-15

array('2012-01-15T00:00:00.000000000', dtype='datetime64[ns]')

Data variables: (1)

TEMP

(depth)

float32

dask.array<chunksize=(10,), meta=np.ndarray>

	Array	Chunk
Bytes	40 B	40 B
Shape	(10,)	(10,)
Count	14 Tasks	1 Chunks
Type	float32	numpy.ndarray

Attributes: (3)
script :
extract_prof.py
history :
Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.nc ISAS13_20120115_fld_TEMP.nc
NCO :
"4.5.5"

data.isel(time=slice(0, 2), depth=slice(0, 10))

<xarray.Dataset>
Dimensions:  (time: 2, depth: 10)
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 15.0 20.0 25.0 30.0 35.0 40.0
  * time     (time) datetime64[ns] 2012-01-15 2012-02-15
Data variables:
    TEMP     (time, depth) float32 dask.array<chunksize=(1, 10), meta=np.ndarray>
Attributes:
    script:   extract_prof.py
    history:  Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.n...
    NCO:      "4.5.5"

xarray.Dataset

Dimensions:
- time: 2
- depth: 10

Coordinates: (2)

depth

(depth)

float32

0.0 3.0 5.0 10.0 ... 30.0 35.0 40.0

array([ 0.,  3.,  5., 10., 15., 20., 25., 30., 35., 40.], dtype=float32)

time

(time)

datetime64[ns]

2012-01-15 2012-02-15

array(['2012-01-15T00:00:00.000000000', '2012-02-15T00:00:00.000000000'],
      dtype='datetime64[ns]')

Data variables: (1)

TEMP

(time, depth)

float32

dask.array<chunksize=(1, 10), meta=np.ndarray>

	Array	Chunk
Bytes	80 B	40 B
Shape	(2, 10)	(1, 10)
Count	14 Tasks	2 Chunks
Type	float32	numpy.ndarray

Attributes: (3)
script :
extract_prof.py
history :
Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.nc ISAS13_20120115_fld_TEMP.nc
NCO :
"4.5.5"

data['TEMP'].isel(time=slice(0, 2), depth=range(0, 10))

<xarray.DataArray 'TEMP' (time: 2, depth: 10)>
dask.array<getitem, shape=(2, 10), dtype=float32, chunksize=(1, 10), chunktype=numpy.ndarray>
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 15.0 20.0 25.0 30.0 35.0 40.0
  * time     (time) datetime64[ns] 2012-01-15 2012-02-15

xarray.DataArray

'TEMP'

time: 2
depth: 10

dask.array<chunksize=(1, 10), meta=np.ndarray>

	Array	Chunk
Bytes	80 B	40 B
Shape	(2, 10)	(1, 10)
Count	16 Tasks	2 Chunks
Type	float32	numpy.ndarray

Coordinates: (2)

depth

(depth)

float32

0.0 3.0 5.0 10.0 ... 30.0 35.0 40.0

array([ 0.,  3.,  5., 10., 15., 20., 25., 30., 35., 40.], dtype=float32)

time

(time)

datetime64[ns]

2012-01-15 2012-02-15

array(['2012-01-15T00:00:00.000000000', '2012-02-15T00:00:00.000000000'],
      dtype='datetime64[ns]')

Attributes: (0)

Extracting using values¶

To extract the data between 100m and 500m and for a given period, the sel method should be used, which can be applied on either DataSet or DataArray. It allows use values rather than indexes.

Note It is the xarray counterpart of the Pandas loc method

data.sel(time=slice('2012-01-15', '2012-02-15'))

<xarray.Dataset>
Dimensions:  (time: 2, depth: 152)
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 ... 1.96e+03 1.98e+03 2e+03
  * time     (time) datetime64[ns] 2012-01-15 2012-02-15
Data variables:
    TEMP     (time, depth) float32 dask.array<chunksize=(1, 152), meta=np.ndarray>
Attributes:
    script:   extract_prof.py
    history:  Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.n...
    NCO:      "4.5.5"

xarray.Dataset

Dimensions:
- time: 2
- depth: 152

Coordinates: (2)

depth

(depth)

float32

0.0 3.0 5.0 ... 1.98e+03 2e+03

array([   0.,    3.,    5.,   10.,   15.,   20.,   25.,   30.,   35.,   40.,
         45.,   50.,   55.,   60.,   65.,   70.,   75.,   80.,   85.,   90.,
         95.,  100.,  110.,  120.,  130.,  140.,  150.,  160.,  170.,  180.,
        190.,  200.,  210.,  220.,  230.,  240.,  250.,  260.,  270.,  280.,
        290.,  300.,  310.,  320.,  330.,  340.,  350.,  360.,  370.,  380.,
        390.,  400.,  410.,  420.,  430.,  440.,  450.,  460.,  470.,  480.,
        490.,  500.,  510.,  520.,  530.,  540.,  550.,  560.,  570.,  580.,
        590.,  600.,  610.,  620.,  630.,  640.,  650.,  660.,  670.,  680.,
        690.,  700.,  710.,  720.,  730.,  740.,  750.,  760.,  770.,  780.,
        790.,  800.,  820.,  840.,  860.,  880.,  900.,  920.,  940.,  960.,
        980., 1000., 1020., 1040., 1060., 1080., 1100., 1120., 1140., 1160.,
       1180., 1200., 1220., 1240., 1260., 1280., 1300., 1320., 1340., 1360.,
       1380., 1400., 1420., 1440., 1460., 1480., 1500., 1520., 1540., 1560.,
       1580., 1600., 1620., 1640., 1660., 1680., 1700., 1720., 1740., 1760.,
       1780., 1800., 1820., 1840., 1860., 1880., 1900., 1920., 1940., 1960.,
       1980., 2000.], dtype=float32)

time

(time)

datetime64[ns]

2012-01-15 2012-02-15

array(['2012-01-15T00:00:00.000000000', '2012-02-15T00:00:00.000000000'],
      dtype='datetime64[ns]')

Data variables: (1)

TEMP

(time, depth)

float32

dask.array<chunksize=(1, 152), meta=np.ndarray>

	Array	Chunk
Bytes	1.19 kiB	608 B
Shape	(2, 152)	(1, 152)
Count	14 Tasks	2 Chunks
Type	float32	numpy.ndarray

Attributes: (3)
script :
extract_prof.py
history :
Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.nc ISAS13_20120115_fld_TEMP.nc
NCO :
"4.5.5"

zmin = 100
zmax = 1000
data.sel(time=slice('2012-01-15', '2012-02-15'), depth=slice(zmin, zmax))

<xarray.Dataset>
Dimensions:  (time: 2, depth: 81)
Coordinates:
  * depth    (depth) float32 100.0 110.0 120.0 130.0 ... 940.0 960.0 980.0 1e+03
  * time     (time) datetime64[ns] 2012-01-15 2012-02-15
Data variables:
    TEMP     (time, depth) float32 dask.array<chunksize=(1, 81), meta=np.ndarray>
Attributes:
    script:   extract_prof.py
    history:  Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.n...
    NCO:      "4.5.5"

xarray.Dataset

Dimensions:
- time: 2
- depth: 81

Coordinates: (2)

depth

(depth)

float32

100.0 110.0 120.0 ... 980.0 1e+03

array([ 100.,  110.,  120.,  130.,  140.,  150.,  160.,  170.,  180.,  190.,
        200.,  210.,  220.,  230.,  240.,  250.,  260.,  270.,  280.,  290.,
        300.,  310.,  320.,  330.,  340.,  350.,  360.,  370.,  380.,  390.,
        400.,  410.,  420.,  430.,  440.,  450.,  460.,  470.,  480.,  490.,
        500.,  510.,  520.,  530.,  540.,  550.,  560.,  570.,  580.,  590.,
        600.,  610.,  620.,  630.,  640.,  650.,  660.,  670.,  680.,  690.,
        700.,  710.,  720.,  730.,  740.,  750.,  760.,  770.,  780.,  790.,
        800.,  820.,  840.,  860.,  880.,  900.,  920.,  940.,  960.,  980.,
       1000.], dtype=float32)

time

(time)

datetime64[ns]

2012-01-15 2012-02-15

array(['2012-01-15T00:00:00.000000000', '2012-02-15T00:00:00.000000000'],
      dtype='datetime64[ns]')

Data variables: (1)

TEMP

(time, depth)

float32

dask.array<chunksize=(1, 81), meta=np.ndarray>

	Array	Chunk
Bytes	648 B	324 B
Shape	(2, 81)	(1, 81)
Count	14 Tasks	2 Chunks
Type	float32	numpy.ndarray

Attributes: (3)
script :
extract_prof.py
history :
Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.nc ISAS13_20120115_fld_TEMP.nc
NCO :
"4.5.5"

Mathematical operations¶

As for pandas, xarray comes with mathematical operations.

data = xr.open_mfdataset('data/*ISAS*', combine='by_coords')

To compute the mean over the entire dataset:

data.mean()

<xarray.Dataset>
Dimensions:  ()
Data variables:
    TEMP     float32 dask.array<chunksize=(), meta=np.ndarray>

xarray.Dataset

Dimensions:
Coordinates: (0)

Data variables: (1)

TEMP

()

float32

dask.array<chunksize=(), meta=np.ndarray>

	Array	Chunk
Bytes	4 B	4.0 B
Shape	()	()
Count	19 Tasks	1 Chunks
Type	float32	numpy.ndarray

Attributes: (0)

To compute the mean along time dimension:

data.mean(dim='time')

<xarray.Dataset>
Dimensions:  (depth: 152)
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 ... 1.96e+03 1.98e+03 2e+03
Data variables:
    TEMP     (depth) float32 dask.array<chunksize=(152,), meta=np.ndarray>

xarray.Dataset

Dimensions:
- depth: 152

Coordinates: (1)

depth

(depth)

float32

0.0 3.0 5.0 ... 1.98e+03 2e+03

array([   0.,    3.,    5.,   10.,   15.,   20.,   25.,   30.,   35.,   40.,
         45.,   50.,   55.,   60.,   65.,   70.,   75.,   80.,   85.,   90.,
         95.,  100.,  110.,  120.,  130.,  140.,  150.,  160.,  170.,  180.,
        190.,  200.,  210.,  220.,  230.,  240.,  250.,  260.,  270.,  280.,
        290.,  300.,  310.,  320.,  330.,  340.,  350.,  360.,  370.,  380.,
        390.,  400.,  410.,  420.,  430.,  440.,  450.,  460.,  470.,  480.,
        490.,  500.,  510.,  520.,  530.,  540.,  550.,  560.,  570.,  580.,
        590.,  600.,  610.,  620.,  630.,  640.,  650.,  660.,  670.,  680.,
        690.,  700.,  710.,  720.,  730.,  740.,  750.,  760.,  770.,  780.,
        790.,  800.,  820.,  840.,  860.,  880.,  900.,  920.,  940.,  960.,
        980., 1000., 1020., 1040., 1060., 1080., 1100., 1120., 1140., 1160.,
       1180., 1200., 1220., 1240., 1260., 1280., 1300., 1320., 1340., 1360.,
       1380., 1400., 1420., 1440., 1460., 1480., 1500., 1520., 1540., 1560.,
       1580., 1600., 1620., 1640., 1660., 1680., 1700., 1720., 1740., 1760.,
       1780., 1800., 1820., 1840., 1860., 1880., 1900., 1920., 1940., 1960.,
       1980., 2000.], dtype=float32)

Data variables: (1)

TEMP

(depth)

float32

dask.array<chunksize=(152,), meta=np.ndarray>

	Array	Chunk
Bytes	608 B	608 B
Shape	(152,)	(152,)
Count	17 Tasks	1 Chunks
Type	float32	numpy.ndarray

Attributes: (0)

Mean over the depth dimension:

data.mean(dim='depth')

<xarray.Dataset>
Dimensions:  (time: 4)
Coordinates:
  * time     (time) datetime64[ns] 2012-01-15 2012-02-15 2012-03-15 2012-04-15
Data variables:
    TEMP     (time) float32 dask.array<chunksize=(1,), meta=np.ndarray>

xarray.Dataset

Dimensions:
- time: 4

Coordinates: (1)

time

(time)

datetime64[ns]

2012-01-15 ... 2012-04-15

array(['2012-01-15T00:00:00.000000000', '2012-02-15T00:00:00.000000000',
       '2012-03-15T00:00:00.000000000', '2012-04-15T00:00:00.000000000'],
      dtype='datetime64[ns]')

Data variables: (1)

TEMP

(time)

float32

dask.array<chunksize=(1,), meta=np.ndarray>

	Array	Chunk
Bytes	16 B	4 B
Shape	(4,)	(1,)
Count	20 Tasks	4 Chunks
Type	float32	numpy.ndarray

Attributes: (0)

Contrary to numpy eager evaluations, xarray performs lazy operations. As indicated on the xarray website:

Operations queue up a series of tasks mapped over blocks, and no computation is performed until you actually ask values to be computed (e.g., to print results to your screen or write to disk)

To force the computation, the compute and/or load methods must be used. Let’s compare the outputs below:

data['TEMP'].mean(dim='time')

<xarray.DataArray 'TEMP' (depth: 152)>
dask.array<mean_agg-aggregate, shape=(152,), dtype=float32, chunksize=(152,), chunktype=numpy.ndarray>
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 ... 1.96e+03 1.98e+03 2e+03

xarray.DataArray

'TEMP'

depth: 152

dask.array<chunksize=(152,), meta=np.ndarray>

	Array	Chunk
Bytes	608 B	608 B
Shape	(152,)	(152,)
Count	17 Tasks	1 Chunks
Type	float32	numpy.ndarray

Coordinates: (1)

depth

(depth)

float32

0.0 3.0 5.0 ... 1.98e+03 2e+03

array([   0.,    3.,    5.,   10.,   15.,   20.,   25.,   30.,   35.,   40.,
         45.,   50.,   55.,   60.,   65.,   70.,   75.,   80.,   85.,   90.,
         95.,  100.,  110.,  120.,  130.,  140.,  150.,  160.,  170.,  180.,
        190.,  200.,  210.,  220.,  230.,  240.,  250.,  260.,  270.,  280.,
        290.,  300.,  310.,  320.,  330.,  340.,  350.,  360.,  370.,  380.,
        390.,  400.,  410.,  420.,  430.,  440.,  450.,  460.,  470.,  480.,
        490.,  500.,  510.,  520.,  530.,  540.,  550.,  560.,  570.,  580.,
        590.,  600.,  610.,  620.,  630.,  640.,  650.,  660.,  670.,  680.,
        690.,  700.,  710.,  720.,  730.,  740.,  750.,  760.,  770.,  780.,
        790.,  800.,  820.,  840.,  860.,  880.,  900.,  920.,  940.,  960.,
        980., 1000., 1020., 1040., 1060., 1080., 1100., 1120., 1140., 1160.,
       1180., 1200., 1220., 1240., 1260., 1280., 1300., 1320., 1340., 1360.,
       1380., 1400., 1420., 1440., 1460., 1480., 1500., 1520., 1540., 1560.,
       1580., 1600., 1620., 1640., 1660., 1680., 1700., 1720., 1740., 1760.,
       1780., 1800., 1820., 1840., 1860., 1880., 1900., 1920., 1940., 1960.,
       1980., 2000.], dtype=float32)

Attributes: (0)

data['TEMP'].mean(dim='time').compute()

<xarray.DataArray 'TEMP' (depth: 152)>
array([28.292002 , 28.27325  , 28.25     , 28.232248 , 28.216    ,
       28.1895   , 28.14475  , 28.1115   , 28.089    , 28.061497 ,
       27.9785   , 27.86825  , 27.706749 , 27.5455   , 27.37575  ,
       27.187498 , 27.0145   , 26.845251 , 26.65725  , 26.47975  ,
       26.300251 , 26.0765   , 25.755249 , 25.4485   , 25.126    ,
       24.78375  , 24.461    , 24.11975  , 23.7695   , 23.343248 ,
       22.948502 , 22.54125  , 22.131    , 21.701248 , 21.247501 ,
       20.817501 , 20.380499 , 19.909    , 19.467499 , 19.02     ,
       18.53825  , 18.0675   , 17.5955   , 17.04475  , 16.568    ,
       16.05925  , 15.553499 , 15.09725  , 14.63025  , 14.14725  ,
       13.635249 , 13.13725  , 12.675499 , 12.21725  , 11.77075  ,
       11.3205   , 10.8885   , 10.518499 , 10.15325  ,  9.795    ,
        9.4395   ,  9.1145   ,  8.85225  ,  8.580999 ,  8.3245   ,
        8.07675  ,  7.8415   ,  7.63525  ,  7.43375  ,  7.23325  ,
        7.0390005,  6.8665   ,  6.71575  ,  6.5675   ,  6.43225  ,
        6.2975   ,  6.17     ,  6.0575   ,  5.9509997,  5.8425   ,
        5.7382503,  5.6480002,  5.5719995,  5.4975   ,  5.42175  ,
        5.34825  ,  5.27975  ,  5.21525  ,  5.1465   ,  5.07925  ,
        5.01725  ,  4.94625  ,  4.8315   ,  4.7165   ,  4.6045   ,
        4.4805   ,  4.36025  ,  4.2535   ,  4.1505   ,  4.0602503,
        3.9720001,  3.8920002,  3.81175  ,  3.738    ,  3.6685   ,
        3.59775  ,  3.53025  ,  3.4675   ,  3.409    ,  3.3554997,
        3.3115   ,  3.2615001,  3.2140002,  3.1664999,  3.12425  ,
        3.0805001,  3.038    ,  2.99825  ,  2.9597497,  2.92275  ,
        2.88575  ,  2.8477502,  2.81425  ,  2.7842498,  2.7525   ,
        2.72125  ,  2.69175  ,  2.66325  ,  2.63925  ,  2.61625  ,
        2.5945   ,  2.5705   ,  2.549    ,  2.527    ,  2.50675  ,
        2.4875   ,  2.46925  ,  2.4507499,  2.43275  ,  2.4145   ,
        2.39675  ,  2.3785   ,  2.3617501,  2.3445   ,  2.32725  ,
        2.3095002,  2.2925   ,  2.27625  ,  2.25875  ,  2.2437499,
        2.2265   ,  2.22025  ], dtype=float32)
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 ... 1.96e+03 1.98e+03 2e+03

xarray.DataArray

'TEMP'

depth: 152

28.29 28.27 28.25 28.23 28.22 28.19 ... 2.276 2.259 2.244 2.227 2.22

array([28.292002 , 28.27325  , 28.25     , 28.232248 , 28.216    ,
       28.1895   , 28.14475  , 28.1115   , 28.089    , 28.061497 ,
       27.9785   , 27.86825  , 27.706749 , 27.5455   , 27.37575  ,
       27.187498 , 27.0145   , 26.845251 , 26.65725  , 26.47975  ,
       26.300251 , 26.0765   , 25.755249 , 25.4485   , 25.126    ,
       24.78375  , 24.461    , 24.11975  , 23.7695   , 23.343248 ,
       22.948502 , 22.54125  , 22.131    , 21.701248 , 21.247501 ,
       20.817501 , 20.380499 , 19.909    , 19.467499 , 19.02     ,
       18.53825  , 18.0675   , 17.5955   , 17.04475  , 16.568    ,
       16.05925  , 15.553499 , 15.09725  , 14.63025  , 14.14725  ,
       13.635249 , 13.13725  , 12.675499 , 12.21725  , 11.77075  ,
       11.3205   , 10.8885   , 10.518499 , 10.15325  ,  9.795    ,
        9.4395   ,  9.1145   ,  8.85225  ,  8.580999 ,  8.3245   ,
        8.07675  ,  7.8415   ,  7.63525  ,  7.43375  ,  7.23325  ,
        7.0390005,  6.8665   ,  6.71575  ,  6.5675   ,  6.43225  ,
        6.2975   ,  6.17     ,  6.0575   ,  5.9509997,  5.8425   ,
        5.7382503,  5.6480002,  5.5719995,  5.4975   ,  5.42175  ,
        5.34825  ,  5.27975  ,  5.21525  ,  5.1465   ,  5.07925  ,
        5.01725  ,  4.94625  ,  4.8315   ,  4.7165   ,  4.6045   ,
        4.4805   ,  4.36025  ,  4.2535   ,  4.1505   ,  4.0602503,
        3.9720001,  3.8920002,  3.81175  ,  3.738    ,  3.6685   ,
        3.59775  ,  3.53025  ,  3.4675   ,  3.409    ,  3.3554997,
        3.3115   ,  3.2615001,  3.2140002,  3.1664999,  3.12425  ,
        3.0805001,  3.038    ,  2.99825  ,  2.9597497,  2.92275  ,
        2.88575  ,  2.8477502,  2.81425  ,  2.7842498,  2.7525   ,
        2.72125  ,  2.69175  ,  2.66325  ,  2.63925  ,  2.61625  ,
        2.5945   ,  2.5705   ,  2.549    ,  2.527    ,  2.50675  ,
        2.4875   ,  2.46925  ,  2.4507499,  2.43275  ,  2.4145   ,
        2.39675  ,  2.3785   ,  2.3617501,  2.3445   ,  2.32725  ,
        2.3095002,  2.2925   ,  2.27625  ,  2.25875  ,  2.2437499,
        2.2265   ,  2.22025  ], dtype=float32)

Coordinates: (1)

depth

(depth)

float32

0.0 3.0 5.0 ... 1.98e+03 2e+03

array([   0.,    3.,    5.,   10.,   15.,   20.,   25.,   30.,   35.,   40.,
         45.,   50.,   55.,   60.,   65.,   70.,   75.,   80.,   85.,   90.,
         95.,  100.,  110.,  120.,  130.,  140.,  150.,  160.,  170.,  180.,
        190.,  200.,  210.,  220.,  230.,  240.,  250.,  260.,  270.,  280.,
        290.,  300.,  310.,  320.,  330.,  340.,  350.,  360.,  370.,  380.,
        390.,  400.,  410.,  420.,  430.,  440.,  450.,  460.,  470.,  480.,
        490.,  500.,  510.,  520.,  530.,  540.,  550.,  560.,  570.,  580.,
        590.,  600.,  610.,  620.,  630.,  640.,  650.,  660.,  670.,  680.,
        690.,  700.,  710.,  720.,  730.,  740.,  750.,  760.,  770.,  780.,
        790.,  800.,  820.,  840.,  860.,  880.,  900.,  920.,  940.,  960.,
        980., 1000., 1020., 1040., 1060., 1080., 1100., 1120., 1140., 1160.,
       1180., 1200., 1220., 1240., 1260., 1280., 1300., 1320., 1340., 1360.,
       1380., 1400., 1420., 1440., 1460., 1480., 1500., 1520., 1540., 1560.,
       1580., 1600., 1620., 1640., 1660., 1680., 1700., 1720., 1740., 1760.,
       1780., 1800., 1820., 1840., 1860., 1880., 1900., 1920., 1940., 1960.,
       1980., 2000.], dtype=float32)

Attributes: (0)

In the first output, no values are displayed. The mean has not been computed yet. In the second output, the effective mean values are shown because computation has been forced using compute.

Group-by operations¶

The groupby methods allows to easily perform operations on indepedent groups. For instance, to compute temporal (yearly, monthly, seasonal) means:

data.groupby('time.month').mean(dim='time')

<xarray.Dataset>
Dimensions:  (month: 4, depth: 152)
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 ... 1.96e+03 1.98e+03 2e+03
  * month    (month) int64 1 2 3 4
Data variables:
    TEMP     (month, depth) float32 dask.array<chunksize=(1, 152), meta=np.ndarray>

xarray.Dataset

Dimensions:
- month: 4
- depth: 152

Coordinates: (2)

depth

(depth)

float32

0.0 3.0 5.0 ... 1.98e+03 2e+03

array([   0.,    3.,    5.,   10.,   15.,   20.,   25.,   30.,   35.,   40.,
         45.,   50.,   55.,   60.,   65.,   70.,   75.,   80.,   85.,   90.,
         95.,  100.,  110.,  120.,  130.,  140.,  150.,  160.,  170.,  180.,
        190.,  200.,  210.,  220.,  230.,  240.,  250.,  260.,  270.,  280.,
        290.,  300.,  310.,  320.,  330.,  340.,  350.,  360.,  370.,  380.,
        390.,  400.,  410.,  420.,  430.,  440.,  450.,  460.,  470.,  480.,
        490.,  500.,  510.,  520.,  530.,  540.,  550.,  560.,  570.,  580.,
        590.,  600.,  610.,  620.,  630.,  640.,  650.,  660.,  670.,  680.,
        690.,  700.,  710.,  720.,  730.,  740.,  750.,  760.,  770.,  780.,
        790.,  800.,  820.,  840.,  860.,  880.,  900.,  920.,  940.,  960.,
        980., 1000., 1020., 1040., 1060., 1080., 1100., 1120., 1140., 1160.,
       1180., 1200., 1220., 1240., 1260., 1280., 1300., 1320., 1340., 1360.,
       1380., 1400., 1420., 1440., 1460., 1480., 1500., 1520., 1540., 1560.,
       1580., 1600., 1620., 1640., 1660., 1680., 1700., 1720., 1740., 1760.,
       1780., 1800., 1820., 1840., 1860., 1880., 1900., 1920., 1940., 1960.,
       1980., 2000.], dtype=float32)

month
(month)
int64
1 2 3 4
```
array([1, 2, 3, 4])
```

Data variables: (1)

TEMP

(month, depth)

float32

dask.array<chunksize=(1, 152), meta=np.ndarray>

	Array	Chunk
Bytes	2.38 kiB	608 B
Shape	(4, 152)	(1, 152)
Count	32 Tasks	4 Chunks
Type	float32	numpy.ndarray

Attributes: (0)

data.groupby('time.year').mean(dim='time')

<xarray.Dataset>
Dimensions:  (year: 1, depth: 152)
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 ... 1.96e+03 1.98e+03 2e+03
  * year     (year) int64 2012
Data variables:
    TEMP     (year, depth) float32 dask.array<chunksize=(1, 152), meta=np.ndarray>

xarray.Dataset

Dimensions:
- year: 1
- depth: 152

Coordinates: (2)

depth

(depth)

float32

0.0 3.0 5.0 ... 1.98e+03 2e+03

array([   0.,    3.,    5.,   10.,   15.,   20.,   25.,   30.,   35.,   40.,
         45.,   50.,   55.,   60.,   65.,   70.,   75.,   80.,   85.,   90.,
         95.,  100.,  110.,  120.,  130.,  140.,  150.,  160.,  170.,  180.,
        190.,  200.,  210.,  220.,  230.,  240.,  250.,  260.,  270.,  280.,
        290.,  300.,  310.,  320.,  330.,  340.,  350.,  360.,  370.,  380.,
        390.,  400.,  410.,  420.,  430.,  440.,  450.,  460.,  470.,  480.,
        490.,  500.,  510.,  520.,  530.,  540.,  550.,  560.,  570.,  580.,
        590.,  600.,  610.,  620.,  630.,  640.,  650.,  660.,  670.,  680.,
        690.,  700.,  710.,  720.,  730.,  740.,  750.,  760.,  770.,  780.,
        790.,  800.,  820.,  840.,  860.,  880.,  900.,  920.,  940.,  960.,
        980., 1000., 1020., 1040., 1060., 1080., 1100., 1120., 1140., 1160.,
       1180., 1200., 1220., 1240., 1260., 1280., 1300., 1320., 1340., 1360.,
       1380., 1400., 1420., 1440., 1460., 1480., 1500., 1520., 1540., 1560.,
       1580., 1600., 1620., 1640., 1660., 1680., 1700., 1720., 1740., 1760.,
       1780., 1800., 1820., 1840., 1860., 1880., 1900., 1920., 1940., 1960.,
       1980., 2000.], dtype=float32)

year
(year)
int64
2012
```
array([2012])
```

Data variables: (1)

TEMP

(year, depth)

float32

dask.array<chunksize=(1, 152), meta=np.ndarray>

	Array	Chunk
Bytes	608 B	608 B
Shape	(1, 152)	(1, 152)
Count	18 Tasks	1 Chunks
Type	float32	numpy.ndarray

Attributes: (0)

data.groupby('time.season').mean(dim='time')

<xarray.Dataset>
Dimensions:  (season: 2, depth: 152)
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 ... 1.96e+03 1.98e+03 2e+03
  * season   (season) object 'DJF' 'MAM'
Data variables:
    TEMP     (season, depth) float32 dask.array<chunksize=(1, 152), meta=np.ndarray>

xarray.Dataset

Dimensions:
- season: 2
- depth: 152

Coordinates: (2)

depth

(depth)

float32

0.0 3.0 5.0 ... 1.98e+03 2e+03

array([   0.,    3.,    5.,   10.,   15.,   20.,   25.,   30.,   35.,   40.,
         45.,   50.,   55.,   60.,   65.,   70.,   75.,   80.,   85.,   90.,
         95.,  100.,  110.,  120.,  130.,  140.,  150.,  160.,  170.,  180.,
        190.,  200.,  210.,  220.,  230.,  240.,  250.,  260.,  270.,  280.,
        290.,  300.,  310.,  320.,  330.,  340.,  350.,  360.,  370.,  380.,
        390.,  400.,  410.,  420.,  430.,  440.,  450.,  460.,  470.,  480.,
        490.,  500.,  510.,  520.,  530.,  540.,  550.,  560.,  570.,  580.,
        590.,  600.,  610.,  620.,  630.,  640.,  650.,  660.,  670.,  680.,
        690.,  700.,  710.,  720.,  730.,  740.,  750.,  760.,  770.,  780.,
        790.,  800.,  820.,  840.,  860.,  880.,  900.,  920.,  940.,  960.,
        980., 1000., 1020., 1040., 1060., 1080., 1100., 1120., 1140., 1160.,
       1180., 1200., 1220., 1240., 1260., 1280., 1300., 1320., 1340., 1360.,
       1380., 1400., 1420., 1440., 1460., 1480., 1500., 1520., 1540., 1560.,
       1580., 1600., 1620., 1640., 1660., 1680., 1700., 1720., 1740., 1760.,
       1780., 1800., 1820., 1840., 1860., 1880., 1900., 1920., 1940., 1960.,
       1980., 2000.], dtype=float32)

season
(season)
object
'DJF' 'MAM'
```
array(['DJF', 'MAM'], dtype=object)
```

Data variables: (1)

TEMP

(season, depth)

float32

dask.array<chunksize=(1, 152), meta=np.ndarray>

	Array	Chunk
Bytes	1.19 kiB	608 B
Shape	(2, 152)	(1, 152)
Count	26 Tasks	2 Chunks
Type	float32	numpy.ndarray

Attributes: (0)

Defining discrete binning (for depth intervals for instance) is done by using the groupby_bins method.

depth_bins = np.arange(0, 1000 + 250, 250)
depth_bins

array([   0,  250,  500,  750, 1000])

zmean = data.groupby_bins('depth', depth_bins).mean(dim='depth')
zmean

<xarray.Dataset>
Dimensions:     (depth_bins: 4, time: 4)
Coordinates:
  * depth_bins  (depth_bins) object (0, 250] (250, 500] (500, 750] (750, 1000]
  * time        (time) datetime64[ns] 2012-01-15 2012-02-15 ... 2012-04-15
Data variables:
    TEMP        (depth_bins, time) float32 dask.array<chunksize=(1, 1), meta=np.ndarray>

xarray.Dataset

Dimensions:
- depth_bins: 4
- time: 4

Coordinates: (2)

depth_bins

(depth_bins)

object

(0, 250] (250, 500] ... (750, 1000]

array([Interval(0, 250, closed='right'), Interval(250, 500, closed='right'),
       Interval(500, 750, closed='right'), Interval(750, 1000, closed='right')],
      dtype=object)

time

(time)

datetime64[ns]

2012-01-15 ... 2012-04-15

array(['2012-01-15T00:00:00.000000000', '2012-02-15T00:00:00.000000000',
       '2012-03-15T00:00:00.000000000', '2012-04-15T00:00:00.000000000'],
      dtype='datetime64[ns]')

Data variables: (1)

TEMP

(depth_bins, time)

float32

dask.array<chunksize=(1, 1), meta=np.ndarray>

	Array	Chunk
Bytes	64 B	4 B
Shape	(4, 4)	(1, 1)
Count	101 Tasks	16 Chunks
Type	float32	numpy.ndarray

Attributes: (0)

import matplotlib.pyplot as plt
plt.rcParams['text.usetex'] = False
cs = zmean['TEMP'].plot()

Let’s reload the ISAS dataset

data = xr.open_mfdataset('data/*ISAS*', combine='by_coords').isel(time=0)
data

<xarray.Dataset>
Dimensions:  (depth: 152)
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 ... 1.96e+03 1.98e+03 2e+03
    time     datetime64[ns] 2012-01-15
Data variables:
    TEMP     (depth) float32 dask.array<chunksize=(152,), meta=np.ndarray>
Attributes:
    script:   extract_prof.py
    history:  Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.n...
    NCO:      "4.5.5"

xarray.Dataset

Dimensions:
- depth: 152

Coordinates: (2)

depth

(depth)

float32

0.0 3.0 5.0 ... 1.98e+03 2e+03

array([   0.,    3.,    5.,   10.,   15.,   20.,   25.,   30.,   35.,   40.,
         45.,   50.,   55.,   60.,   65.,   70.,   75.,   80.,   85.,   90.,
         95.,  100.,  110.,  120.,  130.,  140.,  150.,  160.,  170.,  180.,
        190.,  200.,  210.,  220.,  230.,  240.,  250.,  260.,  270.,  280.,
        290.,  300.,  310.,  320.,  330.,  340.,  350.,  360.,  370.,  380.,
        390.,  400.,  410.,  420.,  430.,  440.,  450.,  460.,  470.,  480.,
        490.,  500.,  510.,  520.,  530.,  540.,  550.,  560.,  570.,  580.,
        590.,  600.,  610.,  620.,  630.,  640.,  650.,  660.,  670.,  680.,
        690.,  700.,  710.,  720.,  730.,  740.,  750.,  760.,  770.,  780.,
        790.,  800.,  820.,  840.,  860.,  880.,  900.,  920.,  940.,  960.,
        980., 1000., 1020., 1040., 1060., 1080., 1100., 1120., 1140., 1160.,
       1180., 1200., 1220., 1240., 1260., 1280., 1300., 1320., 1340., 1360.,
       1380., 1400., 1420., 1440., 1460., 1480., 1500., 1520., 1540., 1560.,
       1580., 1600., 1620., 1640., 1660., 1680., 1700., 1720., 1740., 1760.,
       1780., 1800., 1820., 1840., 1860., 1880., 1900., 1920., 1940., 1960.,
       1980., 2000.], dtype=float32)

time

()

datetime64[ns]

2012-01-15

array('2012-01-15T00:00:00.000000000', dtype='datetime64[ns]')

Data variables: (1)

TEMP

(depth)

float32

dask.array<chunksize=(152,), meta=np.ndarray>

	Array	Chunk
Bytes	608 B	608 B
Shape	(152,)	(152,)
Count	13 Tasks	1 Chunks
Type	float32	numpy.ndarray

Attributes: (3)
script :
extract_prof.py
history :
Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.nc ISAS13_20120115_fld_TEMP.nc
NCO :
"4.5.5"

There is the possibility to compute rolling means along the depth dimensions as follows:

datar = data.rolling({'depth': 31}, center=True).mean(dim='depth')
datar

<xarray.Dataset>
Dimensions:  (depth: 152)
Coordinates:
  * depth    (depth) float32 0.0 3.0 5.0 10.0 ... 1.96e+03 1.98e+03 2e+03
    time     datetime64[ns] 2012-01-15
Data variables:
    TEMP     (depth) float64 dask.array<chunksize=(151,), meta=np.ndarray>
Attributes:
    script:   extract_prof.py
    history:  Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.n...
    NCO:      "4.5.5"

xarray.Dataset

Dimensions:
- depth: 152

Coordinates: (2)

depth

(depth)

float32

0.0 3.0 5.0 ... 1.98e+03 2e+03

array([   0.,    3.,    5.,   10.,   15.,   20.,   25.,   30.,   35.,   40.,
         45.,   50.,   55.,   60.,   65.,   70.,   75.,   80.,   85.,   90.,
         95.,  100.,  110.,  120.,  130.,  140.,  150.,  160.,  170.,  180.,
        190.,  200.,  210.,  220.,  230.,  240.,  250.,  260.,  270.,  280.,
        290.,  300.,  310.,  320.,  330.,  340.,  350.,  360.,  370.,  380.,
        390.,  400.,  410.,  420.,  430.,  440.,  450.,  460.,  470.,  480.,
        490.,  500.,  510.,  520.,  530.,  540.,  550.,  560.,  570.,  580.,
        590.,  600.,  610.,  620.,  630.,  640.,  650.,  660.,  670.,  680.,
        690.,  700.,  710.,  720.,  730.,  740.,  750.,  760.,  770.,  780.,
        790.,  800.,  820.,  840.,  860.,  880.,  900.,  920.,  940.,  960.,
        980., 1000., 1020., 1040., 1060., 1080., 1100., 1120., 1140., 1160.,
       1180., 1200., 1220., 1240., 1260., 1280., 1300., 1320., 1340., 1360.,
       1380., 1400., 1420., 1440., 1460., 1480., 1500., 1520., 1540., 1560.,
       1580., 1600., 1620., 1640., 1660., 1680., 1700., 1720., 1740., 1760.,
       1780., 1800., 1820., 1840., 1860., 1880., 1900., 1920., 1940., 1960.,
       1980., 2000.], dtype=float32)

time

()

datetime64[ns]

2012-01-15

array('2012-01-15T00:00:00.000000000', dtype='datetime64[ns]')

Data variables: (1)

TEMP

(depth)

float64

dask.array<chunksize=(151,), meta=np.ndarray>

	Array	Chunk
Bytes	1.19 kiB	1.18 kiB
Shape	(152,)	(151,)
Count	68 Tasks	2 Chunks
Type	float64	numpy.ndarray

Attributes: (3)
script :
extract_prof.py
history :
Thu Jul 14 11:32:10 2016: ncks -O -3 ISAS13_20120115_fld_TEMP.nc ISAS13_20120115_fld_TEMP.nc
NCO :
"4.5.5"

data['TEMP'].plot(label='original')
datar['TEMP'].plot(label='rolling', marker='o', linestyle='none')
plt.legend()

<matplotlib.legend.Legend at 0x7fd623d7ccd0>

Introduction to Python

NetCDF¶

Reading NetCDF¶

Reading single file¶

Reading multiple files¶

Accessing dimensions, variables, attributes¶

Dimensions¶

Variables¶

Time management¶

Attributes¶

Indexing¶

Extracting using indexes¶

Extracting using values¶

Plotting¶

Mathematical operations¶

Group-by operations¶

Creating NetCDF¶