Format Change - HDF4 to netCDF4
The Level 2 and Level 3 products produced by the Ocean
Biology Processing Group will change file formats from HDF4 to
netCDF4 with the multi-mission reprocessing planned for the
summer of 2014. There are several reasons for making
this significant change to the data products.
-
With the advent of
HDF5,
the current
HDF4 format,
while still funded, is no longer under active development.
-
Data interoperability improves.
-
The application of accepted international conventions
improves metadata consistency.
why netCDF4?
-
netCDF4
was NASA-funded effort to improve:
- interoperability among scientific data representations
- integration of observations and model outputs
- I/O for high-performance computing.
-
netCDF4 combines the netCDF and HDF5 data models, taking the desirable
characteristics of each, while taking advantage of their separate
strengths:
-
netCDF is popular and easy to use, and includes many tools and
multiple implementations.
-
HDF5 is powerful, has high-performance, is efficient for storage
and extensible.
-
The goal of netCDF4 is to make netCDF more suitable for
high-performance computing and large datasets, and to provide
a simple high-level application programming interface (API) for HDF
-
ESA has chosen netCDF4 as the format for the sensors on upcoming
Sentinel-3 mission (the ENVISAT follow-on)
-
Efforts are underway to recognize netCDF and the
Climate and Forecasting
(CF) Conventions as international standards
-
Backward compatible with the previous versions of netCDF will be
maintained for future versions
-
netCDF is a recognized data format for a large number of data
visualization and manipulation packages (e.g. GIS packages, ENVI,
IDL, Matlab, GrADS, Panoply), so using it could facilitate opening
up our data sets to a wider community.
Metadata Changes
With the switch to netCDF there is a corresponding update
to the product metadata. Our updated schema follows these
standards and conventions:
- ISO19115
-
Defines the schema required for describing geographic
information and services. It provides information about
the identification, the extent, the quality, the spatial and
temporal schema, spatial reference, and distribution of digital
geographic data.
- Climate and Forecast (CF)
-
The netCDF-CF conventions are a set of codified recommendations
for practices built around published specifications. While CF
is a convention rather than an established metadata standard, CF
is a critically important step towards better interoperability.
- Attribute Convention for Data Discovery (ACDD)
-
The netCDF Group at Unidata has recommended netCDF attributes
for data discovery. Many of these attributes correspond to
general discovery metadata content, so they are available in
many metadata standards.
NASA's Earth Science Data and Information System Standards Office provides a
summary
of standards and conventions that they approve.