| Title: | Cloud-Native Access to Copernicus Marine ARCO Zarr Stores |
|---|---|
| Description: | Build and query a catalog of Copernicus Marine Service (CMEMS) Analysis-Ready Cloud-Optimized (ARCO) Zarr datasets. Provides GDAL-ready data source names for direct access to 'timeChunked' and 'geoChunked' Zarr stores without file downloads or directory listings. Default DSNs use '/vsicurl/' for zero-configuration access. Catalog is built by walking the CMEMS STAC API at <https://stac.marine.copernicus.eu>. |
| Authors: | Michael Sumner [aut, cre] (ORCID: <https://orcid.org/0000-0002-2471-7511>) |
| Maintainer: | Michael Sumner <[email protected]> |
| License: | MIT + file LICENSE |
| Version: | 0.1.0 |
| Built: | 2026-06-10 09:33:27 UTC |
| Source: | https://github.com/hypertidy/cmemsarco |
Convenience wrapper combining cmems_arco_url() and cmems_gdal_dsn().
cmems_arco_dsn( product_id, dataset_id, version, chunk_type = c("time", "geo"), bucket_version = "045", array = NULL, use_s3 = FALSE )cmems_arco_dsn( product_id, dataset_id, version, chunk_type = c("time", "geo"), bucket_version = "045", array = NULL, use_s3 = FALSE )
product_id |
Product identifier |
dataset_id |
Dataset identifier (without version) |
version |
6-digit version string (YYYYMM) |
chunk_type |
"time" or "geo" |
bucket_version |
Bucket suffix (e.g., "045", "042"). Varies by product. |
array |
Optional array/variable name |
use_s3 |
Use |
GDAL DSN string
Removes datasets that don't have Zarr URLs (static/native-only datasets).
cmems_arco_only(catalog)cmems_arco_only(catalog)
catalog |
A catalog tibble from |
Filtered tibble with only ARCO datasets
Build a Zarr URL from known identifiers without querying STAC. Requires knowing the bucket version suffix which varies by product family.
cmems_arco_url( product_id, dataset_id, version, chunk_type = c("time", "geo"), bucket_version = "045" )cmems_arco_url( product_id, dataset_id, version, chunk_type = c("time", "geo"), bucket_version = "045" )
product_id |
Product identifier |
dataset_id |
Dataset identifier (without version) |
version |
6-digit version string (YYYYMM) |
chunk_type |
"time" or "geo" |
bucket_version |
Bucket suffix (e.g., "045", "042"). Varies by product. |
HTTPS URL to the Zarr store
Walks the STAC catalog to retrieve all products, datasets, and Zarr URLs.
cmems_catalog(product_ids = NULL, progress = TRUE)cmems_catalog(product_ids = NULL, progress = TRUE)
product_ids |
Character vector of product IDs, or NULL for all products |
progress |
Show progress messages |
A tibble with columns: product_id, dataset_version_id, dataset_id, version, timeChunked_url, geoChunked_url, native_url, and GDAL/S3 variants
## Not run: # Single product cat <- cmems_catalog("SEALEVEL_GLO_PHY_L4_NRT_008_046") # All products (takes a few minutes) full <- cmems_catalog() ## End(Not run)## Not run: # Single product cat <- cmems_catalog("SEALEVEL_GLO_PHY_L4_NRT_008_046") # All products (takes a few minutes) full <- cmems_catalog() ## End(Not run)
Cached catalog of Copernicus Marine ARCO Zarr datasets with URLs for
timeChunked and geoChunked stores. Updated periodically via
data-raw/update_catalog.R.
cmems_catalog_datacmems_catalog_data
A tibble with columns:
CMEMS product identifier
Full dataset identifier with version suffix
Dataset identifier without version
6-digit version string (YYYYMM), NA for static datasets
HTTPS URL to timeChunked.zarr (spatial slice access)
HTTPS URL to geoChunked.zarr (time series access)
URL to native files (if available)
GDAL DSN using /vsicurl/ (no setup needed)
GDAL DSN using /vsicurl/ (no setup needed)
GDAL DSN using /vsis3/ (needs cmems_setup())
GDAL DSN using /vsis3/ (needs cmems_setup())
S3 URI for timeChunked store
S3 URI for geoChunked store
STAC catalog at https://stac.marine.copernicus.eu/metadata/catalog.stac.json
cmems_catalog() to refresh, cmems_latest() to filter to latest versions,
cmems_arco_only() to remove non-ARCO datasets
Creates a GDAL DSN using /vsicurl/ which works without environment
variable configuration.
cmems_gdal_dsn(url, array = NULL)cmems_gdal_dsn(url, array = NULL)
url |
HTTPS URL to a Zarr store |
array |
Optional array/variable name to access directly |
GDAL DSN string
url <- "https://s3.waw3-1.cloudferro.com/mdl-arco-time-045/arco/PRODUCT/dataset/timeChunked.zarr" cmems_gdal_dsn(url) cmems_gdal_dsn(url, array = "sla")url <- "https://s3.waw3-1.cloudferro.com/mdl-arco-time-045/arco/PRODUCT/dataset/timeChunked.zarr" cmems_gdal_dsn(url) cmems_gdal_dsn(url, array = "sla")
Creates a GDAL DSN using /vsis3/ which requires cmems_setup() to
configure AWS environment variables first.
cmems_gdal_dsn_s3(url, array = NULL)cmems_gdal_dsn_s3(url, array = NULL)
url |
HTTPS URL to a Zarr store |
array |
Optional array/variable name to access directly |
GDAL DSN string
url <- "https://s3.waw3-1.cloudferro.com/mdl-arco-time-045/arco/PRODUCT/dataset/timeChunked.zarr" cmems_setup() cmems_gdal_dsn_s3(url)url <- "https://s3.waw3-1.cloudferro.com/mdl-arco-time-045/arco/PRODUCT/dataset/timeChunked.zarr" cmems_setup() cmems_gdal_dsn_s3(url)
Generate gdalinfo command
cmems_gdalinfo_cmd(dsn)cmems_gdalinfo_cmd(dsn)
dsn |
GDAL DSN string |
Shell command string
Filter catalog to latest version per dataset
cmems_latest(catalog)cmems_latest(catalog)
catalog |
A catalog tibble from |
Filtered tibble with only the latest version of each dataset
Convert S3 HTTPS URL to s3:// URI
cmems_s3_uri(url)cmems_s3_uri(url)
url |
HTTPS URL to a Zarr store |
S3 URI string
Sets AWS environment variables required for /vsis3/ access to CMEMS.
Not needed if using the default /vsicurl/ DSNs.
cmems_setup()cmems_setup()
Invisible TRUE
cmems_setup()cmems_setup()