Skip to content

GitLab

Explore

Sign in

mows conversion to Zarr

Checklist for Workflow associated with dataset conversion:

Dataset Name: Mean Monthly Evaporation Atlas for the Contiguous 48 United States (1956-1970) and Normal Incident Solar Radiation Atlas

https://cida.usgs.gov/thredds/catalog.html?dataset=cida.usgs.gov/mows/pe
https://cida.usgs.gov/thredds/catalog.html?dataset=cida.usgs.gov/mows/sr

Simple small conversions -- straight from OPeNDAP should be fine.

Identify Source Data location and access (check the dataset spreadsheet)
- Farnsworth evaporation
- NREL solar radiation
Collect ownership information (Who do we ask questions of if we have problems?)
Create new workflow notebook from template; stash in the ./workflows folder tree in an appropriate spot.
- Identify landing spot on S3 (currently somewhere in: https://s3.console.aws.amazon.com/s3/buckets/nhgf-development?prefix=workspace/&region=us-west-2)
  - s3://nhgf-development/workspace/DataConversion/farnsworth_evaporation_atlas.zarr
  - s3://nhgf-development/workspace/DataConversion/nrel_solar_radiation.zarr
- Calculate chunking, layout, compression, etc
- Run notebook
- Read test (pattern to be determined by the dataset)
Create STAC catalog entry;
- Verify all metadata
- Create entry
Reportage
- add notebook and the dask performance report to the repo
- Calculate summary statistics on output (compression ratio, total size)
- Save STAC JSON snippet to repo
Merge and close the issue.

Edited Nov 01, 2023 by Parker A Norton

Assignee Loading

Time tracking Loading

Confidentiality

Confidentiality controls have moved to the issue actions menu () at the top of the page.