Skip to content

GitLab

Explore

Sign in

cooper Conversion to Zarr

Checklist for Workflow associated with dataset conversion:

Dataset Name: SnowModel Output, McKenzie and Upper Deschutes River Basins, Cascades, UTM Zone 10N NAD83 (2014)

https://cida.usgs.gov/thredds/catalog.html?dataset=cida.usgs.gov/cooper/UpperDeschutes
https://cida.usgs.gov/thredds/catalog.html?dataset=cida.usgs.gov/cooper/McKenzie

https://cida.usgs.gov/thredds/catalog/demo/thredds/cooper/catalog.html

Direct OPeNDAP conversion should be possible.

Identify Source Data location and access (check the dataset spreadsheet)
Collect ownership information (Who do we ask questions of if we have problems?)
Create new workflow notebook from template; stash in the ./workflows folder tree in an appropriate spot.
- Identify landing spot on S3 (currently somewhere in: https://s3.console.aws.amazon.com/s3/buckets/nhgf-development?prefix=workspace/&region=us-west-2)
- Calculate chunking, layout, compression, etc
- Run notebook
- Read test (pattern to be determined by the dataset)
Create STAC catalog entry;
- Verify all metadata
- Create entry
Reportage
- add notebook and the dask performance report to the repo
- Calculate summary statistics on output (compression ratio, total size)
- Save STAC JSON snippet to repo
Merge and close the issue.

Edited Oct 23, 2023 by Blodgett, David L.

Assignee Loading

Time tracking Loading