Skip to content

'new_gmo' conversion workflow

Checklist for Workflow associated with dataset conversion:

Dataset Name: GMO_new


  • Identify Source Data location and access
  • Collect ownership information (Who do we ask questions of if we have problems?)
  • Create new workflow notebook from template; stash in the ./workflows folder tree in an appropriate spot.
    • Identify landing spot on S3
    • Calculate chunking, layout, compression, etc
    • Run notebook
    • Read test (pattern to be determined by the dataset)
  • Create STAC catalog entry;
    • Verify all metadata
    • Create entry
  • Reportage
    • add notebook and the dask performance report to the repo
    • Calculate summary statistics on output (compression ratio, total size)
    • Save STAC JSON snippet to repo
  • Merge and close the issue.
Edited by Gene Trantham