STACD: STAC Extension with DAGs for Geospatial Data and Algorithm Management
This program is tentative and subject to change.
Geospatial datasets have complex lineages that are crucial for reproducibility and understanding data provenance, yet current metadata standards like STAC (SpatioTemporal Asset Catalog) provide limited support for capturing complete processing workflows. We propose STACD (STAC extension with DAGs), an extension to STAC specifications that incorporates Directed Acyclic Graph (DAG) representations along with defining algorithms and version changes in the workflows. We also provide a reference implementation on Apache Airflow to demonstrate STACD capabilities such as selective recomputation when some datasets or algorithms in a DAG are updated, complete lineage construction for a dataset, and opportunities for improved collaboration and distributed processing that arise with this standard.
Note: this paper will be presented in a single talk alongside the “GPU accelerated hydrology” paper at the conference.
This program is tentative and subject to change.
Mon 13 OctDisplayed time zone: Perth change
16:00 - 17:40 | |||
16:00 15mTalk | Challenges in Practice: Building a Usable Library for Planetary-Scale Embeddings PROPL Sadiq Jaffer University of Cambridge, Frank Feng University of Cambridge, Robin Young University of Cambridge, Srinivasan Keshav University of Cambridge, Anil Madhavapeddy University of Cambridge, UK, Robin Young University of Cambridge | ||
16:15 15mPaper | STACD: STAC Extension with DAGs for Geospatial Data and Algorithm Management PROPL Saharsh Laud Indian Institute Of Technology Delhi, Saurabh Joshi Indian Institute Of Technology Delhi, Tarun Mangla Indian Institute Of Technology Delhi, Abhilash Jindal IIT Delhi, India, Aaditeshwar Seth Indian Institute Of Technology Delhi | ||
16:30 15mTalk | Spatial Programming for Environmental Monitoring PROPL Josh Millar Imperial College London, Ryan Gibb University of Cambridge, Roy Ang University of Cambridge, Hamed Haddadi Imperial College London, Anil Madhavapeddy University of Cambridge, UK | ||
16:45 15mPaper | Yirgacheffe: a declarative approach to geospatial data PROPL Michael Dales University of Cambridge, UK, Alison Eyres University of Cambridge, Patrick Ferris University of Cambridge, UK, Anil Madhavapeddy University of Cambridge, UK, Francesca A. Ridley Newcastle University, Simon Tarr IUCN | ||
17:00 15mTalk | Large Language Models for computational climate analysis PROPL Jay Torry University of Cambridge | ||
17:15 15mTalk | Scaling the Urban Forest: An Integrated Framework for Managing Cities by Fusing Raster and Vector Data PROPL Andrés C. Zúñiga-González University of Cambridge, Anil Madhavapeddy University of Cambridge, UK, Ronita Bardhan University of Cambridge | ||
17:30 10mDay closing | Closing thoughts from the chairs PROPL Anil Madhavapeddy University of Cambridge, UK, KC Sivaramakrishnan IIT Madras and Tarides, Dominic Orchard University of Cambridge; University of Kent |