Data management at Getech is an integral part of our working processes: from ensuring accurate and efficient data compilation, through to effective and well-presented data delivery.
The data we manage on a daily basis is hugely varied in purpose and format. This requires a range of expertise and an understanding of data workflows, storage and presentation.
Getech’s databases comprise the following:
- A central ESRI ArcSDE™ spatial database containing all of our spatial (GIS) data.
- Image libraries
- A publication reference database
- Developmental databases
Each one requires a slightly different workflow.
One of Getech’s main areas of expertise is the modelling and mapping of palaeo-reconstructed data. This includes reconstructed data for each one of the 60 geological Stages from Present Day through to the start of the Permian. This is set to increase as we continue to map further back in time.
With our data having the added dimension of time, its management becomes a complex task. Updates to Present Day datasets or our plate model, however minor, mean that palaeo-reconstructed versions must also be updated. This is managed through a versioning system, ensuring that any changes are cascaded down through our palaeodatasets and creating consistency through time. Managing this is done on differing scales and time frames depending on the data.
All datasets have associated metadata (the spatial databases also include comprehensive attribution tables) that provide users with an audit trail with which they, and Getech can assess data confidence. This is a huge task in itself, and so a large part of our data management is ensuring clear and accurate attribution with the development of data models and scripting for each of our datasets.
The middle pane shows current data organisation and naming convention; the right pane shows enterprise search results based on the metadata.