Scope available biological datasets, select suitable ones, acquire metadata, organise data in relational database, extract data in format suitable for surrogates and prediction analyses (0.01° grid).