Wild Data

When we start out with the processes of finding data ‘in the wild’ and sourcing them it can feel very overwhelming. With a few considerations however we can drastically cut down on the feelings of uncertainty that enshroud this process. For starters, the vast majority of the environmental and biological data that we may want (on a global scale, local scale can be a bit more specific) can be found in just a handful of places. The methods for accessing these data are also not as varied as they may first appear. The two presentations+exercises in this unit will cover 1) where we may find the majority of the wild data we may need, and 2) the most common methods of loading those data into R in a tidy way.

Slides and application exercises

Wild 1: Where data roam free

Slides

Source

Home on the range

Source

Wild 2: The local data shop

Slides

Source

Orders up!

Source

DIY wild data

At the end of this unit we should now be equipped with the tools we need to source, download, tidy, analyse, and visualise data from a number of free data sources out in the wild. For this DIY session let’s select a dataset out in the wild that has particular importance to the work that we do. But rather than downloading it via a web interface, let’s write a script that contains the full pipeline that can, with the push of one button, download, tidy, analyse, and visualise the data. Saving a faceted/combined figure to our local computer. Preferably with a map, if that would make sense given the dataset in question.