Hypothesis
Here, the objective is to compare the speed and ease of use with either R or python to selct, tidy, and analyze medium sized datasets. The dataset that I will be using is the ‘Earth Surface Temperature Data’ available from Kaggle, which combines 1.6 billion temperature reports from various datasets such as NOAA’s MLOST, NASA’s GISTEMP and the UK’s HadCrut. All temperatures are in units of \(^oC\) and the dataset is roughly 533Mb.