Want to run faster? Run more!

There are differing schools of thought when it comes to training for distance running: on one end of the spectrum, you have the low volume, high intensity advocates who believe that relatively short repetitions at race pace or faster is the path to success, and on the other end, we have those who believe that total running volume, or mileage, is the most important training factor, regardless of intensity. While the optimal training load will no doubt fall somewhere between these two extremes, I am of the view that it is better to err on the side of volume rather than intensity.

"Mileage - that's the key. To try and get as much mileage as we can." - Lewis Hamilton

Okay, so Lewis Hamilton may not even compete in the correct sport, but I'm not going to let that dissuade me.

Despite much anecdotal evidence, both for and against, there is limited research available on the effect of high volume on distance running performance. The most detailed paper I found was by Tanda (2011), where he manages to successfully predict marathon finish time as a function of training volume and average training pace. However the study was performed on a small sample of just 22 runners.

As in the case in many areas, modern times have seen a vast influx in the quantity of data available and so we should be able to use this investigate our question. Strava, "the social network for athletes" provides information about the training of a huge number of athletes.

I decided to analyse the training of athletes in the build up to the 2015 Leeds Abbey Dash, a 10 kilometer road race. Not only was this one of the largest 10k races in the country, but it doubled as the national championships, ensuring data on all standards of athletes.

Continue reading "Want to run faster? Run more!"

'Twas the night before BUCS: An algorithmic approach for predicting cross country performances.

'Twas the night before BUCS, when all through Gloucester
Not an athlete was drinking, not even one beer.
The spikes were stood by the door with care,
In hopes that some medals soon would be theirs.

The greatest day of the year is almost upon us: tomorrow is the British Universities (BUCS) Cross Country championships. Obviously being too excited to do any work I decided to see if it were possible to predict the results based on previous performances.

PowerOf10 provides a fantastic source of athletics data and a few Scrapy spiders later I had a large dataset to play with.

The race entry lists provided a list of names and cross referencing with PowerOf10 allowed me to obtain a complete set of historical race results. Unfortunately several names were insufficiently unique or misspelt (some universities were more prone to this mistake than others... no comment) which meant obtaining performances was impossible.

Based on analysis of every cross country race from 1st of January 2015, my algorithm predicted the following top 20 mens' team results based on a 6 to run, 4 to score system:
Continue reading "'Twas the night before BUCS: An algorithmic approach for predicting cross country performances."