The Grid 2025 Challenge

A dataset representing an entire year of high resolution electricity demand and renewable energy production for Great Britain in 2025.

The Synthetic Dataset

This page describes what the dataset represents, and how it was produced, including the assumptions made and the sources of underlying data.

Download the dataset ->

Scope

Map of Great Britain highlighted within Europe

Great Britain highlighted on a map of Europe. The three countries within Great Britain - England, Scotland and Wales - are provided with electricity by the National Grid. This includes the isles of Orkney and Shetland, but not the Isle of Man and other crown dependencies.

Licence: Public Domain, via Wikimedia Commons

The dataset is intended to represent a potential energy production scenario combined with current demand information, for a whole year, in Great Britain (namely England, Scotland and Wales) in the distant future (for instance the year 2025).

The dataset is a collection of demand data along with two renewable energy technologies which can potentially play a key role in the future energy scenario.

  • Electricity demand, the total power required at a given moment to operate all devices using electricity that are 'plugged in' to the National Grid, either in consumers' homes, in factories or in street lighting and other applications
  • Wind power production, the total power produced at a given moment by wind turbines distributed across the country
  • Solar power production, the total power produced at a given moment by five independent photovoltaic power production facilities distributed across the country
The electricity productions from wind and solar sources are assumed to be identical when averaged over a full year. Also the total electricity production is scaled to equal the total energy demand across the year.

Resolution

The resolution of the data provided is 15 minutes. There are 35,040 values (each representing a 15 minute period in 2025) across 4 columns, forming a 140,160 element matrix. The values are in order, forming a time series from 00:00 on January 1st 2025 to 23:45 on December 31st 2025.

Demand

National Grid demand in megawatts on 23rd October 2011

National Grid demand on 23/10/2011. Demand increases in the morning, before dipping slightly in the afternoon and then increasing to its maximum at around 18:00. After this point, demand drops gradually until around midnight. A slight spike occurs shortly after midnight, as low-tier Economy 7 tariffs take effect.

Data source: National Grid

It is assumed that demand will, in the distant future, follow similar patterns to those of today. The dataset uses real demand data for 2011 with 30 minute resolution, interpolated with a linear algorithm to represent 15 minute time steps.

The underlying data is provided by the National Grid on their website, in megawatts (MW).

Demand in Great Britain follows a reasonably predictable shape throughout each day, with a morning spike, an afternoon saddle and an evening spike around 18:00. Demand gradually decreases after this point until around midnight where it remains low until the next morning. At the weekend, demand is lower but follows a similar pattern.

Wind

As a conservative estimate, we have used Irish wind farm production data as a basis for the wind power information, as the fluctuations in wind speeds are at least the same if not greater in magnitude than those found in Great Britain. The Irish grid operators, EirGrid, make available a facility to obtain Irish wind power production information, with 15 minute resolution, via their website.

In Ireland in 2011, the installed capacity of wind turbines was approximately 1.5GW. The average wind power produced throughout the year in Ireland was 485MW, representing a capacity factor of just over 30%. This is similar to the estimate made by David MacKay for the United Kingdom in his book, Without the Hot Air.[1] Seeing as Ireland shares much of the same weather patterns as Great Britain, we have used the Irish wind power production data for 2011 as a basis for our own dataset. The rate of change between adjacent values in the Irish data has been left unmodified, but the power outputs have been scaled so as to, on average, represent 50% of the total British power requirement for 2025. This requires that the individual Irish power output values across the full year are each multiplied by a factor of approximately 36. This results in an average output of about 17GW, which is roughly half of average British power demand, as intended.

Solar

The solar power data was generated using real solar data provided by the Met Office. To estimate the effect and frequency of effects that clouds have on sunlight reaching the ground, Met Office data for Glasgow was compared against a model of the 'clear sky' irradiance -- that is, to say, the irradiance at ground level from the Sun without the effect of clouds and other atmospheric effects -- incident upon Glasgow.

Clear Sky Irradiance Model

Theoretical irradiance in Glasgow across 2011 according to Loughborough model

Theoretical Sun irradiance incidence on Glasgow (coordinates: 51.485018N, -3.179569E) across 2011, according to the Loughborough model. Each data point represents 2.4 hours. The irradiance is highest in summer, when the Northern Hemisphere is tilted towards the Sun, and it is lowest in winter when the Northern Hemisphere is tilted away from the Sun.

The model used to calculate the irradiance of the Sun in Glasgow was developed by Ian Richardson and Murray Thomson of Loughborough University's Centre for Renewable Energy Systems Technology in their paper "Domestic electricity use: a high-resolution energy demand model".[2] This paper involved the development and use of a solar model which uses the extraterrestrial solar constant, along with factors to compensate for day of year, atmospheric refraction and Sun position, to calculate the total power incident on a piece of ground at a given set of coordinates. An macro-enabled Excel spreadsheet accompanying the article is available here.

Met Office Irradiance Data

The Met Office provided hourly Glasgow sunlight irradiance measurements for 2011. As this data was measured from a sensor at ground level, it already included the full effects of clouds, atmospheric refraction and other factors preventing the full irradiance from the Sun reaching the ground.

Atmospheric Attenuation

Using the coordinates for Glasgow (55.871811N, -4.291798E), the 'clear sky' irradiance was calculated using the Loughborough model. The Met Office's sensor data was then divided by the 'clear sky' model's prediction for the irradiance at that time, to provide an attenuation factor between 0 and 1 of how much of an influence the atmosphere had on the sunlight reaching the ground.

In addition, the rate of change of these attenuation factors was also calculated. This is useful for showing how quickly atmospheric attenuation factors, such as clouds, come and go. This is important to know, because without taking account of such information it could be entirely possible for the model to predict frequent periods of completely overcast skies followed by complete clear skies. A more realistic model would clearly involve a slower transition between the two extremes.

  • Atmospheric attenuation histogram of Met Office sensor data against theoretical clear sky model

    Histogram of atmospheric attenuation between 'clear sky' model and real Met Office Glasgow irradiance data. The 'cloudiness' -- the fraction of the extraterrestrial Sun irradiance absorbed by the clouds -- is a gradual slope towards complete absorption, showing that higher attenuations are favoured over lower ones, except for factors above around 0.9 where far less measured attenuations were present.

  • Atmospheric attenuation rate of change histogram of Met Office sensor data against theoretical clear sky model

    Histogram of the rate of change of the atmospheric attenuation shown in a histogram to the left. This shows that the rate at which the sky becomes more cloudy is very similar to the rate at which the sky becomes less cloudy.

Creation of Synthetic Data

The calculated attenuation and rate of change of attenuation factors across the year were then used to produce a new synthetic solar irradiance data set with the same characteristics as the measured solar irradiance data.

Histograms of the atmospheric attenuation factors, and the rate of change, were produced (see above) so as to show the characteristics of the clouds for the whole year.

Using an initial set of random numbers, a random walk algorithm using a statistical distribution resembling the real attenuation factors and their rates of change allowed for the creation of any arbitrary set of Sun irradiance values. Such values, on average, showed similar characteristics to the real attenuation factors, as well as their rate of change. The synthetic model was able to accept different random number generator seeds as input, to allow for the same randomly generated data to be reproduced as required.

  • Atmospheric attenuation histogram of generated synthetic data

    Histogram of the generated synthetic solar atmospheric attenuation.

  • Atmospheric attenuation rate of change histogram of generated synthetic data

    Histogram of the generated synthetic solar atmospheric attenuation rate of change.

Final Solar Model

The synthetic solar irradiance model was applied to five regions of Great Britain, using their respective coordinates as inputs to the 'clear sky' model and using a different random number generator seed for each:

  1. Cardiff (51.485018N, -3.179569E)
  2. Glasgow (55.871811N, -4.291798E)
  3. Southampton (50.914939N, -1.402187E)
  4. Sheffield (53.384762N, -1.471195E)
  5. London (51.516434N, -0.126457E)
The individual regions were multiplied by their place on this list, so for example Glasgow's solar power production was multiplied by 2 and London's by 5. This was an attempt to account for regional population variations, which would be expected in a real solar panel production scenario in the future.

Finally, the generated irradiance values for each time step were multiplied by a scaling factor (similar to the way it was applied to the wind data) to make it represent 50% of the total year's demand.

Combined 100% Renewable Grid Model

The time series for each of the demand, wind and solar models were combined within one matrix, with one column designated as the time vector. Therefore, for each 15 minute interval across the year, there exists estimated values for the electricity demand, wind production and solar production for that period.

Results

A 3-day average across the year of the three parameters: demand, wind and solar.

Plot of demand, wind and solar power across the full dataset, with each point representing a 3 day average. Solar production peaks in the summer, while demand peaks in the winter. Wind varies substantially across the whole year.

Across the whole year, there are large periods of electricity deficit during winter and large periods of electricity surplus during summer. On individual days, fluctuations can be induced by factors such as whether the Sun is above the horizon and whether or not it is windy.

The scaling factors used to make the wind and solar production large enough to cover demand scale well enough so as to only leave 2.3kJ of demand 'unsupplied'. This is perhaps due to the precision of the variables used during the scaling mathematical operation. Contrast this to the total energy demand for the year of 1.1×1018J.

The greatest supply deficit in the dataset, almost equivalent to total demand requirement

Plot of the day in which the greatest supply deficit occurs. The supply deficit (53.8GW) is almost as large as the demand at that point in time (54.5GW). This is because the Sun has gone down, resulting in a lack of solar production, and the wind production being very low due to a large scale weather system.

The maximum supply deficit (electricity demand minus electricity production) present across the year in the model is 53.8GW, which is almost equal to the demand at that time (54.5GW). This occurs on January 21st at 17:00, which is also the time of maximum demand that day. By 17:00 on this day, the Sun had gone below the horizon and so solar production was nil. Additionally, wind production also happened to be very low at this time. This is an example of maximum demand coinciding with minimum production, which is one of the major problems that will challenge future grid designs.

The minimum supply deficit occurs on 17th July at 12:00. At this point in time, high wind and midday sunshine provides a power deficit of -150.5GW, i.e. a surplus. The midsummer day coincides with clear skies and high winds, resulting in high power generation and low power consumption.

The challenge for a future grid is to somehow find a way of using the surplus to cover the deficits...

Potential Improvements

The data set is by no means fully representative of reality. Here are some potential improvements that could be made to the data in the future:

  • Currently, the model does not account for ambient light from light sources at night such as moonlight. Solar production after the Sun has dropped below the horizon is zero by design. A small contribution could be made by the moon, the effect of which is currently unaccounted for.
  • The dataset uses a resolution of 15 minutes. This is high enough to witness large scale fluctuations from weather events, night and day, morning and evening and so on, but it is too large to accurately resolve quick fluctuations such as those that exist with TV pickups. An increase of the resolution of the data set would involve the acquisition of higher resolution data from the Met Office, the National Grid and EirGrid.

Footnotes

  1. D.J.C. MacKay. Sustainable Energy - Without the Hot Air. UIT, 2009. ISBN: 9780954452933. Also available for free online.
  2. Ian Richardson, Murray Thomson, David Infield, Conor Clifford. Domestic electricity use: A high-resolution energy demand model, Energy and Buildings, Volume 42, Issue 10, October 2010, Pages 1878-1887, ISSN 0378-7788, DOI: 10.1016/j.enbuild.2010.05.023.