Jack's blog

Not currently working on the energy disaggregation competition

Towards the end of last year, I was lucky enough to have a short postdoc paid for by EDF Energy. The main focus of the postdoc was on looking at ways to design a competition to compare the performance of different disaggregation algorithms. This postdoc finished in January 2017 so I am not currently working on the disaggregation competition (although I strongly believe that finding a good way to compare NILM algorithms is one of the most important unsolved problems in NILM).

Very briefly: the main challenge in designing a NILM competition is getting enough clean, private testing data. It turns out that the performance of NILM algorithms can be quite inconsistent across houses: an algorithm might work well on some houses; but on other houses that same algorithm might work badly. Also, one of the promising uses of NILM is to identify "extreme" energy behaviour (such as leaving your electric oven on constantly just in case you fancy doing some baking). Identifying "extreme" behaviour is useful because users can save large sums of money with a single, simple change in behaviour. But - by definition - "extreme" behaviour is rare. Hence we need a large testing dataset (maybe 100 houses) to be confident that we're accurately capturing the performance of each algorithm; and that each algorithm can recognise "extreme" energy behaviour. Recording this quantity of real data would be very expensive and time consuming. Hence we could consider building a high-quality simulator to generate realistic data. But this raises a whole host of additional challenges!

Making a "haunted staircase" for Halloween using a Touch Board & Ableton Live


For Halloween 2016, we made a "haunted staircase". Spooky sounds were triggered as unsuspecting trick-or-treaters walked up the stairs outside our house. This project won the Bare Conductive Halloween competition! This blog post describes how we made our staircase...

(the photo above was taken by my wife, Ginnie)

Here's a video of the final result, as demonstrated by my five year-old daughter, Olive:

Simulating disaggregated electricity data

To do rigorous NILM research, we need lots of high-quality disaggregated electricity data. This is especially true if we want to run a good NILM competition.

There are now 20 public datasets listed on the NILM wiki. But all real data suffers from problems which make it problematic for use in a NILM competition. These problems include:

Survey launched: Please help us to design a competition for energy disaggregation algorithms!

We are working on a competition for energy disaggregation algorithms. Please help us to design this competition by filling in this survey!

A competition for energy disaggregation algorithms

Now that I've (finally!) submitted my PhD thesis, I can focus on designing and implementing a competition for energy disaggregation algorithms. EDF Energy have kindly given me post-doc funding from now until the end of December 2016 to work on the NILM competition.

The broad plan is to first consult with the NILM community and create a specification for the NILM competition which works for everyone. Then I plan to implement a web application which can run the NILM competition.

Right now, I'm writing a survey on the design of a competition for energy disaggregation algorithms. The aim of the survey is to systematically collect feedback about the design of the competition. I plan to launch the survey soon. Prior to the launch, I'm really eager to hear feedback on the survey itself. For example: is the survey missing any vital questions? Do some questions not provide sufficient options? Do some questions not make sense?!

Please note that, prior to the launch of the survey, my aim is to get feedback on the design of the survey itself. So please don't actually submit any answers yet! Feel free to select options and click "next" but just please don't click "submit" at the end of the survey. I'll write another blog post when the survey is ready to accept answers.

It's probably best to provide feedback about the survey in public on the relevant thread on the Energy Disaggregation Google Group. If you want your feedback to be private then, by all means, email me directly at jack.kelly@imperial.ac.uk!

And please do get in touch if you have feedback on any aspect of the proposed NILM competition.

PhD thesis submitted!

Last night I submitted my PhD thesis on "Disaggregation of Domestic Smart Meter Energy Data"! Hurray!

May 2016 release of UK-DALE

I've just updated my UK-DALE dataset with the latest data. House 1 now has 3.5 years of data!

Please help design a competition for energy disaggregation algorithms!

Has disaggregation accuracy improved since the 1980s? Which algorithms are most accurate for a given use-case? Which (if any) use-cases are well served by NILM already?

It's pretty much impossible to answer any of these questions with confidence (unless you only consider the tiny number of algorithms for which you have access to executable code). We can't directly compare published results across papers because, when testing the disaggregation accuracy of NILM algorithms, each paper uses different datasets, different metrics, different pre-processing, etc.

This means that we can't measure progress over time. Nor can we decide which NILM algorithms are most promising and which might be dead-ends.

These are bad problems. Let's work towards fixing them.

Some other machine learning communities have had great success running yearly competitions. For example, the ImageNet "Large Scale Visual Recognition Challenge" has been running yearly since 2010. Some regard this competition as having played a crucial role in the recent dramatic increase in the accuracy of image classification algorithms.

The idea of running a NILM competition has been rumbling around for several years. But designing and implementing a NILM competition is hard. The community uses sample rates ranging from monthly to MHz. No single metric is informative for all use-cases. Collecting ground truth data (the power demand of individual appliances) is expensive and time-consuming.

Maybe we can pull this off. The first step is to decide on a design which will work for everyone.

To give us something concrete to debate, we'll outline one way this could work. This is not meant to be definitive! Think of this as the DNA for a clumsy, inefficient animal 500 million years ago. Together, we need to evolve this design into an elegant, efficient beast, well adapted to its environment.

Please shoot holes in this proposal! What won't work for you? What's impractical? What's unfair? What opens the competition up to cheating? How can we make the competition more attractive to researchers? How can we make the competition more informative for the community? How can we simplify the process?

The draft proposal is available on Google Docs. I've linked to a Google Doc rather than copying-and-pasting the proposal into this post so that we can update the proposal as the discussion develops. Please add your comments either to the mailing list discussion; or to the Google Doc (please sign your comment with your name; unless you deliberately want to be anonymous); or if you want to keep your comment private then email me.

Thanks, (in no particular order) Jack, Mario, Oli, Stephen, Grant, Marco, Peter

Books with strong female characters suitable for 4 to 5 yearolds

A couple of weeks ago I wrote a rant on facebook where I asked for recommendations for "books with strong female characters which are suitable for 4 to 5 yearolds. Characters who can think for themselves. And get shit done. Without a prince. Or without trying to impress a prince."

I got loads of great suggestions from my friends. And these suggestions came in several different ways (chatting over coffee, facebook comments etc). So this blog post an attempt to maintain a list of links on the topic of "books with strong female characters".

Portable air quality monitors for cyclists

I cycle 14 miles each day in London (7 miles each way). I'm getting increasingly worried that I'm breathing in lots of bad stuff. I love my cycle ride into work: It's the best way to get my brain working in the morning. So I'm not planning to stop cycling. But I would like to figure out which route into work is least polluted.


Subscribe to RSS - Jack's blog