The Blog of Ian Mercer.

Time Series Data Compression

1/27/2021, 4:10 AM

In a previous post I talked about compressing time series-data using a co-linearity test. This approach worked OK but recently I came across a better approach and I've implemented it to reduce the number of data points that are collected for any sensor without losing any of the salient features of the time series data.

What's needed is an algorithm that preserves the shape of the signal, each local maximum or minimum must be preserved and each inflection point in the curve should be represented. An overall RMS difference between the compressed data and the actual data must be kept small.

I came across this great summary paper: 'An Overview of Moving Object Trajectory Compression Algorithms' which provided some of the ideas behind my latest data compression scheme. In particular a scan or sweep approach to selecting which points to keep or drop.

The entire code is below and as you can see on the graph above it does a good job capturing the salient points in the data whilst eliminating as many other points as possible.

To use the code you provide a percentage (e.g. 0.05) which expresses the delta up or down that a point can have from its actual value. At each step the algorithm fits two lines to the data points after the current start point: an upper limit and a lower limit line using the up or down delta calculated from the percentage allowed. As new points arrive, if they are within this 'cone' they replace the previous value and adjust the cone to be narrower based on their location. As that location gets ever further from the start point the cone narrows. If the points stay within the percentage threshold around some line from the start point then they keep removing and replacing the last data point. But as soon as one strays outside the cone, the algorithm keeps the last point before it and the current point and restarts the process with that as the new baseline.

Explanatory diagram

At each inflection point, local minimum, or local maximum, the algorithm saves the current point and starts looking for a new line segment. It turned out to be really simple code and it works really well. To use it you provide two call backs: one to write a value to the database and one to update the last value in the database.

There are other better off-line algorithms for trajectory compression but the beauty of this one is that it works incrementally and only updates the last data point. As such it doesn't need a lot of buffer, can run very efficiently and doesn't change the graph as you are looking at it.

    /// <summary>
    /// Trajectory compression using narrowing cones
    /// </summary>
    /// <remarks>
    /// Can update last value but can't go further back, resets when the curve changes direction
    /// </remarks>
    public class TrajectoryCompressor
    {
        private readonly double percentage;

        private double startValue;
        private long startTime;
        private double previousValue;
        private long previousTime;
        private bool hasPrevious = false;
        private double lower_slope = -1.0;
        private double upper_slope = 1.0;

        public TrajectoryCompressor(double percentage, long startTime, double startValue)
        {
            this.percentage = percentage;
            this.startTime = startTime;
            this.startValue = startValue;
        }

        public void Add(long timestamp, double value, 
            Action<long, double> write,
            Action<long, long, double> update)
        {
            if (!hasPrevious)
            {
                write(timestamp, value);
                hasPrevious = true;
                previousTime = timestamp;
                previousValue = value;
                return;
            }

            if (timestamp == previousTime)
            {
                // ignore simultaneous values
                return;
            }

            // Add allowed error to it
            double upper_estimate = (value > 0) ? value * (1.0 + percentage) : value * (1.0 - percentage);
            double lower_estimate = (value > 0) ? value * (1.0 - percentage) : value * (1.0 + percentage);

            double upper_bound = (timestamp - startTime) * upper_slope + startValue;
            double lower_bound = (timestamp - startTime) * lower_slope + startValue;

            bool outside = (value > upper_bound || value < lower_bound);

            if (outside)
            {
                // The new point is outside the cone of allowed values
                // so we allow previousValue to stand and start a new segment
                startTime = previousTime;
                startValue = previousValue;
                previousTime = timestamp;
                previousValue = value;
                hasPrevious = true;
                write(timestamp, value);          // And write out the current value

                // and calculate new min max slope
                upper_slope = (upper_estimate - startValue) / (timestamp - startTime);
                lower_slope = (lower_estimate - startValue) / (timestamp - startTime);

                return;
            }

            // value is within expected range, it can replace previous
            update(previousTime, timestamp, value);

            // Keep track of it so we can keep updating
            previousTime = timestamp;
            previousValue = value;

            // and then narrow the allowed range
            if (upper_estimate < upper_bound) upper_slope = (upper_estimate - startValue) / (timestamp - startTime);
            if (lower_estimate > lower_bound) lower_slope = (lower_estimate - startValue) / (timestamp - startTime);
        }

    }

I've been working on home automation for over 15 years and I'm close to achieving my goal which is a house that understands where everyone is at all times, can predict where you are going next and can control lighting, heating and other systems without you having to do or say anything. That's a true "smart home".

Ian Mercer

Bluetooth Tracking Project

3/22/2021

My year long Bluetooth project that won the $20,000 HCI and Microsoft competition during lockdown has continued to grow and now reliably tracks how many people are in the house and outside and can locate any device down to room level.

Ian Mercer

Digital Twins are never identical

3/15/2021, 3:22 PM

Digital Twin are an online representation of a real world object, a copy of its properties in the digital world and a way to send updated and commands to it. In effect I've been making them for years but now they have a trendy name.

Ian Mercer

Home Automation Sensors

2/21/2021

An overview of the many sensors I've experimented with for home automation including my favorite under-floor strain gauge, through all the usual PIR, beam and contact sensors to some more esoteric devices like an 8x8 thermal camera.

Ian Mercer

Why smarthomes are hard

2/20/2021, 1:28 AM

Why automated learning is hard for a smart home. The perils of over-fitting, under-fitting and how the general unpredictable nature of life makes it hard to build a system that learns your behavior.

Ian Mercer

Collinearity test for sensor data compression

7/15/2021, 3:47 PM

One way to reduce the volume of sensor data is to remove redundant points. In a system with timestamped data recorded on an irregular interval we can achieve this by removing co-linear points.

Ian Mercer

3d Printed ESP32 Brick

3/15/2021, 4:12 AM

ESP32 provides a great platform for sensors around the house but by the time you've added a USB power brick, cable and enclosure it's quite messy. I wanted a device that I could just plug in with no exposed wires and no mounting needed so I designed one in OpenSCAD.

Ian Mercer

Filtering techniques

3/3/2021

Filtering raw data is essential for a reliable home automation system. Here are some of the many ways you can filter sensor data.

Ian Mercer

Bluetooth Sensing for Home Automation

2/21/2021

Bluetooth sensing for home automation is a great proxy for people counting as it can detect and locate each cellphone in the house. iBeacons attached to tools, cars and pets can provide a 'find my anything' feature too.

Ian Mercer

Gas sensors

2/21/2021

Gas sensors come in many different flavors including CO2, VOC and particulate sensors.

Ian Mercer

Humidity Sensors (DHT11, DHT22, AM2320)

2/21/2021

Humidity sensors are great for controlling extractor fans in bathrooms and other damp spaces.

Ian Mercer

Light sensors for Home Automation

2/21/2021

Having at least one light sensor is critical for any home automation system that controls lightng. Lights need to be turned on when it's dark not at specific times of day, especially here in Seattle when it can be dark and cloudy at any time of day.

Ian Mercer

Microwave Doppler Sensors (RCWL-0516)

2/21/2021

Microwave doppler sensors can be found in some alarm sensors but there are also available very cheaply as a separate component. They offer exceptional range but suffer from false triggers requiring a probailistic approach to people sensing.

Ian Mercer

Optical-beam sensors

2/21/2021

Optical-beam sensors are reliable and can cover a long-distance such as across a garage or aisle-way. When they include multiple-beams they have good false-trigger rejection.

Ian Mercer

PIR Sensors for Home Automation

2/21/2021

PIR sensors are cheap and easy to use but they suffer from slow response times and low repeat rates.

Ian Mercer

Strain-gauges

2/21/2021

Strain-gauges are my top-rated sensor for home automation because they are invisible, reliable and can be tuned to detect people and ignore pets.

Ian Mercer

Temperature sensors for home automation

2/21/2021

Temperature sensors I've experimented with for home automation.

Ian Mercer

Event blocks

4/23/2020, 8:42 PM

Home automation systems need to respond to events in the real world. Sometimes it's an analog value, sometimes it's binary, rarely is it clean and not susceptible to problems. Let's discuss some of the ways to convert these inputs into actions.

Ian Mercer

Logistic function - convert values to probabilities

3/22/2020, 9:48 PM

Another super useful function for handling sensor data and converting to probabilities is the logistic function 1/(1+e^-x). Using this you can easily map values onto a 0.0-1.0 probability range.

Ian Mercer

ATAN curve for probabilities

2/28/2020, 4:53 PM

In a home automation system we often want to convert a measurement into a probability. The ATAN curve is one of my favorite curves for this as it's easy to map overything onto a 0.0-1.0 range.

Ian Mercer

Home Construction Advice

2/23/2020, 4:34 AM

Several years ago we did a major remodel. I did all of the finish electrical myself and supervised all of the rough-in electrical. I also put in all of the electrical system and water in our barn. I have opinions ...

Ian Mercer

T-Mobile home internet

2/23/2020, 4:34 AM

I'm testing a T-Mobile Home Internet device as a backup to XFinity and a way to offload half our monthly traffic to avoid the XFinity 1.2TB cap

Ian Mercer

Bluetooth

2/21/2020, 8:42 PM

One of my inventions recently won a $20k global competition for applications that could help in a pandemic. It uses Bluetooth to count people.

Ian Mercer

Probabilistic Home Automation

2/14/2018, 1:14 AM

A probabilistic approach to home automation models the probability that each room is occupied and how many people are in that room.

Ian Mercer

Multiple hypothesis tracking

2/11/2018, 9:00 PM

A statistical approach to understanding which rooms are occupied in a smart house

Ian Mercer

A state machine for lighting control

2/11/2018, 4:58 AM

An if-this-then-that style rules machine is insufficient for lighting control. This state machine accomplishes 90% of the correct behavior for a light that is controlled automatically and manually in a home automation system.

Ian Mercer

Home Automation States

5/16/2016, 4:58 AM

Understanding the many different 'states' a house can have is critical to creating great home automation

Ian Mercer

Graphing gigabytes of home automation data with tableau

3/15/2015, 11:03 PM

Some interesting charts from the gigabytes of data my home automation system produces

Ian Mercer

iBeacons for Home Automation

2/3/2015, 2:07 PM

My investigations into using iBeacons for home automation

Ian Mercer

iBeacon meetup in Seattle - January 2015

1/31/2015, 3:26 PM

My notes on the iBeacon meetup in Seattle held in January 2015

Ian Mercer

Home Automation Systems as a Graph

11/22/2014, 1:54 PM

Using nodes and links to represent a home and all the devices in it

Ian Mercer

N-Gram Analysis of Sensor Events in Home Automation

11/10/2014, 7:31 AM

Using n-gram analysis to spot patterns in sensor activations

Ian Mercer

Xamarin Forms Application For Home Automation

11/5/2014, 7:57 AM

Building a Xamarin Forms application to control my home automation system

Ian Mercer

The Internet of Hubs (and things)

10/12/2014, 10:14 PM

Maybe it should be called the Internet of Hubs instead

Ian Mercer

Showing home status with just a single RGB LED

10/2/2014, 2:51 PM

Multicolored LEDs can convey a lot of information in a small space

Ian Mercer

A wireless sensor network using Moteino boards

6/22/2014, 1:19 PM

The diminutive Arduino boards include a powerful transmitter/receiver

Ian Mercer

The future of user interface design is multi-modal and context aware

5/8/2014, 11:23 PM

Ian Mercer

JSON Patch - a C# implementation

5/2/2014, 3:05 PM

Ian Mercer

The home as a user interface

4/17/2014, 1:42 PM

Ian Mercer

A RESTful API for sensor data

10/23/2013, 2:15 PM

POSTing data to a home automation system from Arduino devices

Ian Mercer

The Internet of Boilers

10/23/2013, 2:02 PM

An experiment to measure every aspect of an HVAC / boiler system

Ian Mercer

VariableWithHistory - making persistence invisible, making history visible

2/4/2013, 2:00 PM

A novel approach to adding history to variables in a programming language

Ian Mercer

A Quantified House - My Talk to the Seattle Quantified Self Meetup

8/1/2012, 1:49 PM

My talk to the Seattle Quantified Self meetup

Ian Mercer

Integrating an Android phone into my home automation system

6/19/2012, 2:37 PM

Some new features for my home automation using an Android phone

Ian Mercer

Before there was the web there was BeebTel

6/7/2012, 2:21 PM

Just thought I should mention that I built a web-like system before the web existed

Ian Mercer

My first programme [sic]

4/21/2012, 10:38 PM

At the risk of looking seriously old, here's something found on a paper tape

Ian Mercer

The Internet of Dogs

3/26/2012, 4:14 AM

Connecting our dog into the home automation

Ian Mercer

GreenGoose Review

3/14/2012, 1:54 PM

A review of the now defunct GreenGoose sensor system

Ian Mercer

A traffic service that answers "which way should I go?"

10/18/2011, 1:22 PM

Ian Mercer

Programming a smart home with a fluent, domain-specific language

9/21/2011, 12:30 AM

Ian Mercer

Home power meters revisited

7/3/2011, 3:53 AM

Ian Mercer

Finally got the 1U Atom Server racked up

5/10/2011, 3:53 AM

Ian Mercer

Timelapse video using the GoPro HD Hero

5/9/2011, 10:55 AM

Ian Mercer

Home Automation Calendar Integration

5/6/2011, 3:53 AM

Ian Mercer

The 'Learning Database' or 'Why do we need so many different databases?'

3/25/2011, 3:27 PM

Ian Mercer

A Semantic Web / Ontology Driven Approach to CRM

3/23/2011, 3:33 AM

Ian Mercer

Smart home tracks network devices by mac address

1/25/2011, 2:37 PM

Ian Mercer

Web site crawler and link checker (free)

1/14/2011, 3:16 PM

Ian Mercer

Algorithm Complexity and the 'Which one will be faster?' question

1/8/2011, 1:40 AM

Ian Mercer

Silencing the annoying SpotBot beep - a simple hack

12/29/2010, 2:00 AM

Ian Mercer

Smart home energy savings - update for 2010

12/15/2010, 1:34 AM

Ian Mercer

A smart power strip

12/10/2010, 2:48 PM

Ian Mercer

Holiday Season (Christmas) in our Smart Home

12/4/2010, 3:44 PM

Ian Mercer

What does a Smart House do at Halloween?

10/30/2010, 5:06 AM

My favorite home automation features for Halloween

Ian Mercer

Continuous Link and SEO Testing - Announcing LinkCheck2

9/22/2010, 12:44 AM

Ian Mercer

If your house could talk to you, what would it say?

9/14/2010, 2:23 PM

Ian Mercer

Home Automation Top Features

9/8/2010, 2:40 AM

Ian Mercer

Passive Air Conditioning to reduce Energy Consumption

7/27/2010, 5:28 AM

Ian Mercer

Weather Forecasting for Home Automation

7/26/2010, 10:38 AM

Ian Mercer

Sequential Logic Blocks - compared to the Reactive Framework

7/20/2010, 8:22 AM

Ian Mercer

Home Automation Heating and Cooling (HVAC) Features

7/8/2010, 2:02 PM

Ian Mercer

Tidy Up! for Mac, a short review, clean up missing thumbnails in iPhoto??

7/6/2010, 12:04 PM

Ian Mercer

Is a 24hr energy consumption graph really useful?

5/29/2010, 5:30 AM

Ian Mercer

I should have created Four Square ...

5/28/2010, 12:05 PM

Ian Mercer

How can I tell if my house is smart?

5/28/2010, 10:03 AM

Ian Mercer

Asian Gadgets

5/28/2010, 8:01 AM

Ian Mercer

A great video explaining the Semantic Web

5/12/2010, 6:28 AM

Ian Mercer

Applying the Semantic Web to Home Automation

4/27/2010, 3:54 AM

Ian Mercer

How to save energy through lighting control with home automation

4/10/2010, 1:00 AM

Ian Mercer

Using Home Automation to Monitor Cable Modem

4/9/2010, 1:34 PM

Ian Mercer

The NOAA feed is great for generating custom weather forecasts

4/8/2010, 1:17 PM

Ian Mercer

Interesting Twitter Posts March 15th-

3/23/2010, 6:28 AM

Ian Mercer

Microsoft's new Image Stitching tool works really well

3/19/2010, 12:52 PM

Ian Mercer

Twitter links for Week beginning March 8th

3/11/2010, 6:28 AM

Ian Mercer

Home Automation Block Diagram

3/8/2010, 8:32 AM

Ian Mercer

Elliott 803 - An Early Computer

3/7/2010, 6:28 AM

Ian Mercer

Certificates for the Microsoft MyDotNetStory competition

3/2/2010, 1:54 PM

Ian Mercer

Facebook, social gaming and points

3/2/2010, 6:28 AM

Ian Mercer

Twitter links for the week beginning March 1st

3/2/2010

Ian Mercer

A strongly-typed natural language engine (C# NLP)

3/1/2010, 3:05 PM

Ian Mercer

Interesting or useful links on Twitter this week relating to .NET and C#

2/23/2010, 1:02 AM

Ian Mercer

Useful Twitter links Feb 8-Feb 15 2010

2/9/2010, 2:23 PM

Ian Mercer

World's Smartest House Demonstration

2/2/2010, 3:33 AM

Ian Mercer

Some comments from Twitter on Home Automation

1/29/2010, 3:05 AM

Ian Mercer

Home Automation with Voxeo (VoxML/Voice XML)

1/29/2010, 2:31 AM

Ian Mercer

World's Smartest House (Home Automation) wins 1st and 3rd

1/28/2010, 6:28 AM

Ian Mercer

Looking forward to the new year and our new datacenter

1/1/2010, 6:28 AM

Historical note about moving my servers into a datacenter

Ian Mercer

When will people learn to backup?

12/11/2009, 4:26 PM

A rant about RAID

Ian Mercer

Future proof your home with a new conduit system?

11/21/2009, 1:20 AM

Running conduit can be expensive but maybe you don't need one to every room

Ian Mercer

Balloon Boy was much ado about nothing - Twitter

10/16/2009, 7:46 AM

Some of the more witty comments on Twitter about the Balloon Boy hoax

Ian Mercer

Shortened URLs should be treated like a Codec ...

10/11/2009, 12:14 PM

Expanding URLs would help users decide whether or not to click a link

Ian Mercer

Tagging File Systems

10/11/2009, 6:33 AM

Isn't it time we stopped knowing which drive our file is on?

Ian Mercer

WMPnetwk.exe started using 50% of my CPU

9/18/2009, 1:07 AM

Uninstalling Windows Media Player - the end of an era

Ian Mercer

Introducing Jigsaw menus

8/28/2009, 4:19 AM

A novel UI for menus that combines a breadcrumb and a menu in one visual metaphor

Ian Mercer

Why Amazon should get into the dedicated server business

6/27/2009, 12:54 PM

Ian Mercer

Amazon Instance vs Dedicated Server comparison

5/19/2009, 2:21 PM

Some benchmark performance for Amazon vs a dedicated server

Ian Mercer

Agile Software Development is Like Sailing

4/26/2009, 10:27 PM

You cannot tack too often when sailing or you get nowhere. Agile is a bit like that.

Ian Mercer

AntiVirus Software is the Worst Software!

4/21/2009, 3:11 AM

When your anti-virus software starts stealing your personal data, it's time to remove it!

Ian Mercer

Second Drobo Update

2/10/2009, 5:28 AM

At this point things were looking up for my Drobo

Ian Mercer

It's all about disk speed

6/12/2008, 3:08 AM

Why disk speed is the most critical aspect for most modern PCs and servers

Ian Mercer

Comcast woes and a new monitoring utility

6/9/2008, 9:28 AM

Monitoring a cable modem using its HTML management interface

Ian Mercer

Core duo desktop machine runs cool

6/29/2006, 12:04 PM

Ian Mercer

Giving up on Internet Explorer

6/17/2006, 3:11 AM

Ian Mercer

New Home Automation Server

6/14/2006, 12:10 PM

Ian Mercer

Dell upgrades - a pricey way to go

9/19/2005, 12:32 AM

Ian Mercer

World's Smartest House

Over 15 years of experimentation with home automation

Ian Mercer

HX711 Strain Gauge Pulsor Sensors

3/6/2024, 3:10 AM

Using Pulsor sensors with an HX711 for homeautomation.

Ian Mercer

Preparing for death

1/11/2023, 4:49 AM

A friend died last year, it wasn't unexpected. He left a lot for his friends to cleanup. Maybe these notes can help someone else prepare better.

Ian Mercer

The Grideye 8x8 camera sensor

2/21/2021

Experiments with an 8x8 IR camera for privacy-preserving people detection using cameras.

Ian Mercer

World's Smartest House Videos

4/30/2020, 4:22 PM

A collection of videos about my smart home efforts

Ian Mercer

CCTV Cameras as Home Automation Sensors

2/21/2021

CCTV cameras are an option for detecting people but within the home there are privacy concerns that need to be addressed.

Ian Mercer

Pressure Sensors for Home Automation

2/21/2021

Pressure sensors can detect HVAC system operation and could potentially detect clogged filters.

Ian Mercer

The Blog of Ian Mercer.

Time Series Data Compression

Related Stories