The Blog of Ian Mercer.

A Bayesian Spam / Relevance Filter for Seesmic / Twitter

Cover Image for A Bayesian Spam / Relevance Filter for Seesmic / Twitter

9/21/2010, 2:39 AM

Twitter is a great resource but it's just so full of noise that sometimes I question its value to me. In between the awesome information I get from it I have to wade through check-ins, mealtimes and childcare woes. I'm sorry but what you had for lunch really isn't that interesting to me unless it's some amazing new restaurant right near me.

I wanted an easy way to see just the relevant bits so this weekend I knocked up a quick Seesmic plugin that uses a Bayesian filter to analyze each Tweet and score it for relevance. You can right click on any Tweet to say "Relevant" or "Not relevant" and it soon learns to categorize tweets using a red - green color dot and a score. There's a cut-off value too below which it simply hides the Tweet (but for the screen-shot I turned this off).

The Seesmic API was easy to learn (if somewhat lacking in documentation and complete working examples). The Visual Studio Templates provided by Tim Heuer came in handy too. The Managed Extensibility Framework made it very easy to write the plug-in - I just wish Seesmic could auto re-load it each time you drop a new version in there. Restarting Seesmic each time was annoying and caused Twitter to rate-limit me more than once.

Information about what you like and don't like is stored in IsolatedStorage in a simple flat file. The file is reloaded on startup and the probabilities for each word are recalculated.

If I get time and if there's sufficient interest I might take this a bit further but for now I'm happy to have less to read.

[Apologies to anyone who is red-dotted in the screen-shot, I'm sure you have lots of good Tweets too which is why I'm following you, I just happened upon this one which is noise to me.]

Related Stories

Cover Image for My love/hate relationship with Stackoverflow

My love/hate relationship with Stackoverflow

2/22/2020, 6:21 PM

Stackoverflow is a terrific source of information but can also be infuriating.

Ian Mercer

Ian Mercer

Logging with Xamarin Insights (but not on Unified App)

12/7/2014

Ian Mercer

Ian Mercer

Cover Image for Xamarin Forms Application For Home Automation

Xamarin Forms Application For Home Automation

11/5/2014, 7:57 AM

Building a Xamarin Forms application to control my home automation system

Ian Mercer

Ian Mercer

Cover Image for A strongly-typed, RegEx-based parser for handling input strings

A strongly-typed, RegEx-based parser for handling input strings

6/26/2014, 8:00 PM

Ian Mercer

Ian Mercer

Websites should stop using passwords for login!

5/29/2013, 12:09 AM

A slightly radical idea to eliminate passwords from many of the websites you use just occasionally

Ian Mercer

Ian Mercer

VariableWithHistory - making persistence invisible, making history visible

2/4/2013, 2:00 PM

A novel approach to adding history to variables in a programming language

Ian Mercer

Ian Mercer

Neo4j Meetup in Seattle - some observations

10/24/2012, 1:49 PM

Some observations from a meetup in Seattle on graph databases and Neo4j

Ian Mercer

Ian Mercer

Updated Release of the Abodit State Machine

7/11/2012, 2:28 PM

A hierarchical state machine for .NET

Ian Mercer

Ian Mercer

My first programme [sic]

4/21/2012, 10:38 PM

At the risk of looking seriously old, here's something found on a paper tape

Ian Mercer

Ian Mercer

Building a better .NET State Machine

4/15/2012, 2:38 PM

A state machine for .NET that I've released on Nuget

Ian Mercer

Ian Mercer

A simple state machine in C#

1/7/2012, 4:31 PM

State machines are useful in many contexts but especially for home automation

Ian Mercer

Ian Mercer

Dynamic persistence with MongoDB - look, no classes! Multiple inheritance in C#!

9/7/2011, 7:58 AM

Ian Mercer

Ian Mercer

Home network crawler - cataloging every file on the home LAN with C# and MongoDB

8/24/2011, 3:53 AM

Ian Mercer

Ian Mercer

Stop writing rude software! Use LASTINPUTINFO instead.

8/20/2011, 11:20 AM

Ian Mercer

Ian Mercer

Class-free persistence and multiple inheritance in C# with MongoDB

5/5/2011, 3:53 AM

Ian Mercer

Ian Mercer

Cover Image for Extending C# to understand the language of the semantic web

Extending C# to understand the language of the semantic web

2/6/2011, 3:53 AM

Ian Mercer

Ian Mercer

Custom Serialization for MongoDB - Hashset with IBsonSerializable

1/8/2011, 4:26 AM

Ian Mercer

Ian Mercer

File and image upload security considerations and best practices

1/8/2011, 1:57 AM

Ian Mercer

Ian Mercer

Cover Image for Algorithm Complexity and the 'Which one will be faster?' question

Algorithm Complexity and the 'Which one will be faster?' question

1/8/2011, 1:40 AM

Ian Mercer

Ian Mercer

A Semantic Web ontology / triple Store built on MongoDB

1/6/2011, 3:53 AM

Ian Mercer

Ian Mercer

Task Parallel Library: A scheduler with priority, apartment state and maximum degree of parallelism

11/26/2010, 3:53 AM

Ian Mercer

Ian Mercer

MongoDB Map-Reduce - Hints and Tips

11/13/2010, 2:32 PM

Ian Mercer

Ian Mercer

Continuous Link and SEO Testing - Announcing LinkCheck2

9/22/2010, 12:44 AM

Ian Mercer

Ian Mercer

Constrained parallelism for the Task Parallel Library

9/2/2010, 2:29 AM

Ian Mercer

Ian Mercer

Sequential Logic Blocks - compared to the Reactive Framework

7/20/2010, 8:22 AM

Ian Mercer

Ian Mercer

Why functional programming and LINQ is often better than procedural code

4/16/2010, 1:09 AM

Ian Mercer

Ian Mercer

10 reasons my O(n²) algorithm is better than your O(n) algorithm

4/9/2010, 5:20 AM

Ian Mercer

Ian Mercer

Why don't you trust your build system?

4/1/2010, 9:20 AM

Ian Mercer

Ian Mercer

Elliott 803 - An Early Computer

3/7/2010, 6:28 AM

Ian Mercer

Ian Mercer

Continuous Integration -> Continuous Deployment

12/30/2009, 6:27 AM

What is "quality" in terms of a released software product or website?

Ian Mercer

Ian Mercer

Making a bootable Windows 7 USB Memory Stick

12/11/2009, 4:17 PM

Here's how I made a bootable USB memory stick for Windows 7

Ian Mercer

Ian Mercer

Tip: getting the index in a foreeach statement

12/11/2009, 6:28 AM

A tip on using LINQ's Select expression with an index

Ian Mercer

Ian Mercer

SQL Server - error: 18456, severity: 14, state: 38 - Incorrect Login

11/17/2009, 6:28 AM

A rant about developers using the same message for different errors

Ian Mercer

Ian Mercer

WCF and the SYSTEM account

11/15/2009, 11:06 PM

Namespace reservations and http.sys, my, oh my!

Ian Mercer

Ian Mercer

Mixed mode assembly errors after upgrade to .NET 4 Beta 2

10/21/2009, 9:35 PM

Fixing this error was fairly simple

Ian Mercer

Ian Mercer

Shortened URLs should be treated like a Codec ...

10/11/2009, 12:14 PM

Expanding URLs would help users decide whether or not to click a link

Ian Mercer

Ian Mercer

Tagging File Systems

10/11/2009, 6:33 AM

Isn't it time we stopped knowing which drive our file is on?

Ian Mercer

Ian Mercer

A great site for developing and testing regular expressions

10/10/2009, 6:28 AM

Just a link to a site I found useful

Ian Mercer

Ian Mercer

Introducing Jigsaw menus

8/28/2009, 4:19 AM

A novel UI for menus that combines a breadcrumb and a menu in one visual metaphor

Ian Mercer

Ian Mercer

Fix for IE's overflow:hidden problem

6/11/2009, 5:32 AM

Ian Mercer

Ian Mercer

A better Tail program for Windows

5/22/2009, 3:49 AM

A comparison of tail programs for Windows

Ian Mercer

Ian Mercer

Measuring website browser performance

5/20/2009, 1:02 AM

Found this great resource on website performance

Ian Mercer

Ian Mercer

Amazon Instance vs Dedicated Server comparison

5/19/2009, 2:21 PM

Some benchmark performance for Amazon vs a dedicated server

Ian Mercer

Ian Mercer

Generate a SQL Compact Database from your Entity Model (EDMX)

5/18/2009, 11:55 PM

Ian Mercer

Ian Mercer

Agile Software Development is Like Sailing

4/26/2009, 10:27 PM

You cannot tack too often when sailing or you get nowhere. Agile is a bit like that.

Ian Mercer

Ian Mercer

Javascript error reporting

4/21/2009, 4:08 AM

Sending client-side errors back to a server for analysis

Ian Mercer

Ian Mercer

AntiVirus Software is the Worst Software!

4/21/2009, 3:11 AM

When your anti-virus software starts stealing your personal data, it's time to remove it!

Ian Mercer

Ian Mercer

ASP.NET Custom Validation

4/18/2009, 12:43 AM

How to solve a problem encountered with custom validation in ASP.NET

Ian Mercer

Ian Mercer

Optimization Advice

4/5/2009, 1:17 AM

Some advice on software optimization

Ian Mercer

Ian Mercer

Google Chart API

6/9/2008, 11:27 AM

Ian Mercer

Ian Mercer

Cache optimized scanning of pairwise combinations of values

1/9/2008, 4:41 PM

Using space-filling curves to optimize caching

Ian Mercer

Ian Mercer

Threading and User Interfaces

7/31/2006, 3:40 AM

A rant about how few software programs get threading right

Ian Mercer

Ian Mercer

Take out the trash!

6/24/2006, 9:34 AM

Why Windows shutdown takes so long

Ian Mercer

Ian Mercer

Dell upgrades - a pricey way to go

9/19/2005, 12:32 AM

Ian Mercer

Ian Mercer

Programming mostly C#

Ian's advice on programming

Ian Mercer

Ian Mercer