The paper “Increasing Campaign Effectiveness”, abbreviated ICE, is out. You can find the paper here. ICE is not the successor to Value of a Fan, abbreviated VOAF. We asked different questions. Last year, in response to VOAF, many of my cohorts came forward with brilliant follow up questions, and the dialogue that ensued contributed to the subsequent study and model design. Work continues. I welcome, in the spirit laid out by Tsang, engagement on the topic. What do you think about Increasing Campaign Effectiveness using social media? What would you consider and explore?

Intelligence means selective ignorance. Imagine how intelligent and ignorant we used to be as a people, just 120 years ago. Some of the first uses of sampling techniques in quantitative methods centered around the use of alcohol in society. They really didn’t have very much machine readable data back then (the first use was for the 1890 US Census), so, the practices of data mining weren’t possible. The entire purpose of sampling, and sample statistics, was precisely because no machines could be used to quantify the entire population against some policy question. You try calculating Chi Square on a very large dataset without a calculator or a spreadsheet! Indeed, sampling continues to be used to this day as a cost[…]

Have you heard about Data Market? It is one of the largest (and free) curated repository of public data. Benefits: It has internal search that doesn’t suck, so you can find what you’re looking for and get out. It offers the ability to preview the data in tables and charts before you export. It offers the ability to export in popular formats. It’s freemium. (API and LIVE data have a cost). Why am I excited about this? These data sets are very clean, and some of the data has direct uses for analysts in their social-professional lives. They’re there, and you should register and check them out.

The goal of a forecast is to make an accurate prediction about the future state of a system based on the best available evidence. The goal of target setting is to make a statement about a desired future state – with or without a forecast. Targets are political artifacts. You can read all about such dynamics in public policy here. Forecasts, ideally, are scientific artifacts. The interplay between forecasts and targets is particularly interesting. Those who produce sophisticated forecasts should understand that the motivation of those probing models is to assess whether or not a future state is possible, or, in certain situations, just how probable a given scenario could be. Don’t become trapped into the mindset that a trend[…]

There are at least five types of error related to analytics. These are instrumentation error, algorithm error, transposition error, statistical error, interpretation error. 1. Instrumentation Error When the instrument is measuring a phenomenon incorrectly. This is not to be confused with a human mistaking what an instrument really actually measures. Rather, this is when the instrument itself is only recording half of something. Or not measuring something at all. It’s akin to saying that thermometer is broken. Instrumentation has varying degrees of accuracy. For instance, the unique http cookie is subjected to fault as a result of a deteriorating cookie retention curve. The instrument continues to work just fine – it’s just that user behavior has changed, affecting its accuracy.[…]

An excellent analysis done by Allan Engelhardt, back in 2006 I suppose, talks about the 3/2 rule of employee productivity. The Coles notes version is that when you triple the number of employees, you cut their productivity in half. Check out the diagram below. Pretty scary right? Naturally, the story is much more complex than portrayed. Some sectors have mild slopes, like technology companies. Arguably, they’re using technology to flatten out the productivity slope. But it’s still slightly negative. Naturally, larger companies scale, so they still make more profit overall. Small companies are very good at doing many things. They become less good as they become large. And then ultimately, they stop being really, really good at anything at all.[…]

Konrad von Finckenstein, chairman of the CRTC, went before committee yesterday and made the remark: “The vast majority of Internet users should not be asked to subsidize a small minority of heavy users.” I take issue with Finckenstein’s statement. For one, the vast majority of Internet users subsidize a number minority. Urban customers, who are comparatively cheaper to connect with bandwidth, pay more to subsidize rural customers, who are comparatively more expense to connect. Didn’t the CRTC pass fees last year, forcing the vast majority of us to subsidize the viewing habits of the small minority of people who watch the CBC, Flashpoint, and DeGrassi? Isn’t the CRTC mandating the subsidization of something called “Canadian New Media”? Seems to me[…]

If you’re following a Canadian tech entrepreneur or scientist on Twitter, you might be noticing the #stopthemeter hashtag and statements questioning something called the #CRTC. The CRTC is the regulatory body responsible for regulating radio, television, and Internet Service Providers in Canada. On paper, Canada has six or seven major teleco’s. These are divided up by region. Telus and Shaw compete in the West. Rogers and Bell compete in Ontario. Videotron and whoever competes in Quebec. There are regional variants and government monopolies in the smaller provinces. Canada is peppered with duopolies – which are, in effect what economists might call natural monopolies. A complicating factor is that Canada is a massive country with a very dispersed rural population. These[…]

If you’re a web analyst or a data scientist, developer, or otherwise deeply involved with web analytics, you should strongly consider pledging to the Code of Ethics. This Code of Ethics is a social compact. Choices were made in authoring this code. Among the biggest challenges Lovett and Peterson had in steering this through a large group of people was balancing several of them – between specificity and generalization, one set of values and another set of values, and fending off the natural tendency of analysts to enumerate 55,000 technologies and their appropriate uses. It’s something that is relatively easy to internalize and has a timeless quality. Anything that is worth pledging has to stand for something. It doesn’t bind[…]

The space is tremendously fragmented because social itself if fragmented, unstructured, and ill behaved. Broadly, there’s ‘listening’, which has its origins in the PR space, and then there’s marketing performance, which has its origins in the analytics space. Although there are nearly 250 (+) ‘listening’ companies out there, none of them will have a solution that fits your unique set of circumstances, biases, and needs. You are simply not a PR person. Web analysts entering into social should be prepared to confront fragmentation and complexity on a level that they have yet to experience. If you work in an enterprise with more than 150 people, you will rapidly reach a stage where you will not be able to keep pace[…]