We’ll Be Right Back…

This morning a can of Red Bull punctured somehow in my bag … while my laptop and camera were in it. As you can imagine, this was a Bad Thing™ and I would not recommend it. Amazingly my Powerbook ran even while wet (before I realized what had happened). The screen is busted, but I am carbon cloning now and will be getting it serviced tomorrow. The RAPS did a decent job keeping my camera dryer, it seems to have come out of it a bit sticky, but otherwise unscathed.

This, along w/ the work, work work, and collab projects I’m swamped with will probably set back personal writing and coding projects a bit.

[note to self: in future, put liquids on other side of waterproof lining]

Facets and Freetags

I’ve been playing around with categorization for a while, and have been watching the recent rise of freetagging with great interest. A friend, Nick Mote, wrote a recent paper that does a good job both to summarize developments from a close-to information sciences perspective and to outline several near-term issues.

Among them, I believe that both disambiguation and synonym merging are relative non-issues. For the former, the ease of intersections almost makes it moot from the practical perspective of searching. For the latter issue, we are already beginning to see automated solutions (related tags).

One of the reasons for the relative ease of solving these problems is that the applicable relevance algorithms are already quite familiar to lay web practitioners (i.e., people like me, without a CS Ph.D) from their long-time use in e-commerce (collaborative filtering), spam filters (mathematical filters), and now social networks (web of trust). [my faves: clustering, adaptive resonance, context graphs, CRM114]

Anyway, what I wanted to ruminate on was the non-hierarchical freetag model of unions, intersections, and differences and see if there’s a way to to build a practical (both in terms of backend implementation and user interface) bridge with (more) traditional hierarchical faceted classification.

The first hurdle is, I suppose coming up with a convincing argument that hierarchies are worthwhile. I think quite obvious that in everyday life, we categorize and subcategorize often and that being a first-class object isn’t completely out of the realm of sense. The real question is if there’s a way of reintroducing hierarchy that doesn’t reintroduce the problems they caused in the first place.

First lets talk a bit about data structures. Traditional structures will explicitly delineate parent/child relationships, either via pointers or relational structures. Note that this can be generalized into the generic subject/predicate/object triplets that we see in RDF tuples. While I’m very partial to typed links (and late binding and dynamic properties… keep on target), I think we can see that this will lead to a level of complexity that will work against both ease of use (first rule of getting user participation) and social/corpus relevance matching (although spam filtering engines like CRM114 are built for sparse data).

Before we get to something I’m throwing out, I’d also like to mention that in our faceted hierarchies, Celko’s set/adjacency models (he recently published a whole book on trees in sql) won’t directly work as we’re dealing with what will likely be very bushy graphs (think overlapping possibly-cyclic digraphs). A real mess huh?

So, my 3AM brainfart last night was to try attacking from the point of view of using traditional tagging structures, and taking the idea of separators for hierarchies and improving on that. For example:

tag: foo
tag: foo.bar
tag: foo.bar.baz
tag: foo.qux

If we do a search for foo[\.]*, we will everything within tree ‘foo’ inclusive. This relieves us of many of the disadvantages of traditional hierarchical representation, and does not marginally increase complexity of either searches or of tag-renaming (the former can be globbed relatively inexpensively and the latter is costly either way).

Now, the main crux of the matter comes with the user interface end. ‘foo.bar.baz’ is a pain in the ass to type. Sure, your non-hierarchical option is to type ‘foo’ and ‘bar’ and ‘baz’, but this, at least from the input side, removes one of the advantages of hierarchical input.

In this case, then, why not do masking? When storing/searching, take both the entire ‘foo.bar.baz’ as well as the most specific child identifier ‘baz’. This creates a new disambiguation issue:

tag: baz
tag: baz (foo.bar.baz)
tag; baz (foo.qux.quux.baz)

From a search/aggregation perspective this might not be necessarily bad as it’d combat the sparseness issue, but from an entry perspective it again minimizes the hierarchy usefulness quotient (at this point, one begins to ask, are intersections that bad? The answer of course is in most cases no, however in some yes).

A UI solution for this is to have an auto-completing combobox that recognizes hierarchies. This widget is also useful for traditional freetagging as well, so is a worthwhile avenue to persue regardless.

[Err, revisions, less half-bakedness forthcoming, as well as some code. Well, might as put this out for commenting. Finishing my trackback/comment code might be good.. ah, screw it, will procrastinate later. Back to real work for now]

On Jenkins and Jennings

I’ve been vaguely following Jason’s Sony travails, but Scott Andrew’s post, Harrassing fans for being fans really hits the nail on the head.

For those who haven’t read Henry Jenkins’ excellent 2002 essay, Interactive Audiences? The ‘Collective Intelligence” of Media Fans, I would definitely recommend it, as it illustrates the transformation of fandom and the consumption cycle.

For all their talk of going viral or guerilla, fundamentally, it seems broadcast media has yet to understand or even care about collaborating and taking advantage of the genuine excitement and support of their best fans.

What a waste of resources huh? Here’s a good whack of the clue-by-four for marketers: you’re burning your brand value by the minute here. And for the bean counters, a good rule of thumb: IT’S BAD BUSINESS PRACTICE TO SUE YOUR BEST CUSTOMERS.

Here’s some work I did last year on New Media and the Consumption Cycle. Also, as part of my academia wrap-up, I’ll have a paper written in the next few months comparing how the game industry and old Hollywood differ on the use of the inverted pyramid, and comparing the relative successes.

New Site, New Focus

In celebrating a new round of way too many projects, I’m finally pushing to kick out something to help me organize all that’s going on.

Currently, I’m aggregating all sorts of stuff that’s already lying about, and actively replacing and augmenting pieces that aren’t up to snuff.

I’ll be renewing my focus on KM, relationship management and collaboration, and trying to work also on productivity, aggregation and integration (especially now that I’m also working on a variety of civic-oriented projects). Lets get this life-site on the road.

(full writeup coming soon, code going into random($code))

Voting machine error gives Bush 3,893 extra votes in Ohio – AP Breaking News

Franklin County’s unofficial results had Bush receiving 4,258 votes to Democrat John Kerry’s 260 votes in a precinct in Gahanna. Records show only 638 voters cast ballots in that precinct. Bush’s total should have been recorded as 365.

OK, lets see intra-state margin of error correlation of exit polls in paper/evoting districts. Not that anything’s going to chante… No one is going to contest because they’ll just be smeared by the press as sore losers. Rove just had to get away with it long enough for Kerry to conceed, Bush to proclaim a mandate, and the media to kiss their asses. Obviously Palast is following up and no one will care. Again.

4 more years America – from the LJ of a soldier

If you voted for Bush, didn’t vote, or voted no on gay marriage, I hope you get drafted. I hope they stick you in my unit, and you go with me to Iraq when my unit goes back in September. I will laugh when you see what soldiers in that country face on a daily basis. I hope you work with gay soldiers too. I did. One of them saved my life. Think he shouldn’t have the right to get married? Fuck you. He fought just as hard as I did and on most days, did his job better than me. Don’t tell me gays don’t have the same rights you do. Think the war in Iraq is a good thing? I’ll donate my M-16 to you and you can go in my place.