Monday 28 July 2008

The Tetris Model of Information Seeking

The more and more I've been reading about other models of information seeking (such as Marchionini 1995 and Kuhlthau 1993 and many more), the more I've been annoyed by how limited to a sequential flow they are. In Marchionini's, for example, there's a clear progression from problem identification, to specification, to seeking action, result viewing and resolving the problem. The model has this nice step towards the end that says 'refinement' and the text has a clause to say that people may drop back to almost any previous point. I believe a text clause like that is an indication that there should be a better way to model Information Seeking.

The thing I did like, was that each step was a different rectangular shape, based on how much time and computer involvement it required, as the two dimensions. These two observations about the model have led me to my tetris model of search, which I'm going to blog about here for a bit to test the water. You'll probably see followup blogs! I've got a lot to say about it!

Now, in Tetris, different shapes fall from the top of the screen, and success is modelled by organising them so that entire horizontal lines are made, removed from the display and converted to points. Let's first take the analogy that resolving an information seeking problem is like clearing a line of the board and that solving a bigger problem is like clearing multiple lines of the board, and finally that your score is representative of the overall knowledge you have on that topic.

Let us then imagine that the pieces that fall down from the top of the screen are then any one of the stages that are found in models like those mentioned above, where the ideal is that you get a series of simple pieces, representing a simple problem, a simple spec, a simple query, and a simple answer. BAM one line, problem solved.

BUT we all know that life is not like that, and regularly you get a nice simple first block (or you think you have a simple problem to solve) and then you get a + shape answer when you view the results that tells you your problem is a little more complicated than that. What we begin to see is that the complexity of a problem is actually represented not by the pieces, but by the current depth of the board. Each piece, therefore, represents an action, such as realising a problem, performing a query, etc.

So, a simple lookup on google is represented by a series of easy bits (specing, querying, viewing, etc) fitting together nicely and a line clearing. If you have a complex problem, however, the first bit you get is complex, like a +, and then you may need a combination of queries, and results to resolve your problem, and shift the 3 lines built by the +. Exploratory search can also be modelled with this analogy. If a user starts with a simple problem and starts off by querying for 'classical music' and then the first resullt says well there are lots of types of classical music:.... this means the next piece you got was a + and so getting an answer to your first query broadens the work you have to do to better understand classical music. Then, over time, you can resolve bits of information, find new problems you need to learn about. get some simple answers to fill in the gaps. Over time you may find that there are always rows with holes in, that you might take years to get back to them and fill them.

that was long, but think about it. I think its a pretty good analogy. Comments?

Friday 25 July 2008

When should users be made to think?

I've been a little quiet, I know. I've been working hard consulting for an interesting new client on a project that has, yet again, completely consumed my interests with new challenges. In this case, its amazed me that one of the primary concerns of this project has not been to make the interaction as quick and simple as possible, but to produce software that is a) intuitive and b) coerces users into thinking about the appropriate things at the appropriate times.

I've noticed this has become a recurring theme in many scenarios. The first time I heard something along this line was an argument against automating the jobs of pilots, but for making the jobs easier. If pilots get used to the plane doing most of the jobs for them, they may become less capable when the plane malfunctions. However, if the actions are made easier to perform, then the skill is maintained, but the usability has improved.

Search can be considered in a similar way. While lots of search designs are focused on letting users express the knowledge or known constraints that they do have quickly, this can leave users with problems when they have to choose between their results using facets that they have not considered. In this case, we do not at all want to make assumptions, but at the same time, we do not want to leave them to make a decision with only a list of options to do so with.

In some of our previous work we have investigated how giving users example (ideally multimedia) result items that would be associated with each of the items in their new decision. This means that users are become aware of things that the should think about before they buy, AND give them the means to understand the effect of their decisions. Another approach mSpace has taken is to be subjunctive, by allowing users to easily change their mind, or considered another way: rapidly tryout different options by minimising the costs of reversing their decision. To do this mSpace maintains all of the options a user was given at each step, so that the user, with a single click, can switch between different items in the same facet, and see the effect is has on the results.

Wednesday 16 July 2008

viacom aint all bad - perhaps i love viacom?

I've been a bit quiet recently, having been travelling to institutions around the UK, and then vacationing a bit. But i was pleased to see the news on the row between YouTube and Viacom. Thankfully they have agreed to let Google anonymise the usage histories that will be sent to Viacom as part of the ongoing copyright legal battle.

This only leads me to think that Viacom has created an even more perfect dataset for us to run user analysis over, like the longditudinal study we recently presented at JCDL2008 this year. I assume each user will get an anonymous ID, so that anonymous individual activity can be followed?

Can we have it after you Viacom? We'd love you!

Saturday 5 July 2008

the world is connected



actually, from the map it looks like the 1st world is connected at least.

interesting visualisation, though, of twitter conversations.

compared to some of the other network visualisations, this one shows how being grounded in a known 2.5D space makes it a lot more accessible.

Friday 4 July 2008

best search dataset ever?

In a current lawsuit between Google's YouTube and Viacom, Google have been instructed to hand over their logs of user viewing habits. Google are upset about it, but at least the judge overruled the request for Google to hand over their source code for filtering copyrighted material! Google have requested, although its not been confirmed yet I believe, that they get to anonymise the logs first, to respect the users' privacy. i personally hope this is allowed.

Anyway, despite this interesting privacy issue, i can't help but thinking that 12 terabytes of usage logs from youtube would be an AMAZING research resource for investigating user behaviour. Sadly they dont have facets to chat about, but it could tell us how people have used query refinements, spelling corrections, categories, filters, similar clips, recommended clips and so much more!

Google might as well do something useful with it, if the data is going to shown to at least one third party.