Blog Category: Uncategorized

RT @twitter: Project retweet

on Comments (1)

One effective way to understand where to put paved paths is to look at places where the grass has been trodden. Twitter has adopted this approach by offering a minimal interface and looking at how people use it.The idea was to allow basic messaging, and not to worry too much about fancy functionality. The tweeters responded by overlaying a number of conventions onto the simple message body. RT, the equivalent of forwarding in e-mail, is a popular convention for cascading news along the social graph, and is one of the central mechanisms responsible for the effectiveness of Twitter as a news dissemination channel.

Continue Reading

SIGIR Twitter Archives

on

We’ve created some archives of twitter conversations for SIGIR 2009 and for some of the workshops associated with the conference. These archives are useful because Twitter messages tend to evaporate after a while.

I know of the following archives:

If the other workshops had significant traffic, I am happy to archive & update the list above.  TwapperKeeper is a service that archives twitter searches based on a specified hashtag. The data is then available through the web site and for download in tab- or semicolon-separated format. Saving your own copy means that you can refer to it later, and also makes it easier to do data mining or other research on the use of Twitter. I encourage people to download archives (although as new tweets come in the archives will get updated on TwapperKeeper) to make sure they persist even if TwapperKeeper doesn’t. Archive early, archive often.

Pattern matching

on

Once a month I drive up to Oakland to attend the SF Bay Groovy and Grails Meetup organized by Chris Richardson. It’s a fun group of people and conversation covers a lot of ground. During Monday’s meeting we chatted about Scala, among other things, and how it was good for pattern matching in exactly the way that object-oriented solutions weren’t.

Chris gave the example of dispatching requests in a web server by matching URI patterns to discover what internal methods to call to handle requests. Scala’s switch statement allows a URI to be split into a list of tokens (as do many other languages) and then makes it easy to match against this list, even if the list is not homogeneous. It furthermore makes the matching values available to the closure handling the case, making it more expressive than (for example) Java.

This made me reflect on when pattern matching is used versus more explicit representations of structure. For example, when making a function call, we rely on an explicit (simple) declared structure rather than on a sequential examination of possible patterns to decide what function to call. (Overloaded methods don’t really require serious pattern matching.) So pattern matching is typically used when the input is poorly structured, natural language processing being a good example. While the utility of using plain text URIs as a lingua franca among heterogeneous systems is compelling, but I found it ironic that, in a sense, that the URI, the basis of internet communication, winds up being treated more like natural language than a function call.

Tweeting at JCDL

on Comments (4)

I attended JCDL 2009 this week, and had the opportunity to do some live tweeting of several papers and panel sessions. It was an interesting experience that I thought was worth summarizing here. Overall, it was difficult to get the messages right, it was a challenge to listen and type at the same time, the 140 character constraint was an issue some of the time, and my tweeting had a couple of effects on my Twitter network. And of course there is the question of utility of this endeavor.

Continue Reading

Apply to be a CI fellow at FXPAL

on

The  NSF is funding postdoc positions through the new Computing Innovation Fellows Project. Four of us here at FXPAL are offering our services as mentors. Work with

  • Francine Chen on the application of machine learning to information access tasks, or
  • Jeremy Pickens on problem of designing algorithms and interfaces to aid small teams to finding information addressing an explicitly-shared information need, or
  • Eleanor Rieffel on image-based approaches to creating virtual models, or
  • Larry Rowe on flexible media streaming systems and applications, such as multiple camera collaboration or mixed reality spaces.

If you received your Ph.D. (or  completed all of the requirements) between May 1, 2008 and August 31, 2009, and have expertise relevant to one of these areas, please contact us! To be eligible for the program, you do not have to be a U.S. citizen (though U.S. citizens are given preference).

Act quickly! Applications are due June 9th.

Making Wolfram | Alpha usable

on

Daniel Tunkelang, among others, has pointed out (here and here) that Wolfram | Alpha is making its life (and potentially its users’) more difficult by focusing on an NLP interface. Instead, they should expose their data through YQL to leverage all of the development happening in that platform. This would allow mashups to leverage Wolfram | Alpha content and provide the context disambiguation that will make the data more useful with less interaction.

While some licensing issues related to the volume of requests may need to be resolved, it seems plausible that the two technologies will complement each other and increase their respective value.

JCDL 2009 preview

on Comments (2)

The 2009 ACM/IEEE Joint Conference on Digital Libraries (JCDL 2009) will be held in Austin, TX June 15-19. The program includes tutorials, workshops, and three days of paper presentations. The schedule is just being made available online, and I found some interesting papers worth watching for when they appear in the ACM DL or on the authors’ pages. Here are some of the papers I am looking forward to when I go to the conference:

  • Generalized Formal Models for Faceted User Interfaces (Clarkson, Navathe, Foley)
    I wonder if this is related to the “Exploring Websites through Contextual Facets” paper I blogged about earlier.
  • How Do You Feel about Dancing Queen? Deriving Mood & Theme Annotations from User Tags (Bischoff, Firan, Nejdl, Paiu)
    How could you not?
  • No Bull, No Spin: a comparison of tags with other forms of user metadata (Marshall)
    Cathy Marshall is always entertaining and has an almost magical ability to generate unexpected insights from mundane data.
  • What Do Exploratory Searchers Look at in a Faceted Search Interface? (Kules, Capra, Banta, Sierra)
    Will this topic be enough to bring Daniel Tunkelang to Austin?

Shared notetaking tools?

on Comments (6)

I am about to start a significant research and writing activity with a co-worker. We will have to organize a bunch of documents, take and share notes, do some searching, etc. The tool should allow us to group and organize notes associated with each document, and perhaps generate some overviews of the collection.

I am looking for recommendations, including pros and cons. We would consider spending a bit of money on this, but obviously free is better. On the other hand, a well-designed, usable interface is worth some investment. Also, I would prefer simplicity to functionality.

Openness is a virtue

on

This is a bit of a rant. Bear with me.

We are working on a whiteboard capture and retrieval project (more blogging on this later). The goal is to capture whiteboard images in some unobtrusive manner and then store and index them for future retrieval and browsing. We currently capture with a video camera, but thought it would be good to be agnostic with respect to the source of capture.

mimio makes a whiteboard ink capture tool that uses an IR receiver to track positions of pens on the whiteboard. You still use the same dry-erase markers, but house them in an IR-transmitting enclosure. We thought that would be perfect to tell us when and where inking was taking place.

Continue Reading