{"id":4774,"date":"2010-10-06T07:16:31","date_gmt":"2010-10-06T14:16:31","guid":{"rendered":"http:\/\/palblog.fxpal.com\/?p=4774"},"modified":"2010-10-05T22:04:20","modified_gmt":"2010-10-06T05:04:20","slug":"4774","status":"publish","type":"post","link":"https:\/\/blog.fxpal.net\/?p=4774","title":{"rendered":"Soylent is food for thought"},"content":{"rendered":"<p><a title=\"Michael Bernstein | MIT\" href=\"http:\/\/people.csail.mit.edu\/msbernst\/\" target=\"_blank\">Michael Bernstein<\/a> and <a title=\"i.e., et al.\" href=\"#\">a cast of thousands<\/a> published an <a title=\"Bernstein, M. S., Little, G., Miller, R. C., Hartmann, B., Ackerman, M. S., Karger, D. R., Crowell, D., and Panovich, K. 2010. Soylent: a word processor with a crowd inside. In Proc. UIST 2010. ACM Press, 313-322.\" href=\"http:\/\/doi.acm.org\/10.1145\/1866029.1866078\" target=\"_blank\">interesting paper<\/a> at UIST 2010, which was honored with the Best Student Paper award. The paper describes and evaluates Soylent, a tool that uses Mechanical Turk to generate corrections and suggestions to improve writing. (The name Soylent is not a substitute for dairy in the weeks leading up to Easter; rather, it is derived from the film <a title=\"Soylent Green | Wikipedia\" href=\"http:\/\/en.wikipedia.org\/wiki\/Soylent_Green\" target=\"_blank\">Soylent Green<\/a>.)<\/p>\n<p>This work is interesting in a number of ways: it automates the distribution and collection of Mechanical Turk tasks and then integrates the results into an interactive system, it recognizes the limitations of fully-automated approaches, and it suggests a design pattern that can be applied in other contexts .<\/p>\n<blockquote><p>The main contribution of this paper is <em>the idea of embedding paid crowd workers in an interactive user interface to support complex cognition and manipulation tasks on demand<\/em>. These crowd workers do tasks that computers cannot reliably do automatically and the user cannot easily script.<\/p><\/blockquote>\n<p><!--more--><\/p>\n<p>The paper implements three different components that use Mechanical Turk input: Shortn to shorten text, Crowdproof to do proofreading, and The Human Macro to specify repeated tasks. The Find-Fix-Verify pattern is used to mitigate errors by splitting the identification (find), generation (fix) and validation (verify) parts among different Turkers.<\/p>\n<p>One challenge with this approach is response time and cost. For example, the authors report that most actual work times were under four minutes per stage for the Shortn task, whereas the overall response times were closer to 45-60 minutes for most tasks. The authors argue that as the number of Turkers increases, wait times will decrease and approach the actual work times. It&#8217;s not clear to me whether the rate at which Turkers accept these tasks will keep pace with the rate at which writers will submit them. On the other hand, the paper reports anecdotally that decreasing the payout for each stage of the job resulted in comparable quality but took longer. This suggests that it may be possible to pay more for faster service; how to set price points, and whether the potential availability of higher-paying HITs will cause Turkers to hold out for them in lieu of completing the lower-cost ones is an open issue.<\/p>\n<p>The other, related, issue is cost: the Shortn tasks cost roughly between $4.50 and $9.50 for a few paragraphs; Crowdproof cost $2 to $5 per paragraph, and don&#8217;t report costs for The Human Macro. These numbers are not cheap, particularly if help is needed throughout a paper. For example, this would generally not be an effective technique for correcting significant mistakes in the writing of non-native English speakers, the kinds of mistakes that often lead to a paper being rejected because it is too hard to understand.<\/p>\n<p>Nonetheless, this is an interesting, provocative, and well-written paper that breaks new ground in crowdsourcing and in interactive system design. It&#8217;ll be interesting to see the evolution of design patterns for crowdsourcing over the next few year. One possible challenge for the long-term stability of crowdsourcing design patterns is the human factor. Uunlike computer systems whose behavior doesn&#8217;t change over time (MVC works just the same now as it did in the 1980s), some patterns designed to work around undesired behaviors by Turkers may not remain effective if Turkers understand how they are applied and how to game them. And, as ever, there is the challenge of scale: the challenge of finding enough qualified Turkers to sustain the demand for their labor.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Michael Bernstein and a cast of thousands published an interesting paper at UIST 2010, which was honored with the Best Student Paper award. The paper describes and evaluates Soylent, a tool that uses Mechanical Turk to generate corrections and suggestions to improve writing. (The name Soylent is not a substitute for dairy in the weeks [&hellip;]<\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[24],"tags":[269],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/posts\/4774"}],"collection":[{"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=4774"}],"version-history":[{"count":7,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/posts\/4774\/revisions"}],"predecessor-version":[{"id":4778,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/posts\/4774\/revisions\/4778"}],"wp:attachment":[{"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=4774"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=4774"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=4774"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}