{"id":5909,"date":"2014-10-23T04:41:13","date_gmt":"2014-10-23T11:41:13","guid":{"rendered":"http:\/\/palblog.fxpal.com\/?p=5909"},"modified":"2014-10-20T15:16:31","modified_gmt":"2014-10-20T22:16:31","slug":"video-text-retouch","status":"publish","type":"post","link":"https:\/\/blog.fxpal.net\/?p=5909","title":{"rendered":"video text retouch"},"content":{"rendered":"<p>Several of us just returned from ACM <a href=\"http:\/\/www.acm.org\/uist\/uist2014\/\">UIST 2014<\/a> where we presented some new work as part of the\u00a0<a href=\"http:\/\/www.fxpal.com\/research-projects\/cemint\/\">cemint<\/a> project. \u00a0One vision of the cemint project is to build\u00a0applications for multimedia content\u00a0manipulation and reuse that are as powerful as their analogues for text content. \u00a0We are\u00a0working towards this goal\u00a0by exploiting two key tools. \u00a0First, we want to use real-time content analysis to expose useful structure within multimedia content. \u00a0Given some\u00a0decomposition of the content, which can be spatial, temporal, or even semantic, we then allow users to interact with these sub-units or segments via direct manipulation. \u00a0Last year, we began exploring these ideas in our work on content-based\u00a0<a href=\"http:\/\/www.fxpal.com\/publications\/content-based-copy-and-paste-from-video-documents\/\">video copy and paste<\/a>.<\/p>\n<p>As another\u00a0embodiment of these ideas, we demonstrated video text retouch at UIST last week. \u00a0Our browser-based system\u00a0performs real-time\u00a0text detection on streamed video frames to locate both\u00a0words and lines. \u00a0When a user clicks on a frame, a live cursor appears next to the nearest word. \u00a0At this point, users can alter text directly using the keyboard. \u00a0When they do so, a video overlay is created to capture and display their edits.<\/p>\n<p><iframe loading=\"lazy\" src=\"\/\/www.youtube.com\/embed\/6JDnWcrD5lo\" width=\"560\" height=\"315\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<p>Because we perform per-frame text detection, as the position of edited text shifts vertically or horizontally in the course of the original (unedited source) video, we can track the corresponding line&#8217;s location and update the overlaid content appropriately.<\/p>\n<p>By\u00a0leveraging\u00a0our familiarity with manipulating text, this work\u00a0exemplifies the larger goal to bring interaction metaphors rooted in content creation to enhance\u00a0both the consumption and reuse of live multimedia streams. \u00a0We believe that integrating real-time content analysis and interaction design can help us create improved tools for multimedia content usage.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Several of us just returned from ACM UIST 2014 where we presented some new work as part of the\u00a0cemint project. \u00a0One vision of the cemint project is to build\u00a0applications for multimedia content\u00a0manipulation and reuse that are as powerful as their analogues for text content. \u00a0We are\u00a0working towards this goal\u00a0by exploiting two key tools. \u00a0First, we [&hellip;]<\/p>\n","protected":false},"author":50,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[24,128,7,1],"tags":[],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/posts\/5909"}],"collection":[{"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/users\/50"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=5909"}],"version-history":[{"count":6,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/posts\/5909\/revisions"}],"predecessor-version":[{"id":5917,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/posts\/5909\/revisions\/5917"}],"wp:attachment":[{"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=5909"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=5909"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=5909"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}