{"id":2638,"date":"2010-01-05T07:48:23","date_gmt":"2010-01-05T15:48:23","guid":{"rendered":"http:\/\/palblog.fxpal.com\/?p=2638"},"modified":"2010-01-04T23:00:57","modified_gmt":"2010-01-05T07:00:57","slug":"2638","status":"publish","type":"post","link":"https:\/\/blog.fxpal.net\/?p=2638","title":{"rendered":"Measuring Scholarometer"},"content":{"rendered":"<p>The ability to manage references to papers is an extremely useful tool for academics. As I see it, the tools divide into two classes: one for managing references while writing, and the other for managing references (often your own) for bibliographic purposes such as putting together your CV. Tools such as <a title=\"EndNote | Thompson Reuters\" href=\"http:\/\/www.endnote.com\/\" target=\"_blank\">EndNote<\/a> and <a title=\"Mendely.com\" href=\"http:\/\/www.mendeley.com\/\" target=\"_blank\">Mendeley<\/a> are designed to manage a database of references that can be embedded in documents (such as MS Word) without the need to re-enter all the metadata. The tools work, but are brittle and prone to corrupting the manuscript.<\/p>\n<p>Recently, a number of tools (often based on Google Scholar as the search\/data mining engine) have been released. I <a title=\"In pursuit of impact | FXPAL Blog\" href=\"http:\/\/palblog.fxpal.com\/?p=1917\" target=\"_blank\">reviewed<\/a> <a title=\"CitationTracker\" href=\"http:\/\/citation-tracker.com\/\" target=\"_blank\">CitationTracker<\/a> earlier, and now got around to looking at <a title=\"Scholarometer | Indiana University\" href=\"http:\/\/scholarometer.indiana.edu\/\" target=\"_blank\">Scholarometer<\/a>.<\/p>\n<p><!--more--> Scholarometer is a Firefox plug-in for issuing and parsing Google Scholar queries. It takes query consisting of one or more authors&#8217; name and a tag from one of several <a title=\"What tags can I use? Scholarometer FAQ\" href=\"http:\/\/scholarometer.indiana.edu\/faqs.html#what-tags\" target=\"_blank\">classification schemes<\/a>. Based on the results returned by Google, it displays a sortable list of publication references, and allows the user to merge duplicates and to remove erroneous references. The plug-in approach is clever because Google does not provide an API for querying Google Scholar, making it difficult to implement a server-based approach to managing these data.<\/p>\n<p>The plug-in uses the papers returned by Google Scholar to compute <a title=\"Universal h-index | Scholarometer FAQ\" href=\"http:\/\/scholarometer.indiana.edu\/faqs.html#universal-h-index\" target=\"_blank\">h-index <\/a>and g-index scores, and generates a couple of visualizations (a Zipfian citation frequency plot or a bar chart of the number of publications in the last few years. They are pretty, but not particularly informative for one&#8217;s own work, and it&#8217;s not clear whether they actually provide actionable information for others&#8217; publications either. Overall, though, the interface is well put-together and does not require any registration (although it does require a Firefox plug-in download).<\/p>\n<p>The limitations of the tool is that initially it relies on Google Scholar for its data, and Google Scholar data is pretty noisy. In a standard vanity search, it came up with over 150 hits for my last  name, some of which were papers by my father, some of which were patents  in which I wasn&#8217;t interested for this purpose, some of which were junk  or poorly-parsed entries, and of course, some were actual papers I wrote  and co-wrote. In the end, I whittled the 150+ papers into a more <a title=\"&quot;golovchinsky&quot;  in &quot;science &gt; computer scinece, information systems&quot; | Scholarometer\" href=\"http:\/\/scholarometer.indiana.edu\/cgi-bin\/index.cgi?func=query&amp;expr=f7772854e760e24ca68358868ccaebcc\" target=\"_blank\">manageable list<\/a> of 83 entries. I am sure I could reduce it further, but I  had a post to write.<\/p>\n<p>While the duplicate merging process is straightforward &#8212; select two or more entries and press &#8220;merge&#8221; &#8212; it suffers from at least two problems: the metadata is merged in a seemingly arbitrary way, without respecting the number of citations of the alternate versions. Thus, rather than incorporating two stray references on which the parser choked (putting an author&#8217;s name in the title, for example) into a nicely-parsed entry with hundreds of citations, it does the reverse, throwing out perfectly good data. And the second part of its one-two punch is that there appears to be no undo operation.<\/p>\n<p>The site is a work in progress, so the UI limitations will likely be worked out. It will be interesting to see what quality of data the operators of this service collect. They have promised to make it public, but no details of the plan have been disclosed on the Scholarometer web site. For now, they have made some statistics available on their web site. You can select the field from their taxonomy (e.g., &#8220;science &gt; computer science&#8221;) and a measure (e..g., h-score) it will display a list of the top few people. While mildly interesting, this is not as informative as a distribution chart showing how many people have particular scores.<\/p>\n<p>Other features I would like to see on this site include the sharing of information in a structured manner. If I have done a vanity search and cleaned up references, I should be able to share that information with others and other should be able to find that information about me. While it appears that I can link to specific queries (although I don&#8217;t know how long these URLs will persist), I have no way of publishing my results through the site. This seems like an oversight on the part of the designers of the system, because one sure way to motivate people to create content is to create a mechanism through which that information <em>about them<\/em> is made public.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The ability to manage references to papers is an extremely useful tool for academics. As I see it, the tools divide into two classes: one for managing references while writing, and the other for managing references (often your own) for bibliographic purposes such as putting together your CV. Tools such as EndNote and Mendeley are [&hellip;]<\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[155],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/posts\/2638"}],"collection":[{"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=2638"}],"version-history":[{"count":8,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/posts\/2638\/revisions"}],"predecessor-version":[{"id":2646,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=\/wp\/v2\/posts\/2638\/revisions\/2646"}],"wp:attachment":[{"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=2638"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=2638"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.fxpal.net\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=2638"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}