{"id":10966,"date":"2014-05-15T21:16:11","date_gmt":"2014-05-15T21:16:11","guid":{"rendered":"http:\/\/jurnsearch.wordpress.com\/?p=10966"},"modified":"2014-05-15T21:16:11","modified_gmt":"2014-05-15T21:16:11","slug":"odysci","status":"publish","type":"post","link":"https:\/\/jurn.link\/jurnsearch\/index.php\/2014\/05\/15\/odysci\/","title":{"rendered":"Odysci"},"content":{"rendered":"<p><a href=\"http:\/\/academic.odysci.com\/\">Odysci Academic Search<\/a> is aimed at allowing&#8230;<\/p>\n<blockquote><p>&#8220;technical professionals and companies to find and use the relevant technical information&#8221; in &#8220;computer science, electrical engineering and math-related areas&#8221;<\/p><\/blockquote>\n<p>Their blog entries tail off and stop in 2011, so it&#8217;s been around for a while, but the developers <a href=\"http:\/\/www.dlib.org\/dlib\/may14\/bergamaschi\/05bergamaschi.html\">have a new paper<\/a> which describes the technical infrastructure and gives the algorithms.  I was interested to learn that&#8230;<\/p>\n<blockquote><p>&#8220;This framework is able to import, de-duplicate and persist 200K papers in the database (and all their entities) in 16 hours on an i7-based workstation with 32GB of RAM.&#8221;<\/p><\/blockquote>\n<p>My broad test search for&#8230;<\/p>\n<p>&nbsp;&nbsp;&nbsp;&#8220;energy conservation&#8221; organizations<\/p>\n<p>&#8230; gave me 44 results which included three fulltext links. That suggests that when Odysci imports records, there might be PDF links on less than 10% of those records?<\/p>\n<p>On that basis I would guesstimate an ability to ingest, strip and process perhaps 20,000 fulltext PDF papers every 16 hours, give or take?  So in terms of making a standalone JURN, give me six such PCs and the bulk of the humanities journal indexing might be done in&#8230; six months?  Keep in mind that processing power is increasing (the Core i7 CPU line was introduced in 2008).  If Odysci&#8217;s i7 is a circa-2008 CPU then more modern processors will do the job faster, and superfast broadband would speed up the actual PDF downloading.<\/p>\n<p>The same search in JURN tended to foreground papers on the role of human behaviours\/attitudes and public policy in organizational energy conservation &mdash; rather than the technical aspects of electrical implementation.  That suggests that &mdash; despite the recent science additions &mdash; JURN will tend to veer toward &#8216;the human element&#8217; of topics.  I also ran the test in Google Scholar, which proved to have the same veer, though with a heavier emphasis on articles from Psychology.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Odysci Academic Search is aimed at allowing&#8230; &#8220;technical professionals and companies to find and use the relevant technical information&#8221; in &hellip;<\/p>\n<p><a href=\"https:\/\/jurn.link\/jurnsearch\/index.php\/2014\/05\/15\/odysci\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2,16],"tags":[],"class_list":["post-10966","post","type-post","status-publish","format-standard","hentry","category-academic-search","category-spotted-in-the-news"],"_links":{"self":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/10966","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/comments?post=10966"}],"version-history":[{"count":0,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/10966\/revisions"}],"wp:attachment":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/media?parent=10966"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/categories?post=10966"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/tags?post=10966"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}