{"id":16763,"date":"2016-03-12T08:02:22","date_gmt":"2016-03-12T07:02:22","guid":{"rendered":"https:\/\/jurnsearch.wordpress.com\/?p=16763"},"modified":"2016-03-12T08:02:22","modified_gmt":"2016-03-12T07:02:22","slug":"a-i-uh-oh","status":"publish","type":"post","link":"https:\/\/jurn.link\/jurnsearch\/index.php\/2016\/03\/12\/a-i-uh-oh\/","title":{"rendered":"A.I.? Uh oh&#8230;"},"content":{"rendered":"<p>A quick check of the front-page statement &#8220;Our corpus currently includes only computer science papers&#8221; on Paul Allen&#8217;s <a href=\"https:\/\/www.semanticscholar.org\/\">Semantic Scholar<\/a> shows that it&#8217;s no longer quite true. &#8220;Our corpus is mostly computer science papers and&#8230; a whole lot of other stuff that the A.I. dragged in&#8221; might be a more apt statement.  <\/p>\n<p>Semantic Scholar is definitely now ranging more widely in science, looking for fulltext PDFs.  I&#8217;d guess that its A.I. is working outward from highly-cited papers and ferreting among their citations to try to dig up the fulltext for each.  That would explain what appears to be the eclectic nature of Semantic Scholar&#8217;s spread away from computer science.  On searches for ecology and other not-computer-sci stuff I very easily found a Powerpoint in PDF, a workshop presentation, even a saved print-to-PDF of a book reviews page in <em>Science<\/em>&#8230;<\/p>\n<p><a href=\"https:\/\/jurn.link\/jurnsearch\/2016\/03\/science.jpg\" rel=\"attachment wp-att-16764\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/jurn.link\/jurnsearch\/2016\/03\/science.jpg?w=231\" alt=\"science\" width=\"231\" height=\"300\" class=\"alignnone size-medium wp-image-16764\" \/><\/a><\/p>\n<p>&#8230; as well as PDF papers from MDPI and ResearchGate, plus really obscurely self-archived and departmental archived PDFs. That kind of scattergun approach and lack of judicious curation seems to me to be the sign of a self-learning baby A.I. in action.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>A quick check of the front-page statement &#8220;Our corpus currently includes only computer science papers&#8221; on Paul Allen&#8217;s Semantic Scholar &hellip;<\/p>\n<p><a href=\"https:\/\/jurn.link\/jurnsearch\/index.php\/2016\/03\/12\/a-i-uh-oh\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[10,14],"tags":[],"class_list":["post-16763","post","type-post","status-publish","format-standard","hentry","category-my-general-observations","category-ooops"],"_links":{"self":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/16763","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/comments?post=16763"}],"version-history":[{"count":0,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/16763\/revisions"}],"wp:attachment":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/media?parent=16763"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/categories?post=16763"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/tags?post=16763"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}