{"id":1624,"date":"2009-06-13T00:41:22","date_gmt":"2009-06-13T00:41:22","guid":{"rendered":"http:\/\/jurnsearch.wordpress.com\/?p=1624"},"modified":"2009-06-13T00:41:22","modified_gmt":"2009-06-13T00:41:22","slug":"getting-only-the-free-articles-into-jurn","status":"publish","type":"post","link":"https:\/\/jurn.link\/jurnsearch\/index.php\/2009\/06\/13\/getting-only-the-free-articles-into-jurn\/","title":{"rendered":"Getting only the free articles into JURN"},"content":{"rendered":"<p>Someone asked about what comes into the JURN index, when a title is indexed but only offers a limited amount of free full-text or &#8220;free-sample&#8221; articles.  Does the rest of the online material (link-less tables-of-contents, abstracts with no full-text links etc) from the journal also enter JURN?  The answer is: no, not usually. It&#8217;s usually possible to filter at the URL level so that <em>only<\/em> the free content enters JURN.  For example, by only indexing URLS such as:<\/p>\n<blockquote><p>http:\/\/www.journal.com\/journal\/sample\/*.pdf<\/p>\n<p>http:\/\/www.journal.edu\/journalABC\/documents\/*.pdf<\/p><\/blockquote>\n<p>A real-world example is:<\/p>\n<blockquote><p>http:\/\/www.egyptpro.sci.waseda.ac.jp\/pdf*\/*\/*.pdf<\/p><\/blockquote>\n<p>Where &#8220;*&#8221; is the Google CSE wildcard. Of course if some dimwit IT techie then decides to juggle the directory structure, it will erase the journal from JURN.  But that&#8217;s a risk any directory or search-engine takes.<\/p>\n<p>Sometimes a few PDFs to do with society or journal administration matters can be called into search along with the articles, if all the PDFs sit indiscriminately in a single URL path.   A search for:<\/p>\n<blockquote><p>site:http:\/\/www.scholarly-society-journal.info\/ filetype:pdf<\/p><\/blockquote>\n<p>&#8230; will usually show if there are too many of these.  Google tends to bunch that sort of material at the top of site: search results.  Usually there are only a dozen or so.<\/p>\n<p>It&#8217;s different with the few ejournals that cheekily use standard &#8216;open access&#8217; publishing software, but which actually keep recent articles locked away behind a one-year or even three-year rolling paywall. The software is not intelligent enough to place paywall article abstract pages on a different and distinctive URL path, and then to automatically transfer&amp;bounce these when the article becomes free.  But by indexing only the .pdf path in such cases, that will usually call only fulltext articles into JURN.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Someone asked about what comes into the JURN index, when a title is indexed but only offers a limited amount &hellip;<\/p>\n<p><a href=\"https:\/\/jurn.link\/jurnsearch\/index.php\/2009\/06\/13\/getting-only-the-free-articles-into-jurn\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[7],"tags":[],"class_list":["post-1624","post","type-post","status-publish","format-standard","hentry","category-jurn-metrics"],"_links":{"self":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/1624","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/comments?post=1624"}],"version-history":[{"count":0,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/1624\/revisions"}],"wp:attachment":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/media?parent=1624"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/categories?post=1624"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/tags?post=1624"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}