{"id":3995,"date":"2009-11-30T02:07:08","date_gmt":"2009-11-30T02:07:08","guid":{"rendered":"http:\/\/jurnsearch.wordpress.com\/2009\/11\/30\/do-we-need-a-new-cse-for-repositories\/"},"modified":"2009-11-30T02:07:08","modified_gmt":"2009-11-30T02:07:08","slug":"do-we-need-a-new-cse-for-repositories","status":"publish","type":"post","link":"https:\/\/jurn.link\/jurnsearch\/index.php\/2009\/11\/30\/do-we-need-a-new-cse-for-repositories\/","title":{"rendered":"Do we need a new CSE for repositories?"},"content":{"rendered":"<p>Do we need a new Google CSE for academic repositories?  The old ones are looking rather long in the tooth, and their link-rot must be getting pretty bad by now.<\/p>\n<p><a href=\"http:\/\/www.opendoar.org\/\">Open DOAR<\/a> search, according to the date on the foot of the search page, <a href=\"http:\/\/www.opendoar.org\/search.php\">has not updated since Nov  2006<\/a>.  Similarly, <a href=\"http:\/\/roar.eprints.org\/\">ROAR<\/a>&#8216;s own Google Custom Search Engine has <a href=\"http:\/\/www.google.com\/cse\/home?cx=008790892318713498856%3Abr_k-l0-cny\">not been updated since Nov 2006<\/a>.<\/p>\n<p>I think it&#8217;s time for a new and up-to-date one. It shouldn&#8217;t be difficult to extract the URLs from a downloaded set of <a href=\"http:\/\/www.opendoar.org\/countrylist.php\">OpenDOAR country pages<\/a>, which are still actively maintained.  It&#8217;s even easier to download <a href=\"http:\/\/roar.eprints.org\/index.php?action=csv\">the .csv<\/a> of all the URLs from ROAR and to extract them with Excel. As with OpenDOAR, it seems that the ROAR repository list is up-to-date, even if the CSE isn&#8217;t. One would then combine the lists and de-duplicate, clean the list, and then upload the cleaned list to a sparkly new Google Custom Search Engine.  If I had the space to add another 2,000 URLs to my Google CSEs, I&#8217;d do it myself.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Do we need a new Google CSE for academic repositories? The old ones are looking rather long in the tooth, &hellip;<\/p>\n<p><a href=\"https:\/\/jurn.link\/jurnsearch\/index.php\/2009\/11\/30\/do-we-need-a-new-cse-for-repositories\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2,5,10],"tags":[],"class_list":["post-3995","post","type-post","status-publish","format-standard","hentry","category-academic-search","category-how-to-improve-academic-search","category-my-general-observations"],"_links":{"self":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/3995","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/comments?post=3995"}],"version-history":[{"count":0,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/3995\/revisions"}],"wp:attachment":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/media?parent=3995"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/categories?post=3995"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/tags?post=3995"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}