Amazon creepers

13 Friday Nov 2015

Posted by futurilla in Spotted in the news

It looks like Amazon is about to have a major problem, in terms of the findability of bona fide academic books. Caused by get-rich-quick marketeers setting up bots that shovel millions of individual CC-BY open access journal articles onto Amazon, for sale as $5 “e-books”.

Early additions have been cleaned by hand

11 Wednesday Nov 2015

Posted by futurilla in My general observations

≈ Leave a comment

All the early additions (2009-10) to JURN have now been checked by hand for viability, and corrected/deleted if needed. This work was in addition to the summer 2015 annual check of all indexed URLs, checking for their continuing presence in the Google Search index.

Extreme right

11 Wednesday Nov 2015

Posted by futurilla in Ooops!, Spotted in the news

≈ Leave a comment

The EU’s “right to be forgotten” ruling is now blocking access to historical Holocaust archives, reports the Jerusalem Post…

Researchers across the continent – especially in Sweden, France and Germany – have claimed that archivists have begun restricting access to data, citing the GDPR as their rationale for not complying with requests for documents. Because the legislation does not stipulate how long after a person’s death his or her private information can be revealed, or when access to such information can be granted, some archivists “have begun reading into what they understand the law will be,” and are “barring access to materials, including materials [related to] the history of the Holocaust,” Dr. Robert Williams said.”

For the birds….

07 Saturday Nov 2015

Posted by futurilla in Ecology additions, My general observations

≈ Leave a comment

My openECO A-Z listing of journals has now had all known free bird titles added to it, with help from the open ejournals in the Ornithology Exchange list and the British Trust for Ornithology list of open ejournals. All the new bird journal URLs were closely checked before being added to the A-Z, since those older lists have a lot of linkrot and even some mis-attribution of OA status.

Some figures on OA in Web of Science

07 Saturday Nov 2015

Posted by futurilla in Spotted in the news

≈ 3 Comments

Thomson Reuters on OA in Web of Science…

The Web of Science [has] more than 12% of its core collection database in Open Access journals, many with direct, full-text links to open content (see Figure 1).”

oa-titles-wos-2015 Fig. 1: “Open Access Titles in the Web of Science by Discipline & Geography” (SOURCE: Thomson Reuters Web of Science)

The total of 72 OA titles in arts & humanities (all in English?) is comparable to Scopus. Scopus had 60 OA arts & humanities titles in English at June 2015, a fact discoverable via their new OA tagging. Though, after sorting, that Scopus category also included such tres arty titles as Canadian Journal of Speech-Language Pathology, Journal of Biomedical Discovery and Collaboration and Asian Social Science.

Hansard Corpus

06 Friday Nov 2015

Posted by futurilla in Spotted in the news

≈ Leave a comment

Hansard Corpus: British Parliament, 1803-2005…

“contains nearly every speech given in the British Parliament from 1803-2005, and it allows you to search these speeches (including semantically-based searches) in ways that are not possible with any other resource.”

200 new titles

05 Thursday Nov 2015

Posted by futurilla in JURN metrics

≈ Leave a comment

Over the last six weeks JURN has added indexing for 200+ new open/free titles published in English, plus about 15 caches of open books and monographs.

In defense of the lecture

04 Wednesday Nov 2015

Posted by futurilla in Spotted in the news

≈ Leave a comment

From The New York Times today, a defense of the value to the humanities of the classic hour-long lecture / following discussion format, augmented by concise hand-written note-taking by students. Of course a lot will depend on one’s ability to deliver the lecture without reading verbatim or killing it with Powerpoint, plus the level of one’s physical dynamism, clarity of diction and strength of voice. And one needs students who know how to take good selective paper notes and sketch mini-diagrams, and who are not either hungover/shivering in the morning or exhausted at the end of the day. So it’s not an option for everyone, nor at all times of the day.

Semantic Scholar

03 Tuesday Nov 2015

Posted by futurilla in Academic search, Spotted in the news

≈ Leave a comment

Another month, another search-engine for the well-thumbed corpus of academic articles in Computer Science. Semantic Scholar is a touch different though, as it’s been developed at the Paul Allen Institute for Artificial Intelligence and it just searches 3 million open access papers. As such I guess that most Computer Science students may come to think of it as just a much more elegantly designed and somewhat faster equivalent of Microsoft Academic, minus the pesky records with no PDF links.

Semantic Scholar reportedly plans to expand to the neurosciences and biomedical by 2016-18. And, of course, one should never underestimate the Microsoft tortoise/hare growth method (Allen is a Microsoft founder) — what looks like a lackluster tortoise at first slowly builds and redefines, and re-builds and expands again over the years, until suddenly it’s out in front of the race. That process stalled with the reported ceasing of further development on Microsoft Academic, but it may be that Semantic Scholar is effectively Microsoft’s arms-length second try at that? Just my guess.

As with most such ventures, it seems to be cloaking the allegedly A.I. / semantics-assisted development of something far more commercial and widely applicable: accurate automatic full-text detection (CORE could only get to around 27% with that on academic repositories, last I heard), then document structure evaluation, extraction, segmentation and re-formatting. Which is nice, if one only has to organise an interface for a very well-behaved corpus of Computer Science papers. Semantic Scholar certainly looks like it can do that, and elegantly too, though I’m not qualified to comment on its relevancy ranking or the alleged semantics aspects. But I suspect we’re still many decades from having an autobot that can tame the messy Wild West of open publishing in that manner.

Museum fur Kunst und Gewerbe Hamburg

02 Monday Nov 2015

Posted by futurilla in Spotted in the news

≈ Leave a comment

Museum fur Kunst und Gewerbe Hamburg has placed online images of all items in its collections, flagged as Public Domain. The site’s image server seems to have crashed over the weekend on all but the front page, but a Google Images search suggests that the largest size is 1200px on the longest side. The site search is also unreliable at picking out hyphenated or apostrophed words: for instance, a search for Orme doesn’t bring up the photo-chromolithograph “Llandudno from the Great Orme’s Head” (which, incidentally, is available at 28Mb from the Library of Congress).

News from JURN

~ search tool for open access content

Amazon creepers

Early additions have been cleaned by hand

Extreme right

For the birds….

Some figures on OA in Web of Science

Hansard Corpus

200 new titles

In defense of the lecture

Semantic Scholar

Museum fur Kunst und Gewerbe Hamburg