Extreme right

The EU’s “right to be forgotten” ruling is now blocking access to historical Holocaust archives, reports the Jerusalem Post

Researchers across the continent – especially in Sweden, France and Germany – have claimed that archivists have begun restricting access to data, citing the GDPR as their rationale for not complying with requests for documents. Because the legislation does not stipulate how long after a person’s death his or her private information can be revealed, or when access to such information can be granted, some archivists “have begun reading into what they understand the law will be,” and are “barring access to materials, including materials [related to] the history of the Holocaust,” Dr. Robert Williams said.”

For the birds….

My openECO A-Z listing of journals has now had all known free bird titles added to it, with help from the open ejournals in the Ornithology Exchange list and the British Trust for Ornithology list of open ejournals. All the new bird journal URLs were closely checked before being added to the A-Z, since those older lists have a lot of linkrot and even some mis-attribution of OA status.

Some figures on OA in Web of Science

Thomson Reuters on OA in Web of Science

The Web of Science [has] more than 12% of its core collection database in Open Access journals, many with direct, full-text links to open content (see Figure 1).”

oa-titles-wos-2015Fig. 1: “Open Access Titles in the Web of Science by Discipline & Geography” (SOURCE: Thomson Reuters Web of Science)

The total of 72 OA titles in arts & humanities (all in English?) is comparable to Scopus. Scopus had 60 OA arts & humanities titles in English at June 2015, a fact discoverable via their new OA tagging. Though, after sorting, that Scopus category also included such tres arty titles as Canadian Journal of Speech-Language Pathology, Journal of Biomedical Discovery and Collaboration and Asian Social Science.

In defense of the lecture

From The New York Times today, a defense of the value to the humanities of the classic hour-long lecture / following discussion format, augmented by concise hand-written note-taking by students. Of course a lot will depend on one’s ability to deliver the lecture without reading verbatim or killing it with Powerpoint, plus the level of one’s physical dynamism, clarity of diction and strength of voice. And one needs students who know how to take good selective paper notes and sketch mini-diagrams, and who are not either hungover/shivering in the morning or exhausted at the end of the day. So it’s not an option for everyone, nor at all times of the day.

Semantic Scholar

Another month, another search-engine for the well-thumbed corpus of academic articles in Computer Science. Semantic Scholar is a touch different though, as it’s been developed at the Paul Allen Institute for Artificial Intelligence and it just searches 3 million open access papers. As such I guess that most Computer Science students may come to think of it as just a much more elegantly designed and somewhat faster equivalent of Microsoft Academic, minus the pesky records with no PDF links.

Semantic Scholar reportedly plans to expand to the neurosciences and biomedical by 2016-18. And, of course, one should never underestimate the Microsoft tortoise/hare growth method (Allen is a Microsoft founder) — what looks like a lackluster tortoise at first slowly builds and redefines, and re-builds and expands again over the years, until suddenly it’s out in front of the race. That process stalled with the reported ceasing of further development on Microsoft Academic, but it may be that Semantic Scholar is effectively Microsoft’s arms-length second try at that? Just my guess.

As with most such ventures, it seems to be cloaking the allegedly A.I. / semantics-assisted development of something far more commercial and widely applicable: accurate automatic full-text detection (CORE could only get to around 27% with that on academic repositories, last I heard), then document structure evaluation, extraction, segmentation and re-formatting. Which is nice, if one only has to organise an interface for a very well-behaved corpus of Computer Science papers. Semantic Scholar certainly looks like it can do that, and elegantly too, though I’m not qualified to comment on its relevancy ranking or the alleged semantics aspects. But I suspect we’re still many decades from having an autobot that can tame the messy Wild West of open publishing in that manner.

Museum fur Kunst und Gewerbe Hamburg

Museum fur Kunst und Gewerbe Hamburg has placed online images of all items in its collections, flagged as Public Domain. The site’s image server seems to have crashed over the weekend on all but the front page, but a Google Images search suggests that the largest size is 1200px on the longest side. The site search is also unreliable at picking out hyphenated or apostrophed words: for instance, a search for Orme doesn’t bring up the photo-chromolithograph “Llandudno from the Great Orme’s Head” (which, incidentally, is available at 28Mb from the Library of Congress).

mg