The COAR of the issue

29 Friday Jan 2021

Posted by futurilla in Academic search, Economics of Open Access, Open Access publishing, Spotted in the news

A useful new analysis today from COAR, “Don’t believe the hype: repositories are critical for ensuring equity, inclusion and sustainability in the transition to open access”. Recent…

publishers’ comments portray gold open access as the only ‘legitimate’ route for open access, and attempt to diminish the repository (or green) route.

According to the author, some publishers are even implying that repositories have no aggregators, or are not present in Google Search or in specialist search-engines such as Scholar and GRAFT. Laughably, they apparently suggest that poor over-worked researchers will instead…

need to search through individual repositories to find the articles.

The publishers are also said to be trying to stop all but a sub-set of elite repositories from being used for data deposit, via…

proposing to define the repository selection criteria for where their authors’ should deposit research data. These criteria, which are very narrowly conceived, threaten to exclude thousands of national and institutional repositories as options for deposit.

Again, this sounds like it is designed to make researchers feel it’s more convenient to publish their article + data via a big publisher.

Report: Equitable access to research in a changing world

28 Monday Sep 2020

Posted by futurilla in Academic search, Economics of Open Access, Official and think-tank reports, Spotted in the news

≈ Leave a comment

Released in June 2020, a new consultancy report titled “Equitable access to research in a changing world: Research4Life Landscape and Situation Analysis”. This surveys the pressures on the Research4Life aid programmes. Established 20 years ago, Research4Life gives developing countries “free or low-cost” online access to journals and books from some 175 publishers. Along with other aid initiatives, this means that African universities often have better free access to journal databases than do some academics in advanced nations. The new report makes no recommendations, but a key point to note is that…

… some of the most relevant and influential research undertaken in low-and-middle income countries happens outside academia: in specialised research institutes, think tanks, or government-backed research agencies. In some countries, research agencies and institutes conduct research in national priority areas and have direct access to and influence on decision-makers” [yet] “these non-governmental organisations have in the past been excluded from open access debates, and may be unable to take advantage of initiatives such as Research4Life.

It could be useful to quantify that “may”, through further research. Do developing nations find roundabout ways to include their research agencies in Research4Life, such as giving off-campus agency researchers special log-ins to access the national university system? Or are such arrangements rather moot, in the age of open-access and Sci-hub? If not, would there be a real benefit if Research4Life were to be extended to bona fide government research agencies and suitable NGOs? How much would such an expansion actually cost, and what could the returns be in such nations?

Well endowed

18 Tuesday Jul 2017

Posted by futurilla in Economics of Open Access

≈ Leave a comment

“Financing Open Access” at Cultural Anthropology gives a figure on production costs. Apparently it costs them $50,000 a year to run a polished fees-free open access journal in the form of Cultural Anthropology. I’m not sure if that $50k figure is a notional “if people were actually paid” or a more grounded “people are actually paid, now”. A Google site: search of /culanth.org/articles/ suggests around 1,000 articles and short notes since 1986, and a current production rate of approx. six articles per quarter, plus another half-dozen short notes. One of the best long-term options, according to the article, seems to be…

“Establishing an endowment. The experts with whom FoCA has consulted have unanimously advised that an endowment is, by far, the best way to stabilize Cultural Anthropology’s financial situation in the long term.”

At a guess a $1.5m endowment donation would presumably perform at around 5% income, or $75,000 per year. Meaning $50k income per year, plus a $25k buffer for re-investment / management fees / mismanagement insurance. $1.5m is ambitious, but a slow crowdfunder + some chunky legacies in wills might do it. Once it’s in, it’s there forever.

Lulu.com launches academic service suite – Glasstree

01 Thursday Dec 2016

Posted by futurilla in Economics of Open Access, Open Access publishing, Spotted in the news

≈ Leave a comment

The leaders in affordable print-on-demand, Lulu.com, have just launched a book publishing service for academics. Glasstree offers the…

“tools and services needed by academic authors, and will leverage technology, such as print-on-demand, to distribute their works more cost-effectively. [aims to boost the] commercial academic publishing market, such as accelerating time to market, more transparent pricing, and reversing the revenue model to allow academics and scholars to realize 70% of the profit from sales of their work. Among Glasstree’s advertised services: support for open access, including the deposit of works in institutional repositories; Tools for bibliometric tracking, so academic authors can monitor Impact Factors, and other relevant measurements; More control over licensing options, through a partnership with Creative Commons; and access to traditional peer review.”

Note that…

“Glasstree is currently in a limited free trial period until 31st December 2016. During this time, authors can publish as many titles as desired, free of charge, receiving a range of complimentary services.”

Somehow I doubt that includes the related Glassleaf services where book production… “Packages start at as low as $2,625”. Ouch.

The Glasstree signup doesn’t port over your existing Lulu details, and thus presumably can’t port your academic book files over from Lulu either. Looks like it’s a wholly separate system.

Digital Monograph Costing Tool

14 Monday Nov 2016

Posted by futurilla in Economics of Open Access, Spotted in the news

≈ Leave a comment

A new Digital Monograph Costing Tool from American University Presses.

Africa is coming online

28 Tuesday Jun 2016

Posted by futurilla in Economics of Open Access, Spotted in the news

≈ Leave a comment

While African research universities usually have better commercial journal database access than their counterparts in the West (thanks to aid deals), what of public access to African-focused research? Great to hear an African voice on this, as Africa starts to buckle up for growth and international access. Chukwuemeka Fred Agbata Jnr. of Nigeria says that there is an…

“overwhelming call for the accessibility of African research [about Africa, but that this] has stretched traditional archiving methods.”

With a substantial increase in population and wealth now happening on the continent, he asks if there is now an opportunity…

“for archiving and digitising African-focused research [in order to] make African research accessible on a global scale.”

Let’s hope so. Although the author also suggests a commercial option, seemingly more in terms of access to contemporary and commercial data…

“monetising the whole process through a subscription model for online hosting of knowledge resources – books, research papers, journals, dissertations, and reports to investors, product and policy developers. [With African researchers getting] “a revenue share for each download”.

That might work for useful locally-created data — one might get the article or substantial data summary for free, anywhere in the world. But if you’re outside Africa then you’d buy the data download direct from the researcher, and in affluent nations your university would require you do that as part of your ethics code as a researcher. Though I’m not sure a commercial pay-per-download model would be useful for things like folklore, the arts, oral history and natural history, which might be better funded by a big pan-African consortium of nations, philanthropists and donors. And thus kept freely available.

OAPEN-UK final report

28 Thursday Jan 2016

Posted by futurilla in Economics of Open Access, Official and think-tank reports, Spotted in the news

≈ Leave a comment

OAPEN-UK’s final report on open access monographs, OAPEN-UK final report: A five-year study into open access monograph publishing in the humanities and social sciences.

“Many libraries will […] be providing links to the open access copies of monographs through their discovery systems, but librarians are not always aware of this. A minority are also reluctant to include open access content within their catalogues.”

“30% of respondents currently identify open access monographs for inclusion within their library collections – 49% do not, while 21% were unsure.” — Librarian survey for the report.

Unsure about including OA at all, or unsure if anyone on staff was identifying OA items?

“There are also large numbers of researchers – especially early career and retired academics – who do extremely valuable research which deserves publication but who work outside academic institutions. Changing publishing culture in a way that affected these researchers negatively would damage the overall discipline.”

“SFX Miscellaneous Free Ejournals Target”

19 Wednesday Aug 2015

Posted by futurilla in Academic search, Economics of Open Access, Spotted in the news

≈ 1 Comment

“SFX Miscellaneous Free Ejournals Target: Usage Survey Among the SFX Community“, Serials Review (2015), 41(2), pp. 58-68.

SFX is an OpenURL link resolver product for university libraries, focussed on the output of traditional publishers — of which 16-20% is apparently so dodgy in terms of quality that it breaks the system. Yet, rather amazingly, it appears that much of this 16-20% is still allowed to get to the point-of-use.

The article briefly surveys recent findings on how SFX copes with open access articles, and then the rest of the paper gives the results of a survey of librarians who integrate a specific ‘free’ section of SFX with their library discovery tools. It appears that scholars looking for open free full-text via SFX can expect way over 20% dead link errors on URLs…

… one category [of failure] (incorrect parse params) alone leads to 20% false positives (dead links) for MFE [the largest ‘free’ target in SFX]. Besides incorrect parse params, there are numerous other reasons for the occurrence of false positives (dead links), such as resolver translation error, inaccurate embargo data, provider target URL translation error, incomplete provider content, wrong coverage dates, indexed-only titles mistakenly considered as fulltext titles, and other reasons listed in the literature review section.”

So that might mean… perhaps 40% of links to open access full-text are dead? Or even more, like… 60%? The article doesn’t hazard a guess.

The DOAJ ‘targets’ are apparently not much better…

It’s an irony that I find discovery services generally have much poorer coverage of Open Access than Google Scholar. … Most discovery services have indexed DOAJ (Directory of Open Access Journals), but many libraries experience such a bad linking experience they just turn it off” — Aaron Tay, July 2015.

I’m pleased to say that JURN should have close to zero dead links on standalone journals, due to the way it is set up. JURN may lead to a few fleeting “server maintainance” / “timeout” errors here and there, but if the journal’s base URL for articles moves then its articles effectively get auto-removed from JURN’s results. But they get found again within a year at most, through an effective two-pronged method.

AWOL releases cleaned A-Z URL list.

18 Tuesday Aug 2015

Posted by futurilla in Economics of Open Access, My general observations, Spotted in the news

≈ Leave a comment

AWOL has a fascinating post today. It’s on the attempts to identify which AWOL linked resources have already been ingested into major long-term Web archives, and which haven’t. As part of that experiment Charles and his helpmate Ryan have offered their readers a nice big cleaned A-Z list of the “52,020 unique URLs” linked from AWOL, which is very good of them. I might clip these URLs back and de-duplicate, and then do a side-by-side sheet with JURN’s own indexing URLs and thus see what’s missing from JURN. Very little in terms of post-1945 journal articles, I suspect, though there may be some I’ve missed.

Of course a JURN Search already runs across the AWOL pages, as well as a great many of the post-war full-text originals (via Google). But if I were an Ancient History scholar I might now be tempted to get together with others to crowdfund a mass download of AWOL’s full-text, so that I could search across the full-text locally and minutely, without having to rely on Google etc. I reckon the entire set of AWOL full-text would fit on a 1.5Tb external drive and would cost around $10,000 to harvest by hand/eye. Why would that be needed? I’m assuming that many long-term Web archives are ‘dark’ or that license complications mean no single archive can ingest the entirety of what AWOL points to.

My calculations for the $10k figure start with the fact that a little over 10,000 of AWOL’s 52,020 URLs are straight-to-PDF links, and so very easily downloaded by a harvesting bot. Assuming an average of 5Mb per PDF, that means about 260Gb of disk storage space for those PDFs.

If one then assumes that perhaps 10,000 of the URLs are not going to articles (rather to such things as sites that show scans of original source manuscripts and old books that display in zoomable and frame-nested forms etc, huge datasets, that are difficult to extract and archive), then that might leave 32,000 URLs that are mostly likely to be links to either journal TOCs pages or individual articles.

Let’s assume that each of the 32,000 TOC page URLs lead to an average of 16 articles and reviews (though some 2,000 may be home-page links sitting above links to issue TOCs). So 32,000 = 512,000 articles of some kind, in PDF or HTML, on average weighing 1.5Mb each. So that’s 768Gb in total. In that case one might easily store all the AWOL-discovered full-text on an $80 1.5Tb external disk, and have space to spare for the desktop indexing software‘s own index, which would be fairly big. That is a product that I might find very useful, if I were an Ancient History student, specialist, or independent scholar without access to university databases.

But how to harvest those 512,000 articles? The brute force way would be to parcel up the 32,000 URLs into parcels of 150 each. That’s 230 parcels x 150 URLs. If one were paying 20 cents per URL to Indian freelancers, to go in and spend 3 minutes grabbing whatever articles are hanging off each of those 150 page URLs, plus the page, then that would cost $37 per parcel. Let’s say $40, with a small quality bonus. Let’s say it takes four hours to do the 150 URLs and not miss anything. So that’s $10 U.S. a hour — pretty good for an Indian freelancer with broadband, I don’t think anyone would be being exploited on that deal. So the whole 32,000 URL set would cost $9,200 to harvest by hand and eye, which seems well within the range of a small crowdfunding campaign.

Of course, it might be that the articles could be wholly or partly harvested by bot. But I suspect that a simple “page + anything it links to” harvest would bring in a lot of chaff alongside the articles, given the very varied and non-standard nature of what AWOL links to. Perhaps that wouldn’t matter in practice, when keyword searching across the entire harvest. Or one might be able to use a more intelligent bot, one using Google Scholar-like article-detection algorithms.

Open Access and the Humanities: Contexts, Controversies and the Future

29 Saturday Nov 2014

Posted by futurilla in Economics of Open Access, Spotted in the news

≈ Leave a comment

The new book Open Access and the Humanities: Contexts, Controversies and the Future is now available from Cambridge University Press, including a free online version as PDF chapters. It seems a usefully comprehensive and dense primer on the subject. But fairly short, at 150 pages for the chapters. Chapters open in a PDF viewer in the browser, but if you use Internet Explorer it should ignore the javascript obfuscation and offer to let you download as a PDF file.

However it doesn’t seem to be a book to go to for an in-depth discussion of public discoverability and search. There is some slight discussion of discoverability on page 53, briefly suggesting that if the academy wishes to make a believable claim to act as an agent of social change, then it must pass its public-funded knowledge to all rather than allow it to be hoarded by a tiny elite. Page 101 discusses the adoption (or not) of text mining, briefly mentioning the discoverability experiments that text mining might enable.

Page 118 suggests that a curated monograph range at a publisher inherently contains a discoverability aspect (so long as the publisher’s publicist is doing their job assiduously, I’d add). If such a publisher also offers a ‘digital-first’ work-flow for monographs then an easy conversion to a mainstream .ePub or .mobi ebook is enabled, again adding discoverability potential (when the book pops on the Amazon Kindle store and suchlike, and/or in Open Access aggregators). In the Kindle Store discoverability shades over into readability, via the convenience of reading on dedicated ereaders rather than struggling with reading a PDF on a small tablet.

News from JURN

~ search tool for open access content

Category Archives: Economics of Open Access

The COAR of the issue

Report: Equitable access to research in a changing world

Well endowed

Lulu.com launches academic service suite – Glasstree

Digital Monograph Costing Tool

Africa is coming online

OAPEN-UK final report

“SFX Miscellaneous Free Ejournals Target”

AWOL releases cleaned A-Z URL list.

Open Access and the Humanities: Contexts, Controversies and the Future