• Directory
  • FAQ: about JURN
  • Group tests
  • Guide to academic search
  • JURN’s donationware
  • Links
  • openEco: titles indexed

News from JURN

~ search tool for open access content

News from JURN

Category Archives: JURN tips and tricks

“Automated book index making” post – updated

03 Thursday Aug 2017

Posted by futurilla in JURN tips and tricks, My general observations

≈ Leave a comment

I’ve updated and expanded my Christmas 2016 post “A survey of automated book index making software”. New bits…

PDF Index Generator 2.4 tested, specifically its very useful new “capitalized phrases only” automated query-filter which allows you to grab only personal names and longer place names, with a short tutorial on finding and using this feature…

More tips on Java security, re: the security nightmare that is Java being needed to run PDF Index Generator.

Also, the genuine freeware Index Generator 5.5 has just updated to 5.8, adding new features such as a “word list import and export feature” and “index support for alphanumerical words”.

Freeware: JP2 thumbnail viewer for Windows

10 Monday Jul 2017

Posted by futurilla in JURN tips and tricks

≈ Leave a comment

Archive.org offers books as unprocessed hi-res .jp2 image scan files, but once unpacked these don’t preview as thumbnails natively in Windows. Have you ever played “spot the page”, as a result, trying to find a picture among the pages? The quick solution to that problem is the freeware IrfanView and its JP2 (JPG-2000) Plugin pack.

Note that there are 32-bit and 64-bit versions of both, and it’s easy to mix them up. I only had success with 64-bit IrfanView + 64-bit plugins.

You may also find you need to enable the plugin via the Help | Plugins menu.

Another great addition to IrfanView. In fact, IrfanView would also be absolutely perfect as a general picture manager, if only there was some way to make a simple persistent bookmark of a folder. There isn’t. It has everything else, but it even with all its plugins it lacks the most obvious and basic feature — folder bookmarking!

Filemare

19 Monday Jun 2017

Posted by futurilla in JURN tips and tricks

≈ Leave a comment

A useful FTP search engine called Filemare. It helped me find the tar.gz archive of the .blend production file assets for the ‘open movie’ Elephants Dream, on a fast FTP. There’s a limit on the number of searches you can do per hour, but apart from that it’s a useful service.

Update: filemare.com live link removed, as site is now defunct. Mamont is the one to use now.

Another Google CSE dashboard glitch?

26 Friday May 2017

Posted by futurilla in JURN tips and tricks, JURN's Google watch

≈ Leave a comment

The recent changes to the Google CSE services appear to have introduced another glitch. The problem happens when adding new URL entries into your Google CSE. For instance, you can no longer add…

http://www.nnns.org.uk/sites/nnns.org.uk/files/

… and reliably select “Include all pages whose address contains this URL”. Oh yes, the Dashboard will let you save it that way… but then go back and open the URL up again. You’ll see that the CSE dashboard has refused to accept the setting you gave the URL, and has instead defaulted the URL to: “Include just this specific page or URL pattern I have entered”.

The problem with this is that you didn’t explicitly enter http://www.nnns.org.uk/sites/nnns.org.uk/files/* With the * wildcard making the “Include just this specific page or URL pattern I have entered” functional. Without the wildcard, the http://www.nnns.org.uk/sites/nnns.org.uk/files/ URL is null and void on that setting, and may as well have not been added to your CSE.

This has only just started happening, and the “Include all pages whose address contains this URL” setting is sticky on entries made prior to about 24 hours ago. Which makes me think it’s probably a temporary glitch, inadvertently introduced during yesterday’s switch from three-options to two-options for settings on individual URLs.

If you’re working on a CSE over the weekend / Bank Holiday (UK), you should be aware of this problem, as it probably won’t be fixed by Google until early next week. You’ll probably want to keep a .txt file of all the URLs you add which you have to use a /* for, because you may need to manually change them back once the problem gets fixed.

Facebook Image Size Guide

08 Monday May 2017

Posted by futurilla in JURN tips and tricks

≈ Leave a comment

Facebook Photos & Images Size Guide – 2017. “1640px wide and 624px” is suggested as optimal for headers.

DuckDuckGo Multi-Columns – script update

07 Sunday May 2017

Posted by futurilla in JURN tips and tricks

≈ Leave a comment

DuckDuckGo – Multi-Columns UserScript has updated to version 8. If you’ve customised v7 beware of simply upgrading, because your tweaks will be over-written.

Internet Archive starts to ignore robots.txt files

26 Wednesday Apr 2017

Posted by futurilla in JURN tips and tricks, Spotted in the news

≈ Leave a comment

The Internet Archive (and its Wayback Machine) has recently announced it has stopped honouring robots.txt files for some sites. A robots.txt is a simple text file which tells visiting crawler and harvesting bots that the site owners don’t want their content accessed, copied and (potentially irrevocably) made public somewhere else without their permission.

The Internet Archive is currently… “ignoring [robots.txt warnings at] U.S. government and military web sites”, and state that in future… “We are now looking to do this more broadly.”

This would seem to have a number of implications for repositories and journals. Especially in terms of things like retractions, ‘heavy harvesting’ of large numbers of large files, and also the practical implementation of the emerging legal concept of ‘the right to be forgotten’.

To anticipate this impending policy change at Internet Archive and to block their crawlers, you reportedly need to set up a way to “limit access by IP addresses” from the IA, and/or configure your site to block visiting clients named “ia_archiver”.

If you can’t do that — at first glance it looks a lot more complex than simply uploading a plain robots.txt file — then note that they say they will… “respond to removal requests sent to info@archive.org”. The latter option may be of special interest to hosted wordpress.com blogs and similar sites, which have no means of blocking the IA’s crawlers.

u’ve had it, uTorrent

25 Tuesday Apr 2017

Posted by futurilla in JURN tips and tricks

≈ Leave a comment

If you use torrents and your torrent software uTorrent has suddenly been overrun with slow-loading banner adverts and nags, I can recommend the free open source qBittorrent. Almost the same as uTorrent in terms of the interface, and it takes about half an hour to swop over if you’ve been seeding a half-dozen or so torrents. Note qBittorrent’s ability to set time-of-day on upload/download speeds, so as to automatically increase them across all torrents at times when you will be away from your PC.

Keep in mind that, after uninstalling uTorrent, it leaves behind a big backup cache of all your downloaded .torrent files in C:\Users\USERNAME\AppData\Roaming\uTorrent

Trello inspiration

21 Friday Apr 2017

Posted by futurilla in JURN tips and tricks, Spotted in the news

≈ Leave a comment

Trello Inspiration, a fine survey of all the different ways in which one can use the excellent and free Trello service. They missed out magazine and journal production, though.

Anti click-jacking code

13 Thursday Apr 2017

Posted by futurilla in JURN tips and tricks

≈ Leave a comment

For your Web page, here’s strong anti-framejacking and anti-clickjacking code, which has been tested and currently busts nasty frame-jackers such as In.is (aka Linkis). As such these snippets may be useful for journals and other academic services, to prevent legitimate content from being hijacked and surrounded by frames advertising ‘essay-writing services’ or predatory publisher services or worse.

Source: Stanford Security Lab via a recent blog post by Zipline Interactive, where there’s also additional defensive code to add to your website’s root .htaccess file (if you have FTP access and your host will allow upload of a changed .htaccess)…

Header set X-Frame-Options SAMEORIGIN

The .htaccess code is ‘as well as’, serving as a second line of deeper defence, and is not required for the first code suggestion to work in your Web page. Most modern Web browsers understand the self-explanatory SAMEORIGIN command when they hear it from a website.

Those with a hosted WordPress blog or journal may also want to consider the Frame Buster plugin. So far as I know there’s nothing similar for the Open Journal System (OJS) or Omeka or similar academic content plug-and-play systems. But perhaps there should be, if they don’t already have such counter-measures baked in?

← Older posts
Newer posts →
RSS Feed: Subscribe

 

Please become my patron at www.patreon.com/davehaden to help JURN survive and thrive.

JURN

  • JURN : directory of ejournals
  • JURN : main search-engine
  • JURN : openEco directory
  • JURN : repository search
  • Categories

    • Academic search
    • Ecology additions
    • Economics of Open Access
    • How to improve academic search
    • JURN blogged
    • JURN metrics
    • JURN tips and tricks
    • JURN's Google watch
    • My general observations
    • New media journal articles
    • New titles added to JURN
    • Official and think-tank reports
    • Ooops!
    • Open Access publishing
    • Spotted in the news
    • Uncategorized

    Archives

    • February 2026
    • January 2026
    • October 2025
    • May 2025
    • April 2025
    • September 2024
    • June 2024
    • May 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • December 2023
    • November 2023
    • October 2023
    • September 2023
    • June 2023
    • May 2023
    • January 2023
    • December 2022
    • November 2022
    • October 2021
    • September 2021
    • August 2021
    • July 2021
    • June 2021
    • May 2021
    • April 2021
    • March 2021
    • February 2021
    • January 2021
    • December 2020
    • November 2020
    • October 2020
    • September 2020
    • August 2020
    • July 2020
    • June 2020
    • May 2020
    • April 2020
    • March 2020
    • February 2020
    • January 2020
    • December 2019
    • November 2019
    • October 2019
    • September 2019
    • August 2019
    • July 2019
    • June 2019
    • May 2019
    • April 2019
    • March 2019
    • February 2019
    • January 2019
    • December 2018
    • November 2018
    • October 2018
    • September 2018
    • August 2018
    • July 2018
    • June 2018
    • May 2018
    • April 2018
    • March 2018
    • February 2018
    • January 2018
    • December 2017
    • November 2017
    • October 2017
    • September 2017
    • August 2017
    • July 2017
    • June 2017
    • May 2017
    • April 2017
    • March 2017
    • February 2017
    • January 2017
    • December 2016
    • November 2016
    • October 2016
    • September 2016
    • August 2016
    • July 2016
    • June 2016
    • May 2016
    • April 2016
    • March 2016
    • February 2016
    • January 2016
    • December 2015
    • November 2015
    • October 2015
    • September 2015
    • August 2015
    • July 2015
    • June 2015
    • May 2015
    • April 2015
    • March 2015
    • February 2015
    • January 2015
    • December 2014
    • November 2014
    • October 2014
    • September 2014
    • August 2014
    • July 2014
    • June 2014
    • May 2014
    • April 2014
    • March 2014
    • February 2014
    • January 2014
    • December 2013
    • November 2013
    • October 2013
    • September 2013
    • August 2013
    • July 2013
    • June 2013
    • May 2013
    • April 2013
    • March 2013
    • February 2013
    • January 2013
    • December 2012
    • November 2012
    • October 2012
    • September 2012
    • August 2012
    • June 2012
    • May 2012
    • April 2012
    • March 2012
    • February 2012
    • January 2012
    • December 2011
    • November 2011
    • October 2011
    • September 2011
    • August 2011
    • July 2011
    • June 2011
    • May 2011
    • April 2011
    • March 2011
    • February 2011
    • January 2011
    • December 2010
    • November 2010
    • October 2010
    • September 2010
    • August 2010
    • July 2010
    • June 2010
    • May 2010
    • April 2010
    • March 2010
    • February 2010
    • January 2010
    • December 2009
    • November 2009
    • October 2009
    • September 2009
    • August 2009
    • July 2009
    • June 2009
    • May 2009
    • April 2009
    • March 2009
    • February 2009

    Proudly powered by WordPress Theme: Chateau by Ignacio Ricci.