The new RSS change at Google News makes their existing keyword-based RSS feeds defunct. It affects the RSS feeds that collect all Google News items with a headline/snippet containing the words ‘bunny’ + ‘fluffy’, for instance. I don’t know if the generic catch-all ‘Science’, ‘Health’ etc RSS feeds are affected, as I don’t use those.
Those keyword-based feeds will now need to be changed. Changed slowly and manually and individually by slogging down the list in one’s RSS feedreader. It’s a big task to do, for some, and journalists and editors and bloggers will have hundreds (if not thousands) of these feeds set up.
So far as I can see there’s no way to export the OPML from one’s desktop RSS feedreader and then simply do a global search-replace of the Google News URL paths in Notepad++, then bring the OPML back in. The URLs are too complex and varied in their structures to allow that.
One way of tackling the change is as follows:
Aim: Open our list of feeds in Excel and extract only the Google News ones, thus making it relatively easy for a worker to run through them all and discover the new ones.
Software required: the free Notepad++ and MS Office Excel with Sobolsoft’s Excel Remove Text addin.
1. Export your OPML master file from your RSS feedreader / newsreader.
2. Right-click on this and open the OPML in Notepad++. Search/replace "/>
with "/>;
and then manually go through and add a ; to the end of the remaining few lines which now lack them.
3. Search/replace all , (i.e.: all the commas) and change these to &&&&.
4. Save a backup of the changed OPML, then save another copy from Notepad++ — this time as “feeds.csv” which makes it a comma-separated Excel file. “But there are no commas left” you cry. That doesn’t matter, as Excel will treat the ; instances as if they were commas. And it won’t be terminally confused by commas sitting within the URLs, as we just changed them all to &&&&.
5. You can now load feeds.csv in MS Office’s Excel spreadsheet package. If you successfully put a ; at the end of each line of the OPML, Excel will happily load the file and it will display correctly, meaning in a similar way to the clear structured view you saw in Notepad++.
6. You’re now able to extract all the lines containing the phrase “Google News” and then do the same for “news.google”. There are a number of complex ways to do this, involving fiendish formulas, but a very easy way is with Sobolsoft’s Excel Remove Text, Spaces & Characters From Cells add-in. This gives Excel a number of very useful functions, including “Clear all cells not containing X”. Select all lines. Then clear everything not containing Google News. You can then ‘sort A-Z’, to get a neat list of all your defunct Google News feeds, one per line.
7. Select all lines with content in them. Then use the same add-in to “Remove all text before…” xmlUrl="
(which is the query command in the URL). Then “Remove all text after…” &output=
You can continue doing this sort of search/replace, and thus end up with a fairly clean set of the keywords and phrases and knockout -keywords which you were using for each Google News URL. For instance, you can search/replace %22
with ” to get recognisable search phrases again, inside the URL.
If you have hundreds or thousands of these, they can now be passed to a gig worker at Fivver.com etc, tasked with working down your nicely cleaned one-per-line list to discover the new working RSS URLs from Google News. While they’re at it, you may as well pay them to discover the Bing News equivalents.
You may also want them to use a VPN in order to also snag the Google News USA equivalent URLs, if you’re in the UK etc. Although it appears possible that simply changing the end of the new URLs from ?hl=en-GB&gl=GB&ned=uk
to ?hl=en&gl=US&ned=us
does the trick and gets the USA version. Google News USA obviously has better coverage, and is perhaps updated more quickly. For instance, a UK-centric search for: newcastle-under-lyme -police in Google News UK has no search results. The same from the USA site has one valid result in a local freesheet two hours ago. Such timeliness may matter for journalists with deadlines to meet.
8. You don’t then need to create a new OPML without any Google News URLs, and try to import it back to your newsreader etc. That’s a hassle and the OPML will probably break. So it’s easier to just let the defunct Google News URLs sit there and do nothing, since they’re not doing any harm. Some newsreader software may eventually flag them as defunct, and may even offer the ability to mass-delete your defunct feeds after 1st December 2017. Apparently that’s the date Google has set for the current feeds to die altogether.
9. Once your Fiverr gig worker etc comes back with the new URLs, either add in your new working Google News URLs by hand, or (if you have lots of them set up) have your Fivver gig worker format them up as a valid OPML file for bulk import to your newsreader. That’s very simple to do, once you have a newly-working Google News sample line to show them, although I think there are website converters that will turn a one-per-line RSS URL list into a valid OPML with ease.
That’s the most efficient way I can think of for handling the changeover.