{"id":24167,"date":"2020-08-23T22:05:41","date_gmt":"2020-08-23T21:05:41","guid":{"rendered":"https:\/\/jurnsearch.wordpress.com\/?p=24167"},"modified":"2020-08-23T22:05:41","modified_gmt":"2020-08-23T21:05:41","slug":"download-a-site-from-archive-org","status":"publish","type":"post","link":"https:\/\/jurn.link\/jurnsearch\/index.php\/2020\/08\/23\/download-a-site-from-archive-org\/","title":{"rendered":"Download a site from Archive.org"},"content":{"rendered":"<p>I&#8217;m happy to report success with testing a gig on Fiverr that offers to <a href=\"https:\/\/www.fiverr.com\/josephraymund12\/restore-from-wayback-machine\">Download an entire website from the Internet Archive Wayback Machine<\/a>.  I put in a test order for a site archived in late 2015, a technical forum for some graphics production software. The forum had abruptly vanished on being sold to a larger business.<\/p>\n<p>While it is possible to do what Joseph is offering for free, the only options appear to be Linux, command-line, or a couple of subscription\/paid Cloud services.  For a mere $6 it thus seemed worth finding out what Joseph could do.<\/p>\n<p>He delivered a 310mb .zip containing 1.1Gb of archive from a given date. It was a script-driven .PHP forum site, but that caused no fuss. I didn&#8217;t expect him to re-work links to make a working site again, for that reason. Though apparently he can do that, on simpler HTML sites.<\/p>\n<p>On my desktop PC dtSearch then indexed all text in the extracted files, regardless of file-type, and thus enabled keyword-search across the archive. If you need freeware on that point, then DocFetcher is a good free equivalent to the paid dtSearch.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I&#8217;m happy to report success with testing a gig on Fiverr that offers to Download an entire website from the &hellip;<\/p>\n<p><a href=\"https:\/\/jurn.link\/jurnsearch\/index.php\/2020\/08\/23\/download-a-site-from-archive-org\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[17],"tags":[],"class_list":["post-24167","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/24167","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/comments?post=24167"}],"version-history":[{"count":0,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/24167\/revisions"}],"wp:attachment":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/media?parent=24167"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/categories?post=24167"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/tags?post=24167"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}