{"id":21388,"date":"2018-08-02T18:02:38","date_gmt":"2018-08-02T17:02:38","guid":{"rendered":"https:\/\/jurnsearch.wordpress.com\/?p=21388"},"modified":"2018-08-02T18:02:38","modified_gmt":"2018-08-02T17:02:38","slug":"open-semantic-desktop-search-free-desktop-search-for-windows","status":"publish","type":"post","link":"https:\/\/jurn.link\/jurnsearch\/index.php\/2018\/08\/02\/open-semantic-desktop-search-free-desktop-search-for-windows\/","title":{"rendered":"Open Semantic Desktop Search &#8211; free desktop search for Windows"},"content":{"rendered":"<p><a href=\"https:\/\/www.opensemanticsearch.org\/\">Open Semantic Desktop Search<\/a> an &#8220;open source desktop search engine for full text search in documents&#8221;, that runs in SOLR on the Windows desktop through Oracle&#8217;s free <a href=\"https:\/\/www.virtualbox.org\/\">VM VirtualBox<\/a>.  It&#8217;s been around since late 2015, and is actively being developed, but they obviously don&#8217;t employ a publicist to promote it.<\/p>\n<p><a href=\"https:\/\/jurn.link\/jurnsearch\/2018\/08\/search.png\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/jurn.link\/jurnsearch\/2018\/08\/search.png\" alt=\"\" width=\"529\" height=\"279\" class=\"alignnone size-large wp-image-21389\" \/><\/a><\/p>\n<p>It has a clean Web-like interface, supports the indexing of a great many file-types including .ePUB and .PDF files, even if they&#8217;re inside .ZIP files. Though it can&#8217;t yet index the Kindle&#8217;s .MOBI ebook files, so you&#8217;d need to do an overnight mass-conversion to .ePUB or .PDF using the free Calibre software, and your purchased encrypted Kindle files will still need to be searched using Amazon. <\/p>\n<p>Despite being run in a VM (often slow in older Windows PCs), Open Semantic Desktop Search can work on&#8230;<\/p>\n<blockquote><p>&#8220;old standard hardware&#8221; and &#8220;The search engine works even offline or unhosted on a single laptop without need of a intranet or internet connection or a server.&#8221;<\/p><\/blockquote>\n<p>Though online comments suggest you&#8217;ll do best with a modern PC, and those with an over-stuffed hard-drive will need to clear 50Gb of disk-space to accommodate both the software and its resulting index. The disk-space needed may be less if you&#8217;re only indexing the folder containing the .PDFs and .ePUBs needed for your PhD or book research.<\/p>\n<p>I haven&#8217;t installed and tested it yet, but it&#8217;s free and looks good. Apparently it can also auto-OCR inside PDFs that don&#8217;t have OCR text, a new feature added in a December 2017 update.<\/p>\n<p>The search-engine software comes packaged in a 2.8Gb .OVA file that you download. This .OVA is a plugin module for the free <a href=\"https:\/\/www.virtualbox.org\/\">VM VirtualBox<\/a> (a 110Mb .EXE download), and the team&#8217;s <a href=\"https:\/\/www.opensemanticsearch.org\/doc\/desktop_search\">Desktop Search<\/a> page has instructions on how to plug your .OVA into the installed VM.  It seems fairly simple to get it up and running.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Open Semantic Desktop Search an &#8220;open source desktop search engine for full text search in documents&#8221;, that runs in SOLR &hellip;<\/p>\n<p><a href=\"https:\/\/jurn.link\/jurnsearch\/index.php\/2018\/08\/02\/open-semantic-desktop-search-free-desktop-search-for-windows\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2,8,16],"tags":[],"class_list":["post-21388","post","type-post","status-publish","format-standard","hentry","category-academic-search","category-jurn-tips-and-tricks","category-spotted-in-the-news"],"_links":{"self":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/21388","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/comments?post=21388"}],"version-history":[{"count":0,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/21388\/revisions"}],"wp:attachment":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/media?parent=21388"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/categories?post=21388"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/tags?post=21388"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}