{"id":16913,"date":"2016-03-30T22:40:12","date_gmt":"2016-03-30T21:40:12","guid":{"rendered":"https:\/\/jurnsearch.wordpress.com\/?p=16913"},"modified":"2016-03-30T22:40:12","modified_gmt":"2016-03-30T21:40:12","slug":"yandex-initial-testing","status":"publish","type":"post","link":"https:\/\/jurn.link\/jurnsearch\/index.php\/2016\/03\/30\/yandex-initial-testing\/","title":{"rendered":"Yandex &#8211; initial testing"},"content":{"rendered":"<p>I&#8217;ve been testing the English version of the major Russian search-engine <a href=\"https:\/\/www.yandex.com\/\">Yandex<\/a> for the last month.  I&#8217;ve found it to be a useful alternative search engine, one with a surprisingly wide range and depth.  In terms of the reach of its plain Web search I&#8217;d say it&#8217;s comparable with DuckDuckGo, though it can&#8217;t match the excellent relevancy-ranking of DuckDuckGo&#8217;s Image Search. <\/p>\n<p>Other notable points are:<\/p>\n<p>* it has &#8216;instant&#8217; speed, as fast as Google Search.<br \/>\n* filters and search modifiers that are similar to Google.<br \/>\n* the user interface is easy to use and doesn&#8217;t annoy.<br \/>\n* there&#8217;s a pleasing lack of <em>nag-nag-nag<\/em> about EU privacy and cookies laws.<br \/>\n* it doesn&#8217;t throw a tantrum if you make more than a half-dozen <strong>site:<\/strong> searches. <\/p>\n<p>For discovery of Web items from the &#8220;last 24 hours&#8221; \/ &#8220;last two weeks&#8221; Yandex is much less spammy than Google, making it much more useful for the few journalists and bloggers who actively go looking for timely new content.  That&#8217;s true even when one runs Google Search with an extremely well-populated installation of <a href=\"https:\/\/greasyfork.org\/en\/scripts\/1682-google-hit-hider-by-domain-search-filter-block-sites\">Google Hit Hider by Domain<\/a> (regrettably this Firefox\/Greasemonkey addon, which automatically blanks spammy URLs in your search results in Google\/Bing\/DuckDuckGo, doesn&#8217;t yet support Yandex).  A &#8220;last 24 hours&#8221; search in Yandex is also refreshingly free of Google&#8217;s &#8220;yeah, it&#8217;s actually from 2009, but we&#8217;re showing it because we crawled it again in the last 24 hours&#8230;&#8221; results.<\/p>\n<p>When a 24-hour search is run on a timely in-the-news topic name or keyword, Yandex effectively becomes a very useful alternative to Google News or Bing News.  It also accurately detects user-location (UK, city), without a sign-in, and correctly and deftly skews the news sources to the appropriate nation.  Although there&#8217;s a caveat&#8230; on many keywords the first 10 results will often seem to skew toward the more left-leaning news outlets. I&#8217;d suspect the algorithm is just being unduly influenced by the clicks of a minority of heavy Yandex users, who will probably skew strongly toward people inclined to click on anti-western news headlines?  I half-expected that the English version of <em>Russia Today<\/em> (<em>RT<\/em>) would rank highly, in that scenario, but it doesn&#8217;t seem to rank <em>at all<\/em> in such 24-hour search results.  Not in the UK, anyway.<\/p>\n<p>So Yandex&#8217;s Web search is impressive, but a little quirky in coverage.  The largest gaps in Yandex&#8217;s coverage are due to sniffy sites that only allow the Googlebot to index their files.  When Yandex <em>is<\/em> allowed to get in to such sites, the indexing and title identification of PDFs and articles seems directly comparable to Google Search.<\/p>\n<p>I looked hard for browser add-ons for Yandex, but there are hardly any in English.  A couple are worthy, though, and a search veteran working with Yandex would ideally want to&#8230;<\/p>\n<p>1. Stop Yandex&#8217;s URL obfuscation in search results, with the browser add-on <a href=\"https:\/\/github.com\/palant\/searchlinkfix\">Google Search Link Fix<\/a> (also works with Yandex).<\/p>\n<p>2. Get a double-column layout for search results, if working on a desktop PC. There&#8217;s no version of GoogleMonkeyR that works with Yandex, but the Firefox add-on Stylish has <a href=\"https:\/\/userstyles.org\/styles\/57878\/yandex-two-columns\">a double-column UserStyle for Yandex<\/a> that works fine.<\/p>\n<p>Interestingly Yandex offers <a href=\"https:\/\/yandex.ru\/support\/site\/?lang=en\">a sort-of Custom Search Engine<\/a>.  But your Yandex CSE and its topic have to pass moderation, and you have to admin the CSE from a single fixed IP.  That&#8217;s no good at all for those who have a western-style ISP which dynamically assigns an IP address each session.  I presume that Russian Internet users must be forced to have a fixed IP address. <\/p>\n<hr>\n<p>Update, April 2016: a new and utterly dim-witted spelling auto-correction has been introduced.  For instance &#8220;modal&#8221; is corrected to &#8220;model&#8221;, with no way of forcing the original term.  eg: &#8220;modal&#8221; &#8220;wind turbine tests&#8221;.  At a stroke, this makes Yandex unusable as an academic search engine.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I&#8217;ve been testing the English version of the major Russian search-engine Yandex for the last month. I&#8217;ve found it to &hellip;<\/p>\n<p><a href=\"https:\/\/jurn.link\/jurnsearch\/index.php\/2016\/03\/30\/yandex-initial-testing\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[10],"tags":[],"class_list":["post-16913","post","type-post","status-publish","format-standard","hentry","category-my-general-observations"],"_links":{"self":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/16913","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/comments?post=16913"}],"version-history":[{"count":0,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/16913\/revisions"}],"wp:attachment":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/media?parent=16913"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/categories?post=16913"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/tags?post=16913"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}