{"id":24901,"date":"2021-03-19T23:05:05","date_gmt":"2021-03-19T22:05:05","guid":{"rendered":"https:\/\/jurnsearch.wordpress.com\/?p=24901"},"modified":"2021-03-19T23:05:05","modified_gmt":"2021-03-19T22:05:05","slug":"googles-live-caption-now-on-desktop-pcs","status":"publish","type":"post","link":"https:\/\/jurn.link\/jurnsearch\/index.php\/2021\/03\/19\/googles-live-caption-now-on-desktop-pcs\/","title":{"rendered":"Google&#8217;s Live Caption, now on desktop PCs"},"content":{"rendered":"<p>Isn&#8217;t the Internet wonderful. Just this morning I was searching and wondering <em>why is there no audio &#8220;automatic transcription&#8221; software for desktop PCs<\/em>? This evening&#8230; Google&#8217;s Live Caption feature is now available on the desktop PC, via the Chrome browser. For free, and running locally and offline and without a Google login. <\/p>\n<p>To enable real-time live subtitles (aka &#8216;closed captions&#8217; or &#8216;live captioning&#8217;) as your audio or video plays back, first get the latest Chrome then go&#8230;<\/p>\n<p>Advanced<\/p>\n<p>Accessibility<\/p>\n<p>Captions<\/p>\n<p>&gt; arrow icon<\/p>\n<p>Live Caption <\/p>\n<p>&#8230;and turn it on in Chrome. At this point a set of speech-definition files will be downloaded, to enable the real-time detection of what&#8217;s being said. While you&#8217;re waiting, set up the preferences for fonts and colours etc.<\/p>\n<p><a href=\"https:\/\/jurn.link\/jurnsearch\/2021\/03\/caption.jpg\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/jurn.link\/jurnsearch\/2021\/03\/caption.jpg?w=529\" alt=\"\" width=\"529\" height=\"346\" class=\"alignnone size-large wp-image-24905\" \/><\/a><\/p>\n<p>Those used to AI sets of 1Gb or more will find the Live Caption&#8217;s are downloaded in a few minutes, even on a slow connection. Other than the initial download of the definition files the services work locally on the PC and without a Cloud connection. So far as I&#8217;m aware this is the first time such a free service is available without a Cloud-upload being needed, still less in real-time.<\/p>\n<p>For this reason I would expect to see third-party UserScripts relatively soon, to enable the transcription to be easily captured into an editable text file as it plays. The playback \/ transcription continues to run, even when Chrome is not the focus of what you&#8217;re doing on the PC, which should help with scripted capture. Obviously if you want the whole thing you would have to let it play back first, to get a full transcription. <\/p>\n<p>Can a recorded .MP3 be loaded and work?  As well as a live stream? Yes, it works very well. A podcast with a 90 year-old guy on a smartphone, and kind of ok-ish voice quality&#8230; it handled that well. In real-time.<\/p>\n<p><a href=\"https:\/\/jurn.link\/jurnsearch\/2021\/03\/2021-03-19_213931.jpg\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/jurn.link\/jurnsearch\/2021\/03\/2021-03-19_213931.jpg?w=529\" alt=\"\" width=\"529\" height=\"467\" class=\"alignnone size-large wp-image-24903\" \/><\/a><\/p>\n<p>As you watch it, it occasionally goes back and auto-corrects and seems to be doing this based on word context. So I&#8217;m guessing it&#8217;s not just speech-to-text, but also text-to-text context tweaking. But it can&#8217;t work miracles: &#8220;gorilla campaign&#8221; rather than &#8220;guerrilla campaign&#8221; etc. And swearing does get f****** bleeped out with asterisks. It can&#8217;t detect different speakers. You can&#8217;t copy-paste. Still, it&#8217;s going to be very useful, especially if you just want a few paragraphs for a quote. Until we get a capture script, you can do things like screen grab with Microsoft OneNote, which handles small fonts fine and can make text from a screengrab very easily.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Isn&#8217;t the Internet wonderful. Just this morning I was searching and wondering why is there no audio &#8220;automatic transcription&#8221; software &hellip;<\/p>\n<p><a href=\"https:\/\/jurn.link\/jurnsearch\/index.php\/2021\/03\/19\/googles-live-caption-now-on-desktop-pcs\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[9,16],"tags":[],"class_list":["post-24901","post","type-post","status-publish","format-standard","hentry","category-jurns-google-watch","category-spotted-in-the-news"],"_links":{"self":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/24901","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/comments?post=24901"}],"version-history":[{"count":0,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/posts\/24901\/revisions"}],"wp:attachment":[{"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/media?parent=24901"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/categories?post=24901"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jurn.link\/jurnsearch\/index.php\/wp-json\/wp\/v2\/tags?post=24901"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}