{"id":73,"date":"2015-08-07T11:15:01","date_gmt":"2015-08-07T10:15:01","guid":{"rendered":"http:\/\/davidjohntaylor.co.uk\/?p=73"},"modified":"2015-08-25T17:42:20","modified_gmt":"2015-08-25T16:42:20","slug":"why-google-voice-recognition-continues-to-impress","status":"publish","type":"post","link":"https:\/\/davidjohntaylor.co.uk\/index.php\/2015\/08\/07\/why-google-voice-recognition-continues-to-impress\/","title":{"rendered":"Why Google voice recognition continues to impress"},"content":{"rendered":"<p>There are a couple of different routes to access\u00a0Google&#8217;s multi-lingual voice recognition function, including Google Now on your\u00a0Android phone, searching on a\u00a0browser, and Google Maps to name a few. \u00a0It&#8217;s generally the case that the environment you&#8217;re in at the time determines the software you use, meaning the quality of speech that&#8217;s captured from you may vary wildly.<\/p>\n<p>In the early days of speech recognition, I remember being seated in front of my PC, with a headset on, in a super-quiet room, carefully pronouncing paragraphs of text to allow some Dragon software to &#8220;learn&#8221; the intricacies of my pronunciation. \u00a0But even after significant training in what would be a perfect environment, I ended up with much less than perfect results!<\/p>\n<p>This morning, I&#8217;ve been stood in the middle of Euston train station &#8211; certainly not the quietest of locations, with the general bustle of the public, tannoy announcements, and beeping electric vehicles all contributing to a cacophony of noise which\u00a0means speech recognition needs to work much harder to pick out my voice against that of the background. \u00a0It also needs to then determine the words, identify what I&#8217;m requesting, and return the result. \u00a0Ideally this all needs to happen within a few seconds for it to be useful.<\/p>\n<p><a href=\"http:\/\/davidjohntaylor.co.uk\/wp\/wp-content\/uploads\/2015\/08\/google-voice-search1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-88\" src=\"http:\/\/davidjohntaylor.co.uk\/wp\/wp-content\/uploads\/2015\/08\/google-voice-search1.png\" alt=\"Google Voice Search\" width=\"1010\" height=\"483\" srcset=\"https:\/\/davidjohntaylor.co.uk\/wp\/wp-content\/uploads\/2015\/08\/google-voice-search1.png 1010w, https:\/\/davidjohntaylor.co.uk\/wp\/wp-content\/uploads\/2015\/08\/google-voice-search1-300x143.png 300w\" sizes=\"auto, (max-width: 1010px) 100vw, 1010px\" \/><\/a><\/p>\n<p>I tried a number of voice requests today, such as:<\/p>\n<ul>\n<li>OK Google, remind me to order a prescription at\u00a0when I get home<\/li>\n<li>OK Google, whats the weather going to be like tomorrow<\/li>\n<li>OK G0ogle, set an alarm for 8am tomorrow<\/li>\n<\/ul>\n<p>With each request came success. \u00a0Whilst this is not the most scientific of tests, it&#8217;s becoming one of those technologies that &#8220;just works&#8221;. \u00a0When things &#8220;just work&#8221; you start to build a dependency which ultimately reinforces its value &#8211; not a bad thing in this case.<\/p>\n<p>I continue to be impressed with the accuracy of the recognition, and\u00a0<a href=\"http:\/\/venturebeat.com\/2015\/05\/28\/google-says-its-speech-recognition-technology-now-has-only-an-8-word-error-rate\/\">Google now claim that they process requests with 92% accuracy<\/a> &#8211; I find that to be an extremely good figure.<\/p>\n<p>Finally, some might say that these advancements come at a cost of personal privacy &#8211; our speech being captured, and the translations being stored for analysis. \u00a0The arguments for and against are really for a whole different article than this one! \u00a0But, I did decide to dip into the My Account feature that Google provide and see <a href=\"https:\/\/history.google.com\/history\/audio\">how much of my voice content they capture<\/a>, and why. \u00a0I was presented with a simple list of my voice searches, a link to play back the audio captured, plus a translation of what the analysis thought I&#8217;d said. \u00a0There&#8217;s an option to delete individual searches, or group them by day, or even all of them.<\/p>\n<p><a href=\"http:\/\/davidjohntaylor.co.uk\/wp\/wp-content\/uploads\/2015\/08\/google-voice-and-audio-activity.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-75\" src=\"http:\/\/davidjohntaylor.co.uk\/wp\/wp-content\/uploads\/2015\/08\/google-voice-and-audio-activity.jpg\" alt=\"Google voice and audio activity\" width=\"982\" height=\"226\" srcset=\"https:\/\/davidjohntaylor.co.uk\/wp\/wp-content\/uploads\/2015\/08\/google-voice-and-audio-activity.jpg 982w, https:\/\/davidjohntaylor.co.uk\/wp\/wp-content\/uploads\/2015\/08\/google-voice-and-audio-activity-300x69.jpg 300w\" sizes=\"auto, (max-width: 982px) 100vw, 982px\" \/><\/a><\/p>\n<p>On the <a href=\"https:\/\/support.google.com\/websearch\/answer\/6030020?hl=en\">Google Search Help pages<\/a> they provide some insight into why they store this data:<\/p>\n<p><em>To help you get better results using your voice, Google uses your Voice &amp; Audio Activity to:<\/em><\/p>\n<ul>\n<li><em>Learn the sound of your voice<\/em><\/li>\n<li><em>Learn how you pronounce words and phrases<\/em><\/li>\n<li><em>Recognize when you say &#8220;Ok Google&#8221;<\/em><\/li>\n<li><em>Improve speech recognition across Google products that use your voice<\/em><\/li>\n<\/ul>\n<p>To be honest, what I saw being held really doesn&#8217;t worry me that much &#8211; if it helps make my experience better, I can&#8217;t really complain about what is being stored, as long as I&#8217;m aware of it, and I have some control over it.<\/p>\n<p>In terms of the future, I can only expect that 92% accuracy figure to increase over time. \u00a0The question is, will it ever reach 100%? \u00a0As humans, we&#8217;re actually prone to misinterpret speech from time to time, so 100% is unlikely to be reached, but if it worked at human equivalent levels, I&#8217;d be more than happy.<\/p>\n<p>Useful links:<\/p>\n<ul>\n<li><a href=\"http:\/\/www.greenbot.com\/article\/2359684\/system-software\/a-list-of-all-the-ok-google-voice-commands.html\">A list of all the Google Now voice commands<\/a><\/li>\n<li><a href=\"http:\/\/recode.net\/2015\/03\/03\/google-voice-search-talks-me-through-a-house-of-cards-weekend\/\">Another less than scientific Google Voice Search test<\/a><\/li>\n<li><a href=\"https:\/\/history.google.com\/history\/audio\">Your Google voice and audio activity<\/a><\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>There are a couple of different routes to access\u00a0Google&#8217;s multi-lingual voice recognition function, including Google Now on your\u00a0Android phone, searching on a\u00a0browser, and Google Maps to name a few. \u00a0It&#8217;s generally the case that the environment you&#8217;re in at the&hellip; <a href=\"https:\/\/davidjohntaylor.co.uk\/index.php\/2015\/08\/07\/why-google-voice-recognition-continues-to-impress\/\" class=\"more-link\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":88,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[2],"tags":[],"class_list":["post-73","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-general"],"_links":{"self":[{"href":"https:\/\/davidjohntaylor.co.uk\/index.php\/wp-json\/wp\/v2\/posts\/73","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/davidjohntaylor.co.uk\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/davidjohntaylor.co.uk\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/davidjohntaylor.co.uk\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/davidjohntaylor.co.uk\/index.php\/wp-json\/wp\/v2\/comments?post=73"}],"version-history":[{"count":5,"href":"https:\/\/davidjohntaylor.co.uk\/index.php\/wp-json\/wp\/v2\/posts\/73\/revisions"}],"predecessor-version":[{"id":89,"href":"https:\/\/davidjohntaylor.co.uk\/index.php\/wp-json\/wp\/v2\/posts\/73\/revisions\/89"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/davidjohntaylor.co.uk\/index.php\/wp-json\/wp\/v2\/media\/88"}],"wp:attachment":[{"href":"https:\/\/davidjohntaylor.co.uk\/index.php\/wp-json\/wp\/v2\/media?parent=73"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/davidjohntaylor.co.uk\/index.php\/wp-json\/wp\/v2\/categories?post=73"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/davidjohntaylor.co.uk\/index.php\/wp-json\/wp\/v2\/tags?post=73"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}