Speech Recognition Anywhere - Chrome Extension
Use speech recognition to fill out forms and documents on the Internet. Control the Internet with you voice.
speech, recognition, voice, chrome, extension, speech-to-text, talk, form, write, type

Speech Recognition Anywhere - Chrome Extension

Published on  
Last updated on  
Speech Recognition Anywhere
  • Type emails with your voice
  • Write documents with your voice
  • Control the Inernet with your voice
  • Chrome Extension
Reconocimiento de voz en cualquier lugar
  • Escribe correos electrónicos con tu voz
  • Escribe documentos con tu voz
  • Controla la Inernet con tu voz
  • Extensión de Chrome
Spracherkennung Allerorts
  • Geben Sie E-Mails mit Ihrer Stimme ein
  • Schreiben Sie Dokumente mit Ihrer Stimme
  • Steuern Sie das Internet mit Ihrer Stimme
  • Chrome-Erweiterung
Reconnaissance de la parole
  • Tapez des e-mails avec votre voix
  • Écrivez des documents avec votre voix
  • Contrôlez l'Inernet avec votre voix
  • Extension Chrome
Riconoscimento vocale ovunque
  • Digita e-mail con la tua voce
  • Scrivi documenti con la tua voce
  • Controlla Internet con la tua voce
  • Estensione Chrome
どこでも
音声認識
  • あなたの声で文書を書く
  • あなたの声でメールを入力してください
  • あなたの声でInernetを制御する
  • Chrome拡張機能
语音识别
无处不在
  • 用你的声音写文件
  • 用您的声音输入电子邮件
  • 用你的声音控制互联网
  • Chrome 扩展程序
語音識別
無處不在
  • 用你的聲音寫文件
  • 用您的聲音輸入電子郵件
  • 用你的聲音控制互聯網
  • Chrome 擴展程序
Subscribe to Internet Tips and Tools Feed

With "Speech Recognition Anywhere" you can control the Internet with your voice. Use Speech Recognition to fill out inputs, textareas, forms or documents on the Internet! The speech you speak is automatically typed into the form on the website. "Speech Recognition Anywhere" can also be used as an awesome Virtual Assistant. Download the Speech Recognition Anywhere Chrome or Edge Extension today. A user can use simple voice commands to go to websites or to click on buttons and links. It also works with Google Docs, Microsoft Word online, Gmail, outlook.com and more. Medical professionals and other health professionals use Speech Recognition Anywhere to fill out web based online patient medical records using speech to text. Business professionals use it dictate letters and emails. Custom commands can be created by the user to allow the user to say shortcut phrases (also known as macros or autotext) that print out a lot of text into a form.

Get Extension

Features

  • Browser extension for Chrome or Add-on for Edge browser for desktop and laptop computers.
  • Virtual Assistant Mode.
  • Choose between dozens of languages and dialects for speech recognition.
  • Dictate emails and online documents.
  • Fill in forms with your voice.
  • Go to the next or previous field with your voice.
  • Go to any website with your voice.
  • Switch tabs and navigate websites with your voice.
  • Scroll page up or down.
  • Click on links and buttons with your voice.
  • Cut, Copy, Paste, Clear, Highlight.
  • Say "Show labels" to see labels to buttons on a website.
  • Say "Show numbers" to see numbers next to links and buttons. Then say "Click on (number)".
  • Say "Youtube (name of artist or song) (in the background)" to play music instantly.
  • Create Custom Voice Commands.
  • Text To Speech.
  • Scripting.
  • Auto Punctuation.
  • Export/Import custom commands.
Get Extension

Custom Commands

Look in the comments below for "Custom Commands" that you can add to Speech Recognition Anywhere. If you have created some awesome commands for Speech Recognition Anywhere then please share them in the comments section below. (If you have urls (http: https:) in the action then please surround it with <code> </code> tags so the comment box does not convert it to a link.)

To create custom commands you do not need to use regular expressions but regular expressions will make your custom commands more powerful. For example, you could create this basic custom command:

Phrase: Display the weather satellite
Action: https://weather.weatherbug.com/maps/
Description: Display the weather satellite

But you would have to say the exact phrase, "Display the weather satellite" in order for the satellite image to display. But if you use regular expressions like in the example below then you can say a number of similar sentences to activate the command:

Phrase: (?:Display|Show)(?:.*?)?(?:satellite|clouds)(?:.*?)?(?:for |of |in )?(.*?)?
Action: https://weather.weatherbug.com/maps/$1
Description: Display the weather satellite

With the above phrase you could say "Show me the clouds" or "Display the weather satellite for New York". Here is a breakdown of the phrase:

(?:Display|Show)        means to look for either "display" or "show". The | symbol means "or". Putting ?: at the beginning of the match inside the parentheses () means to look for the match but don't remember the match.

(?:.*?)?        means to look for any number of optional words like "the" or "the weather" and do not remember the match. The ? at the end outside of the parentheses () means these words are optional. For example, you could say "Show me the weather satellite" or you could just say "Show satellite".

(?:satellite|clouds)        means that the user has to say either "satellite" or "clouds" in the phrase for the phrase to be detected.

(?:.*?)?        means that we again look for any number of optional words.

(?:for |of |in )?        means that we look for "for " or "of " or "in " so that the user can say "Display the satellite for Colorado". Putting the ? at the end means this is optional.

(.*?)?        means that we look for one more optional words or group of words at the end. But this time we don't put ?: at the beginning inside the parentheses () because we want to remember the match to use it later on. We want to remember the last word for a spoken command like "Show me the weather satellite for London". Then the remembered match can be used in the action: https://weather.weatherbug.com/maps/$1 . The $1 will be replaced with London in the url. $1 is used for the first match and $2 for the second, etc. If you wanted to put the whole spoken command in the action then you would use $0 . As an example, if you were wanting to let Google decide how to play music for you then you could use this phrase: Play (.*?) . So the spoken command could be "Play Coldplay". And the action could be: http://www.google.com/search?btnI&q=$0 because the $0 would match the entire phrase "Play Coldplay" so what would be sent to Google as: http://www.google.com/search?btnI&q=Play Coldplay . btnI means to instantly use the I'm feeling lucky button, so Google would use the first result which would probably be a Youtube video.

Medical Record Macros and Autotext

Here are some examples of how medical professionals and other health professionals can use Speech Recognition Anywhere to create custom commands for patient medical record macros or autotext.

Phrase: Rash negative|Negative rash
Action: Rash not worrisome for scabies, measles, meningococcemia, varicella, bullous disorder, Stevens-Johnson syndrome, Toxic epidermal necrolysis, staph scalded skin syndrome, or disseminated herpes. No evidence of erythema multiforme, Lyme, cellulitis, necrotizing fasciitis, meningococcemia, rocky mountain spotted fever.
Description: Say "Rash negative" or say "Negative rash"

Here is an example of a short phrase that you can say for a medical test and have it write a detailed description:

Phrase: Lyme disease test|Lyme antibody test
Action: LYME DISEASE IGG/IGM, As recommended by the Food and Drug Administration (FDA), all samples with positive or equivocal results in the Borrelia burgdorferi antibody EIA (Screening) will be tested by Western Blot.
Description: Say "Lyme disease test" or say "Lyme antibody test"

Text to Speech

Speech Recognition Anywhere Chrome extension also has text to speech capabilities. Here is an example of a custom command for making Wolfram Alpha into a talking virtual assistant with voice recognition.

Phrase: Wolfram\s*Alpha,? (.*?)
Action: http://www.wolframalpha.com/input/?i=$1;speak_element(img._3c8e[1])
Description: Wolfram Alpha Question

Note: As of 12/30/2018 the latest Chrome update blocks speechSynthesis on websites without any notification. To allow speechSynthesis on a website you will need to click on the lock symbol 🔒 or ⓘ to the left of the website address in the address bar and then scroll down and click on the down arrow next to Sound and select "Allow". You will need to do this on every website that you want to use the speak() command on.

The above phrase includes \s* between Wolfram and Alpha because sometimes Google's Web Speech API detects the phrase as "Wolfram Alpha" and other times as "WolframAlpha". This command will accept both. The Action is actually a script. Each script command is separated by ; (semi-colon). The first action in the script goes to wolframalpha website with the input string that was spoken. For example, say "Wolfram Alpha When is the next moon rise?". The next action in the script tells Speech Recognition Anywhere to speak out loud with text-to-speech an element on the web page. The element is an img tag with class="_3c8e" . Wolfram Alpha puts the result in an image instead of plain text. But that image has an alt attribute with a plain text answer to the question. So img._3c8e[1] reads out loud the second or [1] img tag with class="_3c8e" .

Here is another example of a text-to-speech custom command that creates a Decision Maker:

Phrase: Should I (.*?)
Action: say(Yes|No|Definitely Yes|Absolutely Not|Probably Not)
Description: Decision Maker

The say command will read aloud whatever text you put there. The | or pipe (also called vertical bar) separates each text to read as an OR. The say command will randomly choose one of the answers to read aloud. Now ask any question that begins with "Should I...?"

Scripting

In the Action field of custom commands you can create an action script. Each command is separated with a ; (semi-colon). Here is an example:

Action: http://example.com/; scroll_it(down); click_element(search); speak(answer)

The above action script will first go to example.com . Then it will scroll down the page, then click on an element with an id of search and then speak out loud the text in an element with an id of answer .

Scripting Commands

;Separate each command in the Action field with ;
;;Pause for 1 second. (Each command is separated by half a second, so to pause for 1 second use two semi-colons.
%3BTo print a ; on the screen use the url encoded (percent encoded) version which is %3B
%28 %29To print parenthesis () use %28 and %29 respectively.
%2CTo print a text comma , in an "Action" function use %2C or surround the text with double quotes.
\btext\bIf the phrase begins and ends with \b then text will be replaced with the text in the action field. For Example: To replace open round bracket with ( use:
Phrase: \bOpen round bracket\b
Action: (
Description: Always replace: "Open round bracket" with "("
add_labels(option) Add labels to buttons that are only images. option can be show or hide .
add_numbers(option) Add numbers to links, buttons, and inputs. option can be show or hide .
backspace(x)Press the backspace key. Where x is the amount of times to press it. Default is 1.
browse(option)Navigate the browser history. option can be back, forward, reload, refresh . Example to create a command to Go Back in browser history in Spanish:
Phrase: retroceder
Action: browse(back)
Description: Navigate Back in Spanish
capitalize_first_letter(text, all_words) Convert the first letter of text to a capital. If all_words is true then every first letter of every word in text will be converted to a capital. The default is false . For example:

Phrase: capitalize (.*?)
Action: capitalize_first_letter($1, true)
Description: Say "Capitalize hello world" to print "Hello World"
clear_text()Clear all text in the currently selected input or textarea.
click_element(el) el can be the id of an element to click on or if the element does not have an id then it can be a tag under an element. For example, if el is results.img[0] then the click_element command will click on the first (0) img under the element with id of "results" .
click_keyword(el) el can be the id of an element to click or the name, text, title, aria-label or alt of an element to click.
clipboard(option) option can be copy, cut, paste
dictation(option)Turn dictation (speech-to-text) on or off. option can be on, off, start, stop . (Only available in Full Version)
enter_key(x)Press the enter key. Where x is the amount of times to press it. Default is 1.
escape_key(x)Press the escape key. Where x is the amount of times to press it. Default is 1.
find_phrase(keywords)Find a word or phrase on the page and highlight or select it. For example, to create a custom command to search for a word or phrase using Spanish such as "Buscar restaurante":
Phrase: (?:búsqueda |buscar )(?:de |para )?(.*?)
Action: find_phrase($1)
Description: Buscar en Espanol
https://Start a command in the Action field with http:// , https:// , ftp:// , file:/// to go to that website or file. For example: https://seabreezecomputers.com will change the current location to seabreezecomputers.com
insertHTML(html)Previously any html in the action field would print as HTML . Now it will print as text since version 1.4.4 (3/5/2021). To print html now use the insertHTML() function in the action field:
Phrase: line break
Action: insertHTML(<br>)
Description: Say line break to insert html: <br>
insertText(text)To easily print text on the screen including < > ( ) ; then put the text in the insertText() command. Normally html tags < > will not print and ( ) ; are special reserved characters in the Action field. But if you put the characters in insertText() then they will print fine to the screen. Example:
Action: insertText(<div onclick="alert('Hello');">)
keypress(n, ctrl, alt, shift)For webpages that listen for keypresses. Where n is the keyCode of the key to press. For example: keypress(49) will press the 1 key. See Table of keyCodes. Or n can be the letter to press such as keypress(z). Example to send ALT+Z : keypress(z, false, true) ; Example to send CTRL+Z : keypress(z, 1); If n is a string then the string will be converted to keypresses to the website. For example: keypress(zebra) ; will send keypress events for z,e,b,r,a .
moveCursor(direction, x, option)Move the cursor in direction of up, down, left, right, top, bottom, start, end. x is the number of times to move that direction. The default is 1. option can be character, word, sentence, paragraph, document, paper, box, text or field. The default is word.
open(website, name_of_tab)Open website in name_of_tab tab. This command is used to always open in the same tab. If the tab name_of_tab is already open then it will stay in the background if it is not the active tab. If name_of_tab is not supplied then this command will always open a new tab.
play(keywords, new_tab)Play keywords song or video using Google's I'm feeling lucky button. The default for new_tab is false . Here is an example of a custom command for playing music or a video using Spanish:
Phrase: (?:Toca|Juega) (.*?)(?: *?)?(?:en )?(?:una )?(nueva pesta[nñ]a)?
Action: play($1, $2)
Description: play() in Spanish. Di "Toca (música) (en una nueva pestaña)"
print_text(text)To easily print text on the screen including < > ( ) ; then put the text in the print_text() command. Normally html tags < > will not print and ( ) ; are special reserved characters in the Action field. But if you put the characters in print_text() then they will print fine to the screen. Example:
Action: print_text(<div onclick="alert('Hello');">)
print_text() is synonymous with insertText() .
read(option)Read out loud with text to speech. option can be all, everything, page, webpage, paragraph, screen, article, website, site, main, content, main content, alert, notification, status, buttons, inputs, links, hyperlinks, web links, images, figures, headings, titles, headlines, menu, navigation, header, banner, footer, contentinfo, sidebar, aside, complementary, toolbar, selection, selected, highlighted
redo(x)Redo the last text that was undone with undo. x is the number of times to redo. The default is 1.
replace_word(text)Replace a word in a sentence with text. For example: To replace "ok" with "okay" use:
Phrase: ok
Action: replace_word(okay)
Description: Always replace "ok" with "okay"

Another example: To replace "+" with "plus" use:

Phrase: (.*?)\+
Action: replace_word($1 plus)
Description: Always replace "+" with "plus"
say(text)Speak out loud text with text-to-speech.
script(code)You can put advanced scripting in code using the script command. Use return variable at the end to have the variable displayed in the textarea. For example, here is a custom command script for displaying today's date in en-US format:
Phrase: (Enter |type )?today's date
Action: script(var today = new Date().toLocaleDateString('en-US'); return today;)
Description: Say 'Enter today's date' or 'Type today's date' or 'Today's date'

Another example: When you spell out words like names, such as "g e o f f", the Web Speech API automatically adds spaces in between the letters. Here is a custom command to remove those spaces:

Phrase: (\S\s){1,}\S
Action: script(var text = " $0".replace(/\s/g, ""); return text;)
Description: Spell out g e o f f without spaces (Ex: geoff)

Another example: To spell out a word and have its first letter capitalized:

Phrase: Capital ((\S\s)+\S)
Action: script(var text = " $1".replace(/\s/g, ""); text = text.charAt(0).toUpperCase() + text.slice(1); return text;)
Description: Say "Capital g e o f f" to print "Geoff"

Note: At this time script() only works on some websites. Some websites block external scripts from running. It also will NOT work on the Speech Recognition Anywhere tab at this time.

scroll_it(direction)Scroll the page. direction can be up, down, right, left, top or bottom. You can also include an element type with the direction separated by a comma. The element type can be body, page, menu, navigation, sidebar or box. For example: scroll_it(menu, down) will scroll a menu on the page down. scroll_it(box) will default to scrolling a textbox on the page down.
scrollToPosition(el)Scroll to HTML element if it is not in view. el should be the id of the element.
select(option)option can be all or none . Select all text or none of the text in the currently selected input or textarea or on page. Some textareas on some websites also allow option to be character, letter, word, sentence, paragraph. These textareas can also have multiple options separated by a comma that includes a direction or the amount. For example select(last, 3, words) will select the previous 3 words behind the cursor. Another example, select(next, 2, sentences) will select the next 2 sentences in front of the cursor. select(sentence) will select the previous sentence.
set_language(language)Set the speech recognition language to language temporarily. A drop down list of languages can be seen in Speech Recognition Anywhere "Settings" menu.
spacebar(x)Press the spacebar. Where x is the amount of times to press it. Default is 1.
speak_element(el)Read or speak out loud the contents of an element. It will read the innerText, alt, aria-label or title attribute of the element. el can be a selector like using querySelectorAll in JavaScript with an optional [n] to specify which element in the array to read. For Example: To read the WolframAlpha result use speak_element(img._3c8e[1]); The element is an img tag with class="_3c8e" . Wolfram Alpha puts the result in an image instead of plain text. But that image has an alt attribute with a plain text answer to the question. So img._3c8e[1] reads out loud the second or [1] img tag with class="_3c8e" .
submit_form()Submit the current form on the website.
switch_fields(option)Switch fields in a form or switch links. option can be next, previous, keyword such as the name or id of a form element or the number in order of the form/link element on the page.
switch_tabs(option)Switch browser tabs. option can be next, previous, close, last, first, number of tab, title or url of tab. Here is an example of a custom command for Spanish to switch tabs:
Phrase: Cambiar (?:a )?(?:la )?(.*?)? ?pesta[nñ]as? ?(.*?)?
Action: switch_tabs($1$2)
Description: Switch tabs in Spanish. Say "Cambiar a la siguiente pestaña" or "Cambiar a la pestaña anterior".
textType text on to the page in the currently selected input or textarea or it will choose the first available input on the page
.toUpperCase()Javascript String manipulation can now be used such as toUpperCase() . Here is an example of a custom command for making speech spoken to be uppercase or all caps:
Phrase: (?:uppercase|all caps) (.*?)
Action: $1.toUpperCase()
Description: Say "Uppercase hello world" to print "HELLO WORLD"
undo(x)Undo the last text command. x is the number of times to undo. The default is 1. Here is an example of translating an undo command into Spanish:
Phrase: Deshacer(?: *?)(\S*?)?(?: *?)(veces)?$
Action: undo($1)
Description: Undo() in Spanish. Say "Deshacer" or "Deshacer (Número) veces"
url(website, new_tab, focused)Change location to website. The default for new_tab is false. The default for focused is true . If focused is set to false then the tab will stay in the background. Just put url() with no parameters to open a new tab. Here is an example of a custom command for visiting a website using Spanish:
Phrase: Visit[ea]r? (.*?\.\s?\S{2,6})(?: *?)?(?:en )?(?:una )?(nueva pesta[nñ]a)?
Action: url($1, $2)
Description: url() in Español. Di "Visita google punto com (en una Nueva pestaña)"

new_tab can also be the name of a tab so that a tab is reused if the name is the same.

new_tab can also be a url. If new_tab is a url then it will look to see if there is a tab that starts with that url and use it, otherwise it will create a new tab. For example, here is a custom command that will look for a tab that starts with https://www.youtube.com/ when a user says "Youtube name of song" and use that tab if it exists:

Phrase: (Youtube) (.*?)
Action: url(https://www.google.com/search?btnI&q=youtube $2, https://www.youtube.com, true)
Description: "Play (title of song or video)" using Youtube
Privacy Policy

Last updated on September 19, 2023
Created on December 11, 2016

Speech Recognition Anywhere
  • Type emails with your voice
  • Write documents with your voice
  • Control the Inernet with your voice
  • Chrome Extension
Reconocimiento de voz en cualquier lugar
  • Escribe correos electrónicos con tu voz
  • Escribe documentos con tu voz
  • Controla la Inernet con tu voz
  • Extensión de Chrome
Spracherkennung Allerorts
  • Geben Sie E-Mails mit Ihrer Stimme ein
  • Schreiben Sie Dokumente mit Ihrer Stimme
  • Steuern Sie das Internet mit Ihrer Stimme
  • Chrome-Erweiterung
Reconnaissance de la parole
  • Tapez des e-mails avec votre voix
  • Écrivez des documents avec votre voix
  • Contrôlez l'Inernet avec votre voix
  • Extension Chrome
Riconoscimento vocale ovunque
  • Digita e-mail con la tua voce
  • Scrivi documenti con la tua voce
  • Controlla Internet con la tua voce
  • Estensione Chrome
どこでも
音声認識
  • あなたの声で文書を書く
  • あなたの声でメールを入力してください
  • あなたの声でInernetを制御する
  • Chrome拡張機能
语音识别
无处不在
  • 用你的声音写文件
  • 用您的声音输入电子邮件
  • 用你的声音控制互联网
  • Chrome 扩展程序
語音識別
無處不在
  • 用你的聲音寫文件
  • 用您的聲音輸入電子郵件
  • 用你的聲音控制互聯網
  • Chrome 擴展程序
Back to www.seabreezecomputers.com
Subscribe to Internet Tips and Tools Feed        

User Comments

There are 862 comments.

Displaying first 50 comments.

1. Posted By: Jeff - - December 11, 2016, 1:44 pm
Description: Display the weather satellite (for New York)

Phrase: (?:Display|Show)(?:.*?)?(?:satellite|clouds)(?:.*?)?(?:for |of |in )?(.*?)?

Action: https://weather.weatherbug.com/maps/$1?layerId=GlobalSatellite

Jeff
www.seabreezecomputers.com/

2. Posted By: Jeff - - December 11, 2016, 2:25 pm
Phrase: (?:Display|Show)(?:.*?)?(?:moon)(?:.*?)?(phase)?

Action: http://api.usno.navy.mil/imagery/moon.png

Description: You can say: "Show me the moon" or "Display the current moon phase"

Jeff
www.seabreezecomputers.com/

3. Posted By: Jeff - - December 13, 2016, 2:26 pm
Phrase: (?:Display|Show)(?:.*?)?(?:rain|radar)(?:.*?)?(?:for |of |in )?(.*?)?
Action: https://weather.weatherbug.com/maps/$1?layerId=Radar.US

Description: Display the radar (for New York)

4. Posted By: Jeff - - December 13, 2016, 2:39 pm
Phrase: (?:Display|Show)(?:.*?)?(?:traffic)(?:.*?)?(?:for |of |in )?(.*?)?

Action: https://www.google.com/maps/place/$1/data=!5m1!1e1

Description: Show me the traffic (for Los Angeles)


5. Posted By: emin - - February 18, 2017, 3:38 pm
I need turkish language

6. Posted By: Jeff - - February 19, 2017, 5:28 pm
To change the speech recognition language to Turkish, In Speech Recognition Anywhere, click on "Settings" and then under "Language" you can select "Turkish".

Jeff
www.seabreezecomputers.com/

7. Posted By: Jeff - - February 25, 2017, 7:11 pm
Control Philips Hue Lights

1. Go to www.meethue.com/api/nupnp to get the IP address of your Hue Bridge.
2. Go to http://<bridge ip address>/debug/clip.html/debug/clip.html
3. For url enter: /api for message body enter: {"devicetype":"jeff"}
Where jeff is the username you want to create.
4. Press POST
5. You will get the message "link button not pressed". So press the big round link button on top of your Hue Bridge.
6. Press POST again.
7. This time you get a username hash similar to 1028d66426293e821ecfd9ef1a0731df . Save the username hash for future requests.

Then create a custom command in Speech Recognition Anywhere:

Phrase: Turn( on)?( the)? living room light(s)?( on)?
Action: http://<bridge ip address>/debug/clip.html;clear_text();/api/<username hash>/lights/1/state;click_keyword(messagebody);{"on":true};click_keyword(put)

Je

8. Posted By: Jeff - - March 6, 2017, 7:33 pm
Description: Youtube (any video)

Phrase: Youtube (.*?)

Action: https://www.youtube.com/results?search_query=$1;click_element(thumbnail)

Jeff
www.seabreezecomputers.com/

9. Posted By: Jeff - - March 15, 2017, 2:32 pm

Description: "Play (title of song or video)" with youtube

Phrase: ^Play (.*?)$

Action: http://www.google.com/search?q=youtube $0;click_element(res.a[0])


Jeff
www.seabreezecomputers.com/

10. Posted By: Jeff - - April 7, 2017, 9:23 pm
Description: What is the UV index in New York

Phrase: (?:What|Display|Show)(?:.*?)(?:UV index)(?:.*?)?(?:for |of |in )?(.*?)?

Action: http://sunburnmap.com/;;;;;clear_text();$1;click_keyword(find)

Jeff
www.seabreezecomputers.com

11. Posted By: Raymond - - June 12, 2017, 12:48 pm
I am not a programmer, so I need some help. I want to be able to switch Speech Recognition from English to Spanish and vice versa. Is there a script I can use for this? Any help will be greatly appreciated

June 20, 2017 - From the Editor:
Version 0.98.8 now has the voice command built in "Change the language to spanish or english"


12. Posted By: Paul LaZar - - July 4, 2017, 10:22 am
I just purchased and installed your speech software. I have installed it on a Windows Tablet running Windows 10 home (full version).

My only interest in your software is to enable OK Google which I have done and it is working. The problem is OK Google responds and takes me away from the web page an example:

"OK Google navigate to DC", the map page comes up with route shown and Google says "navigating to DC" and the page goes somewhere else.

I have tried this in various ways asking questions etc and everytime Google speaks, your software interprets it and types new pages.

Please advise.

13. Posted By: Jeff - - July 4, 2017, 2:49 pm
Hi Paul Lazar,

Sorry about the problem you are having. It is because the microphone is picking up the speech from your speakers. Try this, in Speech Recognition Anywhere click on Settings and then check Pause "Speech Recognition Anywhere" if audio is playing in a tab. I believe that should solve the problem.

Jeff
www.seabreezecomputers.com/


14. Posted By: Lola - - July 26, 2017, 4:09 am
Is there anyway to stop the text from appearing in the upper left hand corner before it is typed in the speech box? Also, is there a way to stop the yellow highlighting and scrolling when text is entered?

15. Posted By: Jeff - - July 26, 2017, 11:36 am
Hello Lola,

There is no way to stop the text from appearing before it is typed into the box. That is how users know that the speech recognition is hearing what they say. I'm not sure by what you mean about the yellow highlighting and scrolling. Do you have an example website and box where this is happening?

Jeff
www.seabreezecomputers.com/

16. Posted By: Nice software, but not quite working on one site - - August 1, 2017, 11:30 am
Nice software, just what I'm looking for, but unfortunately doesn't quite work for me on ankiweb.net. This site displays a link that I repeatedly click to hear voice prompts. When I try to do this using SRE, the recording starts to play but then I get a 404. I'm having trouble getting the source for the page, but I think what is going on is that the recording is played by a JavaScript snippet which is connected to the link as an onClick attribute. The location specified by the link doesn't really exist, so, e.g., if I right-click on the link and copy the destination to the clipboard and try to load it in another page, I do indeed get a 404. But if I manually click on the link, it works as designed -- i.e., it plays the voice recording and does not try to go to another page.

So I'm not sure of the disconnect here, and unfortunately you won't be able to repro easily without my credentials. If you want to pursue it, I'll give you my creds (and $150 if you make this work on ankiweb).

17. Posted By: Paul LaZar - - August 2, 2017, 9:44 am
Thanks for the info Jeff.

18. Posted By: Jeff - - August 2, 2017, 9:46 am
Hi Nelson,

I think I figured it out. ankiweb.net has another element overlaying the "Play" link with a javascript click event attached to it. So to use your voice and say "Click Play", add this custom command to Speech Recognition Anywhere:

Phrase: Click Play
Action: click_element(jp_container_1)

Or, if you like you can replace "Click Play" in the phrase above with "Press Play".

Jeff
www.seabreezecomputers.com/

19. Posted By: Nice software, but not quite working on one site - - August 2, 2017, 9:48 am
Nice work, Jeff -- this works for me.

20. Posted By: Kim V - - August 16, 2017, 1:31 pm
Hi. You should make an option to hide the labels when speaking. They are highly annoying when not using it, and it keeps trying to detect what im saying.

Another thing that would be geat is to disable speech to text and only ahve it work with commands.

21. Posted By: Jeff - - August 17, 2017, 11:26 am
Hi Kim,

Thanks for the suggestions! I like the idea of being able to disable speech to text and only have it work with commands. But I'm not so sure about hiding the labels when speaking. I think most people would be confused and think that the Speech Recognition is not working because it would show no sign of it working until after they finish talking. Let me think about it and see what I can do.

Jeff
www.seabreezecomputers.com/

22. Posted By: Jeff - - August 21, 2017, 7:00 pm
Hi Kim V,

In version 0.98.9 of Speech Recognition Anywhere I added the following three settings:
*Disable yellow speech bubble (Only final speech will display)
*Disable Speech-To-Text (Only Voice Commands will work)
*Disable Voice Commands (Only Speech-To-Text will work)

I hope that works for you.

Jeff
www.seabreezecomputers.com/

23. Posted By: Kim V - - August 22, 2017, 12:35 pm
Hi Jeff.

You are awesome! Nice to see a suggestions getting into final product :)

24. Posted By: Jeff - - September 4, 2017, 9:32 am
Hi Samuel Cartaxo,

Thank you for notifying me of the error. Some people get the error and others don't. But Google seems to be doing nothing about it. How often do you get the error?

Jeff
www.seabreezecomputers.com/

25. Posted By: thomas - - September 28, 2017, 11:18 am
Hello!
I am curious what you are using internally for ASR and TTS?
Are you using google's speech API, or built-in browser-specific tools from chrome?
Thanks!
-thomas

26. Posted By: Jeff - - September 28, 2017, 12:39 pm
Hi thomas,

Speech Recognition in Speech Recognition Anywhere is accomplished with the Javascript Web Speech API which at this time only works well in Chrome.
See: developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API

TTS is accomplished with the Web Speech API SpeechSynthesis. It seems to work with Chrome, Firefox, Edge and Safari but not IE.
See: developer.mozilla.org/en-US/docs/Web/API/SpeechSynthesis

Jeff
www.seabreezecomputers.com/

27. Posted By: whelb5050 - - November 10, 2017, 7:50 am
Will this work with Chromium browser. I have a project involving a Raspberry Pi and I would like to open web pages and navigate them with voice.

28. Posted By: Jeff - - November 10, 2017, 10:22 am
Hi whelb5050,

I think it will work with Chromium browser. But I have no idea if it will work with Raspberry Pi. Can you test it and let us know?

Jeff
www.seabreezecomputers.com/

29. Posted By: bodyalive - - November 11, 2017, 9:27 am
is there a way to disable profanity filter? Thank you.

30. Posted By: Markey - - November 11, 2017, 9:55 am
For some reason this extension does not work on facebook chat (chat window close itself for no reason, speech to text works fine tough). Any help?


31. Posted By: whelb5050 - - November 11, 2017, 12:57 pm
Successful test on the Raspberry Pi. This saved our school project from failure.

32. Posted By: Jeff - - November 11, 2017, 12:58 pm
Hi whelb5050,

That is good to hear! I'm glad it worked. Thank you for letting us know.

Jeff
www.seabreezecomputers.com/

33. Posted By: Jeff - - November 11, 2017, 1:13 pm
Hi bodyalive,

You can try to filter profanity with a custom command like the following. I'm not sure how well it will work though:

Phrase: (.*?)(bad|word|list)(.*?)
Action: $1$3

Jeff
www.seabreezecomputers.com/

34. Posted By: Jeff - - November 11, 2017, 1:25 pm
Hi Markey,

I have duplicated the problem. It appears that Facebook might have some blocking in place to prevent malware or viruses from sending messages without user permission. I noticed that if I type a space first in the Facebook chat box then it will accept Speech Recognition Anywhere. But saying "Press enter" or "Submit" does not send the message. You have to manually press the enter key.

Jeff
www.seabreezecomputers.com/

35. Posted By: bodyalive - - November 12, 2017, 4:07 am
QUOTE "

You can try to filter profanity with a custom command like the following. I'm not sure how well it will work though:

Phrase: (.*?)(bad|word|list)(.*?)
Action: $1$3"

i'M SORRY this did not work,
i know how to Stop Android's Voice-to-Text from Blocking Swear Words, i wonder if there's a way disable censorship in chrome too.






36. Posted By: Jeff - - November 13, 2017, 11:22 am
Hi bodyalive,

Sorry, I misread your message. I thought you wanted to block profanity.

The web speech api in Chrome used to not filter out profanity but now it does. But I can't find any settings for how to stop it from filtering profanity. You can try the following custom command, but you would have to make a separate command for each swear word:

Phrase: (.*?)(s\*+)(.*?)
Action: $1ship$3

Replace the s in the phrase with the first letter of the swear word for each custom command.

Jeff
www.seabreezecomputers.com/

37. Posted By: bodyalive - - November 15, 2017, 9:02 am
Jeff this did the trick!

Since some "bad words" share same first letter i had to use (.*?)(1stletter\*{n})(.*?) to specify one from another.

thank you.

38. Posted By: whelb5050 - - November 15, 2017, 12:56 pm
How well does this extension handle peoples names? For instance, I am building a directory for our college professors. I would like to say show Dr. John Doe, and have it pull up his contact information, however some of the professor's names are quite hard to pronounce.

39. Posted By: Jeff - - November 15, 2017, 12:59 pm
Hi bodyalive,

Thanks for letting us know! I'm glad it worked. It looks like you know some regular expressions. Good job!

Jeff
www.seabreezecomputers.com/

40. Posted By: Jeff - - November 15, 2017, 2:00 pm
Hi whelb5050,

The speech recognition seems to do really well for common names. Dr. John Doe works good and many other common names. For uncommon names, if it gets it wrong then you could try creating a custom command. For example, here is an uncommon spelling of a name: "Alexus Rodrigues". If I say that name then the speech recognition prints out the common spelling of "Alexis Rodriguez". So assuming I do not have an Alexis Rodriguez in the database I could create this custom command:

Phrase: Alexis Rodriguez
Action: Alexus Rodrigues

I hope that helps,

Jeff
www.seabreezecomputers.com/

41. Posted By: Maicon - maiconpinheiro@zoho.com - November 18, 2017, 9:40 pm
the buy option at chrome appstore is grayed out, and I can't buy it. Can you guys help me out?

42. Posted By: Jeff - - November 19, 2017, 11:11 am
Hi Maicon,

I don't know why, but it appears that the Chrome Web Store blocks some countries from buying. Some users have had success by clicking on the Settings button in Chrome Web Store and changing their country to one of the supported countries. See: developer.chrome.com/webstore/pricing

Other users have had success by using a VPN service like HotSpot Shield (also a chrome extension) to change their country temporarily.

Jeff
www.seabreezecomputers.com/


43. Posted By: Maicon - maiconpinheiro@zoho.com - November 20, 2017, 6:44 pm
Thank you! I've managed to purchase successfully. BTW do you guys know a custom command to play music on youtube, better than using the google search engine? sometimes it doesn't play the song, but a cover.

44. Posted By: JohnS - - November 20, 2017, 8:10 pm
How do we control capital letters? In the Windows Voice Rec, use UPPERCASE results in WORD ; Capitalize results in Word, same for lowercase and NoCaps. I can't get SR Anywhere to do this.
Also, how to control numeral 1 vs the word or thirteen vs 13 or one thousand eighty seven vs 1087.
It may be the way I speak, but if I say "and" alone, the software takes me to the end of the line but if I say "they went and" it works fine.
I've processed about 1000 lines so far and these are challenges for me. Works very well otherwise, names are fantastic.

45. Posted By: JohnS - - November 20, 2017, 8:54 pm
One other thing I've noticed, the word "O'CLOCK" when spoken, pops up the word "OK" but prints nothing.

46. Posted By: Jeff - - November 21, 2017, 11:39 am
Hi Maicon,

Have you tried the custom commands in comments 8 and 9 for playing music? You might be able to come up with a variation of one those commands.

Jeff
www.seabreezecomputers.com/

47. Posted By: Jeff - - November 21, 2017, 11:55 am
Hi JohnS,

Thank you for your feedback. Speech Recognition Anywhere uses the Web Speech API built-in to Chrome so unfortunately we are somewhat limited by the capabilities that Google has programmed into the speech recognition.

Google's Web Speech API doesn't seem to have an "all caps" or "uppercase" command, so I would have to program it myself to have that capability.

As for numbers. Google seems to like to turn most numbers into numerals instead of words. Probably because they are assuming people are filling out forms or setting calendar events or reminders.

As for saying "and" taking you to the end of the line. I programmed that command because, sometimes when I said "end" by itself the speech recognition thought I was saying "and", and I didn't think anyone would be saying "and" by itself. But I will remove that in the next version so that "and" will print.

Jeff
www.seabreezecomputers.com/

48. Posted By: Jeff - - November 21, 2017, 11:59 am
Hi JohnS,

You are right. I didn't notice that before. Google's Web Speech API seems to not understand "o'clock". I tried it over and over. I would say "seven o'clock" and it would just print "7".

But I found a very strange workaround. If I say "seven clock" without the "o" then it prints "7 o'clock".

Jeff
www.seabreezecomputers.com/

49. Posted By: Maicon - - November 21, 2017, 1:43 pm
I have to create a command to stop an youtube video, so I tried using the command to close the tab or to press the spacebar, but I'm not managing to make it work, sorry, I don't know much about those things.

50. Posted By: Jeff - - November 21, 2017, 1:58 pm
Hi Maicon,

If you say "Click on pause" or "press pause" on the youtube page then it should pause the video. But if your speakers are too loud or the microphone is too close to the speakers then it may not understand you properly. You can also say "Click on play" or "press play" to start the video again.

Also make sure that the Setting 'Pause "Speech Recognition Anywhere" if audio is playing in a tab'
is not checked. Otherwise Speech Recognition will not work while audio is playing.

Jeff
www.seabreezecomputers.com/