Hello, thank you for the feedback! It is great to hear that you are finding it useful. Making this application powerful enough to use at work is definitely something we're focused on. Responses to your feedback below.
Page down triggered for failed recognition
I'm assuming this is happening to you within the browser. Most browsers will trigger a page down when you press the space key. This means that when you say something like "open new tub", each of the two spaces in that text trigger a page down. We are considering a "commands-only" interpreter state that you could easily turn on and off while dictating. In the meantime, if there is something in particular that is misrecognized frequently for you, you can update your settings to include command alternates. See the "Command" section of this help file:
https://utterlyvoice.com/help/settings-files.
Editing text
The basic and windows modes provide basic text editing commands that work in most applications. For example, "go left word four", "clear right word two", "go left fifteen". However, if you use a particular text editor frequently, and the text editor supports many keyboard shortcuts for editing, you can get much more efficient with your voice commands.You can create a custom mode file with any voice commands that you like. Many good editors support commands for jumping up/down paragraphs, sentences, words, etc. To learn how to create a custom mode, see the "Customize" group of help documents starting with YAML:
https://utterlyvoice.com/help/yaml
Triggering keyboard shortcuts
Yes, you can absolutely create any voice commands of your choosing to trigger any keyboard shortcuts. As mentioned above, check out the "Customize" group of help documents. These documents take you step by step towards learning how to customize everything. Here is a high level description:
- You can define new mode files in your config/modes directory. You can activate and deactivate these modes while you are dictating. Each mode is a collection of commands.
- Each command has a name that triggers it's execution, and as mentioned above, you can also provide alternates that will trigger execution.
- Each command lists a sequence of function calls. For a simple keyboard shortcut, this will just be one function call to the keyPress function. However, you can define complex commands that trigger a sequence of actions.
The documentation goes into complete detail for describing how to customize. However, if you take a look at the existing files in config\modes, you can see how all of the default commands are defined as an example to quickly grasp how it works. You will see many commands like the following:
- name: "go left word"
description: >-
Moves the cursor one or more words to the left.
The optional utterance argument is the number of times the key should be pressed.
If the argument is not provided,
the key is pressed once.
biasFactor: 1.2
alternates:
- "go leftward"
functions:
- name: "keyRepeat"
fixedArguments:
- "control"
- "left"
utteranceArguments: 1
You will find details for each of the fields for this command in the documentation, but here is a summary:
- Either "go left word" or "go leftward" will trigger the command.
- It triggers a single function called keyRepeat. This function will execute the provided keyboard shortcut a certain number of times according to the single utterance argument provided to the command. See all function descriptions at https://utterlyvoice.com/help/functions.
Many number labels for "show"
Yes, you are noticing some of the tradeoffs for the algorithm we chose. The algorithm is looking for any isolated elements on your screen. When they are very close to each other, they are not recognized as unique. When images are on the screen, many isolated elements will be found. We do believe this is the best algorithm we have seen in any dictation program, but there is definitely room for improvement. We are always looking for ways to improve it. You should definitely use "show" along with "show links" when browsing the web. "Show links" does a better job at finding each clickable link on a webpage.
If you're having trouble reading the labels, it would probably be good for us to add some way to adjust label size in the settings. We added this to our task list for a future version.
Open tab five
I believe you're just using the wrong command. Try "go to tab five". Note that this command only works for values up to 8, because that is the limit of the keyboard shortcut. When you have a lot of tabs open, you might find the "go right/left tab" commands useful. For example, "go right tab twenty".