Controlling the Computer with Your Voice

AbilityNet Factsheet – August 2015

Controlling the computer with your voice

This factsheet provides an overview of how you can control computers (and tablets and smartphones) with your voice.

Communication technology continues to evolve at a rapid pace. Using voice recognition software to dictate to and control your computer has become an increasingly attractive and practical proposition for many different users.

For some disabled people who might struggle, or find it impossible, to work with a mouse or keyboard, speech recognition opens up a world of productive possibilities. The same goes for many people with dyslexia who would otherwise find it very difficult, and much more time consuming, to communicate clearly and professionally using the written word.

For employees generally, voice recognition opens up many possible new ways of working – helping to reduce, for example, the risk of repetitive strain injury and also making mobile working far easier and more efficient.

For employers, investing in voice control can be a very ‘reasonable adjustment’ in preventing discrimination against, and maximising the productivity of, disabled staff.

Although the technology is now widely available, different options for speech recognition do require careful consideration for all sorts of practical reasons. Most staff will also require additional training and support to best use of it.

Section 9 gives details of the services that AbilityNet provides to assist businesses, charities and individuals with identifying and realising the best possible solution for their particular situation and needs. Links to other helpful resources are given in Section 10.

Page 1 of 1 Page 1 of 1

AbilityNet Factsheet: Controlling the computer with your voice,

August 2015. www.abilitynet.org.uk/factsheets

Contents

1 How can you control a computer with your voice? 3

2 Who can voice recognition software help? 3

3 What are the benefits to employers? 3

4 How does voice recognition software work? 4

5 What can you do with voice recognition software? 5

6 What voice recognition software is available? 6

7 What about tablets and smartphones? 8

8 How important is training? 9

9 How can AbilityNet help you find the best solution? 9

10 Other useful links and resources 10

11 About AbilityNet 12

1 How can you control a computer with your voice?

Voice recognition software provides an alternative to typing on a keyboard .At its simplest, it provides a fast method of writing on a computer – you talk into a microphone and your words appear in a text file on screen.

However, many voice recognition programmes are far more sophisticated – while still easy to use. You can use them to start and control your computer, using simple spoken commands, for example, to:

§ open and run different programmes

§ produce and format a document

§ write and send email

§ browse the internet and specific websites.

Very often, it isn’t even necessary to purchase specialist software. Quite powerful voice recognition programmes are now built into new computers, tablets and smartphones.

2 Who can voice recognition software help?

Voice recognition offers significant benefits to a wide variety of potential users. Most obviously, it is extremely useful for anyone with a physical disability who finds typing difficult, painful or impossible. Additionally, it can help to reduce the risk of getting a repetitive strain injury (RSI) or to manage any such upper limb disorder more effectively.

Voice recognition programmes can also greatly benefit people with dyslexia who would otherwise struggle with spellings and/or structuring sentences correctly.

More generally, voice recognition can help to make mobile working easier, as well as offering potential productivity benefits to anyone who might not be very proficient at typing. In fact, most people can talk much faster than they can type accurately – while ‘hands-free’ computing also offers additional scope for multi-tasking.

3 What are the benefits to employers?

Making appropriate use of voice recognition software can help employers to meet their legal ‘duty of care’ to ensure the health, safety and welfare at work of their staff. Employers must take action to minimise the risk of illness or injury to their employees including through, for example, RSI. Voice recognition software helps prevent fatigue and injury by offering a comfortable, ergonomic alternative to the keyboard and mouse.

Employers who do not meet their statutory responsibilities for health and safety may be taken to an employment tribunal. They could also be vulnerable to claims of discrimination under the Equality Act, if they failed to make ‘reasonable adjustments’ for their disabled employees. Utilising voice recognition is a very straightforward and cost-effective adjustment that an employer could make to give a disabled member of staff equal access to everything involved in doing and keeping their job as a non-disabled employee.

Aside from concerns regarding health and safety or discrimination, employers should consider the potential efficiency and productivity gains that could be achieved by deploying voice recognition more widely, particularly for staff working ‘on the move.’

4 How does voice recognition software work?

Voice recognition programmes work by analysing sounds and converting these to text. The software draws on a vast vocabulary and a knowledge of how English is spoken to determine what the speaker most probably said. Some programmes allow you to personalise the vocabulary by adding additional words including, for example, technical terms and people’s surnames.

Recording your voice

Voice recognition software captures and converts your speech via a microphone. Some computers include built-in microphones, but most specialist voice recognition programmes also include a microphone headset. You connect this to your computer, either through its soundcard socket or via a USB (or similar) connection.

It is also possible to use a suitable hand-held digital recorder to dictate recordings – something that may be especially useful for mobile working. Voice recognition applications can transcribe recordings from a number of formats (including wav, mp3 and wma).

Enrolment

Everyone’s voice and phrasing sounds slightly different, so the most effective programmes use a simple, one-off process called ‘enrolment’ for the software to determine how you speak. This only takes a few minutes and simply involves you reading an on-screen article. Although some programmes say this voice training is not necessary, it is still worth doing to get the best results.

5 What can you do with voice recognition software?

Dictating and correcting

When talking, people often hesitate, mumble or slur their words. One of the key skills in using voice recognition software is learning how to talk clearly so that the computer can recognise what you are saying. This means planning what to say and then speaking in complete phrases or sentences. Your voice recognition software will misunderstand some of the words spoken, so it is necessary to proofread and correct any mistakes.

Corrections to your documents (letters, reports, emails etc.) can be made easily using the mouse and keyboard or your voice. As you do this, the software will adapt and learn so that, hopefully, the same mistake will not occur again. Some programmes achieve 99% accuracy with careful dictation and correction.

Editing and formatting

Text can be edited very easily. You can highlight the text to be changed by using commands such as “Select line” or “Select paragraph” and then saying the changes you want to make to the selected text.

Applying formatting is just as straightforward. For example, if a document includes the phrase ‘introductory thoughts’, you can underline this phrase by saying “Select introductory thoughts” and then saying “Underline that.”

Controlling the computer

Most voice recognition software gives you the ability to start, navigate and control your computer programmes through spoken commands.

§ Commands vary, depending on which voice-recognition programme you are using. The programme Dragon NaturallySpeaking uses natural language commands so you could say, for example, “Start Microsoft Word,” and then “Open letter to John.”

§ Menus can be selected simply by pausing and then saying the menu item. For example, saying “File” would open the file menu.

§ On the Internet, you can dictate web addresses and browse websites simply by saying the text in the link.

§ For tasks that require a mouse, there are spoken commands to enable the mouse to be moved, dragged and clicked.

Some voice recognition programmes provide more advanced features for controlling how your computer works. These enable you to use text macros for dictation shortcuts, and scripts for automating certain tasks. You could use these features, for example, to add some standard signature text to all of your emails, and to open a number of specific programmes each time you start up your computer.

Text to speech

Some applications, including Dragon NaturallySpeaking provide a text-to-speech option (for listening back to the text file you have created) and, also, audio playback of your own dictation (so you can compare what you actually said with the software-generated text).

The text-to-speech facility is especially useful for people with a sight impairment (who would find it difficult or impossible to read any text file) and for anyone with dyslexia.

Details of some specialist text-to-speech programmes are given in Section 10.

6 What voice recognition software is available?

Dragon NaturallySpeaking

Produced by Nuance, Dragon NaturallySpeaking is the market-leading voice recognition software. Home, Premium and Professional editions are available for personal and business uses, with the Professional version offering the greatest functionality.

Nuance claim that Dragon NaturallySpeaking works three times faster than typing, and achieves 99% accuracy. The Home edition probably meets most users needs, enabling you to:

§ operate word processing programmes to dictate, edit and format reports, letters, memos and to-do lists etc.

§ search the web or specific websites, and browse online stores

§ produce, send and receive emails, and manage your calendar

§ stay connected through social media, posting to Facebook and Twitter

§ use the microphone built into your computer, to go both hands-free and headset-free

§ dictate wirelessly using a Bluetooth microphone (or the free Dragon Recorder App for iPads and iPhones, available from the App Store)

§ easily check all your outputs using natural sounding text-to-speech or audio playback of your own dictation.

Additional features offered by the more powerful editions include the ability to:

§ create spreadsheets and presentations using Excel and PowerPoint

§ use a digital recorder to capture your voice on the go

§ create custom commands and scripts to insert frequently used text and automate repeated tasks.

All versions of Dragon NaturallySpeaking run on Windows-based personal computers.

Nuance also produce an equivalent version for Apple computers (running OS X Mountain Lion 10.8.3 or later) called Dragon Dictate.

For more information and to purchase Dragon software, visit http://www.nuance.co.uk/dragon/index.htm

Windows Speech Recognition

All Microsoft Windows computers now have a built-in voice recognition programme called Speech Recognition. Although it does not boast as many features as Dragon NaturallySpeaking, it is free and easy to use with good recognition rates.

To improve its performance, having set up your microphone, you can train Windows Speech Recognition to better understand your voice and spoken commands by creating a voice profile. You can then use voice commands to run applications, dictate and edit text within documents and emails, browse the web, fill out forms, interact with other elements of the operating system, and control your mouse cursor.

The functionality of Windows Speech Recognition can be extended through a separate utility released by Microsoft that enables the creation and use of custom macros.

Dictation on a Mac

Apple Mac computers running OS X Mountain Lion, Mavericks or Yosemite also provide free built-in dictation software. This can be accessed through the ‘Dictation and Speech’ panel within System Preferences.

Although it is a basic programme offering more limited voice control, it does allow you to dictate text anywhere you could type it by keyboard.

In Mountain Lion, OS X dictation works by sending up to 30 second chunks of recorded speech to Apple’s servers for processing – which is fine as long as you have a stable and reliable broadband connection.

In Mavericks and Yosemite, turning on ‘Enhanced Dictation’ enables continuous speech recognition and also allows offline processing. The most recent version of OS X, Yosemite, also introduced many new editing and formatting commands and the ability to create additional dictation commands yourself.

7 What about tablets and smartphones?

Intelligent personal assistants are an important feature of all modern tablets and smartphones. They use voice recognition technology and a natural language user interface to provide a range of services. Some of the most popular personal assistants include:

§ Siri – for iOS devices (iPads and iPhones)

§ Google Now – for Android and iOs devices

§ Cortana – for Windows devices.

These personal assistants offer similar (but varying) features to help with everyday tasks – responding to your voice commands and requests to provide information and answer queries (through online sources), send messages and emails, make phone calls, take notes, schedule meetings, and play music.

Additionally, a wide range of personal assistant apps for Android and iOS devices is also available from Google Play and the App Store. Very useful options here include vokul (for iOS) and Evi (for Android, iOS and Kindle Fire).

vokul provides complete hands-free voice control app for dictating text messages and emails, posting to Facebook and Twitter, listening to music and podcasts, and calling contacts in your address book. It is tuned for noisy environments, including your car.

Evi is a mobile app that draws on a vast, and growing, database to answer your everyday questions about books, music, films, conversions, history, people, places and much more. You can ask your questions using text or speech, and get a vocal response.

8 How important is training?

Appropriate training is crucial for users to realise the full benefits of working with voice recognition programmes. To make the best out of any training, it is most helpful to spread it out over a period of weeks – giving the user sufficient opportunity to practice new skills and consolidate their learning between formal coaching sessions.

Training will be most effective when it geared towards the specific needs of the individual, focusing on their particular tasks and challenges. It is especially important for learning to talk to your computer because you also need to train the software to respond accurately to how you speak and to learn any more specialist vocabulary.