CA2570767A1 - Simultaneous support of isolated and connected phrase command recognition in automatic speech recognition systems - Google Patents

Simultaneous support of isolated and connected phrase command recognition in automatic speech recognition systems Download PDF

Info

Publication number
CA2570767A1
CA2570767A1 CA002570767A CA2570767A CA2570767A1 CA 2570767 A1 CA2570767 A1 CA 2570767A1 CA 002570767 A CA002570767 A CA 002570767A CA 2570767 A CA2570767 A CA 2570767A CA 2570767 A1 CA2570767 A1 CA 2570767A1
Authority
CA
Canada
Prior art keywords
command
active
commands
controller
identifying
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA002570767A
Other languages
French (fr)
Other versions
CA2570767C (en
Inventor
Gang Wang
Matteo Contolini
Chengyi Zheng
David Chatenever
Heinz-Werner Stiller
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Storz Endoskop Produktions GmbH Germany
Original Assignee
Storz Endoskop Produktions GmbH Germany
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Storz Endoskop Produktions GmbH Germany filed Critical Storz Endoskop Produktions GmbH Germany
Publication of CA2570767A1 publication Critical patent/CA2570767A1/en
Application granted granted Critical
Publication of CA2570767C publication Critical patent/CA2570767C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Abstract

A system for operating one or more devices using speech input including a receiver for receiving a speech input, a controller in communication with said receiver, software executing on said controller for converting the speech input into computer-readable data, software executing on said controller for generating a table of active commands, the table including a portion of all valid commands of the system, software executing on said controller for identifying at least one active command represented by the data, and software executing on said controller for transmitting the active command to at least one device operable by the active command.

Claims (29)

1. A system for operating one or more devices using speech input, comprising:
a receiver for receiving a speech input;
a controller in communication with said receiver;
software executing on said controller for converting the speech input into com-puter-readable data;

a command menu including a plurality of command menu levels;
software executing on said controller for identifying at least two commands in sequence represented by the data, each of the at least two commands from a different command menu level; and software executing on said controller for sending the at least two commands to one or more devices operable by the system.
2. The system according to claim 1, further comprising:
software executing on said controller for identifying an isolated command repre-sented by the data; and software executing on said controller for implementing the isolated command.
3. The system according to claim 1, wherein said command menu includes one or more global commands.
4. The system according to claim 3, further comprising:
software executing on said controller for identifying a global command repre-sented by the data; and software executing on said controller for implementing the global command.
5. The system according to claim 1, wherein the one or more devices operable by the system are medical devices.
6. A system for operating one or more devices using speech input, comprising:
a receiver for receiving a speech input;
a controller in communication with said receiver;
software executing on said controller for converting the speech input into com-puter-readable data;

software executing on said controller for generating a table of active commands, the table including active commands selected from at least two different levels of a command menu;
software executing on said controller for identifying at least one active command represented by the data; and software executing on said controller for sending the at least one active com-mand to one or more devices operable by the system.
7. The system according to claim 6, wherein the at least one active command is an isolated command.
8. The system according to claim 6, wherein said software for identifying at least one active command identifies at least one other active command in sequence.
9. The system according to claim 8, wherein each of the at least one active com-mand and at least one other active command are sequential commands each from a different level of the command menu.
10. The system according to claim 6, wherein the table of active commands includes at least one global command.
11. The system according to claim 10, further comprising:
software executing on said controller for identifying a global command repre-sented by the data; and software executing on said controller for implementing the global command.
12. The system according to claim 6, wherein the one or more devices operable by the system are medical devices.
13. The system according to claim 6, wherein the speech input includes isolated speech.
14. The system according to claim 6, wherein the speech input includes continuous speech.
15. The system according to claim 6, wherein the speech input includes isolated speech and continuous speech.
16. The system according to claim 6, wherein the table of active commands includes at least one isolated command phrase and at least one concatenated command phrase.
17. The system according to claim 6, wherein the active commands are selected from the command menu based on a depth parameter.
18. The system according to claim 17, wherein the depth parameter is indicative of a deviation from a current menu position.
19. The system according to claim 18, wherein the depth parameter is indicative of a number of menu levels.
20. The system according to claim 6, wherein said software for identifying at least one active command parses the data into one or more potential commands.
21. The system according to claim 6, wherein said software for identifying at least one active command queries the table of active commands.
22. The system according to claim 6, wherein said software for identifying at least one active command queries a table of command equivalents.
23. A method of controlling a device using a speech input, comprising the steps of:
determining commands associated with each device of a system from a com-mand menu;
generating a table of active commands, wherein the table includes active com-mands selected from at least two different levels of the command menu;
receiving a speech input;
converting the speech input into computer-readable data;
identifying at least one active command represented by the data; and sending the active command to at least one device to which the active command pertains.
24. The method according to claim 23, wherein the step of generating the table of active commands includes determining the last active command identified.
25. The method according to claim 23, wherein the step of generating the table of active commands includes utilizing a depth parameter, the depth indicative of a number of menu levels.
26. The method according to claim 23, wherein the step of identifying at least one active command includes parsing the data into one or more potential commands.
27. The method according to claim 23, further comprising the step of:
displaying the identified at least one active command.
28. The method according to claim 23, wherein the step of identifying the at least one active command includes generating a prompt to a user of the system.
29. The method according to claim 23, wherein the table of active commands in-cludes at least one isolated command and at least one command sequence.
CA2570767A 2005-12-20 2006-12-11 Simultaneous support of isolated and connected phrase command recognition in automatic speech recognition systems Active CA2570767C (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/312,785 2005-12-20
US11/312,785 US7620553B2 (en) 2005-12-20 2005-12-20 Simultaneous support of isolated and connected phrase command recognition in automatic speech recognition systems

Publications (2)

Publication Number Publication Date
CA2570767A1 true CA2570767A1 (en) 2007-06-20
CA2570767C CA2570767C (en) 2010-10-19

Family

ID=37891749

Family Applications (1)

Application Number Title Priority Date Filing Date
CA2570767A Active CA2570767C (en) 2005-12-20 2006-12-11 Simultaneous support of isolated and connected phrase command recognition in automatic speech recognition systems

Country Status (4)

Country Link
US (1) US7620553B2 (en)
EP (1) EP1801780B1 (en)
JP (1) JP4842114B2 (en)
CA (1) CA2570767C (en)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7624019B2 (en) * 2005-10-17 2009-11-24 Microsoft Corporation Raising the visibility of a voice-activated user interface
US7620553B2 (en) 2005-12-20 2009-11-17 Storz Endoskop Produktions Gmbh Simultaneous support of isolated and connected phrase command recognition in automatic speech recognition systems
US20080097176A1 (en) * 2006-09-29 2008-04-24 Doug Music User interface and identification in a medical device systems and methods
US8005237B2 (en) * 2007-05-17 2011-08-23 Microsoft Corp. Sensor array beamformer post-processor
US20090210233A1 (en) * 2008-02-15 2009-08-20 Microsoft Corporation Cognitive offloading: interface for storing and composing searches on and navigating unconstrained input patterns
US8515763B2 (en) * 2009-11-24 2013-08-20 Honeywell International Inc. Methods and systems for utilizing voice commands onboard an aircraft
US20130041662A1 (en) * 2011-08-08 2013-02-14 Sony Corporation System and method of controlling services on a device using voice data
KR20130133629A (en) 2012-05-29 2013-12-09 삼성전자주식회사 Method and apparatus for executing voice command in electronic device
US9189465B2 (en) * 2012-09-28 2015-11-17 International Business Machines Corporation Documentation of system monitoring and analysis procedures
US9584642B2 (en) * 2013-03-12 2017-02-28 Google Technology Holdings LLC Apparatus with adaptive acoustic echo control for speakerphone mode
US10373615B2 (en) * 2012-10-30 2019-08-06 Google Technology Holdings LLC Voice control user interface during low power mode
US10381002B2 (en) * 2012-10-30 2019-08-13 Google Technology Holdings LLC Voice control user interface during low-power mode
US10304465B2 (en) * 2012-10-30 2019-05-28 Google Technology Holdings LLC Voice control user interface for low power mode
TWI519122B (en) * 2012-11-12 2016-01-21 輝達公司 Mobile information device and method for controlling mobile information device with voice
US9264801B2 (en) 2012-12-04 2016-02-16 Storz Endoskop Produktions Gmbh System and method for pairing a command device incorporating a microphone to a remotely controlled medical system
KR101433506B1 (en) * 2013-01-29 2014-08-22 엘에스산전 주식회사 Operation method of energy management system using an isolated language voice recognition
US9414004B2 (en) 2013-02-22 2016-08-09 The Directv Group, Inc. Method for combining voice signals to form a continuous conversation in performing a voice search
JP2015011170A (en) * 2013-06-28 2015-01-19 株式会社ATR−Trek Voice recognition client device performing local voice recognition
US10186262B2 (en) * 2013-07-31 2019-01-22 Microsoft Technology Licensing, Llc System with multiple simultaneous speech recognizers
US8768712B1 (en) 2013-12-04 2014-07-01 Google Inc. Initiating actions based on partial hotwords
US20160078864A1 (en) * 2014-09-15 2016-03-17 Honeywell International Inc. Identifying un-stored voice commands
US11062707B2 (en) * 2018-06-28 2021-07-13 Hill-Rom Services, Inc. Voice recognition for patient care environment

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2688413B2 (en) * 1987-10-06 1997-12-10 株式会社日立製作所 Plant operation monitoring device
WO1991013431A1 (en) * 1990-02-26 1991-09-05 Motorola, Inc Method and apparatus for recognizing string of word commands in a hierarchical command structure
US7053752B2 (en) 1996-08-06 2006-05-30 Intuitive Surgical General purpose distributed operating room control system
US6463361B1 (en) 1994-09-22 2002-10-08 Computer Motion, Inc. Speech interface for an automated endoscopic system
US6646541B1 (en) 1996-06-24 2003-11-11 Computer Motion, Inc. General purpose distributed operating room control system
US5794196A (en) * 1995-06-30 1998-08-11 Kurzweil Applied Intelligence, Inc. Speech recognition system distinguishing dictation from commands by arbitration between continuous speech and isolated word modules
US5970457A (en) 1995-10-25 1999-10-19 Johns Hopkins University Voice command and control medical care system
US6496099B2 (en) 1996-06-24 2002-12-17 Computer Motion, Inc. General purpose distributed operating room control system
US6642836B1 (en) 1996-08-06 2003-11-04 Computer Motion, Inc. General purpose distributed operating room control system
US6301560B1 (en) * 1998-01-05 2001-10-09 Microsoft Corporation Discrete speech recognition system with ballooning active grammar
US6182046B1 (en) * 1998-03-26 2001-01-30 International Business Machines Corp. Managing voice commands in speech applications
US6456972B1 (en) * 1998-09-30 2002-09-24 Scansoft, Inc. User interface for speech recognition system grammars
JP2000194391A (en) * 1998-12-25 2000-07-14 Kojima Press Co Ltd Voice recognition controller
US6266635B1 (en) 1999-07-08 2001-07-24 Contec Medical Ltd. Multitasking interactive voice user interface
US6601026B2 (en) 1999-09-17 2003-07-29 Discern Communications, Inc. Information retrieval by natural language querying
US6587818B2 (en) 1999-10-28 2003-07-01 International Business Machines Corporation System and method for resolving decoding ambiguity via dialog
US6591239B1 (en) 1999-12-09 2003-07-08 Steris Inc. Voice controlled surgical suite
EP1346344A1 (en) * 2000-12-18 2003-09-24 Koninklijke Philips Electronics N.V. Store speech, select vocabulary to recognize word
JP3997459B2 (en) * 2001-10-02 2007-10-24 株式会社日立製作所 Voice input system, voice portal server, and voice input terminal
JP2003241784A (en) * 2002-02-21 2003-08-29 Nissan Motor Co Ltd Speech input and output device
US7149983B1 (en) * 2002-05-08 2006-12-12 Microsoft Corporation User interface and method to facilitate hierarchical specification of queries using an information taxonomy
JP4107093B2 (en) * 2003-01-30 2008-06-25 株式会社日立製作所 Interactive terminal device and interactive application providing method
US7620553B2 (en) 2005-12-20 2009-11-17 Storz Endoskop Produktions Gmbh Simultaneous support of isolated and connected phrase command recognition in automatic speech recognition systems

Also Published As

Publication number Publication date
JP4842114B2 (en) 2011-12-21
US20070150288A1 (en) 2007-06-28
EP1801780A1 (en) 2007-06-27
JP2007171963A (en) 2007-07-05
EP1801780B1 (en) 2011-09-14
US7620553B2 (en) 2009-11-17
CA2570767C (en) 2010-10-19

Similar Documents

Publication Publication Date Title
CA2570767A1 (en) Simultaneous support of isolated and connected phrase command recognition in automatic speech recognition systems
CN1764945B (en) Distributed speech recognition system
JP6115941B2 (en) Dialog program, server and method for reflecting user operation in dialog scenario
CN101272418B (en) Communication terminal and method for long-range controlling communication terminal
EP2557565A1 (en) Voice recognition method and apparatus
EP4064713A1 (en) Voice control method and apparatus, server, terminal device, and storage medium
EP1054388A3 (en) Method and apparatus for determining the state of voice controlled devices
RU2014134443A (en) METHOD AND DEVICE FOR CONTROLING THE STATUS OF LOCKING / UNLOCKING THE TERMINAL THROUGH SPEECH RECOGNITION
TW200942784A (en) Navigation device, system & method
CN104978964B (en) Phonetic control command error correction method and system
EP1933287A3 (en) Remote control system and method for portable terminals
EP2663064A3 (en) Method and system for operating communication service
EP2209333A3 (en) Communication system, communication device, program and communication control method
WO2008144638A3 (en) Systems and methods of a structured grammar for a speech recognition command system
CA2351701A1 (en) System and method for implementing a natural language user interface
WO2008005897A3 (en) System and method for operating a mobile device, such as providing an out of box connection system for uma type mobile devices
WO2007035385A3 (en) Remote control system
WO2009063565A1 (en) Control system, control method, master device, and controller
WO2004068251A3 (en) Integrated control system to control addressable remote devices
WO2002030097A1 (en) Telephone device and translation telephone device
JP2004007297A5 (en)
CN103474068A (en) Method, equipment and system for implementing voice command control
CN106373566A (en) Data transmission control method and device
WO2004057551A3 (en) System with macrocommands
US20170053645A1 (en) Speech recognition system with abbreviated training

Legal Events

Date Code Title Description
EEER Examination request