USRE41002E1 - Telephone for the deaf and method of using same - Google Patents

Telephone for the deaf and method of using same Download PDF

Info

Publication number
USRE41002E1
USRE41002E1 US09/603,247 US60324700A USRE41002E US RE41002 E1 USRE41002 E1 US RE41002E1 US 60324700 A US60324700 A US 60324700A US RE41002 E USRE41002 E US RE41002E
Authority
US
United States
Prior art keywords
person
phrases
signing
motions
words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US09/603,247
Inventor
Raanan Liebermann
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ALEXANDER TRUST
Original Assignee
Raanan Liebermann
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Raanan Liebermann filed Critical Raanan Liebermann
Priority to US09/603,247 priority Critical patent/USRE41002E1/en
Application granted granted Critical
Publication of USRE41002E1 publication Critical patent/USRE41002E1/en
Assigned to ALEXANDER TRUST reassignment ALEXANDER TRUST UNCONDITIONAL ASSIGNMENT Assignors: LIEBERMANN, RAANAN
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/247Telephone sets including user guidance or feature selection means facilitating their use
    • H04M1/2474Telephone terminals specially adapted for disabled people
    • H04M1/2475Telephone terminals specially adapted for disabled people for a hearing impaired user
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/57Arrangements for indicating or recording the number of the calling subscriber at the called subscriber's set

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • Otolaryngology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Processing Or Creating Images (AREA)

Abstract

An electronic communications system for the deaf includes a video apparatus for observing and digitizing the facial, body and hand and finger signing motions of a deaf person, an electronic translator for translating the digitized signing motions into words and phrases, and an electronic output for the words and phrases. The video apparatus desirably includes both a video camera and a video display which will display signing motions provided by translating spoken words of a hearing person into digitized images. The system may function as a translator by outputting the translated words and phrases as synthetic speech at the deaf person's location for another person at that location, and that person's speech may be picked up, translated, and displayed as signing motions on a display in the video apparatus.

Description

CROSS REFERENCE TO RELATED APPLICATION
The present application is a continuation-in-part of our application Ser. No. 08/396,554 filed Mar. 1, 1995, now abandoned now U.S. Pat. No. 5,592,801.
BACKGROUND OF THE INVENTION
The present invention relates to electronic apparatus for communication by the deaf, and, more particularly, to such apparatus which enables the deaf person to communicate through use of sign language.
Deaf people are employed in almost every occupational field. They drive cars, get married, buy homes, and have children, much like everyone else. Because of many inherent communication difficulties, most deaf people are more comfortable when associating with other deaf people. They tend to marry deaf people whom they have met at schools for the deaf or through deaf clubs. Most deaf couples have hearing children who learn sign language early in life to communicate with their parents. Many deaf people tend to have special electronics and telecommunications equipment in their homes. Captioning decoders may be on their television, and electrical hook-ups may flash lights to indicate when the baby is crying, the doorbell is ringing, or the alarm clock is going off.
However, deaf persons have substantial difficulties in communicating with persons at remote locations. One technique which is employed utilizes a teletype machine for use by the deaf person to transmit his message and also to receive messages, and the person with whom the deaf person is communicating also has such teletype machine so that there is an effective connection directly between them. In another method, the deaf person utilizes a teletype machine, but the person who is communicating with the deaf person is in contact with a communications center where a person reads the transmission to the hearing person over the telephone and receives the telephone message from the hearing person and transmits that information on the teletype machine to the deaf person. Obviously, this teletype based system is limited and requires the deaf person to be able to manipulate a teletype machine and to understand effectively the written information which he or she receives on the teletype machine. Processing rapidly received written information is not always effective with those who have been profoundly deaf for extended periods of time. Moreover, a system based upon such teletype transmissions is generally relatively slow.
The widespread availability of personal computers and modems, has enabled direct communication with and between deaf persons having such computers. However, it is still required that the deaf person be able to type effectively and to readily comprehend the written message being received.
Deaf persons generally are well schooled in the use of finger and hand signing to express themselves, and this signing may be coupled with facial expression and/or body motion to modify the words and phrases which are being signed by the hands and to convey emotion. As used herein, “signing motions” include finger and hand motions, body motions, and facial motions and expressions to convey emotions or to modify expressions generated by finger and hand motions. A written message being received on a teletype machine or computer may not convey any emotional content that may have been present in the voice of the person conveying the message.
Profoundly deaf people communicate among themselves by this sign language on a face to face basis, and utilize a Tele-Typewriter (TTY) for telephone communication. The TTY itself leaves much to be desired, since their sign language is a modified syntax of the spoken language, resulting in a smaller vocabulary and lessened ease of reading printed text as a whole (e.g. definite and indefinite articles [“the”, “a”, “an”] are omitted most of the time and possessives and plurals are not usually distinguished.
When it comes to communication of profoundly deaf persons and normally hearing persons, the problem intensifies. Only a negligible percentage of the non-deaf population is versed in sign language. Thus, some deaf people read lips and utter words similar enough in their vocal resemblance to enable them to be understood. Beyond this tedious and taxing effort, there is virtually no form for such communication except exchanging some written notes or having an interpreter involved.
A number of methods as to how to achieve sign recognition have been proposed in the literature. However, in spite of the apparent detail of such articles, they do not go beyond general suggestions, which fail when tested against the development of enabling technology. Major problems have been impeding the success of such enabling technology.
The Kurokawa et al article entitled “Bi-Directional Transmission Between Sign Language And Japanese For Communication With Deaf-Mute People” Proceedings of the 5th International Conference on Human Computer Interaction, 2, 1109 (1993) described how limited recognition can be achieved of static gestures utilizing electromechanical gloves which are sensor based and Kurokawa digitizes the electromechanical output of sensors. Capturing images with a camera is a well known art, but interpreting such images in a consistent way without relying on the human brain for direct interpretation (i.e., machine interpreted images) has alluded researches. The Rogers article entitled “Proceedings SPIE-The International Society For Optical Engineering: Applications of Artificial Neural Networks”, IV, 589 (1993), suggests various approaches which cannot work when tested in a real life situation, such as utilizing infrared for signal interpretation. Unfortunately, one cannot combine the technology of Rogers and Kurokawa to solve the problem because the technologies employed are mutually exclusive. If one uses images as Rogers proposes, one cannot obtain from them the information provided by the sensors of the data gloves of Kurokawa; if one uses Kurokawa's gloves, one cannot utilize the camera images to provide any intelligence, knowledge or information beyond what the sensors in the DataGloves provide. Therefore, a fresh approach to the problem is necessary.
Displaying signed motions presents another challenge. A simple database of all possible signed motions which is an intuitive approach is rather problematic. To create a lucid signing stream, one needs a smooth movement from one word or phrase to another. Otherwise, the signing is jerky at best if not totally unintelligible. Although there may have been suggestions for such a database of signing images, this is not a realistic resolution due to the fact that, for every signed image in the database, one will need to have an enormous amount of connecting movements to other potential gestures, increasing dramatically the size of the database. To select a signing stream, inclusive of all the proper intermediary connecting gestures between previous and current images needed for lucid signing presentation, from such an enormous database puts search algorithms to an unrealistic challenge.
Attempts have also been made to transmit digitized signing motions to a central station as disclosed in Jean-Francois Abramatic et al, U.S. Pat. No. 4,546,383. Even when images are transmitted as proposed by Abramatic et al, the edge detection performed fails to enunciate detail of overlapping hands, or to differentiate between finger spelling and signed motions. All such attempts are restricted by available bandwidth which curtails wide use of such methods.
It is an object of the present invention to provide a novel electronic communication system for use by deaf persons to enable them to communicate by signing.
It is also an object to provide such an electronic communication system wherein the deaf person and the person communicating with the deaf person do so through a central facility containing a translating means for processing elements of digitized image data.
Another object is to provide such a system in which a hearing person may have his speech converted into digitized signing motions which are displayed to the deaf person.
A further object is to provide a unique method utilizing such an electronic communication system to enable communication by and to deaf persons.
SUMMARY OF THE INVENTION
It has now been found that the foregoing and related objects may be readily attained in an electronic communications system for the deaf comprising a video apparatus for observing and digitizing the signing motions, and means for translating the digitized motions into words and phrases. Also included are means for outputting the words and phrases in a comprehensible form to another hearing person, generally as artificial speech.
In a telephone type system, the other person is at a remote location, although the system may also be used as a translator for communication with a person in the immediate vicinity. Generally, the video apparatus is a video camera.
From cost and portability standpoints, the translating means is at a remote location or central station and there is included transmission means for transmitting the digitized signing motions or their digital identifiers to the translating means.
In addition to use of a database of words and phrases corresponding to digitized motions, the translating means also includes artificial intelligence for interpreting and converting the translated motions into words and phrases and into coherent sentences.
The outputting means may convert the coherent sentences into synthetic speech or present the words and phrases in written form.
To enable communication of the deaf person, the system includes means for the other or hearing person to transmit words and phrases. The translating means is effective to translate said words and phrases into digitized signing motions, and the video apparatus includes a display screen which provides an output of the digitized signing motion on the display screen for viewing by the deaf person.
There is included means for translating speech into digital data representing words and phrases and such digital data into digitized signing motions. Desirably, the video apparatus includes a display screen to provide an output of the digitized motions as signing motions on the display screen for viewing by the deaf person. The video apparatus also includes a microphone and speaker whereby a deaf person may communicate with another person in the immediate vicinity.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a schematic presentation of the steps performed in an electronic communication system embodying the present invention;
FIG. 2 is a schematic representation of a method for connecting an incoming call on the deaf person's telephone to a processing center providing the computer software for the translating functions of the present invention;
FIG. 3 is a schematic representation of the functions when utilizing such a processing center;
FIG. 4 is a schematic presentation of the several steps in the intervention and operation of the processing center when a call is received by the deaf person's telephone;
FIGS. 5a-5c are perspective views of a deaf person's receiver/transmitter installation embodying the present invention in three different forms using a personal computer and video camera, using a television set with a video camera, and as a public telephone kiosk;
FIG. 6 is a perspective view of the present invention in the form of a cellular telephone;
FIG. 7 is a schematic representation of artificial intelligence used to determine and translate the emotional content in the speech of a hearing person communicating with a deaf person;
FIG. 8 is a diagrammatic representation of the manner in which the screen of a display unit may be divided into sections presenting elements of information in addition to signing motions;
FIG. 9 is a schematic representation of the modules of the artificial intelligence for converting signing into speech;
FIG. 10 is a schematic representation of the modules for creating multiple neural networks and collecting the necessary examples for training these networks;
FIG. 11 is a schematic representation of the modules for controlling the conversion of text to signing animation;
FIG. 12 is a schematic representation of the modules for capturing and compressing the images to be used during the playback of sign language animation;
FIGS. 13 illustrates a user of the device wearing special gloves to enhance the ability of the system to identify the signing of the deaf person;
FIGS. 14a-14d illustrate the manner in which the unique shape of the glove makes it possible to recognize the differences between two very similar signs;
FIG. 15 is a schematic representation of the steps to effect translation of English text to American Sign Language (ASL); and
FIG. 16 is a schematic representation of the steps to effect translation of American Sign Language to English text.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
Turning first to FIG. 1 of the attached drawings, therein illustrated schematically is an electronic communications system embodying the present invention.
Generally, the deaf person uses sign language in front of a device containing a video camera. The images captured by the camera at 20-30 frames/second are processed by a digital device which does initial and extended image processing. In the processing, each of the frames containing a captured image undergoes a process whereby the image is transformed into manageable identifiers. It is the set of identifiers, in the form of tables of numbers, that travels the normal telephone lines to the central processing facility (i.e., the Center). These identifiers, and not the images themselves, are then correlated with a database of vocabulary and grammar by using artificial intelligence at the Center. Subsequently, syntax rebuilding occurs, again utilizing artificial intelligence, resulting in a complete verbal text which is equivalent to the signed language content. The text then undergoes a text-to-synthesized-speech transformation and the speech is sent as an analog signal to any ordinary telephone utilized by a hearing person by existing copper or fiberoptic telephone lines. Part of the artificial intelligence referred to above consists of neural networks which are trained for these specific applications.
On the other end of the telephone line, the normally hearing person talks on his or her conventional telephone in the normal and regular way of spoken language. His or her voice is carried on line (in whatever method of transport is utilized by the telephone carrier) to the Center where speech recognition algorithms convert the spoken word to text. The Center will accommodate appropriate speech recognition (i.e., automatic, continuous and speaker independent). The recognized speech is then transformed into its equivalent signing content vocabulary and then into text. The text is sent via the telephone lines to the device used by the deaf person and converted to signing animation. Depending upon the transmission line and computer capability of the deaf person's location, the text may be sent as reduced identifiers which are converted into animated images by the deaf person's computer or as completely formatted animated images. The sign images then appear on the screen of a monitor viewed by the deaf person, resulting in a continuous dynamic set of animated sign language motions which portray the content of the spoken language uttered as speech by the normally hearing person.
In view of the computer processing requirements, a preferred form of the present invention includes a processing center containing the sophisticated computer equipment, databases and neural networks to effect the signing/verbal translations, and the communications are conducted through this center. As seen in FIG. 2, a caller (or receiver) and deaf person are actually communicating through such a center. The method of employment of the center is illustrated in FIG. 3 wherein the center receives the input from the video device of the deaf person and provides an audible output to the hearing person who is using a telephone. The hearing person speaks into the telephone and the center provides a video output to the video device of the deaf person.
To avoid excessive costs for a hearing caller, the telephone installation of the deaf person receiving a call may automatically call the center and switch the incoming call to a routing through the center as is illustrated in FIG. 4.
In FIG. 5a, the deaf person's station comprises a personal computer 30 including the monitor 32 and a video camera 34. In FIG. 5b, a computer unit 36 and a video camera 38 is utilized on top of a standard television set 40 so as to be at hand level. In FIG. 5c, a public kiosk 42 has built into it, a video camera 44, a video monitor 46, and lamps 48 to ensure adequate lighting of the user's hands, face and body. To place the call, there is a keypad 50, and a credit card reader may be combined therewith.
A portable transmitter/receiver generally designated by the numeral 8 for use by a deaf person is shown in FIG. 6 and it contains a video camera, the lens 10 of which is disposed in the upright portion 12. In the base portion 13 are an LCD display panel 14 and a key pad 16 for dialing and other functions. Also seen is an antenna 18 for the device so that it may be transported and communicate as a wireless remote or through a cellular telephone network. The device is supported in a stable position and the deaf person is positioned so that the camera lens 10 will record the signing movement of the hands and fingers and body and facial motions and expressions. The signing motions captured by the camera are converted into digital data for processing by the translation software, (i.e., artificial intelligence) to produce data representing numbers, words and phrases which are then combined into coherent sentences. As previously indicated, such translation is most economically effected in a dedicated central computer facility. The translated message is then conveyed to the “listener” in either verbal or written form.
The other party may speak into a telephone receiver (not shown) and the verbal expressions are translated by the artificial intelligence into digital data for signs. These signs are displayed on the LCD panel 14.
Since the emotional content of the speech of the other party is not conveyed by signs, the artificial intelligence in the system may provide an analysis of the emotional content of the speech and convey this to the LCD display panel as a separate output. Indicative of the functions of the artificial intelligence software for doing so is the diagrammatic presentation in FIG. 7.
This is portrayed to the deaf either as a separate image in a corner of the screen which he or she is watching or incorporated into facial expressions of animated signing figures.
Turning next to FIG. 8, therein illustrated is a layout for the visual display to present multiple information to the deaf person such as touchless function buttons, system status indicators, alarms, a printed translation, and a playback of the image being recorded, as well as the signing images and text of the hearing person's responses.
FIGS. 9-12 are schematics of the system software modules for converting signing to speech and speech to animation, including system training methods.
The overall operation of a preferred electronic communications system is set forth hereinafter.
The deaf person uses sign language in front of the transmitter/receiver device containing the camera. The images captured by the camera are of the finger and hand motions and of body motions and of facial expressions and motions captured by a digital device which does initial processing. In the initial processing, each of the frames containing a captured image undergoes a process whereby the image is collapsed into a small set of fixed identifiers. At the end of the initial processing, the resulting information is sent as data on a regular and designated phone line using an internal modem in the device to the data processing center.
The rest of the processing is completed at the center. This includes identification of the letters, numbers and words, conversion to standard sign language, and the conversion to spoken language which results in the equivalent text of the signed content. The text then undergoes a text to synthesized speech transformation and the speech is sent as an analog content to the normally hearing person. The voice content may leave the center as data if packet switching (64 kb or 56 Kb service) is utilized directly from the center. Processing in the center utilizes artificial intelligence such as neural networks trained for the specific applications of the device.
The normally hearing person who calls a deaf person dials the deaf person's phone number. However, at the deaf person's station, his or her call is connected to the center on a single line which is the deaf person's designated line to the center. The deaf person's device arranges for switching and enables both the caller and his or her station to be on line as a “party call”. The deaf person's station also arranges for the simultaneous transmission of both voice and data on the dedicated line. Thus, the line between the normally hearing person and the deaf person is analog for voice content only, while the line between the deaf person (and now the normally hearing person too) is analog but transfers both voice and data.
The normally hearing person's voice undergoes speech recognition in the center and is transformed into the equivalent signing content and then into textual material. The text is sent from the center to the deaf person's device via telephone lines. Software in the device converts the text into reduced identifying pointers for each gesture, which are then converted into animated images which portray in sign language the content of the speech processed in the center.
In a cellular phone, the operation is much the same in its operation as the hard wired telephone. The camera in the cellular phone transmits the image for initial processing in the cellular phone. From there the reduced data is transmitted to the center for processing. The same switching occurs here as well, and voice/data is sent to the center on the dedicated line assigned for the deaf person. However, in this case the cellular phone maintains two cellular connections on line, one to the center (voice/data) and one to the caller. The deaf person sees the content of the call to him by viewing the display LCD on his cellular phone unit.
When the phone for the deaf is equipped with a microphone and a speaker instead of, or in addition to a second telephone channel, it may be turned into a communicator. Obviously, one can opt to have both of these options to double the usefulness of the device. The communicator enables the deaf person to conduct a “conversation” with any normally hearing person in the close proximity. The signing motion of the deaf person are processed by the center and is transmitted back to the device as a normal voice transmission which the speaker renders as speech to the normally hearing person. His or her speech in turn, is picked up by the microphone and sent to the center for processing. The result is an animated content on the LCD of the communicator which portrays in sign language the spoken content of the normally hearing person.
The modules for the software effect translation of the signing into and from digital text are set forth in FIGS. 9 and 10 and those to recognize animation are set forth in FIGS. 11 and 12. Software presently used for this purpose is appended hereto and is utilized with Borland C++.
A person engaging in the development of other software should consider the following with respect to figure tracking:
  • A. The groups listed below are captured in their separate forms, then added to integrated forms. The integrated forms are then integrated into a single observable signing (i.e. our normalized signing with a single camera), while location information are kept in a separate log. The separate log can have various usages which may not be in their entirety related to signing on the phone. Such can be the case of activating an ATM machine or food billboard in a drive-in situation.
  • a. Definitions:
    • L(h):=Left hand
    • L(a):=Left arm
    • R(h):=Right hand
    • R(a):=Right arm
    • L(H):=Left side of the head
    • R(H):=Right side of the head
    • L(T):=Left side of torso
    • R(T):=Right side of torso
    • L(T):=Left side of torso
    • R(f):=Right femur
    • L(f):=Left femur
    • R(t):=Right tibia
    • L(t):=Left tibia
  • B. Section addition with recognition takes place:
  • b.1.A=L(h)+L(a)
    • B=R(h)+R(a)
    • C=L(H)+R(H)
    • D=L(T)+R(T)
    • E=L(t)+L(f)
    • G=R(t)+R(f)
  • c. Signing content (Sc):
    • S=A+B
  • d. Emotional content (Ec):
    • Ec=C+D
  • e. Pointing and activation (PA):
    • PA=A+B
  • f. Location in space (Ls):
    • Ls=E+G+(C+D+A+B)
In seeking to have the software recognize emotional content in the signing or in the speech, the following should be considered:
Our emotional content is divided into two separate segments:
  • A. The hearing person segment
  • B. The hearing challenged segment
    • A. The hearing person segment.
In this segment we analyze in the speech four distinct elements.
  • A.1. Changes in various speech output elements.
  • A.2. Duration of changes recognized in A.1.
  • A.3. Frequency of the changes appearing in A.1.
  • A.4. Frequency of the duration of changes appearing in A.2.
The elements that are analyzed by A.1., through A.4. are:
    • a. Pitch
    • b. Volume
    • c. Non words elements for which the system is trained (g.g., gasps of air, emitting the word “ah, chuckle, crying, etc.)
    • d. Repetitions of words and/or word parts (indicating stuttering).
  • B. The hearing challenged person segment.
This segment analyzes combination of intrafacial positions, where the system utilizes the training similar to signing, but with different attributes and meanings.
  • a. Definitions and variables status;
    • U(I):=Upper lip [showing=1, not showing=0]
    • LL(1):=Lower lip [showing=1, not showing=0] (m):=Left part of mouth [compressed=1, uncompressed=0]
    • R(m):=Right part of mouth [compressed=1, uncompressed=0]
    • M( ):=Complete mouth as a unit [Opened wide=1, closed=0;
      • compressed and drawn in=4;
      • compressed and downward=5;
      • stretched flat=6;
      • opened with teeth showing=7]
    • U(t):=Upper front teeth [showing=1; not showing=0]
    • LL(t):=Lower front teeth [showing=1; not showing=0]
    • t():=Frontal teeth as a whole [shown=1; not shown=0]
    • R(n):=Right nostril [expanded=1; unexpanded=0]
    • L(n):=Left nostril [expanded=1; unexpanded=0]
    • L(cb):=Left cheek bone [raised=1; unraised=0]
    • R(cb):=Right cheek bone [raised=1; unraised=0]
    • LO(e):=Left Open eye as a whole [distance above pupil=1; no distance above pupil=0]
    • RO(e):=Right Open eye as a whole [distance above pupil=1; no distance above pupil=0]
    • LC(e):=Left closed eye
    • RC(e):=Right closed eye
    • LN(e):=Left eye narrowed
    • RN(e):=Right eye narrowed
    • R(b):=Right eye brow [raised=1; not raised=0]
    • L(b):=Left eye brow [raised=1; not raised=0]
    • N(b):=Nose bridge [two states: compressed=1; uncompressed=0]
    • F(f):=Frontal forehead [compressed=1; uncompressed=0]
In addition to the emotional content variable Ec, we analyze various combinations as they pertain to emotional expressions of a cultural group. For example:
    • The state of (i.e., showing of) to=1 and n(b)=1
    • means “anger”.
Computer software for speech recognition and conversion to digital data presently exists and may be modified and enhanced for use in the communications system. Exemplary of such software is that of International Business Machines designated “IBM Continuous Speech Recognition Program”. Similarly, commercial software may be used to convert digital data into artificial speech.
Because commercial speech recognition software is not completely accurate, it may be desirable to develop a corrective addon to increase the accuracy as set forth hereinafter:
Algorithmic Steps
  • a. Duplicate each incoming analog stream to provide two segments:
    • 1. An untouched segment (Segment A).
    • 2. A processed segment (Segment B).
  • b. Tag each segment with respect to position in the incoming stream.
  • c. Each segment (Segment A) can have variable length.
  • d. Digitize incoming analog stream.
  • e. operate speech recognition kernel on Segment B.
    • e.1. Speech recognition kernel.
    • e.2. Spell checker for word.
    • e.3. Grammatic checks.
    • e.4. If recognized and proper tag as Ra
    • If unrecognized or improper tag as Ua
  • f. Tag each fully (i.e., 100%) recognized word as to its position in Segment B.
  • g. Deduct the recognized words of Segment B in their appropriate position in Segment B from Segment A. The result is Segment C.
    • g.1. Segment C is tagged to identify its position in Segment A (Position 1).
  • h. Segment C is inserted into a prepared digitized speech section (which contains a message to the speech originator)
  • i. Digital to Analog conversion takes place.
  • j. The resulting analog speech segment is sent to the speech originator.
  • k. Return from speech originator is digitized (Segment D).
  • l. Segment D is inserted in position 1 in Segment A.
  • m. Segment A is declared 100% recognized segment and is moved to signing dispatch.
    Corrective Measures
Corrective measures fall into the following.
  • A. Topic Assisted/using Trap words
  • B. Intermediary Agent Assisted
  • C. Speaker Assisted.
  • D. Spell Checker assistance.
  • E. Grammatic Assistance.
    • A. Topic Assisted
  • 1. Invoking the most common nine words to decide:
    • 1.a. Accent/Country/Location
    • 1.b. Channel to subgroup section [divided into geographic and demographic (cultural) groups
  • 2. Invoke Trap words to locate area of discussion
  • 3. Utilize B-tree [C++,V4+] for list of words possibly matching word in question.
    First Level of Assistance
  • 1. This level utilizes trap words in order to determine personal speech patterns.
  • 2. Big Nine words are evaluated in 4 tiers: Word [i.j.k.l] i=1, . . . ,n; n=n(a)+n(b) where n(a)=6, and n(b)=6.
Values of n(a) or n(b) can be modified per specific situation.
    • i determines the group most appropriate to determine any of the nine words.
    • S=Total number of words S = 9 i = 1 Word [ i ] = 9
      Second Level of Assistance
  • 1. This level traps words to determine area of discussion.
    • j=1, . . . ,10 i.e. Ten words for each area of concentration
    • k=1, . . . ,12 i.e. Twelve areas of concentration S ( j , k ) = 10 j = 1 12 K = 1 Word [ j , k ] = 120
      Third Level of Assistance
  • 1. This level compares unrecognized words against groups of 20 words describing each of the 12 areas. S ( i , j , k , l ) = 9 i = 1 10 j = 1 12 K = 1 20 L = 1 Word [ i , j , k , l ] = 9 × 10 · 12 · 20 = 20.600 words
If the signer uses American Sign Language, there is a need to effect linguistic analysis beyond what was recognized by William Stokoe in Semantics and Human Sign Language, Mouton (1971), and Sign Language Structure, Linstok Press (1978).
ASL is a visual-spatial language requiring simultaneous, multiple, dynamic articulations. At any particular instant, one has to combine information about the handshape (Stokoe's dez), the motion (Stokoe's sig) and the spatial location of the hands relative to the rest of the body (Stokoe's tab). Supplementing such information and by dynamically articulating a word or a meaning, are grammatical cues provided in context and requiring attention to detail.
Repetition of words indicates plurality, vibrations signify intensity, and relative spatial distance between cooperating hands specifies magnitude. Further grammatical delineation is contributed by facial expressions. Some of the facial cues are intuitive to human emotions and simplify such correlation. For example, the eyebrows when raised indicate surprise but when drawn down in a frown like manner signify negation or suspicion. Other facial expressions have no such immediate and intuitive affect. Such as the case of utilizing tongue position. A protruding tongue synchronized with the sign “late” turns the meaning into “not yet”.
Isolated grammatical similarities exist between the two languages, although their utilization in translation differs. Utilizing a number system with its siblings of ordinal numbers, age, or time as well as compounds are examples of such similarities.
Translation of compound words in a spoken language is benefited by its written presentation as a single unit, or when spoken, presentation in a continuous utterance, guarantees a unique interpretation which begets a correct translation. “Homework”, “businessman”, “classroom”, “babysitter” are all in daily usage as independent words.
Compounds in ASL are no different than their spoken counterparts, albeit the fact that no manual dexterity is required in rapid concatenation of the components. However, in the absence of external cues accorded the spoken compound in its rapid utterance, a machine translation of ASL compound word requires a resolving algorithm.
Other routines are mandatory for quality translation involving ASL. For example, word order in the context of a spoken language should be observed. It is set by rules which are consistently applied as a way to achieve unambiguous meaning. Such a strict rule set does not exist in ASL. However, the appearance that ASL is more lax and forgiving in its scrutiny for order and thus leading to ambiguity in the resulting meaning is misleading. There are rules in ASL for breaking the rules. In fact, a particular word order rule is a corollary of a prevailing situation conveyed by the signer. Hence, there is a rule for selecting the rule of a particular word order, which together employ supplemental meaning to the sentence, while enabling a shorter exposition. The economy of exposition achieved contributes to a more efficient communication for the signing parties. Subtle but clear message is conveyed by such order. Sentences with classifiers indicating locations appear with the order of Object, Subject, Verb, while Subject preceding Object which precedes Verb singularly indicates inflected verbs. Translation algorithms which treat even the most subtle of ASL idiosyncrasies as rules, emanated from and borne out of a need to improve efficient and economic communication will attain a higher level of comprehensive quality.
The software in FIGS. 15 and 16 handles various translation issues which need resolution before an acceptable translation can follow. Issues or word order in ASL, such as the word order just discussed, are germane to the language itself.
Cultural issues require attention right from the outset. The ASL finger spelled letter “T” viewed in Europe, or ASL signs spatially located relative to the person's midsection viewed in China, will be locally construed a pejorative. Hence, identification of the expression in the context of the intended recipient, may cause the format of delivery to undergo an appropriate substitution. Therefore, the algorithms as related to telephone communication, try to identify the recipient's cultural base or geography prior to dispatch, so that the algorithmic routines for appropriate adjustments can be invoked.
Notwithstanding such efforts, the advanced group of algorithms is far from being comprehensive, and represents only the first step in a much deserving subject. FIG. 15 shows the essential components of an English to ASL translation algorithm, while FIG. 16 shown the ASL to English translation algorithm.
As will be appreciated, there is a substantial problem in effectuating real time transmission of the data as to images because of the need for compression even after discarding superfluous information. If we consider a video camera with 640 horizontal pixels and 480 lines, this means that a single frame amounts to 307,200 Bytes or 2.4576 Mbits. When considering a real time operation of 30-frames/sec, this would require 73.728 Mbits/Sec. Obviously, a bottleneck will result in the transfer to and from any acceptable storage media. Furthermore, to utilize telephone lines in a meaningful way, such as at 56 kilobits/second or even at 64 kilobits/second, it would take close to 20 minutes to transfer one second of video data. Using compression would mean a compression rate of over 1,000:1. Even resorting to compressing the data by utilizing wavelets, the level of resulting quality would be questionable. The other alternative is typically to transmit fewer frames per second, but this is an unacceptable method as it results in jerky motions and becomes difficult to interpret visual signing gestures.
In the present invention, the preferred approach is to avoid the conventional approach of trying to force some compression scheme on the data, and instead bring the data down from the frame level to a Reduced Data Set (RDS).
It will be appreciated that another significant aspect of the invention is the requirement that finger spelling be captured by the camera, undergo the RDS process, and still be recognized once artificial intelligence procedures are invoked. This task can be difficult because the frame grabber has to capture the signed gesture against the ambient surroundings, other body parts of the signing person, and clothes. Preferably, the system uses special gloves which allow discrimination of the hands from the background for the image processing system.
Turning now to FIGS. 13 and 14, therein illustrated is the benefit in using special gloves to enhance the ability of the system to recognize important detail of the hand shapes during the actual gesturing of sign language. Many times the hands are overlapping or touching each other. Video separation of left from right is accomplished by color separation using different saturated colors for each hand. For example, the fingers of the right hand can be distinctly green and the fingers of the left hand are distinctly blue. In addition, each glove has a third color (typically red) for left and right palm areas. This allows hand shape and finger details to be seen whenever the hand is closed vs. opened and when palm is disposed toward the camera vs. palm away.
The same type of RDS is utilized in recreating images, frame by frame, in real time, which will be displayed on the deaf person's monitor. These images will appear as smooth, continuous animation which will be easy to recognize. This is because the recreation of this animation is a result of actual frame by frame information which has been captured from a live subject and put into memory. The RDS takes up minimal memory and yet is completely on demand, interactive, and operates at real time speed.
At the end of the speech recognition, from the hearing persons' voice and text building procedure, the various words will be assembled into their counterpart animated signing gestures, starting with the table of data generated from the text that was transmitted from the center doing the frame by frame recreation for each gesture, employing special algorithms for transitional frames between gestures and then displaying them in sequence on the deaf persons' monitor.
The illustrated embodiments all utilize a single video cameras. It may be desirable to utilize more than one camera to allow the signing person “free” movement in his or her environment to track down spatial positions in that environment.
In such a case, the installation should follow the following criteria:
    • 1. Each camera is covering a separate angle.
    • 2. Each camera operates independently of the other(s).
    • 3. Angle overlap may or may not be permitted according to the pre-signing calibration.
    • 4. Integration of input from multiple camera is performed
    • 5. A defined figure with signing motions (where applicable) is rendered in conformity with allowable images (for persons). The same technique is useful in defining any objects or, alive, stationary or moving entities, such as animals.
    • 6. Movements without signing are classified as null figures (coordinates are preserved).
    • 7. The animated form of the signing figure can be shown in an “abbreviated” form when the person is not signing. That is, a figure not well defined with specific locations of fingers, etc. Such animated figures an occur for all null figures.
Recently, three dimensional video cameras have been developed. The use of such devices may facilitate recognition of signing motions by enhancing spatial differences.
Thus, it can be seen that the electronic communications system of the present invention provides an effective means for translating signing motions to speech or text for a hearing party using only a normal telephone at the hearing party's end of the line, and for translating speech to signing motions which are conveyed to the deaf party. The system may function as a telephone for the deaf, or as an on-site translator.

Claims (43)

1. An electronic communications system for the deaf comprising:
(a) a video apparatus for visually observing the images of facial and hand and finger signing motions of a deaf person and converting the observed signing motions into digital identifiers;
(b) means for translating said digital identifiers of said observed signing motions into words and phrases;
(c) means for outputting said words and phrases generated by the visual observation of said signing motions in a comprehensible form to another person;
(d) a receiver for receiving spoken words and phrases of another person and transmitting them;
(e) means for translating said spoken words and phrases into a visual form which may be observed by the a deaf person; and
(f) means for outputting said visual form of said spoken words and phrases on said video apparatus for viewing by the deaf person.
2. The electronic communications system in accordance with claim 1 wherein said another person is at a remote location.
3. The electronic communications system in accordance with claim 1 wherein said video apparatus includes a video camera and image capture and processing hardware and software.
4. The electronic communications system in accordance with claim 1 wherein said translating means is located at a central station with which said video apparatus and said receiver and outputting means are in communication.
5. The electronic communications system in accordance with claim 1 wherein said translating means also includes artificial intelligence for interpreting and converting the translated signaling motions into words and phrases and into coherent sentences.
6. The electronic communications system in accordance with claim 5 wherein said outputting means converts said coherent sentences into synthetic speech.
7. The electronic communications system in accordance with claim 1 wherein said outputting means converts said spoken words and phrases into written form.
8. The electronic communications system in accordance with claim 1 wherein said video apparatus includes a display screen.
9. The electronic communications system in accordance with claim 8 wherein said video apparatus provides an output of said spoken words and phrases as signing motions on said display screen for viewing by the deaf person.
10. The electronic communications system in accordance with claim 1 wherein said video apparatus includes a display screen to provide an output of said spoken words and phrases as signing motions on said display screen for viewing by the deaf person, and wherein said video apparatus includes a microphone and speaker whereby a deaf person may communicate with another person in the immediate vicinity.
11. The electronic communications system in accordance with claim 10 wherein said translating means is located at a central station with which said video apparatus and said receiver and outputting means are in communication.
12. In a method for electronic communication for the deaf comprising:
(a) visually observing the images of facial and hand and finger signing motions of a deaf person and converting the observed signing motions into digital identifiers;
(b) translating said digital identifiers of said observed signing motions into words and phrases;
(c) outputting said words and phrases in a comprehensible form to another person;
(d) receiving speech from said another person;
(e) translating said speech of said another person into signing motions; and
(f) displaying said signing motions representing said speech to said a deaf person.
13. The electronic communications method in accordance with claim 12 wherein said another person is at a remote location.
14. The electronic communication method in accordance with claim 13 wherein said step of outputting at a remote location is effected by transmission of said translated words and phrases to a communications device receiver at said remote location.
15. The electronic communication method in accordance with claim 12 wherein said step of observing and converting the signing motions is effected by a video camera.
16. The electronic communication method in accordance with claim 12 including the step of transmitting said digital identifiers of said motions and said speech electronically to a central station where said translating steps are performed.
17. The electronic communication method in accordance with claim 12 wherein said outputting step provides such words and phrases as synthetic speech.
18. The electronic communication method in accordance with claim 12 wherein said outputting step provides said words and phrases in written form to said another person.
19. The electronic communication method in accordance with claim 12 wherein said displaying step provides said words and phrases in written form.
20. The electronic communication method in accordance with claim 12 wherein said translating step utilizes artificial intelligence.
21. The electronic communication method and software in accordance with claim 20 wherein said intelligence is developed with the use of multiple neural networks automatically created and assigned by gesture type.
22. The electronic communication method in accordance with claim 12 wherein said another person and said displaying step are at the same location as said deaf person and wherein said visually observing and converting step utilizes a video apparatus.
23. The electronic communication method in accordance with claim 22 wherein said receiver and outputting steps are conducted by components of an installation including said video apparatus.
24. The electronic communication method in accordance with claim 22 wherein said translating steps are conducted at a remote center.
25. The electronic communication method in accordance with claim 12 wherein said translating steps are conducted at a remote center.
26. An electronic communications communication system for the deaf comprising:
(a) a video apparatus for visually observing the images of facial and hand and finger signing motions of a deaf person and converting the observed signing motions into digital identifiers;
(b) means for translating said digital identifiers of said observed signing motions into words and phrases;
(c) means for outputting said words and phrases generated by the visual observations of said signing motions in a comprehensible form to another person;
(d) a receiver for receiving spoken words and phrases of another person and transmitting them;
(e) means for translating said spoken words and phrases into signing motions which may be observed by the a deaf person; and
(f) means for outputting said signing motions on said video apparatus for viewing by the deaf person, said translating means being located at a central station with which said video apparatus and receiver are in communication.
27. An electronic communications communication system for the deaf in accordance with claim 26 wherein said another person is at a remote location.
28. An electronic communications communication system for the deaf in accordance with claim 26 wherein said video apparatus includes a video camera and image capture and processing hardware and software.
29. An electronic communications communication system for the deaf in accordance with claim 26 wherein said translating means also includes artificial intelligence for interpreting and converting the translated motions into words and phrases into coherent sentences.
30. An electronic communications communication system for the deaf in accordance with claim 28 wherein said outputting means converts said coherent sentences into synthetic speech.
31. An electronic communications communication system for the deaf in accordance with claim 26 wherein said video apparatus includes a display screen.
32. An electronic communications communication system for the deaf in accordance with claim 26 wherein said video apparatus includes a display screen to provide an output of said spoken words and phrases as signing motions on said display screen for viewing by the deaf person, and wherein said video apparatus includes a microphone and speaker whereby a deaf person may communicate with another person in the immediate vicinity.
33. An electronic communications systems for the hearing impaired comprising:
a receiver for receiving spoken words and phrases;
means for translating said spoken words and phrases into a visual form which may be observed by a hearing impaired person;
said translating means including means for transforming said spoken words into equivalent signing content and then into textual material;
means for outputting said textual material for display on a device utilized by said hearing impaired person;
said device utilized by said hearing impaired person including means for receiving words and phrases from the hearing impaired person;
said transforming means converting said words and phrases from the hearing impaired person into a form which may be presented to a hearing person;
means for outputting said converted words and phrases from said hearing impaired person; and
said device utilized by said hearing impaired person comprising a personal computer which includes a monitor and which further includes a video camera for capturing facial, hand, and finger signing motions generated by said hearing impaired person.
34. An electronic communications system according to claim 33, wherein said translating means are located in a station remote from said hearing impaired person and said hearing person.
35. An electronic communications system according to claim 33, further comprising means for converting said captured signing motions into a plurality of identifiers and means for transmitting said plurality of identifiers to said translating means.
36. An electronic communications system according to claim 35, wherein said transmitting means comprises at least one telephone line.
37. An electronic communications system according to claim 35, wherein said translating means includes means for correlating said identifiers with a vocabulary and grammar database.
38. An electronic communications system according to claim 33, wherein said translating means includes artificial intelligence means for providing an analysis of the emotional content of said spoken words and wherein said system further comprises means for separately conveying said emotional content to said device utilized by said hearing impaired person.
39. An electronic communications system according to claim 33, wherein said device has means for converting textual material received from said translating means into reduced identifying pointers and for converting said reduced identifying pointers into animated images which portray in sign language the content of the spoken words and phrases.
40. An electronic communications system according to claim 33, wherein said device utilized by said hearing impaired person is located in a kiosk.
41. An electronic communications system according to claim 33, wherein said device utilized by said hearing impaired person comprises a portable transmitter/receiver.
42. An electronic communications system according to claim 33, wherein said output means comprises means for transmitting said text via telephone lines and said device used by said hearing impaired person includes means for converting said transmitted text to animated images.
43. An electronic communication system for the hearing impaired comprising:
a receiver for receiving spoken words and phrases;
means for translating said spoken words and phrases into a visual form which may be observed by a hearing impaired person;
said translating means including means for transforming said spoken words into equivalent signing content and then into textual material;
means for outputting said textual material for display on a device utilized by said hearing impaired person;
said device utilized by said hearing impaired person including means for receiving words and phrases from the hearing impaired person;
said system including a video apparatus for visually observing any images of facial and hand and finger signing motions of the hearing impaired person and converting any observed signing motions into digital identifiers;
said transforming means converting said words and phrases from the hearing impaired person into a form which may be presented to a hearing person;
said transforming means including means for translating said digital identifiers of said observed signing motions into words and phrases;
means for outputting said translated words and phrases from said hearing impaired person; and
said outputting means including means for outputting said words and phrases generated by the visual observation of said signing motions in a comprehensible form to another person.
US09/603,247 1995-03-01 2000-06-23 Telephone for the deaf and method of using same Expired - Lifetime USRE41002E1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US09/603,247 USRE41002E1 (en) 1995-03-01 2000-06-23 Telephone for the deaf and method of using same

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US39655495A 1995-03-01 1995-03-01
US08/653,732 US5982853A (en) 1995-03-01 1996-05-23 Telephone for the deaf and method of using same
US09/603,247 USRE41002E1 (en) 1995-03-01 2000-06-23 Telephone for the deaf and method of using same

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US08/653,732 Reissue US5982853A (en) 1995-03-01 1996-05-23 Telephone for the deaf and method of using same

Publications (1)

Publication Number Publication Date
USRE41002E1 true USRE41002E1 (en) 2009-11-24

Family

ID=23567693

Family Applications (2)

Application Number Title Priority Date Filing Date
US08/653,732 Ceased US5982853A (en) 1995-03-01 1996-05-23 Telephone for the deaf and method of using same
US09/603,247 Expired - Lifetime USRE41002E1 (en) 1995-03-01 2000-06-23 Telephone for the deaf and method of using same

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US08/653,732 Ceased US5982853A (en) 1995-03-01 1996-05-23 Telephone for the deaf and method of using same

Country Status (1)

Country Link
US (2) US5982853A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110116608A1 (en) * 2009-11-18 2011-05-19 Gwendolyn Simmons Method of providing two-way communication between a deaf person and a hearing person
US8489397B2 (en) * 2002-01-22 2013-07-16 At&T Intellectual Property Ii, L.P. Method and device for providing speech-to-text encoding and telephony service
US10008128B1 (en) 2016-12-02 2018-06-26 Imam Abdulrahman Bin Faisal University Systems and methodologies for assisting communications
US11817126B2 (en) 2021-04-20 2023-11-14 Micron Technology, Inc. Converting sign language

Families Citing this family (148)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8352400B2 (en) 1991-12-23 2013-01-08 Hoffberg Steven M Adaptive pattern recognition based controller apparatus and method and human-factored interface therefore
WO1997008895A1 (en) * 1995-08-30 1997-03-06 Hitachi, Ltd. Chirological telephone system
US6477239B1 (en) * 1995-08-30 2002-11-05 Hitachi, Ltd. Sign language telephone device
US6786420B1 (en) 1997-07-15 2004-09-07 Silverbrook Research Pty. Ltd. Data distribution mechanism in the form of ink dots on cards
US8209184B1 (en) * 1997-04-14 2012-06-26 At&T Intellectual Property Ii, L.P. System and method of providing generated speech via a network
US6618117B2 (en) 1997-07-12 2003-09-09 Silverbrook Research Pty Ltd Image sensing apparatus including a microcontroller
US6690419B1 (en) 1997-07-15 2004-02-10 Silverbrook Research Pty Ltd Utilising eye detection methods for image processing in a digital image camera
US6879341B1 (en) 1997-07-15 2005-04-12 Silverbrook Research Pty Ltd Digital camera system containing a VLIW vector processor
US7551201B2 (en) 1997-07-15 2009-06-23 Silverbrook Research Pty Ltd Image capture and processing device for a print on demand digital camera system
US7110024B1 (en) 1997-07-15 2006-09-19 Silverbrook Research Pty Ltd Digital camera system having motion deblurring means
US6624848B1 (en) 1997-07-15 2003-09-23 Silverbrook Research Pty Ltd Cascading image modification using multiple digital cameras incorporating image processing
GB9715516D0 (en) * 1997-07-22 1997-10-01 Orange Personal Comm Serv Ltd Data communications
US6603835B2 (en) 1997-09-08 2003-08-05 Ultratec, Inc. System for text assisted telephony
DE69830295T2 (en) * 1997-11-27 2005-10-13 Matsushita Electric Industrial Co., Ltd., Kadoma control method
US6301370B1 (en) 1998-04-13 2001-10-09 Eyematic Interfaces, Inc. Face recognition from video images
US6272231B1 (en) * 1998-11-06 2001-08-07 Eyematic Interfaces, Inc. Wavelet-based facial motion capture for avatar animation
DE69910757T2 (en) * 1998-04-13 2004-06-17 Eyematic Interfaces, Inc., Santa Monica WAVELET-BASED FACIAL MOTION DETECTION FOR AVATAR ANIMATION
US6434198B1 (en) * 1998-08-28 2002-08-13 Lucent Technologies Inc. Method for conveying TTY signals over wireless communication systems
AUPP702098A0 (en) 1998-11-09 1998-12-03 Silverbrook Research Pty Ltd Image creation method and apparatus (ART73)
EP0991011B1 (en) * 1998-09-28 2007-07-25 Matsushita Electric Industrial Co., Ltd. Method and device for segmenting hand gestures
US6714661B2 (en) 1998-11-06 2004-03-30 Nevengineering, Inc. Method and system for customizing facial feature tracking using precise landmark finding on a neutral face image
US7050655B2 (en) * 1998-11-06 2006-05-23 Nevengineering, Inc. Method for generating an animated three-dimensional video head
AUPP702198A0 (en) * 1998-11-09 1998-12-03 Silverbrook Research Pty Ltd Image creation method and apparatus (ART79)
US7050624B2 (en) * 1998-12-04 2006-05-23 Nevengineering, Inc. System and method for feature location and tracking in multiple dimensions including depth
US6453170B1 (en) * 1998-12-31 2002-09-17 Nokia Corporation Mobile station user interface, and an associated method, facilitating usage by a physically-disabled user
JP3624733B2 (en) * 1999-01-22 2005-03-02 株式会社日立製作所 Sign language mail device and sign language information processing device
US7966078B2 (en) 1999-02-01 2011-06-21 Steven Hoffberg Network media appliance system and method
AUPQ056099A0 (en) 1999-05-25 1999-06-17 Silverbrook Research Pty Ltd A method and apparatus (pprint01)
US6611804B1 (en) * 1999-06-15 2003-08-26 Telefonaktiebolaget Lm Ericsson (Publ) Universal TTY/TDD devices for robust text and data transmission via PSTN and cellular phone networks
ES2174708B1 (en) * 2000-06-08 2004-08-16 Monica Socias Gili TELECOMMUNICATIONS MACHINE SPECIALLY FOR INVIDENTS, SORDOMUDOS AND PHYSICAL DECREASES.
WO2001008393A1 (en) * 1999-07-26 2001-02-01 Socias Gili Monica Multifunction telecommunication system for public and/or private use
AU1601501A (en) * 1999-11-12 2001-06-06 William E Kirksey Method and apparatus for displaying writing and utterance of word symbols
US6377925B1 (en) 1999-12-16 2002-04-23 Interactive Solutions, Inc. Electronic translator for assisting communications
IL133797A (en) * 1999-12-29 2004-07-25 Speechview Ltd Apparatus and method for visible indication of speech
US7054819B1 (en) * 2000-02-11 2006-05-30 Microsoft Corporation Voice print access to computer resources
US7231367B1 (en) 2000-06-29 2007-06-12 Eastman Kodak Company Electronic imaging capture and billing distribution system
US7287009B1 (en) * 2000-09-14 2007-10-23 Raanan Liebermann System and a method for carrying out personal and business transactions
WO2002023389A1 (en) 2000-09-15 2002-03-21 Robert Fish Systems and methods for translating an item of information using a distal computer
US6570963B1 (en) * 2000-10-30 2003-05-27 Sprint Communications Company L.P. Call center for handling video calls from the hearing impaired
US6963839B1 (en) 2000-11-03 2005-11-08 At&T Corp. System and method of controlling sound in a multi-media communication application
US7203648B1 (en) 2000-11-03 2007-04-10 At&T Corp. Method for sending multi-media messages with customized audio
US7091976B1 (en) 2000-11-03 2006-08-15 At&T Corp. System and method of customizing animated entities for use in a multi-media communication application
US20080040227A1 (en) 2000-11-03 2008-02-14 At&T Corp. System and method of marketing using a multi-media communication system
US6990452B1 (en) 2000-11-03 2006-01-24 At&T Corp. Method for sending multi-media messages using emoticons
US6976082B1 (en) 2000-11-03 2005-12-13 At&T Corp. System and method for receiving multi-media messages
US7035803B1 (en) 2000-11-03 2006-04-25 At&T Corp. Method for sending multi-media messages using customizable background images
ES2180396B1 (en) * 2000-11-14 2004-04-16 Protocolo Omega, S.L. "DATA-VOICE" COMMUNICATION SYSTEM FOR DISABLED AUDIO AND SONO PEOPLE.
DE60133928D1 (en) * 2000-11-17 2008-06-19 Tate & Lyle Technology Ltd MELTABLE SUCRALOSE COMPOSITION
US7031922B1 (en) * 2000-11-20 2006-04-18 East Carolina University Methods and devices for enhancing fluency in persons who stutter employing visual speech gestures
US6917703B1 (en) 2001-02-28 2005-07-12 Nevengineering, Inc. Method and apparatus for image analysis of a gabor-wavelet transformed image using a neural network
US7392287B2 (en) 2001-03-27 2008-06-24 Hemisphere Ii Investment Lp Method and apparatus for sharing information using a handheld device
US20020198716A1 (en) * 2001-06-25 2002-12-26 Kurt Zimmerman System and method of improved communication
US6853379B2 (en) * 2001-08-13 2005-02-08 Vidiator Enterprises Inc. Method for mapping facial animation values to head mesh positions
US6834115B2 (en) 2001-08-13 2004-12-21 Nevengineering, Inc. Method for optimizing off-line facial feature tracking
US6876364B2 (en) 2001-08-13 2005-04-05 Vidiator Enterprises Inc. Method for mapping facial animation values to head mesh positions
US8416925B2 (en) * 2005-06-29 2013-04-09 Ultratec, Inc. Device independent text captioned telephone service
AU2002332805A1 (en) * 2001-08-31 2003-03-10 Communication Service For The Deaf Enhanced communications services for the deaf and hard of hearing
US7333507B2 (en) * 2001-08-31 2008-02-19 Philip Bravin Multi modal communications system
US7671861B1 (en) 2001-11-02 2010-03-02 At&T Intellectual Property Ii, L.P. Apparatus and method of customizing animated entities for use in a multi-media communication application
US20030119518A1 (en) * 2001-12-26 2003-06-26 Samusung Electroincs Co. Ltd. Apparatus and method for selecting TTY/TDD baudot code-capable vocoders in a wireless mobile network
US20040012643A1 (en) * 2002-07-18 2004-01-22 August Katherine G. Systems and methods for visually communicating the meaning of information to the hearing impaired
US7774194B2 (en) * 2002-08-14 2010-08-10 Raanan Liebermann Method and apparatus for seamless transition of voice and/or text into sign language
TW200405988A (en) * 2002-09-17 2004-04-16 Ginganet Corp System and method for sign language translation
TWI276357B (en) * 2002-09-17 2007-03-11 Ginganet Corp Image input apparatus for sign language talk, image input/output apparatus for sign language talk, and system for sign language translation
TW200417228A (en) * 2002-09-17 2004-09-01 Ginganet Corp Sign language image presentation apparatus, sign language image input/output apparatus, and system for sign language translation
US6760408B2 (en) * 2002-10-03 2004-07-06 Cingular Wireless, Llc Systems and methods for providing a user-friendly computing environment for the hearing impaired
GB0306875D0 (en) * 2003-03-25 2003-04-30 British Telecomm Apparatus and method for generating behavior in an object
US6975273B1 (en) * 2003-06-17 2005-12-13 Samsung Electronics Co., Ltd. Antenna with camera lens assembly for portable radiotelephone
US20130289970A1 (en) * 2003-11-19 2013-10-31 Raanan Liebermann Global Touch Language as Cross Translation Between Languages
US7707039B2 (en) 2004-02-15 2010-04-27 Exbiblio B.V. Automatic modification of web pages
US8442331B2 (en) 2004-02-15 2013-05-14 Google Inc. Capturing text from rendered documents using supplemental information
US7812860B2 (en) 2004-04-01 2010-10-12 Exbiblio B.V. Handheld device for capturing text from both a document printed on paper and a document displayed on a dynamic display device
US10635723B2 (en) 2004-02-15 2020-04-28 Google Llc Search engines and systems with handheld document data capture devices
US8515024B2 (en) 2010-01-13 2013-08-20 Ultratec, Inc. Captioned telephone service
US9143638B2 (en) 2004-04-01 2015-09-22 Google Inc. Data capture from rendered documents using handheld device
US20060098900A1 (en) 2004-09-27 2006-05-11 King Martin T Secure data gathering from rendered documents
US20060081714A1 (en) 2004-08-23 2006-04-20 King Martin T Portable scanning device
US9008447B2 (en) 2004-04-01 2015-04-14 Google Inc. Method and system for character recognition
US7894670B2 (en) 2004-04-01 2011-02-22 Exbiblio B.V. Triggering actions in response to optically or acoustically capturing keywords from a rendered document
US9116890B2 (en) 2004-04-01 2015-08-25 Google Inc. Triggering actions in response to optically or acoustically capturing keywords from a rendered document
US7990556B2 (en) 2004-12-03 2011-08-02 Google Inc. Association of a portable scanner with input/output and storage devices
US8081849B2 (en) 2004-12-03 2011-12-20 Google Inc. Portable scanning and memory device
US8146156B2 (en) 2004-04-01 2012-03-27 Google Inc. Archive of text captures from rendered documents
US8713418B2 (en) 2004-04-12 2014-04-29 Google Inc. Adding value to a rendered document
US8874504B2 (en) 2004-12-03 2014-10-28 Google Inc. Processing techniques for visual capture data from a rendered document
US8489624B2 (en) 2004-05-17 2013-07-16 Google, Inc. Processing techniques for text capture from a rendered document
US8620083B2 (en) 2004-12-03 2013-12-31 Google Inc. Method and system for character recognition
US7206386B2 (en) * 2004-04-23 2007-04-17 Sorenson Communications, Inc. Method and system for electronic communication with the hearing impaired
US7016479B2 (en) * 2004-04-23 2006-03-21 Sorenson Communications, Inc. Method and system for call restoration in a video relay service
US7583286B2 (en) * 2004-04-23 2009-09-01 Sorenson Media, Inc. System and method for collection and redistribution of video conferences
US7702506B2 (en) * 2004-05-12 2010-04-20 Takashi Yoshimine Conversation assisting device and conversation assisting method
US20050281395A1 (en) * 2004-06-16 2005-12-22 Brainoxygen, Inc. Methods and apparatus for an interactive audio learning system
US8229082B2 (en) * 2004-06-17 2012-07-24 International Business Machines Corporation Awareness and negotiation of preferences for improved messaging
US8346620B2 (en) 2004-07-19 2013-01-01 Google Inc. Automatic modification of web pages
US7519670B2 (en) * 2004-08-12 2009-04-14 International Business Machines Corporation Method for disappearing ink for text messaging
BRPI0502931A (en) * 2005-06-24 2007-03-06 Inst Ct De Pesquisa E Desenvol rybena: method and communication system that uses text, voice and pounds to enable accessibility for people with disabilities
US11258900B2 (en) 2005-06-29 2022-02-22 Ultratec, Inc. Device independent text captioned telephone service
US8577593B2 (en) * 2005-08-18 2013-11-05 General Motors Llc Navigation system for hearing-impaired operators
US7746984B2 (en) * 2005-09-14 2010-06-29 Sorenson Communications, Inc. Method and system for call initiation in a video relay service
US20070057912A1 (en) * 2005-09-14 2007-03-15 Romriell Joseph N Method and system for controlling an interface of a device through motion gestures
US7742068B2 (en) * 2005-09-14 2010-06-22 Sorenson Communications, Inc. Method and system for auto configuration in a video phone system
US7746985B2 (en) * 2005-09-14 2010-06-29 Sorenson Communications, Inc. Method, system and device for relay call transfer service
JP4677938B2 (en) * 2006-03-23 2011-04-27 富士通株式会社 Information processing apparatus, universal communication method, and universal communication program
KR20080002081A (en) * 2006-06-30 2008-01-04 삼성전자주식회사 Image communications apparatus using voip and operating method thereof
EP2067119A2 (en) 2006-09-08 2009-06-10 Exbiblio B.V. Optical scanners, such as hand-held optical scanners
US20080114602A1 (en) * 2006-11-10 2008-05-15 Herman Norris Noritek, voice type
US8874445B2 (en) * 2007-03-30 2014-10-28 Verizon Patent And Licensing Inc. Apparatus and method for controlling output format of information
US20080267361A1 (en) * 2007-04-25 2008-10-30 Dileo Walter R Public phone device with multiple functionalities including services for the hearing impaired
US8638363B2 (en) 2009-02-18 2014-01-28 Google Inc. Automatically capturing information, such as capturing information using a document-aware device
US8478578B2 (en) 2008-01-09 2013-07-02 Fluential, Llc Mobile speech-to-speech interpretation system
US7957717B2 (en) * 2008-02-29 2011-06-07 Research In Motion Limited System and method for differentiating between incoming and outgoing messages and identifying correspondents in a TTY communication
US8190183B2 (en) * 2008-02-29 2012-05-29 Research In Motion Limited System and method for differentiating between incoming and outgoing messages and identifying correspondents in a TTY communication
WO2009118183A2 (en) * 2008-03-26 2009-10-01 Ident Technology Ag System and method for the multidimensional evaluation of gestures
CN101605158A (en) * 2008-06-13 2009-12-16 鸿富锦精密工业(深圳)有限公司 Mobile phone dedicated for deaf-mutes
US9300796B2 (en) * 2009-02-16 2016-03-29 Microsoft Technology Licensing, Llc Telecommunications device for the deaf (TDD) interface for interactive voice response (IVR) systems
US8280434B2 (en) * 2009-02-27 2012-10-02 Research In Motion Limited Mobile wireless communications device for hearing and/or speech impaired user
EP2224427B1 (en) * 2009-02-27 2018-11-14 BlackBerry Limited Mobile wireless communications device for hearing and/or speech impaired user
WO2010105246A2 (en) 2009-03-12 2010-09-16 Exbiblio B.V. Accessing resources based on capturing information from a rendered document
US8447066B2 (en) 2009-03-12 2013-05-21 Google Inc. Performing actions based on capturing information from rendered documents, such as documents under copyright
US20100299150A1 (en) * 2009-05-22 2010-11-25 Fein Gene S Language Translation System
CN102044128A (en) * 2009-10-23 2011-05-04 鸿富锦精密工业(深圳)有限公司 Emergency alarm system and method
US9081799B2 (en) 2009-12-04 2015-07-14 Google Inc. Using gestalt information to identify locations in printed information
US9323784B2 (en) 2009-12-09 2016-04-26 Google Inc. Image search using text-based elements within the contents of images
CN102104670B (en) * 2009-12-17 2014-03-05 深圳富泰宏精密工业有限公司 Sign language identification system and method
KR20150008840A (en) 2010-02-24 2015-01-23 아이피플렉 홀딩스 코포레이션 Augmented reality panorama supporting visually imparired individuals
DE102010009738A1 (en) * 2010-03-01 2011-09-01 Institut für Rundfunktechnik GmbH Arrangement for translating spoken language into a sign language for the deaf
US20130332952A1 (en) * 2010-04-12 2013-12-12 Atul Anandpura Method and Apparatus for Adding User Preferred Information To Video on TV
US20110294099A1 (en) * 2010-05-26 2011-12-01 Brady Patrick K System and method for automated analysis and diagnosis of psychological health
US20110295597A1 (en) * 2010-05-26 2011-12-01 Brady Patrick K System and method for automated analysis of emotional content of speech
CN202014321U (en) 2011-03-21 2011-10-19 海尔集团公司 Remote controller and television system
US9026441B2 (en) 2012-02-29 2015-05-05 Nant Holdings Ip, Llc Spoken control for user construction of complex behaviors
US9230560B2 (en) 2012-10-08 2016-01-05 Nant Holdings Ip, Llc Smart home automation systems and methods
CN103226692B (en) * 2012-11-22 2016-01-20 广东科学中心 A kind of recognition system of video stream image frame and method thereof
CN105612554B (en) * 2013-10-11 2019-05-10 冒纳凯阿技术公司 Method for characterizing the image obtained by video-medical equipment
US9558756B2 (en) 2013-10-29 2017-01-31 At&T Intellectual Property I, L.P. Method and system for adjusting user speech in a communication session
US9549060B2 (en) 2013-10-29 2017-01-17 At&T Intellectual Property I, L.P. Method and system for managing multimedia accessiblity
WO2015116014A1 (en) * 2014-02-03 2015-08-06 IPEKKAN, Ahmet Ziyaeddin A method of managing the presentation of sign language by an animated character
US10389876B2 (en) 2014-02-28 2019-08-20 Ultratec, Inc. Semiautomated relay method and apparatus
US20180034961A1 (en) 2014-02-28 2018-02-01 Ultratec, Inc. Semiautomated Relay Method and Apparatus
US10878721B2 (en) 2014-02-28 2020-12-29 Ultratec, Inc. Semiautomated relay method and apparatus
US10748523B2 (en) 2014-02-28 2020-08-18 Ultratec, Inc. Semiautomated relay method and apparatus
US20180270350A1 (en) 2014-02-28 2018-09-20 Ultratec, Inc. Semiautomated relay method and apparatus
CN107566863A (en) * 2016-06-30 2018-01-09 中兴通讯股份有限公司 A kind of exchange of information methods of exhibiting, device and equipment, set top box
IT201800002791A1 (en) * 2018-02-19 2019-08-19 Xeos It S R L System and automatic method of translation from an expressive form of language by means of signs and / or gestures into a different expressive form of the written, vocal and / or other type of language and / or vice versa
RU2691864C1 (en) * 2018-06-13 2019-06-18 Общество с ограниченной ответственностью "РостРесурс-Инклюзия" Telecommunication complex
US10776617B2 (en) * 2019-02-15 2020-09-15 Bank Of America Corporation Sign-language automated teller machine
CN110348420B (en) 2019-07-18 2022-03-18 腾讯科技(深圳)有限公司 Sign language recognition method and device, computer readable storage medium and computer equipment
US11539900B2 (en) 2020-02-21 2022-12-27 Ultratec, Inc. Caption modification and augmentation systems and methods for use by hearing assisted user

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4546383A (en) 1982-06-18 1985-10-08 Inria Institute National De Recherche En Informatique Et En Automatique Method and apparatus for visual telecommunications, in particular for use by the deaf
US5163081A (en) 1990-11-05 1992-11-10 At&T Bell Laboratories Automated dual-party-relay telephone system
US5283833A (en) 1991-09-19 1994-02-01 At&T Bell Laboratories Method and apparatus for speech processing using morphology and rhyming
US5313522A (en) 1991-08-23 1994-05-17 Slager Robert P Apparatus for generating from an audio signal a moving visual lip image from which a speech content of the signal can be comprehended by a lipreader
US5473705A (en) 1992-03-10 1995-12-05 Hitachi, Ltd. Sign language translation system and method that includes analysis of dependence relationships between successive words
US5481454A (en) 1992-10-29 1996-01-02 Hitachi, Ltd. Sign language/word translation system
US5544050A (en) 1992-09-03 1996-08-06 Hitachi, Ltd. Sign language learning system and method
US5659764A (en) 1993-02-25 1997-08-19 Hitachi, Ltd. Sign language generation apparatus and sign language translation apparatus
US5689575A (en) 1993-11-22 1997-11-18 Hitachi, Ltd. Method and apparatus for processing images of facial expressions
US5734794A (en) 1995-06-22 1998-03-31 White; Tom H. Method and system for voice-activated cell animation

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4546383A (en) 1982-06-18 1985-10-08 Inria Institute National De Recherche En Informatique Et En Automatique Method and apparatus for visual telecommunications, in particular for use by the deaf
US5163081A (en) 1990-11-05 1992-11-10 At&T Bell Laboratories Automated dual-party-relay telephone system
US5313522A (en) 1991-08-23 1994-05-17 Slager Robert P Apparatus for generating from an audio signal a moving visual lip image from which a speech content of the signal can be comprehended by a lipreader
US5283833A (en) 1991-09-19 1994-02-01 At&T Bell Laboratories Method and apparatus for speech processing using morphology and rhyming
US5473705A (en) 1992-03-10 1995-12-05 Hitachi, Ltd. Sign language translation system and method that includes analysis of dependence relationships between successive words
US5544050A (en) 1992-09-03 1996-08-06 Hitachi, Ltd. Sign language learning system and method
US5481454A (en) 1992-10-29 1996-01-02 Hitachi, Ltd. Sign language/word translation system
US5659764A (en) 1993-02-25 1997-08-19 Hitachi, Ltd. Sign language generation apparatus and sign language translation apparatus
US5689575A (en) 1993-11-22 1997-11-18 Hitachi, Ltd. Method and apparatus for processing images of facial expressions
US5734794A (en) 1995-06-22 1998-03-31 White; Tom H. Method and system for voice-activated cell animation

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"Applications of Artificial Neural Networks IV", SPIE vol. 1965, By Steven K. Rogers, 1993, pp. 589-599.
"Bidirectional Translation Between Sign Language and Japanese for Communication with Deaf-Mute People", By Takao Kuwokawa et al., 1993, pp. 1109-1114.

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8489397B2 (en) * 2002-01-22 2013-07-16 At&T Intellectual Property Ii, L.P. Method and device for providing speech-to-text encoding and telephony service
US9361888B2 (en) 2002-01-22 2016-06-07 At&T Intellectual Property Ii, L.P. Method and device for providing speech-to-text encoding and telephony service
US20110116608A1 (en) * 2009-11-18 2011-05-19 Gwendolyn Simmons Method of providing two-way communication between a deaf person and a hearing person
US10008128B1 (en) 2016-12-02 2018-06-26 Imam Abdulrahman Bin Faisal University Systems and methodologies for assisting communications
US11817126B2 (en) 2021-04-20 2023-11-14 Micron Technology, Inc. Converting sign language

Also Published As

Publication number Publication date
US5982853A (en) 1999-11-09

Similar Documents

Publication Publication Date Title
USRE41002E1 (en) Telephone for the deaf and method of using same
CN108000526B (en) Dialogue interaction method and system for intelligent robot
Brashear et al. Using multiple sensors for mobile sign language recognition
KR101777807B1 (en) Sign language translator, system and method
US8494859B2 (en) Universal processing system and methods for production of outputs accessible by people with disabilities
US20090012788A1 (en) Sign language translation system
US11482134B2 (en) Method, apparatus, and terminal for providing sign language video reflecting appearance of conversation partner
CN109409255A (en) A kind of sign language scene generating method and device
Abdulla et al. Design and implementation of a sign-to-speech/text system for deaf and dumb people
CN110825164A (en) Interaction method and system based on wearable intelligent equipment special for children
KR102293743B1 (en) AI Chatbot based Care System
Seebun et al. Let’s talk: An assistive mobile technology for hearing and speech impaired persons
Bangham et al. Signing for the deaf using virtual humans
Solina et al. Multimedia dictionary and synthesis of sign language
KR20000017756A (en) Apparatus for Translating of Finger Language
US20040012643A1 (en) Systems and methods for visually communicating the meaning of information to the hearing impaired
Martin et al. An Indian Sign Language (ISL) corpus of the domain disaster message using Avatar
Papatsimouli et al. Real Time Sign Language Translation Systems: A review study
CN213634995U (en) Bidirectional sign language translator based on augmented reality and artificial intelligence
Suman et al. Sign Language Interpreter
KR102644927B1 (en) Multi-directional online communication system providing sign language interpretation services
RU2660600C2 (en) Method of communication between deaf (hard-of-hearing) and hearing
Kumar et al. Real time detection and conversion of gestures to text and speech to sign system
TWI795209B (en) Various sign language translation system
Dayana et al. Recommendations for Developing a Sign Language Recognition Application for Malaysia

Legal Events

Date Code Title Description
FPAY Fee payment

Year of fee payment: 12

AS Assignment

Owner name: ALEXANDER TRUST, CONNECTICUT

Free format text: UNCONDITIONAL ASSIGNMENT;ASSIGNOR:LIEBERMANN, RAANAN;REEL/FRAME:032149/0142

Effective date: 20140105