WO2016060642A1 - System and method for controlling photography through acoustic signals - Google Patents
System and method for controlling photography through acoustic signals Download PDFInfo
- Publication number
- WO2016060642A1 WO2016060642A1 PCT/US2014/060291 US2014060291W WO2016060642A1 WO 2016060642 A1 WO2016060642 A1 WO 2016060642A1 US 2014060291 W US2014060291 W US 2014060291W WO 2016060642 A1 WO2016060642 A1 WO 2016060642A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- trigger signal
- module
- processing
- surrounding sound
- sound
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/60—Control of cameras or camera modules
- H04N23/66—Remote control of cameras or camera parts, e.g. by remote control devices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/15—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
Definitions
- the present disclosure generally relates to method for controlling photography, and in particular relates to system and method for controlling photography through acoustic signals.
- Bluetooth extendable selfie stick helps the user to control the photography with a Bluetooth remote shutter.
- the Bluetooth extendable selfie stick requires a Bluetooth connection between the terminal (e.g., the mobile phone) and the selfie stick, which is power consuming and easy to get power off.
- Embodiments described herein relate to method, electronic device and system for controlling photography through acoustic signals.
- a system for controlling photography through acoustic signal includes a sound generator, an acquisition module, a storage module, a processing module, a determining module and a control module.
- the sound generator is configured for generating a trigger signal according to user's operation.
- the acquisition module is configure for obtain a surrounding sound which comprises the trigger signal and/or an environment noise.
- the storage module is configured for storing characteristic information of the trigger signal and processing-related parameters.
- the processing module is configured for processing the surrounding sound according to the processing-related parameters and for outputting processed surrounding sound.
- the determining module is configured for determining whether the trigger signal is comprised in the surrounding sound according to the processed and the characteristic information of the trigger signal.
- the controlling module is configured for controlling the terminal to take a picture if the trigger signal is comprised in the surrounding sound.
- the trigger signal has a constant frequency
- a method for controlling photography in a terminal through acoustic signals includes: setting processing-related parameter; generating a trigger signal in quiet environment by a sound generator; identifying characteristic information of the trigger signal by processing the trigger signal; obtaining surrounding sound by an acquisition module; processing on obtained surrounding sound by a processing module; determining whether the trigger signal is comprised in the surrounding sound; and controlling the terminal to take a picture by a control module is the trigger signal is detected in the surrounding sound.
- the trigger signal has a constant frequency.
- an electronic device for controlling photography in a terminal includes an acquisition module, a storage module, a processing module, a determining module and a control module.
- the sound generator is configured for generating a trigger signal according to user's operation.
- the acquisition module is configure for obtain a surrounding sound which comprises a trigger signal and/or an environment noise.
- the trigger signal is generated by a sound generator and has a constant frequency.
- the storage module is configured for storing characteristic information of the trigger signal and processing-related parameters.
- the processing module is configured for processing the surrounding sound according to the processing-related parameters and for outputting processed surrounding sound.
- the determining module is configured for determining whether the trigger signal is comprised in the surrounding sound according to the processed and the characteristic information of the trigger signal.
- the controlling module is configured for controlling the terminal to take a picture if the trigger signal is comprised in the surrounding sound.
- FIG. 1 illustrates an exemplary block diagram of a system for controlling a camera arranged in a terminal, in accordance with an embodiment of the present disclosure.
- FIG. 2 is a flowchart illustrating a method for controlling
- FIG. 3 is a flowchart illustrating a method for using a buzzer to control photography in the terminal, in accordance with an embodiment of the present disclosure.
- FIG. 4 illustrates an example of the system of FIG.1 , in accordance with an embodiment of the present disclosure.
- FIG. 5 illustrates an example of a system for controlling
- FIG. 6 illustrates another example of a system for controlling photography in a terminal, in accordance with embodiment of the present disclosure.
- FIG. 1 illustrates an exemplary block diagram of a system 100 for controlling a camera arranged in a terminal, in accordance with an embodiment of the present disclosure.
- the system 100 includes a sound generator 102, and an acquisition module 104, a processing module 106, a storage module 108, a determining module 1 10 and a controlling module 1 12.
- the terminal herein may be a mobile terminal, a tablet computer or any portable electronic device, which is not defined herein.
- the sound generator 102 is configured for generating a trigger signal according to user's operation.
- the sound generator 102 can be various kinds of products which can produce stable acoustic signals.
- stable means that the acoustic signals generated by the sound generator 102 every time are substantially the same.
- the frequency related to power peak is a fixed value.
- the sound generator 102 can generate sound in electronic means.
- the sound generator 102 can be a buzzer.
- the sound generator 102 may generate sound in mechanical means without electric energy supply.
- the sound generator 102 can be a training clicker.
- the trigger signal is configured for instructing the camera to take a picture.
- the trigger signal can be used for other purposes.
- the sound generator 102 can generate more than one kind of stable acoustic signals. Each of the acoustic signals can be related to one kind of controlling method.
- the acquisition module 104 coupled with the sound generator 102 is configured to obtain surrounding sound.
- the surrounding sound can include the trigger signal and/or an environmental noise.
- the storage module 108 is configured to store characteristic information of the trigger signal and processing-related parameters.
- the storage module 108 is coupled with the processing module 106 and the determining module 1 10.
- the characteristic information of the trigger signal includes power and frequency.
- the processing-related parameters include sampling frequency Fs, sampling period T, FFT Length Fft_Length, signal-to noise ration threshold SNR_Th, power threshold N_Th, noise detecting period Noise_F and so on.
- the processing module 106 is coupled with the acquisition module 104 and the storage module 108.
- the processing module 106 is configured to process the surrounding sound obtained by the acquisition module 104 according to the processing-related parameters stored in the storage module 108.
- the processing module 106 outputs the processed surrounding sound to the determining module 1 10.
- the determining module 1 10 is coupled with the processing module 106 and the storage module 108.
- the determining module 1 10 is configured to determine whether the trigger signal is included in the surrounding sound according to the processed surrounding sound and the characteristic information of the trigger signal.
- the controlling module 1 12 is coupled with the determining module 1 10 and is configured to control the terminal to take a picture according to the determining result. For example, if the determining module 1 10 determines that the trigger signal is detected in the surrounding sound.
- the terminal herein may be a mobile terminal, a tablet computer or any portable electronic device, which is not defined herein.
- the sound generator 102 can be any suitable device which can produce stable acoustic signals. There is no limitation on whether it has electric energy supply or not. For example, it can adopt electronic means (e.g., a buzzer) or mechanical means (e.g., a training clicker).
- the acquisition module 104, the processing module 106, the storage module 108, the determining module 1 10 and the controlling module 1 12 can be implemented using software, hardware, or a combination of software and hardware. In one exemplary
- the acquisition module 104, the processing module 106, the storage module 108, the determining module 1 10 and the controlling module 1 12 can be implemented using any programming language (e.g., Java), in the form of the terminal software (e.g., APP).
- the acquisition module 104, the processing module 106, the storage module 108, the determining module 1 10 and the controlling module 1 12 can be implemented outside of the terminal, in the form of hardware.
- FIG. 2 is a flowchart illustrating a method for controlling
- FIG. 2 is described in combination with FIG. 1 .
- processing-related parameters are set and stored in the storage module 108.
- the sound generator 102 generates a trigger signal in quiet environment which contain little environmental noise.
- the acquisition module 104 obtains surrounding sound and transfers it to the processing module 106 for further processing and analysis.
- the processing module 106 identifies characteristic information of the trigger signal and outputs it to the storage module 108 for storing.
- the acquisition module 104 keeps acquiring surrounding sound.
- the processing module 106 processes the acquired surrounding sound.
- the determining module 1 10 determines whether the trigger signal is included in the surrounding sound. If the trigger signal is detected in the surrounding sound, the process goes to step S214. Otherwise, the process goes back to step S208.
- the control module 1 12 controls the terminal to take a picture.
- FIG. 3 is a flowchart illustrating a method for using a buzzer to control photography in the terminal, in accordance with an embodiment of the present disclosure.
- the process can be divided into learning period and operating period.
- characteristic information of a trigger signal is aimed to obtain.
- the characteristic information may include a frequency point which relates to power peak of the trigger signal and a detecting range based on the frequency.
- the buzzer after determining the processing-related parameters, the buzzer generates a trigger signal in quiet environment.
- the processing-related parameters include sampling frequency Fs, FFT Length
- the acquisition module 102 obtains the trigger signal and sends it to processing module 104 for further analysis.
- the processing module 104 identifies the frequency point F which relates to power peak the trigger signals and calculates detecting frequency range F ran ge based on the frequency point F according to an equation (1 ):
- the F_res represents frequency resolution of FFT.
- F_res is calculated according to an equation (2):
- F_res Fs/Fft_Length (2)
- the Fs represents sampling frequency.
- the Fft_Length represents FFT length
- the power peak appears twice in the trigger signals.
- two frequency points, F1 and F2 are identified by the processing module 104 and outputs to storage module 108.
- the first detecting frequency range F ran g e i is determined as (F1 ⁇ Noise_F)/F_res.
- the second detecting frequency range F ran ge2 is determined as (F2 ⁇ Noise_F)/F_res.
- the process goes into operating period.
- the acquisition module 104 obtains surrounding sound.
- the processing module 106 performs FFT on the surrounding sound.
- the processing module 106 calculates power P at the frequency point F of the processed surrounding sound and average noise power P_Noise during the detecting frequency range F ran ge.
- the processing module 106 calculates Snr of the processed surrounding sound according to an equation (3):
- power P1 related to the frequency point F1 and power P2 related to the frequency point F2 are calculated.
- average noise power P1_Noise during frequency range F ra ngei and average noise power P2_Noise during frequency range F ra nge2 are both calculated.
- SNR1 which related to F1 is calculated according to equation (4)
- the determining module 1 10 determines whether the trigger signals are included in the acquired surrounding sound. If the SNR related to the frequency point F is greater than a SNR threshold SNR_Th, and the power P at the frequency point F is greater than the power threshold N_Th, the determining module 1 10 determines that the trigger signal is detected in the surrounding sound. Otherwise, the determining module 1 10 determines that the trigger signal is not included in the surrounding sound.
- the processing module may process a cepstrum transform for more complex surrounding sound.
- FIG. 4 illustrates an example of the system of FIG.1 , in accordance with an embodiment of the present disclosure.
- the acquisition module 104, the processing module 106, the storage module 108, the determining module 1 10 and the controlling module 1 12 are implemented in the form of software (e.g., APP) in a terminal 402.
- software e.g., APP
- FIG.5 and FIG. 6 illustrates examples of a system for controlling photography in a terminal which further includes an extendable stick, in accordance with embodiment of the present disclosure.
- the stick is configured for mounting the terminal 402.
- the extendable stick 502 and the sound generator 102 are two separate elements as shown in FIG. 5.
- the sound generator 102 can be integrated in the extendable stick 502 as show in FIG. 6.
- the embodiments of the present disclosure provide method for control photography in terminal through acoustic signals. After learning characteristic information of the acoustic signals generated by common products, such as a buzzer, the camera of the terminal can be controlled by the product through acoustic signals.
- the control of the present invention is achieved through acoustic signals, no Bluetooth or Wifi needed which avoids the shortcoming of such wireless connection.
- the acoustic signals can be generated by products in electronic means or mechanical means which requires less power supply than Bluetooth controller.
Abstract
Description
Claims
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2014/060291 WO2016060642A1 (en) | 2014-10-13 | 2014-10-13 | System and method for controlling photography through acoustic signals |
KR1020147037153A KR20170070864A (en) | 2014-10-13 | 2014-10-13 | System and method for controlling photography through acoustic signals |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2014/060291 WO2016060642A1 (en) | 2014-10-13 | 2014-10-13 | System and method for controlling photography through acoustic signals |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016060642A1 true WO2016060642A1 (en) | 2016-04-21 |
Family
ID=55747038
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2014/060291 WO2016060642A1 (en) | 2014-10-13 | 2014-10-13 | System and method for controlling photography through acoustic signals |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR20170070864A (en) |
WO (1) | WO2016060642A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102009593B1 (en) | 2018-01-22 | 2019-08-09 | 숭실대학교산학협력단 | Personalized Voice Recognition Device Using Speaker Analysis and Voice Recognition Method thereof, Recording Medium for Performing the Method |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4389109A (en) * | 1979-12-31 | 1983-06-21 | Minolta Camera Co., Ltd. | Camera with a voice command responsive system |
US5027149A (en) * | 1988-01-28 | 1991-06-25 | Konica Corporation | Voice-recognition camera |
US5162832A (en) * | 1989-06-20 | 1992-11-10 | Konica Corporation | Automatic release camera |
US5959667A (en) * | 1996-05-09 | 1999-09-28 | Vtel Corporation | Voice activated camera preset selection system and method of operation |
US7525575B2 (en) * | 2003-09-12 | 2009-04-28 | Canon Europa N.V. | Voice activated image capture device and method |
US20100066485A1 (en) * | 2008-09-15 | 2010-03-18 | Sony Ericsson Mobile Communications Ab | Remote control of mobile phone camera |
-
2014
- 2014-10-13 KR KR1020147037153A patent/KR20170070864A/en not_active Application Discontinuation
- 2014-10-13 WO PCT/US2014/060291 patent/WO2016060642A1/en active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4389109A (en) * | 1979-12-31 | 1983-06-21 | Minolta Camera Co., Ltd. | Camera with a voice command responsive system |
US5027149A (en) * | 1988-01-28 | 1991-06-25 | Konica Corporation | Voice-recognition camera |
US5162832A (en) * | 1989-06-20 | 1992-11-10 | Konica Corporation | Automatic release camera |
US5959667A (en) * | 1996-05-09 | 1999-09-28 | Vtel Corporation | Voice activated camera preset selection system and method of operation |
US7525575B2 (en) * | 2003-09-12 | 2009-04-28 | Canon Europa N.V. | Voice activated image capture device and method |
US20100066485A1 (en) * | 2008-09-15 | 2010-03-18 | Sony Ericsson Mobile Communications Ab | Remote control of mobile phone camera |
Also Published As
Publication number | Publication date |
---|---|
KR20170070864A (en) | 2017-06-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106462240B (en) | For providing touch feedback with the system and method for aided capture image | |
CN103702029B (en) | The method and device of focusing is pointed out during shooting | |
US20170126192A1 (en) | Method, device, and computer-readable medium for adjusting volume | |
CN109256146B (en) | Audio detection method, device and storage medium | |
WO2015154445A1 (en) | Voice recognition method and mobile terminal | |
KR20200055983A (en) | Method for estimating electromagnatic signal radiated from device and an electronic device thereof | |
CN106961514B (en) | Terminal device and information output method | |
US11909340B2 (en) | Motor vibration method, motor vibration device, computer device, and computer-readable storage medium | |
EP2720224B1 (en) | Voice Converting Apparatus and Method for Converting User Voice Thereof | |
RU2011154335A (en) | ASSESSMENT OF THE SPEAKERS | |
EP3932088B1 (en) | Feedback control for display as sound emitter | |
KR20200086476A (en) | Method for estimating electromagnatic signal and an electronic device thereof | |
CN107908144B (en) | Method and device for controlling smoke extractor and storage medium | |
KR102512614B1 (en) | Electronic device audio enhancement and method thereof | |
CN113744750B (en) | Audio processing method and electronic equipment | |
JP2021509963A (en) | Multi-beam selection method and equipment | |
US20140015744A1 (en) | Control system and method for a display | |
CN115132224A (en) | Abnormal sound processing method, device, terminal and storage medium | |
CN105451011B (en) | The method and apparatus of regulation power | |
CN105848061B (en) | Control method and electronic equipment | |
WO2016060642A1 (en) | System and method for controlling photography through acoustic signals | |
CN105554663B (en) | Hearing system for estimating a feedback path of a hearing device | |
CN107911564B (en) | Terminal reminding method and device and terminal | |
CN109120864A (en) | Light filling processing method, device, storage medium and mobile terminal | |
JP5267982B2 (en) | Voice input device, noise removal method, and computer program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
ENP | Entry into the national phase |
Ref document number: 20147037153 Country of ref document: KR Kind code of ref document: A |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14904027 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 28/07/2017) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 14904027 Country of ref document: EP Kind code of ref document: A1 |