WO2016060642A1 - System and method for controlling photography through acoustic signals - Google Patents

System and method for controlling photography through acoustic signals Download PDF

Info

Publication number
WO2016060642A1
WO2016060642A1 PCT/US2014/060291 US2014060291W WO2016060642A1 WO 2016060642 A1 WO2016060642 A1 WO 2016060642A1 US 2014060291 W US2014060291 W US 2014060291W WO 2016060642 A1 WO2016060642 A1 WO 2016060642A1
Authority
WO
WIPO (PCT)
Prior art keywords
trigger signal
module
processing
surrounding sound
sound
Prior art date
Application number
PCT/US2014/060291
Other languages
French (fr)
Inventor
Woody Yu
Original Assignee
Looq System Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Looq System Inc. filed Critical Looq System Inc.
Priority to PCT/US2014/060291 priority Critical patent/WO2016060642A1/en
Priority to KR1020147037153A priority patent/KR20170070864A/en
Publication of WO2016060642A1 publication Critical patent/WO2016060642A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/66Remote control of cameras or camera parts, e.g. by remote control devices
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/15Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information

Definitions

  • the present disclosure generally relates to method for controlling photography, and in particular relates to system and method for controlling photography through acoustic signals.
  • Bluetooth extendable selfie stick helps the user to control the photography with a Bluetooth remote shutter.
  • the Bluetooth extendable selfie stick requires a Bluetooth connection between the terminal (e.g., the mobile phone) and the selfie stick, which is power consuming and easy to get power off.
  • Embodiments described herein relate to method, electronic device and system for controlling photography through acoustic signals.
  • a system for controlling photography through acoustic signal includes a sound generator, an acquisition module, a storage module, a processing module, a determining module and a control module.
  • the sound generator is configured for generating a trigger signal according to user's operation.
  • the acquisition module is configure for obtain a surrounding sound which comprises the trigger signal and/or an environment noise.
  • the storage module is configured for storing characteristic information of the trigger signal and processing-related parameters.
  • the processing module is configured for processing the surrounding sound according to the processing-related parameters and for outputting processed surrounding sound.
  • the determining module is configured for determining whether the trigger signal is comprised in the surrounding sound according to the processed and the characteristic information of the trigger signal.
  • the controlling module is configured for controlling the terminal to take a picture if the trigger signal is comprised in the surrounding sound.
  • the trigger signal has a constant frequency
  • a method for controlling photography in a terminal through acoustic signals includes: setting processing-related parameter; generating a trigger signal in quiet environment by a sound generator; identifying characteristic information of the trigger signal by processing the trigger signal; obtaining surrounding sound by an acquisition module; processing on obtained surrounding sound by a processing module; determining whether the trigger signal is comprised in the surrounding sound; and controlling the terminal to take a picture by a control module is the trigger signal is detected in the surrounding sound.
  • the trigger signal has a constant frequency.
  • an electronic device for controlling photography in a terminal includes an acquisition module, a storage module, a processing module, a determining module and a control module.
  • the sound generator is configured for generating a trigger signal according to user's operation.
  • the acquisition module is configure for obtain a surrounding sound which comprises a trigger signal and/or an environment noise.
  • the trigger signal is generated by a sound generator and has a constant frequency.
  • the storage module is configured for storing characteristic information of the trigger signal and processing-related parameters.
  • the processing module is configured for processing the surrounding sound according to the processing-related parameters and for outputting processed surrounding sound.
  • the determining module is configured for determining whether the trigger signal is comprised in the surrounding sound according to the processed and the characteristic information of the trigger signal.
  • the controlling module is configured for controlling the terminal to take a picture if the trigger signal is comprised in the surrounding sound.
  • FIG. 1 illustrates an exemplary block diagram of a system for controlling a camera arranged in a terminal, in accordance with an embodiment of the present disclosure.
  • FIG. 2 is a flowchart illustrating a method for controlling
  • FIG. 3 is a flowchart illustrating a method for using a buzzer to control photography in the terminal, in accordance with an embodiment of the present disclosure.
  • FIG. 4 illustrates an example of the system of FIG.1 , in accordance with an embodiment of the present disclosure.
  • FIG. 5 illustrates an example of a system for controlling
  • FIG. 6 illustrates another example of a system for controlling photography in a terminal, in accordance with embodiment of the present disclosure.
  • FIG. 1 illustrates an exemplary block diagram of a system 100 for controlling a camera arranged in a terminal, in accordance with an embodiment of the present disclosure.
  • the system 100 includes a sound generator 102, and an acquisition module 104, a processing module 106, a storage module 108, a determining module 1 10 and a controlling module 1 12.
  • the terminal herein may be a mobile terminal, a tablet computer or any portable electronic device, which is not defined herein.
  • the sound generator 102 is configured for generating a trigger signal according to user's operation.
  • the sound generator 102 can be various kinds of products which can produce stable acoustic signals.
  • stable means that the acoustic signals generated by the sound generator 102 every time are substantially the same.
  • the frequency related to power peak is a fixed value.
  • the sound generator 102 can generate sound in electronic means.
  • the sound generator 102 can be a buzzer.
  • the sound generator 102 may generate sound in mechanical means without electric energy supply.
  • the sound generator 102 can be a training clicker.
  • the trigger signal is configured for instructing the camera to take a picture.
  • the trigger signal can be used for other purposes.
  • the sound generator 102 can generate more than one kind of stable acoustic signals. Each of the acoustic signals can be related to one kind of controlling method.
  • the acquisition module 104 coupled with the sound generator 102 is configured to obtain surrounding sound.
  • the surrounding sound can include the trigger signal and/or an environmental noise.
  • the storage module 108 is configured to store characteristic information of the trigger signal and processing-related parameters.
  • the storage module 108 is coupled with the processing module 106 and the determining module 1 10.
  • the characteristic information of the trigger signal includes power and frequency.
  • the processing-related parameters include sampling frequency Fs, sampling period T, FFT Length Fft_Length, signal-to noise ration threshold SNR_Th, power threshold N_Th, noise detecting period Noise_F and so on.
  • the processing module 106 is coupled with the acquisition module 104 and the storage module 108.
  • the processing module 106 is configured to process the surrounding sound obtained by the acquisition module 104 according to the processing-related parameters stored in the storage module 108.
  • the processing module 106 outputs the processed surrounding sound to the determining module 1 10.
  • the determining module 1 10 is coupled with the processing module 106 and the storage module 108.
  • the determining module 1 10 is configured to determine whether the trigger signal is included in the surrounding sound according to the processed surrounding sound and the characteristic information of the trigger signal.
  • the controlling module 1 12 is coupled with the determining module 1 10 and is configured to control the terminal to take a picture according to the determining result. For example, if the determining module 1 10 determines that the trigger signal is detected in the surrounding sound.
  • the terminal herein may be a mobile terminal, a tablet computer or any portable electronic device, which is not defined herein.
  • the sound generator 102 can be any suitable device which can produce stable acoustic signals. There is no limitation on whether it has electric energy supply or not. For example, it can adopt electronic means (e.g., a buzzer) or mechanical means (e.g., a training clicker).
  • the acquisition module 104, the processing module 106, the storage module 108, the determining module 1 10 and the controlling module 1 12 can be implemented using software, hardware, or a combination of software and hardware. In one exemplary
  • the acquisition module 104, the processing module 106, the storage module 108, the determining module 1 10 and the controlling module 1 12 can be implemented using any programming language (e.g., Java), in the form of the terminal software (e.g., APP).
  • the acquisition module 104, the processing module 106, the storage module 108, the determining module 1 10 and the controlling module 1 12 can be implemented outside of the terminal, in the form of hardware.
  • FIG. 2 is a flowchart illustrating a method for controlling
  • FIG. 2 is described in combination with FIG. 1 .
  • processing-related parameters are set and stored in the storage module 108.
  • the sound generator 102 generates a trigger signal in quiet environment which contain little environmental noise.
  • the acquisition module 104 obtains surrounding sound and transfers it to the processing module 106 for further processing and analysis.
  • the processing module 106 identifies characteristic information of the trigger signal and outputs it to the storage module 108 for storing.
  • the acquisition module 104 keeps acquiring surrounding sound.
  • the processing module 106 processes the acquired surrounding sound.
  • the determining module 1 10 determines whether the trigger signal is included in the surrounding sound. If the trigger signal is detected in the surrounding sound, the process goes to step S214. Otherwise, the process goes back to step S208.
  • the control module 1 12 controls the terminal to take a picture.
  • FIG. 3 is a flowchart illustrating a method for using a buzzer to control photography in the terminal, in accordance with an embodiment of the present disclosure.
  • the process can be divided into learning period and operating period.
  • characteristic information of a trigger signal is aimed to obtain.
  • the characteristic information may include a frequency point which relates to power peak of the trigger signal and a detecting range based on the frequency.
  • the buzzer after determining the processing-related parameters, the buzzer generates a trigger signal in quiet environment.
  • the processing-related parameters include sampling frequency Fs, FFT Length
  • the acquisition module 102 obtains the trigger signal and sends it to processing module 104 for further analysis.
  • the processing module 104 identifies the frequency point F which relates to power peak the trigger signals and calculates detecting frequency range F ran ge based on the frequency point F according to an equation (1 ):
  • the F_res represents frequency resolution of FFT.
  • F_res is calculated according to an equation (2):
  • F_res Fs/Fft_Length (2)
  • the Fs represents sampling frequency.
  • the Fft_Length represents FFT length
  • the power peak appears twice in the trigger signals.
  • two frequency points, F1 and F2 are identified by the processing module 104 and outputs to storage module 108.
  • the first detecting frequency range F ran g e i is determined as (F1 ⁇ Noise_F)/F_res.
  • the second detecting frequency range F ran ge2 is determined as (F2 ⁇ Noise_F)/F_res.
  • the process goes into operating period.
  • the acquisition module 104 obtains surrounding sound.
  • the processing module 106 performs FFT on the surrounding sound.
  • the processing module 106 calculates power P at the frequency point F of the processed surrounding sound and average noise power P_Noise during the detecting frequency range F ran ge.
  • the processing module 106 calculates Snr of the processed surrounding sound according to an equation (3):
  • power P1 related to the frequency point F1 and power P2 related to the frequency point F2 are calculated.
  • average noise power P1_Noise during frequency range F ra ngei and average noise power P2_Noise during frequency range F ra nge2 are both calculated.
  • SNR1 which related to F1 is calculated according to equation (4)
  • the determining module 1 10 determines whether the trigger signals are included in the acquired surrounding sound. If the SNR related to the frequency point F is greater than a SNR threshold SNR_Th, and the power P at the frequency point F is greater than the power threshold N_Th, the determining module 1 10 determines that the trigger signal is detected in the surrounding sound. Otherwise, the determining module 1 10 determines that the trigger signal is not included in the surrounding sound.
  • the processing module may process a cepstrum transform for more complex surrounding sound.
  • FIG. 4 illustrates an example of the system of FIG.1 , in accordance with an embodiment of the present disclosure.
  • the acquisition module 104, the processing module 106, the storage module 108, the determining module 1 10 and the controlling module 1 12 are implemented in the form of software (e.g., APP) in a terminal 402.
  • software e.g., APP
  • FIG.5 and FIG. 6 illustrates examples of a system for controlling photography in a terminal which further includes an extendable stick, in accordance with embodiment of the present disclosure.
  • the stick is configured for mounting the terminal 402.
  • the extendable stick 502 and the sound generator 102 are two separate elements as shown in FIG. 5.
  • the sound generator 102 can be integrated in the extendable stick 502 as show in FIG. 6.
  • the embodiments of the present disclosure provide method for control photography in terminal through acoustic signals. After learning characteristic information of the acoustic signals generated by common products, such as a buzzer, the camera of the terminal can be controlled by the product through acoustic signals.
  • the control of the present invention is achieved through acoustic signals, no Bluetooth or Wifi needed which avoids the shortcoming of such wireless connection.
  • the acoustic signals can be generated by products in electronic means or mechanical means which requires less power supply than Bluetooth controller.

Abstract

A system for controlling photography through acoustic signal is disclosed. The system includes a sound generator, for generating a trigger signal according to user's operation; an acquisition module, configured to obtain a surrounding sound comprising the trigger signal and/or an environmental noise; a storage module, configured to store characteristic information of the trigger signal and processing-related parameters; a processing module, configured to process the surrounding sound according to the processing-related parameters and to output the processed surrounding sound; a determining module, configured to determine whether the trigger signal is comprised in the surrounding sound according to the processed surrounding sound and the characteristic information of the trigger signal; and a controlling module, configured to control the terminal to take a picture according to the determining result.

Description

SYSTEM AND METHOD FOR CONTROLLING PHOTOGRAPHY THROUGH
ACOUSTIC SIGNALS
FIELD OF THE PRESENT DISCLOSURE
[0001 ] The present disclosure generally relates to method for controlling photography, and in particular relates to system and method for controlling photography through acoustic signals.
BACKGROUND
[0002] With the popularity of selfie photography, many peripheral products for selfie photography become hot on the market. The most common product is Bluetooth extendable selfie stick, which helps the user to control the photography with a Bluetooth remote shutter. However, the Bluetooth extendable selfie stick requires a Bluetooth connection between the terminal (e.g., the mobile phone) and the selfie stick, which is power consuming and easy to get power off.
[0003] Therefore, there exists a need to provide an improved method for controlling photography of a terminal in a power saving means.
SUMMARY
[0004] Embodiments described herein relate to method, electronic device and system for controlling photography through acoustic signals.
[0005] In an embodiment, a system for controlling photography through acoustic signal is disclosed. The system includes a sound generator, an acquisition module, a storage module, a processing module, a determining module and a control module. The sound generator is configured for generating a trigger signal according to user's operation. The acquisition module is configure for obtain a surrounding sound which comprises the trigger signal and/or an environment noise. The storage module is configured for storing characteristic information of the trigger signal and processing-related parameters. The processing module is configured for processing the surrounding sound according to the processing-related parameters and for outputting processed surrounding sound. The determining module is configured for determining whether the trigger signal is comprised in the surrounding sound according to the processed and the characteristic information of the trigger signal. The controlling module is configured for controlling the terminal to take a picture if the trigger signal is comprised in the surrounding sound. The trigger signal has a constant frequency
[0006] In another embodiment, a method for controlling photography in a terminal through acoustic signals is disclosed. The method includes: setting processing-related parameter; generating a trigger signal in quiet environment by a sound generator; identifying characteristic information of the trigger signal by processing the trigger signal; obtaining surrounding sound by an acquisition module; processing on obtained surrounding sound by a processing module; determining whether the trigger signal is comprised in the surrounding sound; and controlling the terminal to take a picture by a control module is the trigger signal is detected in the surrounding sound. The trigger signal has a constant frequency.
[0007] In yet another embodiment, an electronic device for controlling photography in a terminal is disclosed. The electronic device includes an acquisition module, a storage module, a processing module, a determining module and a control module. The sound generator is configured for generating a trigger signal according to user's operation. The acquisition module is configure for obtain a surrounding sound which comprises a trigger signal and/or an environment noise. The trigger signal is generated by a sound generator and has a constant frequency. The storage module is configured for storing characteristic information of the trigger signal and processing-related parameters. The processing module is configured for processing the surrounding sound according to the processing-related parameters and for outputting processed surrounding sound. The determining module is configured for determining whether the trigger signal is comprised in the surrounding sound according to the processed and the characteristic information of the trigger signal. The controlling module is configured for controlling the terminal to take a picture if the trigger signal is comprised in the surrounding sound.
[0008] Additional benefits and novel features will be set forth in part in the description which follows, and in part will become apparent to those skilled in the art upon examination of the following and the accompanying drawings or may be learned by production or operation of the disclosed embodiments. The benefits of the present embodiments may be realized and attained by practice or use of various aspects of the methodologies, instrumentations and combinations set forth in the detailed description set forth below.
BRIEF DESCRIPTION OF THE DRAWINGS
[0009] Features and benefits of embodiments of the claimed subject matter will become apparent as the following detailed description proceeds, and upon reference to the drawings, wherein like numerals depict like parts. These exemplary embodiments are described in detail with reference to the drawings.
These embodiments are non-limiting exemplary embodiments.
[0010] FIG. 1 illustrates an exemplary block diagram of a system for controlling a camera arranged in a terminal, in accordance with an embodiment of the present disclosure. [001 1 ] FIG. 2 is a flowchart illustrating a method for controlling
photography through acoustic signals, in accordance with an embodiment of the presenting disclosure.
[0012] FIG. 3 is a flowchart illustrating a method for using a buzzer to control photography in the terminal, in accordance with an embodiment of the present disclosure.
[0013] FIG. 4 illustrates an example of the system of FIG.1 , in accordance with an embodiment of the present disclosure.
[0014] FIG. 5 illustrates an example of a system for controlling
photography in a terminal, in accordance with embodiment of the present disclosure.
[0015] FIG. 6 illustrates another example of a system for controlling photography in a terminal, in accordance with embodiment of the present disclosure.
DETAILED DESCRIPTION
[0016] Reference will now be made in detail to the embodiments of the present disclosure. While the present disclosure will be described in conjunction with these embodiments, it will be understood that they are not intended to limit the present disclosure to these embodiments. On the contrary, the present disclosure is intended to cover alternatives, modifications and equivalents, which may be included within the spirit and scope of the present disclosure as defined by the appended claims.
[0017] Furthermore, in the following detailed description of the present disclosure, numerous specific details are set forth in order to provide a thorough understanding of the present disclosure. However, it will be recognized by one of ordinary skill in the art that the present disclosure may be practiced without these specific details. In other instances, well known methods, procedures, components, and circuit have not been described in detail as not to unnecessarily obscure aspects of the present disclosure.
[0018] FIG. 1 illustrates an exemplary block diagram of a system 100 for controlling a camera arranged in a terminal, in accordance with an embodiment of the present disclosure. The system 100 includes a sound generator 102, and an acquisition module 104, a processing module 106, a storage module 108, a determining module 1 10 and a controlling module 1 12. The terminal herein may be a mobile terminal, a tablet computer or any portable electronic device, which is not defined herein.
[0019] The sound generator 102 is configured for generating a trigger signal according to user's operation. The sound generator 102 can be various kinds of products which can produce stable acoustic signals. Here the term "stable" means that the acoustic signals generated by the sound generator 102 every time are substantially the same. The frequency related to power peak is a fixed value. In one embodiment, the sound generator 102 can generate sound in electronic means. For example, the sound generator 102 can be a buzzer. In other embodiments, the sound generator 102 may generate sound in mechanical means without electric energy supply. For example, the sound generator 102 can be a training clicker.
[0020] In one embodiment, the trigger signal is configured for instructing the camera to take a picture. In other embodiments, the trigger signal can be used for other purposes. Alternatively, the sound generator 102 can generate more than one kind of stable acoustic signals. Each of the acoustic signals can be related to one kind of controlling method.
[0021 ] The acquisition module 104 coupled with the sound generator 102 is configured to obtain surrounding sound. The surrounding sound can include the trigger signal and/or an environmental noise. [0022] The storage module 108 is configured to store characteristic information of the trigger signal and processing-related parameters. The storage module 108 is coupled with the processing module 106 and the determining module 1 10. The characteristic information of the trigger signal includes power and frequency. The processing-related parameters include sampling frequency Fs, sampling period T, FFT Length Fft_Length, signal-to noise ration threshold SNR_Th, power threshold N_Th, noise detecting period Noise_F and so on.
[0023] The processing module 106 is coupled with the acquisition module 104 and the storage module 108. The processing module 106 is configured to process the surrounding sound obtained by the acquisition module 104 according to the processing-related parameters stored in the storage module 108. The processing module 106 outputs the processed surrounding sound to the determining module 1 10.
[0024] The determining module 1 10 is coupled with the processing module 106 and the storage module 108. The determining module 1 10 is configured to determine whether the trigger signal is included in the surrounding sound according to the processed surrounding sound and the characteristic information of the trigger signal.
[0025] The controlling module 1 12 is coupled with the determining module 1 10 and is configured to control the terminal to take a picture according to the determining result. For example, if the determining module 1 10 determines that the trigger signal is detected in the surrounding sound. The terminal herein may be a mobile terminal, a tablet computer or any portable electronic device, which is not defined herein.
[0026] As mentioned above, the sound generator 102 can be any suitable device which can produce stable acoustic signals. There is no limitation on whether it has electric energy supply or not. For example, it can adopt electronic means (e.g., a buzzer) or mechanical means (e.g., a training clicker). Furthermore, the acquisition module 104, the processing module 106, the storage module 108, the determining module 1 10 and the controlling module 1 12 can be implemented using software, hardware, or a combination of software and hardware. In one exemplary
embodiment, the acquisition module 104, the processing module 106, the storage module 108, the determining module 1 10 and the controlling module 1 12 can be implemented using any programming language (e.g., Java), in the form of the terminal software (e.g., APP). In another embodiment, the acquisition module 104, the processing module 106, the storage module 108, the determining module 1 10 and the controlling module 1 12 can be implemented outside of the terminal, in the form of hardware.
[0027] FIG. 2 is a flowchart illustrating a method for controlling
photography with acoustic signals, in accordance with an embodiment of the present disclosure. FIG. 2 is described in combination with FIG. 1 .
[0028] At S202, processing-related parameters are set and stored in the storage module 108. At S204, the sound generator 102 generates a trigger signal in quiet environment which contain little environmental noise. The acquisition module 104 obtains surrounding sound and transfers it to the processing module 106 for further processing and analysis. At S206, the processing module 106 identifies characteristic information of the trigger signal and outputs it to the storage module 108 for storing. At S208, the acquisition module 104 keeps acquiring surrounding sound. At S210, the processing module 106 processes the acquired surrounding sound.
[0029] At S212, the determining module 1 10 determines whether the trigger signal is included in the surrounding sound. If the trigger signal is detected in the surrounding sound, the process goes to step S214. Otherwise, the process goes back to step S208. At S214, when the trigger signal is detected, the control module 1 12 controls the terminal to take a picture.
[0030] More specifically, take the buzzer as sound generator 102 as an example, which is not the limitation of the present invention. FIG. 3 is a flowchart illustrating a method for using a buzzer to control photography in the terminal, in accordance with an embodiment of the present disclosure. Generally, the process can be divided into learning period and operating period.
[0031 ] During learning period, characteristic information of a trigger signal is aimed to obtain. In one embodiment, the characteristic information may include a frequency point which relates to power peak of the trigger signal and a detecting range based on the frequency.
[0032] More specifically, at S302, after determining the processing-related parameters, the buzzer generates a trigger signal in quiet environment. The processing-related parameters include sampling frequency Fs, FFT Length
Fft_Length, sampling period T, signal-to noise ratio threshold SNR_Th, power threshold N_Th, and noise detecting period Noise_F. The acquisition module 102 obtains the trigger signal and sends it to processing module 104 for further analysis. At S304, the processing module 104 identifies the frequency point F which relates to power peak the trigger signals and calculates detecting frequency range Frange based on the frequency point F according to an equation (1 ):
Frange=(F±Noise_F)/F_res. (1 )
The F_res represents frequency resolution of FFT. F_res is calculated according to an equation (2):
F_res=Fs/Fft_Length (2) The Fs represents sampling frequency. The Fft_Length represents FFT length
[0033] For example, in the case of buzzer, the power peak appears twice in the trigger signals. Thus, two frequency points, F1 and F2 are identified by the processing module 104 and outputs to storage module 108. The first detecting frequency range Frangei is determined as (F1 ±Noise_F)/F_res. The second detecting frequency range Frange2 is determined as (F2±Noise_F)/F_res. After the learning period the characteristic information of the trigger signal can be acquired
[0034] Then the process goes into operating period. At S306, the acquisition module 104 obtains surrounding sound. At S308, the processing module 106 performs FFT on the surrounding sound. At S310, the processing module 106 calculates power P at the frequency point F of the processed surrounding sound and average noise power P_Noise during the detecting frequency range Frange.
Moreover, the processing module 106 calculates Snr of the processed surrounding sound according to an equation (3):
Snr= PI P_Noise. (3)
[0035] For present case, power P1 related to the frequency point F1 and power P2 related to the frequency point F2 are calculated. Similarly, average noise power P1_Noise during frequency range Frangei and average noise power P2_Noise during frequency range Frange2 are both calculated. SNR1 which related to F1 is calculated according to equation (4)
Snr1 =P1/P1_Noise (4)
SNR2 which related to F2 is calculated according to equation (5):
Snr2=P2/P2_Noise (5)
[0036] At S312, the determining module 1 10 determines whether the trigger signals are included in the acquired surrounding sound. If the SNR related to the frequency point F is greater than a SNR threshold SNR_Th, and the power P at the frequency point F is greater than the power threshold N_Th, the determining module 1 10 determines that the trigger signal is detected in the surrounding sound. Otherwise, the determining module 1 10 determines that the trigger signal is not included in the surrounding sound.
[0037] Moreover, beside FFT, the processing module may process a cepstrum transform for more complex surrounding sound.
[0038] FIG. 4 illustrates an example of the system of FIG.1 , in accordance with an embodiment of the present disclosure. As shown in FIG. 4, the acquisition module 104, the processing module 106, the storage module 108, the determining module 1 10 and the controlling module 1 12 are implemented in the form of software (e.g., APP) in a terminal 402.
[0039] FIG.5 and FIG. 6 illustrates examples of a system for controlling photography in a terminal which further includes an extendable stick, in accordance with embodiment of the present disclosure. The stick is configured for mounting the terminal 402. In one embodiment, the extendable stick 502 and the sound generator 102 are two separate elements as shown in FIG. 5. Alternatively, the sound generator 102 can be integrated in the extendable stick 502 as show in FIG. 6.
[0040] The embodiments of the present disclosure provide method for control photography in terminal through acoustic signals. After learning characteristic information of the acoustic signals generated by common products, such as a buzzer, the camera of the terminal can be controlled by the product through acoustic signals. Advantageously, the control of the present invention is achieved through acoustic signals, no Bluetooth or Wifi needed which avoids the shortcoming of such wireless connection. Moreover, the acoustic signals can be generated by products in electronic means or mechanical means which requires less power supply than Bluetooth controller.
[0041 ] While the foregoing description and drawings represent
embodiments of the present disclosure, it will be understood that various additions, modifications and substitutions may be made therein without departing from the spirit and scope of the principles of the present disclosure as defined in the accompanying claims. One skilled in the art will appreciate that the disclosure may be used with many modifications of form, structure, arrangement, proportions, materials, elements, and components and otherwise, used in the practice of the disclosure, which are particularly adapted to specific environments and operative requirements without departing from the principles of the present disclosure. The presently disclosed embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the disclosure being indicated by the appended claims and their legal equivalents, and not limited to the foregoing description.

Claims

WE CLAIM:
1 . A system for controlling photography through acoustic signal, comprising: a sound generator, for generating a trigger signal according to user's operation; an acquisition module, coupled with the sound generator and configured for obtaining a surrounding sound which comprises the trigger signal and/or an
environmental noise;
a storage module, configured for storing characteristic information of the trigger signal and processing-related parameters;
a processing module, coupled with the acquisition module and the storage module, and configured for processing the surrounding sound according to the processing-related parameters and to output the processed surrounding sound;
a determining module, coupled with the storage module and the storage module, and configured for determining whether the trigger signal is comprised in the
surrounding sound according to the processed surrounding sound and the characteristic information of the trigger signal; and
a controlling module, coupled with the determining module, and configured for controlling the terminal to take a picture according to the determining result,
wherein the trigger signal has a constant frequency
2. The system of claim 1 , wherein the sound generator is a buzzer whose frequency related to power peak is a fixed value.
3. The system of claim 1 , wherein the sound generator is a training clicker.
4. The system of claim 1 , wherein the characteristic information of the trigger signal comprises a frequency point which relates to power peak of the trigger signal.
5. The system of claim 1 , wherein the processing module is configured for identifying a frequency point which relates to power peak of the trigger signal, and for calculating power and SNR related to the frequency point.
6. The system of claim 5, wherein the determining module is configured for determining whether the power of the frequency point is greater than a power threshold and whether the SNR related to the frequency point is greater than a SNR threshold.
7. An electronic device for controlling photography through acoustic signal, comprises:
an acquisition module, coupled with a sound generator and configured for obtaining a surrounding sound comprising a trigger signal and/or an environmental noise;
a storage module configured to store characteristic information of the trigger signal and processing-related parameters;
a processing module, coupled with the acquisition module and the storage module and configured for processing the surrounding sound according to the processing-related parameters and for outputting the processed surrounding sound; a determining module, coupled with the storage module and the storage module, and configured for determining whether the trigger signal is comprised in the
surrounding sound according to the processed surrounding sound and the characteristic information of the trigger signal; and
a controlling module, coupled with the determining module, and configured for controlling the terminal to take a picture according to the determining result,
wherein the trigger signal has a constant frequency.
8. The system of claim 7, wherein the sound generator is a buzzer whose frequency related to power peak is a fixed value.
9. The system of claim 7, wherein the sound generator is a training clicker.
10. The system of claim 7, wherein the characteristic information of the trigger signal comprises a frequency point which relates to power peak of the trigger signal.
1 1 . The system of claim 7, wherein the processing module is configured for identifying a frequency point which relates to power peak of the trigger signal, and for calculating power and SNR related to the frequency point.
12. The system of claim 1 1 , wherein the determining module is configured for determining whether the power of the frequency point is greater than a power threshold and whether the SNR related to the frequency point is greater than a SNR threshold.
13. A method for controlling photography of a terminal, comprises:
setting processing-related parameter;
generating a trigger signal in quiet environment by a sound generator;
identifying characteristic information of the trigger signal by processing the trigger signal;
obtaining surrounding sound by an acquisition module;
processing on obtained surrounding sound by a processing module;
determining whether the trigger signal is comprised in the surrounding sound; and
controlling the terminal to take a picture by a control module is the trigger signal is detected in the surrounding sound, wherein the trigger signal has a constant frequency.
14. The method of claim 1 , wherein the sound generator is a buzzer whose frequency related to power peak is a fixed value.
15. The method of claim 1 , wherein the sound generator is a training clicker.
16. The method of claim 1 , wherein the characteristic information of the trigger signal comprises a frequency point which relates to power peak of the trigger signal.
17. The method of claim 16, wherein the step that processing on obtained surrounding sound by a processing module comprises:
calculating power and SNR related to the frequency point.
18. The method of claim 17. Wherein the step that determining whether the trigger signal is comprised in the surrounding sound comprises:
determining whether the power of the frequency point is greater than a power threshold and whether the SNR related to the frequency point is greater than a SNR threshold.
PCT/US2014/060291 2014-10-13 2014-10-13 System and method for controlling photography through acoustic signals WO2016060642A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/US2014/060291 WO2016060642A1 (en) 2014-10-13 2014-10-13 System and method for controlling photography through acoustic signals
KR1020147037153A KR20170070864A (en) 2014-10-13 2014-10-13 System and method for controlling photography through acoustic signals

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2014/060291 WO2016060642A1 (en) 2014-10-13 2014-10-13 System and method for controlling photography through acoustic signals

Publications (1)

Publication Number Publication Date
WO2016060642A1 true WO2016060642A1 (en) 2016-04-21

Family

ID=55747038

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2014/060291 WO2016060642A1 (en) 2014-10-13 2014-10-13 System and method for controlling photography through acoustic signals

Country Status (2)

Country Link
KR (1) KR20170070864A (en)
WO (1) WO2016060642A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102009593B1 (en) 2018-01-22 2019-08-09 숭실대학교산학협력단 Personalized Voice Recognition Device Using Speaker Analysis and Voice Recognition Method thereof, Recording Medium for Performing the Method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4389109A (en) * 1979-12-31 1983-06-21 Minolta Camera Co., Ltd. Camera with a voice command responsive system
US5027149A (en) * 1988-01-28 1991-06-25 Konica Corporation Voice-recognition camera
US5162832A (en) * 1989-06-20 1992-11-10 Konica Corporation Automatic release camera
US5959667A (en) * 1996-05-09 1999-09-28 Vtel Corporation Voice activated camera preset selection system and method of operation
US7525575B2 (en) * 2003-09-12 2009-04-28 Canon Europa N.V. Voice activated image capture device and method
US20100066485A1 (en) * 2008-09-15 2010-03-18 Sony Ericsson Mobile Communications Ab Remote control of mobile phone camera

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4389109A (en) * 1979-12-31 1983-06-21 Minolta Camera Co., Ltd. Camera with a voice command responsive system
US5027149A (en) * 1988-01-28 1991-06-25 Konica Corporation Voice-recognition camera
US5162832A (en) * 1989-06-20 1992-11-10 Konica Corporation Automatic release camera
US5959667A (en) * 1996-05-09 1999-09-28 Vtel Corporation Voice activated camera preset selection system and method of operation
US7525575B2 (en) * 2003-09-12 2009-04-28 Canon Europa N.V. Voice activated image capture device and method
US20100066485A1 (en) * 2008-09-15 2010-03-18 Sony Ericsson Mobile Communications Ab Remote control of mobile phone camera

Also Published As

Publication number Publication date
KR20170070864A (en) 2017-06-23

Similar Documents

Publication Publication Date Title
CN106462240B (en) For providing touch feedback with the system and method for aided capture image
CN103702029B (en) The method and device of focusing is pointed out during shooting
US20170126192A1 (en) Method, device, and computer-readable medium for adjusting volume
CN109256146B (en) Audio detection method, device and storage medium
WO2015154445A1 (en) Voice recognition method and mobile terminal
KR20200055983A (en) Method for estimating electromagnatic signal radiated from device and an electronic device thereof
CN106961514B (en) Terminal device and information output method
US11909340B2 (en) Motor vibration method, motor vibration device, computer device, and computer-readable storage medium
EP2720224B1 (en) Voice Converting Apparatus and Method for Converting User Voice Thereof
RU2011154335A (en) ASSESSMENT OF THE SPEAKERS
EP3932088B1 (en) Feedback control for display as sound emitter
KR20200086476A (en) Method for estimating electromagnatic signal and an electronic device thereof
CN107908144B (en) Method and device for controlling smoke extractor and storage medium
KR102512614B1 (en) Electronic device audio enhancement and method thereof
CN113744750B (en) Audio processing method and electronic equipment
JP2021509963A (en) Multi-beam selection method and equipment
US20140015744A1 (en) Control system and method for a display
CN115132224A (en) Abnormal sound processing method, device, terminal and storage medium
CN105451011B (en) The method and apparatus of regulation power
CN105848061B (en) Control method and electronic equipment
WO2016060642A1 (en) System and method for controlling photography through acoustic signals
CN105554663B (en) Hearing system for estimating a feedback path of a hearing device
CN107911564B (en) Terminal reminding method and device and terminal
CN109120864A (en) Light filling processing method, device, storage medium and mobile terminal
JP5267982B2 (en) Voice input device, noise removal method, and computer program

Legal Events

Date Code Title Description
ENP Entry into the national phase

Ref document number: 20147037153

Country of ref document: KR

Kind code of ref document: A

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14904027

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 28/07/2017)

122 Ep: pct application non-entry in european phase

Ref document number: 14904027

Country of ref document: EP

Kind code of ref document: A1