专利检索 cpc:"G10L25/87" 第 6 页

51.

发明申请
METHOD AND APPARATUS FOR SOCIAL NETWORK COMMUNICATION OVER A MEDIA NETWORK 有权
标题翻译：媒体网络社交网络通信的方法与装置

公开(公告)号：US20140337015A1

公开(公告)日：2014-11-13

申请号：US14339536

申请日：2014-07-24

申请人： AT&T Intellectual Property I, LP

发明人： Hisao Chang , David Mornhineway

IPC分类号： H04L12/58

CPC分类号： H04L51/32 , G06Q50/01 , G10L15/00 , G10L15/22 , G10L15/26 , G10L19/00 , G10L25/87 , H04L12/1818 , H04L51/046 , H04L67/141 , H04L67/306 , H04M3/53366 , H04M2201/60 , H04M2203/655

摘要： A system that incorporates teachings of the present disclosure may initiate a communication session with a member device of a social network and may activate a speech capture element based on a pattern of prior speech messages. A speech message may be detected at the speech capture element and, in turn, transmitted to the member device.

摘要翻译： 结合本公开的教导的系统可以启动与社交网络的成员设备的通信会话，并且可以基于先前语音消息的模式来激活语音捕获元素。可以在语音捕获元件处检测到语音消息，并且又传送到成员设备。

52.

发明授权
Method and apparatus for melody recognition 有权
标题翻译：旋律识别的方法和装置

公开(公告)号：US08742243B2

公开(公告)日：2014-06-03

申请号：US13160750

申请日：2011-06-15

申请人： Wen-Nan Wang , Jyh-Shing Jang , Tzu-chun Yeh , Chung-Che Wang , Hsin-Wen Yu , Cheng-Yu Hsu

发明人： Wen-Nan Wang , Jyh-Shing Jang , Tzu-chun Yeh , Chung-Che Wang , Hsin-Wen Yu , Cheng-Yu Hsu

IPC分类号： A63H5/00 , G04B13/00 , G10H7/00

CPC分类号： G06F17/30743 , G10H1/0008 , G10H2210/051 , G10H2210/066 , G10H2210/076 , G10H2240/141 , G10L25/54 , G10L25/87 , G10L2025/906

摘要： The present disclosure provides a method for recognition. The method includes inputting a melody and obtaining pitch tracking information of the melody; obtaining beat information of the melody; determining a clarity value according to the pitch tracking information; implementing a first comparison process first to filter a first set of candidate songs from a database and then implementing a second comparison process to filter a second set of candidate songs from the first set of candidate songs if the clarity value is larger than a predetermined threshold; and determining at least one final candidate song from the second set of candidate songs.

摘要翻译： 本公开提供了一种用于识别的方法。该方法包括输入旋律并获得旋律的音高跟踪信息; 获取旋律的节拍信息; 根据音调跟踪信息确定清晰度值; 首先实施第一比较处理，以从数据库中过滤第一组候选歌曲，然后如果清晰度值大于预定阈值，则实施第二比较过程以从第一组候选歌曲过滤第二组候选歌曲; 以及从所述第二组候选歌曲中确定至少一个最终候选歌曲。

53.

发明申请
AUTOMATED DETECTION AND FILTERING OF AUDIO ADVERTISEMENTS 有权
标题翻译：自动检测和过滤音频广告

公开(公告)号：US20130268103A1

公开(公告)日：2013-10-10

申请号：US13867264

申请日：2013-04-22

申请人： AT&T Intellectual Property I, L.P.

发明人： Yeon-Jun KIM , I. Dan MELAMED , Steven Neil TISCHER , Bernard S. RENGER

IPC分类号： G06F17/00

CPC分类号： G06F17/30761 , G06F17/00 , G10L15/083 , G10L25/48 , G10L25/78 , G10L25/87 , G10L25/90

摘要： Methods, apparatuses, and media for filtering a data stream are provided. The data stream is partitioned into a plurality of data stream segments. An acoustic parameter of each of the data stream segments is measured, and it is determined whether the acoustic parameter of each of the data stream segments satisfies a predetermined condition. Extraneous segments of the data stream segments are identified in which the predetermined condition is satisfied, and it is determined whether the extraneous segments have a predetermined relationship in the data stream. The extraneous segments are deleted from the data stream to produce a filtered data stream in response to the extraneous segments having the predetermined relationship.

摘要翻译： 提供了用于过滤数据流的方法，设备和介质。数据流被划分成多个数据流段。测量每个数据流段的声学参数，并且确定每个数据流段的声学参数是否满足预定条件。识别数据流段的外部段，其中满足预定条件，并且确定外部段在数据流中是否具有预定关系。响应于具有预定关系的外部段，从数据流中删除无关段以产生经滤波的数据流。

54.

发明申请
Method and System For Endpoint Automatic Detection of Audio Record 有权
标题翻译：端点自动检测音频记录的方法和系统

公开(公告)号：US20130197911A1

公开(公告)日：2013-08-01

申请号：US13878818

申请日：2010-10-29

申请人： Si Wei , Guoping Hu , Yu Hu , Qingfeng Liu

发明人： Si Wei , Guoping Hu , Yu Hu , Qingfeng Liu

IPC分类号： G10L15/26

CPC分类号： G10L15/265 , G10L15/083 , G10L25/87

摘要： A method and system for endpoint automatic detection of audio record is provided. The method comprises the following steps: acquiring a audio record text and affirming the text endpoint acoustic model for the audio record text; starting acquiring the audio record data of each frame in turn from the audio record start frame in the audio record data; affirming the characteristics acoustic model of the decoding optimal path for the acquired current frame of the audio record data; comparing the characteristics acoustic model of the decoding optimal path acquired from the current frame of the audio record data with the endpoint acoustic model to determine if they are the same; if yes, updating a mute duration threshold with a second time threshold, wherein the second time threshold is less than a first time threshold. This method can improve the recognizing efficiency of the audio record endpoint.

摘要翻译： 提供了一种端点自动检测音频记录的方法和系统。该方法包括以下步骤：获取音频记录文本并确定音频记录文本的文本端点声学模型; 依次从音频记录数据中的音频记录开始帧获取每个帧的音频记录数据; 确定音频记录数据的获取的当前帧的解码最优路径的特征声学模型; 将从音频记录数据的当前帧获取的解码最优路径的特征声学模型与端点声学模型进行比较，以确定它们是否相同; 如果是，则用第二时间阈值更新静音持续时间阈值，其中所述第二时间阈值小于第一时间阈值。该方法可以提高音频记录端点的识别效率。

55.

发明授权
System for detecting speech with background voice estimates and noise estimates 有权
标题翻译：用于用背景语音估计和噪声估计来检测语音的系统

公开(公告)号：US08457961B2

公开(公告)日：2013-06-04

申请号：US13566603

申请日：2012-08-03

申请人： Phillip Alan Hetherington , Mark Ryan Fallat

发明人： Phillip Alan Hetherington , Mark Ryan Fallat

IPC分类号： G10L15/20 , G10L15/04

CPC分类号： G10L25/87

摘要： A system detects a speech segment that may include unvoiced, fully voiced, or mixed voice content. The system includes a window function that passes signals within a programmed aural frequency range while substantially blocking signals above and below the programmed aural frequency range. A frequency converter converts the signals passing within the programmed aural frequency range into a plurality of frequency bins. A background voice detector estimates the strength of a background speech segment relative to the noise of selected portions of the aural spectrum. A noise estimator estimates a maximum distribution of noise to an average of an acoustic noise power of some of the plurality of frequency bins. A voice detector compares the strength of a desired speech segment to a maximum of an output of the background voice detector and an output of the noise estimator.

摘要翻译： 系统检测可能包括清音，完全有声或混合语音内容的语音段。该系统包括窗口功能，其在编程的听觉频率范围内传递信号，同时基本上阻挡在编程的听觉频率范围之上和之下的信号。变频器将在编程的听觉频率范围内通过的信号转换成多个频率仓。背景语音检测器估计背景语音段相对于听觉频谱的所选部分的噪声的强度。噪声估计器将噪声的最大分布估计为多个频率仓中的一些频率仓的声学噪声功率的平均值。语音检测器将所需语音段的强度与背景语音检测器的输出的最大值和噪声估计器的输出进行比较。

56.

发明申请
REGION OF INTEREST IDENTIFICATION DEVICE, REGION OF INTEREST IDENTIFICATION METHOD, REGION OF INTEREST IDENTIFICATION PROGRAM, AND REGION OF INTEREST IDENTIFICATION INTEGRATED CIRCUIT 有权
标题翻译：兴趣鉴别设备地区，兴趣鉴别方法区域，兴趣鉴别程序区域和兴趣识别集成电路区域

公开(公告)号：US20130108244A1

公开(公告)日：2013-05-02

申请号：US13809480

申请日：2012-04-24

申请人： Tomohiro Konuma , Ryouichi Kawanishi , Tomoyuki Karibe , Tsutomu Uenoyama

发明人： Tomohiro Konuma , Ryouichi Kawanishi , Tomoyuki Karibe , Tsutomu Uenoyama

IPC分类号： G11B27/031

CPC分类号： G11B27/031 , G10L25/87 , G11B27/28

摘要： An interesting section identifying device for identifying an interesting section of a video file based on an audio signal included in the video file, the interesting section being a section in which a user is estimated to express interest, includes an interesting section candidate extracting unit that extracts an interesting section candidate from the video file, the interesting section candidate being a candidate for the interesting section, a detailed structure determining unit that determines whether the interesting section candidate includes a specific detailed structure, and an interesting section identifying unit that identifies the interesting section by analyzing a specific section when the detailed structure determining unit determines that the interesting section candidate includes the detailed structure, the specific section including the detailed structure and being shorter than the interesting section candidate.

摘要翻译： 一种有趣的部分识别装置，用于基于包括在视频文件中的音频信号识别视频文件的有趣部分，感兴趣部分是用户被估计表示兴趣的部分，包括提取的感兴趣部分候选提取单元来自视频文件的有趣的部分候选者，感兴趣的部分候选者是感兴趣部分的候选者，确定感兴趣部分候选是否包括特定详细结构的详细结构确定单元，以及识别感兴趣部分的感兴趣部分识别单元通过分析特定部分，当详细结构确定单元确定感兴趣部分候选包括详细结构时，具体部分包括详细结构并且短于感兴趣部分候选。

57.

发明申请
Speech Recognition Method of and System for Determining the Status of an Answered Telephone During the Course of an Outbound Telephone Call 有权
标题翻译：用于确定外拨电话过程中应答电话状态的语音识别方法和系统

公开(公告)号：US20130108031A1

公开(公告)日：2013-05-02

申请号：US13717082

申请日：2012-12-17

申请人： Eliza Corporation

发明人： Lucas Merrow , Alexandra Drane , John Kroeker , Oleg Boulanov , Nasreen R. Quibria , Michael R. Robinson

IPC分类号： H04M1/64

CPC分类号： H04M3/5158 , G10L15/26 , G10L17/06 , G10L25/51 , G10L25/87 , H04M1/271 , H04M1/64 , H04M1/82 , H04M3/46 , H04M3/487 , H04M2201/40 , H04M2203/2016 , H04M2203/2027

摘要： A system for determining the status of an answered telephone during the course of an outbound telephone call includes an automated telephone calling device for placing a telephone call to a location having a telephone number at which a target person is listed, upon the telephone call being answered, initiating a prerecorded greeting which asks for the target person and receiving a spoken response from an answering person and a speech recognition device for performing a speech recognition analysis on the spoken response to determine a status of the spoken response. If the speech recognition device determines that the answering person is the target person, the speech recognition device initiates a speech recognition application with the target person.

摘要翻译： 用于在出站电话过程中确定应答电话的状态的系统包括一个自动电话呼叫装置，用于在电话呼叫被应答时将电话呼叫放置到具有目标人员列出的电话号码的位置发起一个预先记录的问候语，要求目标人员并接收来自应答人员的语音响应和语音识别装置，用于对口语响应执行语音识别分析以确定口语响应的状态。如果语音识别装置确定应答者是目标人，则语音识别装置与目标人发起语音识别应用。

58.

发明授权
Structure and method for echo reduction without loss of information 有权
标题翻译：回波减少的结构和方法，而不会丢失信息

公开(公告)号：US08422663B1

公开(公告)日：2013-04-16

申请号：US13168762

申请日：2011-06-24

申请人： James H. Parry

发明人： James H. Parry

IPC分类号： H04M9/08 , H04M1/00 , H04B1/38 , H04B3/20 , H04L5/16

CPC分类号： H04M9/082 , G10L25/87 , H04M1/6016

摘要： An echo reduction method stores a received audio information stream. A sound detection flag is activated following detection of locally generated sound. Output based on the received audio information stream is muted in response to the activating the sound detection flag. Rendering status of the received audio information stream is saved, in response to the activating the sound detection flag, to reduce loss of audio information. At least a portion of the stored received audio information stream is rendered following inactivation of the sound detection flag.

摘要翻译： 回波减少方法存储所接收的音频信息流。检测到本地产生的声音后，声音检测标志被激活。响应于激活声音检测标志，基于接收到的音频信息流的输出被静音。响应于激活声音检测标志，节省了所接收的音频信息流的渲染状态，以减少音频信息的丢失。存储的所接收的音频信息流的至少一部分在声音检测标志失效之后呈现。

59.

发明申请
AUDIO ANALYSIS APPARATUS AND AUDIO ANALYSIS SYSTEM 有权
标题翻译：音频分析设备和音频分析系统

公开(公告)号：US20130080170A1

公开(公告)日：2013-03-28

申请号：US13412214

申请日：2012-03-05

申请人： Haruo HARADA , Hirohito YONEYAMA , Kei SHIMOTANI , Yohei NISHINO , Kiyoshi IIDA , Takao NAITO

发明人： Haruo HARADA , Hirohito YONEYAMA , Kei SHIMOTANI , Yohei NISHINO , Kiyoshi IIDA , Takao NAITO

IPC分类号： G10L17/00

CPC分类号： G10L17/00 , G10L21/0208 , G10L25/51 , G10L25/87

摘要： An audio analysis apparatus includes the following components. A main body includes a discrimination unit and a transmission unit. A strap is used for hanging the main body from a user's neck. A first audio acquisition device is provided to the strap or the main body. A second audio acquisition device is provided to the strap at a position where a distance between the second audio acquisition device and the user's mouth is smaller than the distance between the first audio acquisition device and the user's in a state where the strap is worn around the user's neck. The discrimination unit discriminates whether an acquired sound is an uttered voice of the user or of another person by comparing audio signals of the sound acquired by the first and second audio acquisition devices. The transmission unit transmits information including the discrimination result to an external apparatus.

摘要翻译： 音频分析装置包括以下部件。主体包括识别单元和发送单元。用于将主体从使用者的脖子上悬挂下来。第一音频获取装置被提供给带或主体。第二音频采集装置被提供给带子，在第二音频采集装置和用户嘴部之间的距离小于第一音频采集装置与用户之间的距离的位置处，用户的脖子。识别单元通过比较由第一和第二音频获取装置获取的声音的音频信号来鉴别所获取的声音是用户或另一人的发声。发送单元将包含鉴别结果的信息发送到外部设备。

60.

发明申请
SYSTEM FOR DETECTING SPEECH WITH BACKGROUND VOICE ESTIMATES AND NOISE ESTIMATES 有权
标题翻译：用背景语音估计和噪声估计来检测语音的系统

公开(公告)号：US20120303366A1

公开(公告)日：2012-11-29

申请号：US13566603

申请日：2012-08-03

申请人： Phillip Alan Hetherington , Mark Ryan Fallat

发明人： Phillip Alan Hetherington , Mark Ryan Fallat

IPC分类号： G10L15/20

CPC分类号： G10L25/87

摘要： A system detects a speech segment that may include unvoiced, fully voiced, or mixed voice content. The system includes a window function that passes signals within a programmed aural frequency range while substantially blocking signals above and below the programmed aural frequency range. A frequency converter converts the signals passing within the programmed aural frequency range into a plurality of frequency bins. A background voice detector estimates the strength of a background speech segment relative to the noise of selected portions of the aural spectrum. A noise estimator estimates a maximum distribution of noise to an average of an acoustic noise power of some of the plurality of frequency bins. A voice detector compares the strength of a desired speech segment to a maximum of an output of the background voice detector and an output of the noise estimator.

摘要翻译： 系统检测可能包括清音，完全有声或混合语音内容的语音段。该系统包括窗口功能，其在编程的听觉频率范围内传递信号，同时基本上阻挡在编程的听觉频率范围之上和之下的信号。变频器将在编程的听觉频率范围内通过的信号转换成多个频率仓。背景语音检测器估计背景语音段相对于听觉频谱的所选部分的噪声的强度。噪声估计器将噪声的最大分布估计为多个频率仓中的一些频率仓的声学噪声功率的平均值。语音检测器将所需语音段的强度与背景语音检测器的输出的最大值和噪声估计器的输出进行比较。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类