专利检索 ap:("Shay Ben-David" OR "Ron Hoory" OR "Alexey Roytman" OR "Zohar Sivan" OR "James Jude Sliwa") AND inv:"Ron Hoory" 第 1 页

1.

发明授权
Distributed off-line voice services 有权
标题翻译：分布式离线语音服务

公开(公告)号：US08451823B2

公开(公告)日：2013-05-28

申请号：US11301434

申请日：2005-12-13

申请人： Shay Ben-David , Ron Hoory , Alexey Roytman , Zohar Sivan , James Jude Sliwa

发明人： Shay Ben-David , Ron Hoory , Alexey Roytman , Zohar Sivan , James Jude Sliwa

IPC分类号： H04L12/66 , H04M1/64 , G10L15/00

CPC分类号： H04L29/06027 , G10L15/30 , H04L65/103 , H04L65/104 , H04L65/1069 , H04L65/608 , H04M3/4938 , H04M2201/39 , H04M2201/40 , H04M2201/60

摘要： A voice processing system includes a real-time voice server, which is arranged to process real-time voice processing tasks for clients of the system. A gateway processor is arranged to accept from a client a request to perform an off-line voice processing task, to convert the off-line voice processing task into an equivalent real-time voice processing task, to invoke the voice server to process the equivalent real-time voice processing task, and to output a result of the equivalent real-time voice processing task.

摘要翻译： 语音处理系统包括实时语音服务器，其被配置为处理系统的客户端的实时语音处理任务。网关处理器被设置为从客户端接受执行离线语音处理任务的请求，将离线语音处理任务转换为等效的实时语音处理任务，以调用语音服务器来处理等效的实时语音处理任务，并输出等效实时语音处理任务的结果。

2.

发明申请
Distributed off-line voice services 有权
标题翻译：分布式离线语音服务

公开(公告)号：US20070133518A1

公开(公告)日：2007-06-14

申请号：US11301434

申请日：2005-12-13

申请人： Shay Ben-David , Ron Hoory , Alexey Roytman , Zohar Sivan , James Sliwa

发明人： Shay Ben-David , Ron Hoory , Alexey Roytman , Zohar Sivan , James Sliwa

IPC分类号： H04L12/66

CPC分类号： H04L29/06027 , G10L15/30 , H04L65/103 , H04L65/104 , H04L65/1069 , H04L65/608 , H04M3/4938 , H04M2201/39 , H04M2201/40 , H04M2201/60

摘要： A voice processing system includes a real-time voice server, which is arranged to process real-time voice processing tasks for clients of the system. A gateway processor is arranged to accept from a client a request to perform an off-line voice processing task, to convert the off-line voice processing task into an equivalent real-time voice processing task, to invoke the voice server to process the equivalent real-time voice processing task, and to output a result of the equivalent real-time voice processing task.

摘要翻译： 语音处理系统包括实时语音服务器，其被配置为处理系统的客户端的实时语音处理任务。网关处理器被设置为从客户端接受执行离线语音处理任务的请求，将离线语音处理任务转换为等效的实时语音处理任务，以调用语音服务器来处理等效的实时语音处理任务，并输出等效实时语音处理任务的结果。

3.

发明申请
Dictionary lookup for mobile devices using spelling recognition 审中-公开
标题翻译：使用拼写识别的移动设备的字典查找

公开(公告)号：US20070016420A1

公开(公告)日：2007-01-18

申请号：US11176154

申请日：2005-07-07

申请人： Ophir Azulai , Ron Hoory , Zohar Sivan

发明人： Ophir Azulai , Ron Hoory , Zohar Sivan

IPC分类号： G10L15/04 , G10L15/00

CPC分类号： G10L15/19

摘要： A method for querying an electronic dictionary using letters of an alphabet enunciated by a user includes accepting a speech input from the user. The speech input includes a sequence of spelled letters enunciated by the user that spell a query word. The speech input is analyzed to determine one or more sequences of the letters that approximate the sequence of spelled letters. The one or more sequences of the letters are post-processed so as to produce a plurality of recognized words approximating the query word. The electronic dictionary is queried with the plurality of recognized words so as to retrieve a respective plurality of dictionary entries. A list of results including the plurality of recognized words and the respective plurality of dictionary entries is presented to the user.

摘要翻译： 一种使用用户名字母字母查询电子词典的方法包括接受来自用户的语音输入。语音输入包括由用户发出拼写查询词的拼写字母序列。分析语音输入以确定近似拼写字母序列的一个或多个字母序列。对字母的一个或多个序列进行后处理，以产生近似于查询词的多个识别词。使用多个识别的字查询电子词典，以便检索相应的多个字典条目。向用户呈现包括多个识别字和相应的多个字典条目的结果列表。

4.

发明授权
Voice transformation with encoded information 有权
标题翻译：具有编码信息的语音变换

公开(公告)号：US08930182B2

公开(公告)日：2015-01-06

申请号：US13049924

申请日：2011-03-17

申请人： Shay Ben-David , Ron Hoory , Zvi Kons , David Nahamoo

发明人： Shay Ben-David , Ron Hoory , Zvi Kons , David Nahamoo

IPC分类号： G10L21/00 , G10L25/90 , G10L25/93 , G10L21/003 , G10L19/018

CPC分类号： G10L21/003 , G10L19/018

摘要： Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.

摘要翻译： 提供语音转换的方法，系统和计算机程序产品。该方法包括使用变换参数来变换源语言，以及使用隐写术对输入语音中的变换参数对信息进行编码，其中可以使用输出语音和关于变换参数的信息来重构源语音。还提供了一种用于重建语音变换的方法，包括：接收语音转换系统的输出语音，其中输出语音是使用隐写术编码关于变换参数的信息的变换语音; 提取变换参数信息; 并执行输出语音的逆变换以获得原始源语音的近似。

5.

发明申请
SPEECH OUTPUT WITH CONFIDENCE INDICATION 审中-公开
标题翻译：语音输出与信心指示

公开(公告)号：US20110313762A1

公开(公告)日：2011-12-22

申请号：US12819203

申请日：2010-06-20

申请人： Shay Ben-David , Ron Hoory

发明人： Shay Ben-David , Ron Hoory

IPC分类号： G10L13/08 , G10L21/00 , G10L15/00

CPC分类号： G10L13/08

摘要： A method, system, and computer program product are provided for speech output with confidence indication. The method includes receiving a confidence score for segments of speech or text to be synthesized to speech. The method includes modifying a speech segment by altering one or more parameters of the speech proportionally to the confidence score.

摘要翻译： 提供了一种用于具有置信指示的语音输出的方法，系统和计算机程序产品。该方法包括接收将要合成为语音的语音段或文本段的置信度分数。该方法包括通过根据置信度分数改变语音的一个或多个参数来修改语音段。

6.

发明申请
VOICE TRANSFORMATION WITH ENCODED INFORMATION 有权
标题翻译：语音转换与编码信息

公开(公告)号：US20120239387A1

公开(公告)日：2012-09-20

申请号：US13049924

申请日：2011-03-17

申请人： Shay Ben-David , Ron Hoory , Zvi Kons , David Nahamoo

发明人： Shay Ben-David , Ron Hoory , Zvi Kons , David Nahamoo

IPC分类号： G10L19/02

CPC分类号： G10L21/003 , G10L19/018

摘要： Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.

摘要翻译： 提供语音转换的方法，系统和计算机程序产品。该方法包括使用变换参数来变换源语言，以及使用隐写术对输入语音中的变换参数对信息进行编码，其中可以使用输出语音和关于变换参数的信息来重构源语音。还提供了一种用于重建语音变换的方法，包括：接收语音转换系统的输出语音，其中输出语音是使用隐写术编码关于变换参数的信息的变换语音; 提取变换参数信息; 并执行输出语音的逆变换以获得原始源语音的近似。

7.

发明授权
Accuracy improvement of spoken queries transcription using co-occurrence information 有权
标题翻译：使用同现信息进行语音查询转录的准确性提高

公开(公告)号：US08650031B1

公开(公告)日：2014-02-11

申请号：US13194972

申请日：2011-07-31

申请人： Jonathan Mamou , Abhinav Sethy , Bhuvana Ramabhadran , Ron Hoory , Paul Joseph Vozila , Nathan Bodenstab

发明人： Jonathan Mamou , Abhinav Sethy , Bhuvana Ramabhadran , Ron Hoory , Paul Joseph Vozila , Nathan Bodenstab

IPC分类号： G10L15/00 , G10L15/26 , G06F17/27 , G10L21/00 , G10L25/00 , G10L21/06 , G06F17/28 , G10L13/00 , G10L13/06 , G10L19/12 , G06F7/00 , G06F17/30

CPC分类号： G10L15/08 , G06F7/00 , G06F17/30 , G10L15/1815 , G10L15/265

摘要： Techniques disclosed herein include systems and methods for voice-enabled searching. Techniques include a co-occurrence based approach to improve accuracy of the 1-best hypothesis for non-phrase voice queries, as well as for phrased voice queries. A co-occurrence model is used in addition to a statistical natural language model and acoustic model to recognize spoken queries, such as spoken queries for searching a search engine. Given an utterance and an associated list of automated speech recognition n-best hypotheses, the system rescores the different hypotheses using co-occurrence information. For each hypothesis, the system estimates a frequency of co-occurrence within web documents. Combined scores from a speech recognizer and a co-occurrence engine can be combined to select a best hypothesis with a lower word error rate.

摘要翻译： 本文公开的技术包括用于支持语音的搜索的系统和方法。技术包括基于共现的方法，以提高非短语语音查询的1最佳假设的准确性，以及用于短语语音查询。使用统计自然语言模型和声学模型来识别口语查询（例如用于搜索搜索引擎的口语查询）的共现模型。给定一个话语和相关的自动语音识别n最佳假设列表，系统使用同现信息重新分辨不同的假设。对于每个假设，系统估计网络文档中共现的频率。来自语音识别器和共现引擎的组合分数可以组合以选择具有较低字错误率的最佳假设。

8.

发明申请
VOCAL SOURCE EXTRACTION BY MAXIMUM PHASE DETECTION 有权
标题翻译：通过最大相位检测提取VOCAL SOURCE

公开(公告)号：US20130325455A1

公开(公告)日：2013-12-05

申请号：US13487275

申请日：2012-06-04

申请人： Aharon Satt , Zvi Kons , Ron Hoory

发明人： Aharon Satt , Zvi Kons , Ron Hoory

IPC分类号： G10L11/04

CPC分类号： G10L25/75 , G10L25/03 , G10L25/45

摘要： Methods, apparatus and computer program products implement embodiments of the present invention that include receiving a time domain voice signal, and extracting a single pitch cycle from the received signal. The extracted single pitch cycle is transformed to a frequency domain, and the misclassified roots of the frequency domain are identified and corrected. Using the corrected roots, an indication of a maximum phase of the frequency domain is generated.

摘要翻译： 方法，装置和计算机程序产品实现本发明的实施例，其包括接收时域语音信号，并从接收到的信号中提取单个音调周期。提取的单音调周期被转换为频域，并且识别和校正频域的错误分类的根。使用校正的根，产生频域的最大相位的指示。

9.

发明授权
Feature-domain concatenative speech synthesis 有权
标题翻译：特征域级联语音合成

公开(公告)号：US07035791B2

公开(公告)日：2006-04-25

申请号：US09901031

申请日：2001-07-10

申请人： Dan Chazan , Ron Hoory

发明人： Dan Chazan , Ron Hoory

IPC分类号： G10L11/04

CPC分类号： G10L13/07 , G10L25/18

摘要： A method for speech synthesis includes receiving an input speech signal containing a set of speech segments, and estimating spectral envelopes of the input speech signal in a succession of time intervals during each of the speech segments. The spectral envelopes are integrated over a plurality of window functions in a frequency domain so as to determine elements of feature vectors corresponding to the speech segments. An output speech signal is reconstructed by concatenating the feature vectors corresponding to a sequence of the speech segments.

摘要翻译： 一种用于语音合成的方法包括接收包含一组语音段的输入语音信号，并且在每个语音段期间以一连串的时间间隔估计输入语音信号的频谱包络。频谱包络被集成在频域中的多个窗口函数上，以便确定与语音段对应的特征向量的元素。通过连接对应于语音片段序列的特征向量来重构输出语音信号。

10.

发明授权
Fast frequency-domain pitch estimation 有权
标题翻译：快速频域间距估计

公开(公告)号：US06587816B1

公开(公告)日：2003-07-01

申请号：US09617582

申请日：2000-07-14

申请人： Dan Chazan , Meir Zibulski , Ron Hoory

发明人： Dan Chazan , Meir Zibulski , Ron Hoory

IPC分类号： G10L1104

CPC分类号： G10L25/90

摘要： A method for estimating a pitch frequency of an audio signal includes computing a first transform of the signal to a frequency domain over a first time interval, and computing a second transform of the signal to the frequency domain over a second time interval, which contains the first time interval. A line spectrum of the signal is found, based on the first and second transforms, the spectrum including spectral lines having respective line amplitudes and line frequencies. A utility function that is periodic in the frequencies of the lines in the spectrum is then computed. This function is indicative, for each candidate pitch frequency in a given pitch frequency range, of a compatibility of the spectrum with the candidate pitch frequency. The pitch frequency of the speech signal is estimated responsive to the utility function.

摘要翻译： 一种用于估计音频信号的音调频率的方法包括：在第一时间间隔上计算信号到频域的第一变换，以及在第二时间间隔上计算信号到频域的第二变换，该第二时间间隔包含第一时间间隔。基于第一和第二变换，发现包括具有各自线路幅度和线路频率的谱线的频谱的信号线谱。然后计算在频谱中的线的频率中周期性的效用函数。该功能针对给定音调频率范围内的每个候选音调频率指示频谱与候选音调频率的兼容性。响应于效用函数来估计语音信号的音调频率。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类