专利检索 ap:("Werner Kriechbaum" OR "Gerhard Stenzel") AND inv:"Werner Kriechbaum" 第 1 页

1.

发明授权
Method and system for the automatic amendment of speech recognition vocabularies 失效
标题翻译：自动修改语音识别词汇的方法和系统

公开(公告)号：US06975985B2

公开(公告)日：2005-12-13

申请号：US09994396

申请日：2001-11-26

申请人： Werner Kriechbaum , Gerhard Stenzel

发明人： Werner Kriechbaum , Gerhard Stenzel

IPC分类号： G10L15/06 , G10L15/187 , G10L15/00

CPC分类号： G10L15/187 , G10L15/06

摘要： The present invention provides a method and system to improve speech recognition using an existing audio realization of a spoken text and a true textual representation of the spoken text. The audio realization and the true textual representation can be aligned to reveal time stamps. A speech recognition can be performed on the audio realization to provide a hypothesis textual representation for the audio realization. The aligned true textual representation can be compared with the hypothesis textual representation. Single word pairs from the true and the hypothesis textual representations can be selected where the representations are different. Similarly, single word pairs can be selected from each representation where the representations are identical. A word or pronunciation database can be updated using the selected single word pairs together with the corresponding aligned audio realization.

摘要翻译： 本发明提供了一种利用语音文本的现有音频实现和语音文本的真实文本表示来改善语音识别的方法和系统。音频实现和真正的文本表示可以对齐以显示时间戳。可以对音频实现执行语音识别，以为音频实现提供假设文本表示。对齐的真实文本表示可以与假设文本表示进行比较。可以在表示不同的地方选择来自真实和假设文本表示的单个字对。类似地，可以从表示相同的每个表示中选择单个字对。可以使用所选择的单个字对以及对应的对准的音频实现来更新单词或发音数据库。

2.

发明授权
Method and apparatus for the automatic separating and indexing of multi-speaker conversations 有权
标题翻译：多扬声器对话的自动分离和索引的方法和装置

公开(公告)号：US07496510B2

公开(公告)日：2009-02-24

申请号：US09997957

申请日：2001-11-30

申请人： Joachim Frank , Werner Kriechbaum , Gerhard Stenzel

发明人： Joachim Frank , Werner Kriechbaum , Gerhard Stenzel

IPC分类号： G10L17/00

CPC分类号： G10L21/028 , G10L17/00

摘要： Disclosed are a method and apparatus for processing a continuous audio stream containing human speech in order to locate a particular speech-based transaction in the audio stream, applying both known speaker recognition and speech recognition techniques. Only the utterances of a particular predetermined speaker are transcribed thus providing an index and a summary of the underlying dialogue(s). In a first scenario, an incoming audio stream, e.g. a speech call from outside, is scanned in order to detect audio segments of the predetermined speaker. These audio segments are then indexed and only the indexed segments are transcribed into spoken or written language. In a second scenario, two or more speakers located in one room are using a multi-user speech recognition system (SRS). For each user there exists a different speaker model and optionally a different dictionary or vocabulary of words already known or trained by the speech or voice recognition system.

摘要翻译： 公开了一种用于处理包含人类语音的连续音频流的方法和装置，以便在音频流中定位特定的基于语音的事务，应用已知的说话人识别和语音识别技术。只有特定的预定说话者的话语被转录，从而提供一个或多个基础对话的索引和总结。在第一种情况下，输入音频流，例如，扫描来自外部的语音呼叫，以便检测预定扬声器的音频段。然后将这些音频片段编入索引，并且仅索引片段被转录成口语或书面语言。在第二种情况下，位于一个房间中的两个或更多个扬声器正在使用多用户语音识别系统（SRS）。对于每个用户，存在不同的说话者模型以及可选地由语音或语音识别系统已知或训练的词语的不同字典或词汇。

3.

发明授权
Method and system for the automatic segmentation of an audio stream into semantic or syntactic units 有权
标题翻译：将音频流自动分割成语义或句法单位的方法和系统

公开(公告)号：US07120575B2

公开(公告)日：2006-10-10

申请号：US09920983

申请日：2001-08-02

申请人： Martin Haase , Werner Kriechbaum , Gerhard Stenzel

发明人： Martin Haase , Werner Kriechbaum , Gerhard Stenzel

IPC分类号： G10L11/04

CPC分类号： G10L25/87 , G10L15/1807

摘要： A digitized speech signal (600) is input to an F0 (fundamental frequency) processor that computes (610) a continuous F0 data from the speech signal. By the criterion voicing state transition (voiced/unvoiced transitions) the speech signal is presegmented (620) into segments. For each segment (630) it is evaluated (640) whether F0 is defined or not defined i.e. whether F0 is ON or OFF. In case of F0=OFF a candidate segment boundary is assumed as described above and, starting from that boundary, prosodic features are computed (650). The feature values are input into a classification tree and each candidate segment is classified thereby revealing, as a result, the existence or non-existence of a semantic or syntactic speech unit.

摘要翻译： 数字化语音信号（600）输入到从语音信号计算（610）连续F0数据的F0（基频）处理器。通过标准语音状态转换（有声/无声转换），语音信号被预先分段（620）成段。对于每个段（630），评估（640）是否定义F0是否定义F0是开还是关。在F0 = OFF的情况下，假设如上所述的候选段边界，并且从该边界开始，计算韵律特征（650）。将特征值输入到分类树中，并且将每个候选片段分类，从而揭示语义或句法语音单元的存在或不存在。

4.

发明申请
METHOD AND APPARATUS FOR LINKING REPRESENTATION AND REALIZATION DATA 有权
标题翻译：用于链接表示和实现数据的方法和装置

公开(公告)号：US20080228490A1

公开(公告)日：2008-09-18

申请号：US12126507

申请日：2008-05-23

申请人： Uwe Fischer , Stefan Hoffmann , Werner Kriechbaum , Gerhard Stenzel

发明人： Uwe Fischer , Stefan Hoffmann , Werner Kriechbaum , Gerhard Stenzel

IPC分类号： G10L21/06

CPC分类号： G06F17/30056 , G06F17/30746 , G06F17/30855 , Y10S707/912

摘要： A method and apparatus for creating links between a representation, (e.g. text data,) and a realization, (e.g. corresponding audio data,) is provided. According to the invention the realization is structured by combining a time-stamped version of the representation generated from the realization with structural information from the representation. Thereby so called hyper links between representation and realization are created. These hyper links are used for performing search operations in realization data equivalent to those which are possible in representation data, enabling an improved access to the realization (e.g. via audio databases).

摘要翻译： 提供了用于在表示（例如文本数据）和实现之间创建链接（例如相应的音频数据）的方法和装置。根据本发明，实现通过将从实现产生的表示的时间戳版本与来自表示的结构信息组合而构成。从而创建所谓的表示和实现之间的超链接。这些超链接用于执行与在表示数据中可能的那些相当的实现数据中的搜索操作，使得能够改进对实现的访问（例如，经由音频数据库）。

5.

发明授权
Method and apparatus for linking representation and realization data 有权
标题翻译：用于链接表示和实现数据的方法和装置

公开(公告)号：US07412643B1

公开(公告)日：2008-08-12

申请号：US09447871

申请日：1999-11-23

申请人： Uwe Fischer , Stefan Hoffmann , Werner Kriechbaum , Gerhard Stenzel

发明人： Uwe Fischer , Stefan Hoffmann , Werner Kriechbaum , Gerhard Stenzel

IPC分类号： G06F17/00 , G06F3/00 , G10L15/00

CPC分类号： G06F17/30056 , G06F17/30746 , G06F17/30855 , Y10S707/912

摘要： A method and apparatus for creating links between a representation, (e.g. text data,) and a realization, (e.g. corresponding audio data,) is provided. According to the invention the realization is structured by combining a time-stamped version of the representation generated from the realization with structural information from the representation. Thereby so called hyper links between representation and realization are created. These hyper links are used for performing search operations in realization data equivalent to those which are possible in representation data, enabling an improved access to the realization (e.g. via audio databases).

摘要翻译： 提供了用于在表示（例如文本数据）和实现之间创建链接（例如相应的音频数据）的方法和装置。根据本发明，实现通过将从实现产生的表示的时间戳版本与来自表示的结构信息组合而构成。从而创建所谓的表示和实现之间的超链接。这些超链接用于执行与在表示数据中可能的那些相当的实现数据中的搜索操作，使得能够改进对实现的访问（例如，经由音频数据库）。

6.

发明授权
Method and system for generating a characteristic identifier for digital data and for detecting identical digital data 有权
标题翻译：用于产生数字数据的特征标识符并用于检测相同的数字数据的方法和系统

公开(公告)号：US06799158B2

公开(公告)日：2004-09-28

申请号：US09739130

申请日：2000-12-18

申请人： Uwe Fischer , Stefan Hoffmann , Werner Kriechbaum , Gerhard Stenzel

发明人： Uwe Fischer , Stefan Hoffmann , Werner Kriechbaum , Gerhard Stenzel

IPC分类号： G10L1914

CPC分类号： G11B20/00123 , G10L17/26 , G11B20/00086 , G11B20/10

摘要： A characteristic identifier for digital data is generated. Thereby, the information contained in a digital data set is reduced such that the resulting identifier is made comparable to another identifier made in the same manner. The generated identifiers are used for detecting identical digital data or to determine inexact copies of digital data. In one embodiment of the invention, the digital data is a digital audio signal and the characteristic identifier is called an audio signature. The comparison of identical audio data according to the invention can be carried out without a person actually listening to the audio data. The present invention can be used to establish automated processes to find potential unauthorized copies of audio data, e.g., music recordings, and therefore enables a better enforcement of copyrights in the audio industry.

摘要翻译： 生成数字数据的特征标识符。从而，包含在数字数据集中的信息被减少，使得所得到的标识符与以相同方式制造的另一标识符相当。所生成的标识符用于检测相同的数字数据或确定数字数据的不准确的副本。在本发明的一个实施例中，数字数据是数字音频信号，特征标识符称为音频签名。根据本发明的相同音频数据的比较可以在没有人真正听音频数据的情况下进行。本发明可用于建立自动化过程以发现音频数据（例如，音乐录制）的潜在未授权副本，并且因此能够更好地执行音频工业中的版权。

7.

发明授权
Method and apparatus for linking representation and realization data 有权
标题翻译：用于链接表示和实现数据的方法和装置

公开(公告)号：US07954044B2

公开(公告)日：2011-05-31

申请号：US12126507

申请日：2008-05-23

申请人： Uwe Fischer , Stefan Hoffmann , Werner Kriechbaum , Gerhard Stenzel

发明人： Uwe Fischer , Stefan Hoffmann , Werner Kriechbaum , Gerhard Stenzel

IPC分类号： G06F17/30 , G06F3/00 , G01L15/00

CPC分类号： G06F17/30056 , G06F17/30746 , G06F17/30855 , Y10S707/912

摘要： A method and apparatus for creating links between a representation, (e.g. text data,) and a realization, (e.g. corresponding audio data,) is provided. According to the invention the realization is structured by combining a time-stamped version of the representation generated from the realization with structural information from the representation. Thereby so called hyper links between representation and realization are created. These hyper links are used for performing search operations in realization data equivalent to those which are possible in representation data, enabling an improved access to the realization (e.g. via audio databases).

摘要翻译： 提供了用于在表示（例如文本数据）和实现之间创建链接（例如相应的音频数据）的方法和装置。根据本发明，实现通过将从实现产生的表示的时间戳版本与来自表示的结构信息组合而构成。从而创建所谓的表示和实现之间的超链接。这些超链接用于执行与在表示数据中可能的那些相当的实现数据中的搜索操作，使得能够改进对实现的访问（例如，经由音频数据库）。

8.

发明授权
Method and system for the automatic generation of multi-lingual synchronized sub-titles for audiovisual data 失效

公开(公告)号：US07117231B2

公开(公告)日：2006-10-03

申请号：US09994544

申请日：2001-11-27

申请人： Uwe Fischer , Stefan Hoffmann , Werner Kriechbaum , Gerhard Stenzel

发明人： Uwe Fischer , Stefan Hoffmann , Werner Kriechbaum , Gerhard Stenzel

IPC分类号： G06F17/30

CPC分类号： H04N21/235 , H04N21/435 , Y10S707/99942 , Y10S707/99943 , Y10S707/99944 , Y10S707/99945 , Y10S707/99954

摘要： The present invention provides a method and system for computerized synchronization of an audio stream having a first synchronized textual representation usable for subtitling of the audio stream in the original language, with a second synchronized textual representation which can be used as an alternative subtitle including a transcription of the original language into another language. Time synchronous links can be built between the audio stream and, for instance, the textual representations of the words spoken in the audio stream. More particularly, the second representation can inherit the synchronization between the audio stream and the first representation using structure association information determined between the first and the second representation.

9.

发明授权
Method of establishing a communication channel to intelligent support for ebusiness applications 有权

公开(公告)号：US07003090B2

公开(公告)日：2006-02-21

申请号：US10138435

申请日：2002-05-03

申请人： Werner Kriechbaum , Ronald Pfeifer , Gerhard Stenzel

发明人： Werner Kriechbaum , Ronald Pfeifer , Gerhard Stenzel

IPC分类号： G06F15/16 , H04L12/28 , H04L12/66 , H04M3/523

CPC分类号： H04M7/0036 , H04M3/493 , H04M3/51 , Y10S379/90

摘要： The present invention relates to method and system for providing online information in a networked user environment in which an end-user runs an application program and transmits data to an online server while running the application program. It is proposed to provide a request-button at the end-user application program dedicated to requesting information, and in particular help-information. When a help request is received at the communication server, a communication channel is promptly established between end-user and an agent. Information about the user activities sent in one or more transaction parts of an end-user intended business process and performed in the current application program session is read from the storage in the application server and is provided to the terminal of said agent in the help center. Advantageously, the same communication channel as used for performing the transactions is used for voice transmission for providing help or other information to the end-user.

10.

发明授权
Method of generating a link between a note of a digital score and a realization of the score 失效
标题翻译：在数字乐谱的音符与乐谱的实现之间生成连接的方法

公开(公告)号：US06768046B2

公开(公告)日：2004-07-27

申请号：US10295058

申请日：2002-11-14

申请人： Werner Kriechbaum , Gerhard Stenzel

发明人： Werner Kriechbaum , Gerhard Stenzel

IPC分类号： A63H500

CPC分类号： G10H1/0008 , G10H2220/015 , G10H2240/056

摘要： A system and method of generating a link between a note of a digital score and a realization of the score are provided. To do so, a digital score is processed to generate an onset curve. The onset curve is then filtered to generate a first series of first time intervals, which each have a significant number of onsets. A realization of the digital score is also processed to generate a second series of second time intervals, which each have a significant dynamic change of the realization. The first and the second series of time intervals are then correlated to produce the link.

摘要翻译： 提供了在数字乐谱的音符与乐谱的实现之间生成连接的系统和方法。为此，处理数字得分以产生起始曲线。然后对起始曲线进行过滤以产生第一系列的第一时间间隔，其中每个具有相当数量的开始。还处理数字得分的实现以产生第二系列的第二时间间隔，其中每个间隔都具有实现的显着的动态变化。然后将第一和第二系列时间间隔相关联以产生链接。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类