Patent search ap:"ERIC COSATTO" Page 5

41.

发明申请
SYSTEM AND METHOD FOR GENERATING CODED VIDEO SEQUENCES FROM STILL MEDIA 有权
Title translation: 用于从静态媒体生成编码视频序列的系统和方法

公开(公告)号：US20120033743A1

公开(公告)日：2012-02-09

申请号：US13205321

申请日：2011-08-08

Applicant: Andrea Basso , Eric Cosatto , Steven Lloyd Greenspan , David M. Weimer

Inventor： Andrea Basso , Eric Cosatto , Steven Lloyd Greenspan , David M. Weimer

IPC: H04N7/26 , G06K9/46

CPC classification number: H04N21/23412 , H04N21/8153

Abstract: The invention provides a system and method that transforms a set of still/motion media (i.e., a series of related or unrelated still frames, web-pages rendered as images, or video clips) or other multimedia, into a video stream that is suitable for delivery over a display medium, such as TV, cable TV, computer displays, wireless display devices, etc. The video data stream may be presented and displayed in real time or stored and later presented through a set-top box, for example. Because these media are transformed into coded video streams (e.g. MPEG-2, MPEG-4, etc.), a user can watch them on a display screen without the need to connect to the Internet through a service provider. The user may request and interact with the desired media through a simple telephone interface, for example. Moreover, several wireless and cable-based services can be developed on the top of this system. In one possible embodiment, the system for generating a coded video sequence may include an input unit that receives the multimedia input and extracts image data, and derives the virtual camera scripts and coding hints from the image data, a video sequence generator that generates a video sequence based on the extracted image data and the derived virtual camera scripts and coding hints, and a video encoder that encodes the generated video sequence using the coding hints and outputs the coded video sequence to an output device. The system may also provide customized video sequence generation services to subscribers.

Abstract translation: 本发明提供一种系统和方法，其将一组静止/运动媒体（即，一系列相关或不相关的静止帧，呈现为图像的网页或视频剪辑）或其他多媒体转换为适合的视频流用于通过诸如电视，有线电视，计算机显示器，无线显示设备等的显示介质的传送。例如，可以实时地呈现和显示视频数据流，或者通过机顶盒存储和显示视频数据流。由于这些媒体被转换为编码视频流（例如，MPEG-2，MPEG-4等），用户可以在显示屏上观看它们，而不需要通过服务提供商连接到因特网。例如，用户可以通过简单的电话接口来请求和与期望的媒体交互。此外，可以在该系统的顶部开发几种基于无线和有线的服务。在一个可能的实施例中，用于生成编码视频序列的系统可以包括接收多媒体输入并提取图像数据并从图像数据导出虚拟相机脚本和编码提示的输入单元，产生视频的视频序列生成器基于提取的图像数据和导出的虚拟相机脚本和编码提示的序列，以及使用编码提示对所生成的视频序列进行编码并将编码的视频序列输出到输出装置的视频编码器。该系统还可以向订户提供定制的视频序列生成服务。

42.

发明授权
Coarticulation method for audio-visual text-to-speech synthesis 有权
Title translation: 音视频文本到语音合成的协方法

公开(公告)号：US08078466B2

公开(公告)日：2011-12-13

申请号：US12627373

申请日：2009-11-30

Applicant: Eric Cosatto , Hans Peter Graf , Juergen Schroeter

Inventor： Eric Cosatto , Hans Peter Graf , Juergen Schroeter

IPC: G10L13/00

CPC classification number: G10L13/00 , G10L2021/105

Abstract: A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. The processor reads first data comprising one or more parameters associated with noise-producing orifice images of sequences of at least three concatenated phonemes which correspond to an input stimulus. The processor reads, based on the first data, second data comprising images of a noise-producing entity. The processor generates an animated sequence of the noise-producing entity.

Abstract translation: 一种在文本到语音应用中产生通话头的动画序列的方法，其中处理器对包括图像样本的多个帧进行采样。处理器读取包括与对应于输入刺激的至少三个级联音素的序列的产生噪声的孔图像相关联的一个或多个参数的第一数据。处理器基于第一数据读取包括噪声产生实体的图像的第二数据。处理器产生产生噪声的实体的动画序列。

43.

发明申请
Coarticulation Method for Audio-Visual Text-to-Speech Synthesis 有权
Title translation: 视听文本到语音合成的协方法

公开(公告)号：US20100076762A1

公开(公告)日：2010-03-25

申请号：US12627373

申请日：2009-11-30

Applicant: Eric Cosatto , Hans Peter Graf , Juergen Schroeter

Inventor： Eric Cosatto , Hans Peter Graf , Juergen Schroeter

IPC: G10L15/26 , G10L13/00 , G10L13/08

CPC classification number: G10L13/00 , G10L2021/105

Abstract: A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. The processor reads first data comprising one or more parameters associated with noise-producing orifice images of sequences of at least three concatenated phonemes which correspond to an input stimulus. The processor reads, based on the first data. second data comprising images of a noise-producing entity. The processor generates an animated sequence of the noise-producing entity.

Abstract translation: 一种在文本到语音应用中产生通话头的动画序列的方法，其中处理器对包括图像样本的多个帧进行采样。处理器读取包括与对应于输入刺激的至少三个级联音素的序列的产生噪声的孔图像相关联的一个或多个参数的第一数据。处理器基于第一个数据读取。第二数据包括产生噪声的实体的图像。处理器产生产生噪声的实体的动画序列。

44.

发明申请
System for Low-Latency Animation of Talking Heads 有权
Title translation: 谈话头的低延迟动画系统

公开(公告)号：US20100076750A1

公开(公告)日：2010-03-25

申请号：US12627573

申请日：2009-11-30

Applicant: Eric Cosatto , Hans Peter Graf , Joern Ostermann

Inventor： Eric Cosatto , Hans Peter Graf , Joern Ostermann

IPC: G06F17/27

CPC classification number: G06F17/30905

Abstract: Methods and apparatus for rendering a talking head on a client device are disclosed. The client device has a client cache capable of storing audio/visual data associated with rendering the talking head. The method comprises storing sentences in a client cache of a client device that relate to bridging delays in a dialog, storing sentence templates to be used in dialogs, generating a talking head response to a user inquiry from the client device, and determining whether sentences or stored templates stored in the client cache relate to the talking head response. If the stored sentences or stored templates relate to the talking head response, the method comprises instructing the client device to use the appropriate stored sentence or template from the client cache to render at least a part of the talking head response and transmitting a portion of the talking head response not stored in the client cache, if any, to the client device to render a complete talking head response. If the client cache has no stored data associated with the talking head response, the method comprises transmitting the talking head response to be rendered on the client device.

Abstract translation: 公开了一种用于在客户端设备上呈现通话头的方法和设备。客户端设备具有能够存储与呈现话音头相关联的音频/视频数据的客户端高速缓存。该方法包括将客户端设备的客户端缓存中的句子存储在与对话中的桥接延迟相关联，存储要在对话中使用的语句模板，从客户端设备生成对用户的询问头响应，以及确定句子或存储在客户端缓存中的存储模板涉及到通话头响应。如果存储的句子或存储的模板与谈话头响应相关，则该方法包括指示客户端设备使用来自客户端高速缓存的适当存储的句子或模板来呈现至少一部分通话头响应并且传送一部分说话头响应没有存储在客户端缓存中（如果有的话）给客户端设备呈现完整的通话头响应。如果客户端缓存没有与通话头响应相关联的存储数据，则该方法包括传送要在客户端设备上呈现的通话头响应。

45.

发明申请
Automated Method and System for Nuclear Analysis of Biopsy Images 有权
Title translation: 活检图像核分析的自动化方法和系统

公开(公告)号：US20090297007A1

公开(公告)日：2009-12-03

申请号：US12131346

申请日：2008-06-02

Applicant: Eric Cosatto , Hans-Peter Graf , Matthew L. Miller

Inventor： Eric Cosatto , Hans-Peter Graf , Matthew L. Miller

IPC: A61B5/00

CPC classification number: G06T7/0012 , G06K9/00147 , G06T2207/10056 , G06T2207/20081 , G06T2207/20116 , G06T2207/30024

Abstract: An automated method and system for analyzing a digital image of a biopsy to determine whether the biopsy is normal or abnormal, i.e., exhibits some type of disease such as, but not limited to, cancer. In the method and system, a classifier is trained to recognize well formed nuclei outlines from imperfect nuclei outlines in digital biopsy images. The trained classifier may then be used to filter nuclei outlines from one or more digital biopsy images to be analyzed, to obtain the well formed nuclei outlines. The well formed nuclei outlines may then be used to obtain statistics on the size or area of the nuclei for use in determining whether the biopsy is normal or abnormal.

Abstract translation: 一种用于分析活组织检查的数字图像以确定活组织检查是正常还是异常的自动化方法和系统，即呈现某种类型的疾病，例如但不限于癌症。在该方法和系统中，训练分类器以识别数字活检图像中不完美核子轮廓的良好形成的细胞核轮廓。然后训练的分类器可用于从要分析的一个或多个数字活检图像中过滤细胞核轮廓，以获得良好形成的细胞核轮廓。然后可以使用良好形成的核轮廓来获得关于核的大小或面积的统计数据，以用于确定活检是否正常或异常。

46.

发明授权
Digitally-generated lighting for video conferencing applications 有权
Title translation: 用于视频会议应用的数字生成照明

公开(公告)号：US07231099B1

公开(公告)日：2007-06-12

申请号：US11216997

申请日：2005-08-31

Applicant: Andrea Basso , Eric Cosatto , David Crawford Gibbon , Hans Peter Graf , Shan Liu

Inventor： Andrea Basso , Eric Cosatto , David Crawford Gibbon , Hans Peter Graf , Shan Liu

IPC: G06K9/40 , H04N7/14

CPC classification number: G06K9/00281 , G06F3/012 , G06K9/00228 , G06K9/2027 , G06T7/246 , G06T15/50 , G06T17/00 , G06T2200/08 , G06T2207/10152 , G06T2207/30201 , H04N7/147 , H04N7/15

Abstract: A method of improving the lighting conditions of a real scene or video sequence. Digitally generated light is added to a scene for video conferencing over telecommunication networks. A virtual illumination equation takes into account light attenuation, lambertian and specular reflection. An image of an object is captured, a virtual light source illuminates the object within the image. In addition, the object can be the head of the user. The position of the head of the user is dynamically tracked so that an three-dimensional model is generated which is representative of the head of the user. Synthetic light is applied to a position on the model to form an illuminated model.

Abstract translation: 一种改善真实场景或视频序列的照明条件的方法。数字生成的光被添加到通过电信网络进行视频会议的场景中。虚拟照明方程考虑了光衰减，朗伯和镜面反射。捕获对象的图像，虚拟光源照亮图像内的对象。另外，对象可以是用户的头。动态跟踪用户头部的位置，从而生成代表用户头部的三维模型。将合成光应用于模型上的位置以形成照明模型。

47.

发明授权
Coarticulation method for audio-visual text-to-speech synthesis 有权

公开(公告)号：US07117155B2

公开(公告)日：2006-10-03

申请号：US10676630

申请日：2003-10-01

Applicant: Eric Cosatto , Hans Peter Graf , Juergen Schroeter

Inventor： Eric Cosatto , Hans Peter Graf , Juergen Schroeter

IPC: G10L19/00

CPC classification number: G10L13/00 , G10L2021/105

Abstract: A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. Representative parameters are extracted from the image samples and stored in an animation library. The processor also samples a plurality of multiphones comprising images together with their associated sounds. The processor extracts parameters from these images comprising data characterizing mouth shapes, maps, rules, or equations, and stores the resulting parameters and sound information in a coarticulation library. The animated sequence begins with the processor considering an input phoneme sequence, recalling from the coarticulation library parameters associated with that sequence, and selecting appropriate image samples from the animation library based on that sequence. The image samples are concatenated together, and the corresponding sound is output, to form the animated synthesis.

48.

发明申请
System and method of providing conversational visual prosody for talking heads 有权

公开(公告)号：US20060074689A1

公开(公告)日：2006-04-06

申请号：US11237561

申请日：2005-09-28

Applicant: Eric Cosatto , Hans Graf , Thomas Isaacson , Volker Storm

Inventor： Eric Cosatto , Hans Graf , Thomas Isaacson , Volker Storm

IPC: G10L21/00

CPC classification number: G06T13/40 , G10L21/06

Abstract: A system and method of controlling the movement of a virtual agent while the agent is listening to a human user during a conversation is disclosed. The method comprises receiving speech data from the user, performing a prosodic analysis of the speech data and controlling the virtual agent movement according to the prosodic analysis.

49.

发明授权
Method for sending multi-media messages using emoticons 失效
Title translation: 使用表情发送多媒体消息的方法

公开(公告)号：US06990452B1

公开(公告)日：2006-01-24

申请号：US10003350

申请日：2001-11-02

Applicant: Joern Ostermann , Mehmet Reha Civanlar , Eric Cosatto , Hans Peter Graf , Yann Andre LeCun

Inventor： Joern Ostermann , Mehmet Reha Civanlar , Eric Cosatto , Hans Peter Graf , Yann Andre LeCun

IPC: G10L13/08 , G10L21/00 , G06T13/00

CPC classification number: G06Q10/107 , G06F17/241 , G10L13/00

Abstract: A system and method of providing sender-customization of multi-media messages through the use of emoticons is disclosed. The sender inserts the emoticons into a text message. As an animated face audibly delivers the text, emoticons associated with the message are started a predetermined period of time or number of words prior to the position of the emoticon in the message text and completed a predetermined length of time or number of words following the location of the emoticon. The sender may insert emoticons through the use of emoticon buttons that are icons available for choosing. Upon sender selections of an emoticon, an icon representing the emoticon is inserted into the text at the position of the cursor. Once an emoticon is chosen, the sender may also choose the amplitude for the emoticon and increased or decreased amplitude will be displayed in the icon inserted into the message text.

Abstract translation: 公开了通过使用表情符号来提供多媒体消息的发送者定制的系统和方法。发件人将表情符号插入文本消息。作为动画面部听觉地传送文本，与消息相关联的表情符号在消息文本中表情符号的位置之前的预定时间段或字数开始，并且完成预定时间长度或位置之后的字数的表情符号。发件人可以通过使用可用于选择的图标的表情符号按钮来插入表情符号。在表情符号的发送者选择之后，表示表情符号的图标被插入到光标位置的文本中。一旦选择了表情符号，发送者也可以选择表情符号的幅度，增加或减小的幅度将显示在插入到消息文本中的图标中。

50.

发明授权
System and method for generating coded video sequences from still media 有权
Title translation: 用于从静止媒体生成编码视频序列的系统和方法

公开(公告)号：US08955031B2

公开(公告)日：2015-02-10

申请号：US13205321

申请日：2011-08-08

Applicant: Andrea Basso , Eric Cosatto , Steven Lloyd Greenspan , David M. Weimer

Inventor： Andrea Basso , Eric Cosatto , Steven Lloyd Greenspan , David M. Weimer

IPC: H04N7/10 , H04N7/16 , H04N21/234 , H04N21/81

CPC classification number: H04N21/23412 , H04N21/8153

Abstract: The invention provides a system and method that transforms a set of still/motion media (i.e., a series of related or unrelated still frames, web-pages rendered as images, or video clips) or other multimedia, into a video stream that is suitable for delivery over a display medium, such as TV, cable TV, computer displays, wireless display devices, etc. The video data stream may be presented and displayed in real time or stored and later presented through a set-top box, for example. Because these media are transformed into coded video streams (e.g. MPEG-2, MPEG-4, etc.), a user can watch them on a display screen without the need to connect to the Internet through a service provider. The user may request and interact with the desired media through a simple telephone interface, for example. Moreover, several wireless and cable-based services can be developed on the top of this system. In one possible embodiment, the system for generating a coded video sequence may include an input unit that receives the multimedia input and extracts image data, and derives the virtual camera scripts and coding hints from the image data, a video sequence generator that generates a video sequence based on the extracted image data and the derived virtual camera scripts and coding hints, and a video encoder that encodes the generated video sequence using the coding hints and outputs the coded video sequence to an output device. The system may also provide customized video sequence generation services to subscribers.

Abstract translation: 本发明提供一种系统和方法，其将一组静止/运动媒体（即，一系列相关或不相关的静止帧，呈现为图像的网页或视频剪辑）或其他多媒体转换为适合的视频流用于通过诸如电视，有线电视，计算机显示器，无线显示设备等的显示介质的传送。例如，可以实时地呈现和显示视频数据流，或者通过机顶盒存储和显示视频数据流。由于这些媒体被转换为编码视频流（例如，MPEG-2，MPEG-4等），用户可以在显示屏上观看它们，而不需要通过服务提供商连接到因特网。例如，用户可以通过简单的电话接口来请求和与期望的媒体交互。此外，可以在该系统的顶部开发几种基于无线和有线的服务。在一个可能的实施例中，用于生成编码视频序列的系统可以包括接收多媒体输入并提取图像数据并从图像数据导出虚拟相机脚本和编码提示的输入单元，产生视频的视频序列生成器基于提取的图像数据和导出的虚拟相机脚本和编码提示的序列，以及使用编码提示对所生成的视频序列进行编码并将编码的视频序列输出到输出装置的视频编码器。该系统还可以向订户提供定制的视频序列生成服务。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification