-
公开(公告)号:US20170287486A1
公开(公告)日:2017-10-05
申请号:US15625685
申请日:2017-06-16
Applicant: Google Inc.
Inventor: Jay Pierre Civelli , Mikhal Shemer , Turaj Zakizadeh Shabestary , David Tapuska
CPC classification number: G10L15/30 , G10L15/02 , G10L15/22 , G10L15/32 , G10L2015/088 , G10L2015/223
Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.
-
公开(公告)号:US09779735B2
公开(公告)日:2017-10-03
申请号:US15052426
申请日:2016-02-24
Applicant: GOOGLE INC.
Inventor: Jay Pierre Civelli , Mikhal Shemer , Turaj Zakizadeh Shabestary , David Tapuska
CPC classification number: G10L15/30 , G10L15/02 , G10L15/22 , G10L15/32 , G10L2015/088 , G10L2015/223
Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.
-
公开(公告)号:US10516718B2
公开(公告)日:2019-12-24
申请号:US14735489
申请日:2015-06-10
Applicant: GOOGLE INC.
Inventor: Mikhal Shemer , Patrik Göran Westin
Abstract: Provided is a platform for data devices in which the architecture and runtime parameters of the platform are adaptively updated based on real-time data collected about a network on which the platform operates, the source type (e.g., codec selection) for data being communicated between devices, the grouping/architecture of the devices, or any combination thereof. The platform is thus able to support multiple different types and configurations of data devices under varied, constantly-changing conditions. The platform offers a flexible architecture for a content management and rendering system in which multiple data devices connected via the network each play a unique role in the operation of the system. The data devices are capable of dynamically switching between different roles while the system is in active operation. The platform also includes adaptive delay capabilities as well as adaptive codec selection capabilities.
-
公开(公告)号:US20170287485A1
公开(公告)日:2017-10-05
申请号:US15624935
申请日:2017-06-16
Applicant: Google Inc.
Inventor: Jay Pierre Civelli , Mikhal Shemer , Turaj Zakizadeh Shabestary , David Tapuska
IPC: G10L15/30
CPC classification number: G10L15/30 , G10L15/02 , G10L15/22 , G10L15/32 , G10L2015/088 , G10L2015/223
Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.
-
公开(公告)号:US20170063497A1
公开(公告)日:2017-03-02
申请号:US15344629
申请日:2016-11-07
Applicant: GOOGLE INC.
Inventor: Marco Paniconi , Mikhal Shemer
IPC: H04L1/08 , H04L12/801 , H04L29/06
CPC classification number: H04L1/08 , H03M13/353 , H03M13/356 , H03M13/373 , H03M13/6306 , H04L1/18 , H04L47/34 , H04L69/22
Abstract: A method for decoding a packetized video signal including at least one encoded frame. In one case, the method includes receiving at least one FEC packet at a receiving station. The receiving station uses embedded data associated with the FEC packet to obtain more accurate knowledge of the packet loss state of the media packets. This improved knowledge can allow the receiver to make better use of packet retransmission requests. The embedded data associated with the FEC packet can include in some cases a base sequence number and a packet mask.
Abstract translation: 一种用于对包括至少一个编码帧的分组化视频信号进行解码的方法。 在一种情况下,该方法包括在接收站处接收至少一个FEC分组。 接收站使用与FEC分组相关联的嵌入数据来获得关于媒体分组的分组丢失状态的更准确的知识。 这种改进的知识可以允许接收机更好地利用分组重传请求。 与FEC分组相关联的嵌入数据在一些情况下可以包括基本序列号和分组掩码。
-
公开(公告)号:US10163442B2
公开(公告)日:2018-12-25
申请号:US15597249
申请日:2017-05-17
Applicant: Google Inc.
Inventor: Jay Pierre Civelli , Mikhal Shemer , Turaj Zakizadeh Shabestary , David Tapuska
Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.
-
公开(公告)号:US20170287484A1
公开(公告)日:2017-10-05
申请号:US15622170
申请日:2017-06-14
Applicant: Google Inc.
Inventor: Jay Pierre Civelli , Mikhal Shemer , Turaj Zakizadeh Shabestary , David Tapuska
IPC: G10L15/30
CPC classification number: G10L15/30 , G10L15/02 , G10L15/22 , G10L15/32 , G10L2015/088 , G10L2015/223
Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling, speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.
-
公开(公告)号:US20170249943A1
公开(公告)日:2017-08-31
申请号:US15597249
申请日:2017-05-17
Applicant: Google Inc.
Inventor: Jay Pierre Civelli , Mikhal Shemer , Turaj Zakizadeh Shabestary , David Tapuska
IPC: G10L15/30
CPC classification number: G10L15/30 , G10L15/02 , G10L15/22 , G10L15/32 , G10L2015/088 , G10L2015/223
Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.
-
公开(公告)号:US09190061B1
公开(公告)日:2015-11-17
申请号:US13839655
申请日:2013-03-15
Applicant: GOOGLE INC.
Inventor: Mikhal Shemer
Abstract: A data processing apparatus for detecting a probability of speech based on video data is disclosed. The data processing apparatus may include at least one processor, and a non-transitory computer-readable storage medium including instructions executable by the at least one processor, where execution of the instructions by the at least one processor causes the data processing apparatus to execute a visual speech detector. The visual speech detector may be configured to receive a coordinate-based signal. The coordinate-based signal may represent movement or lack of movement of at least one facial landmark of a person in a video signal. The visual speech detector may be configured to compute a probability of speech of the person based on the coordinate-based signal.
Abstract translation: 公开了一种用于基于视频数据检测语音概率的数据处理装置。 数据处理装置可以包括至少一个处理器和包括由至少一个处理器可执行的指令的非暂时计算机可读存储介质,其中由至少一个处理器执行指令使数据处理装置执行 视觉语音检测器。 视觉语音检测器可以被配置为接收基于坐标的信号。 基于坐标的信号可以表示视频信号中的人的至少一个面部地标的移动或缺乏移动。 视觉语音检测器可以被配置为基于基于坐标的信号来计算人的语音概率。
-
-
-
-
-
-
-
-