Methods And Systems For Detecting And Processing Speech Signals

    公开(公告)号:US20170287486A1

    公开(公告)日:2017-10-05

    申请号:US15625685

    申请日:2017-06-16

    Applicant: Google Inc.

    Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

    Platform for multiple device playout

    公开(公告)号:US10516718B2

    公开(公告)日:2019-12-24

    申请号:US14735489

    申请日:2015-06-10

    Applicant: GOOGLE INC.

    Abstract: Provided is a platform for data devices in which the architecture and runtime parameters of the platform are adaptively updated based on real-time data collected about a network on which the platform operates, the source type (e.g., codec selection) for data being communicated between devices, the grouping/architecture of the devices, or any combination thereof. The platform is thus able to support multiple different types and configurations of data devices under varied, constantly-changing conditions. The platform offers a flexible architecture for a content management and rendering system in which multiple data devices connected via the network each play a unique role in the operation of the system. The data devices are capable of dynamically switching between different roles while the system is in active operation. The platform also includes adaptive delay capabilities as well as adaptive codec selection capabilities.

    Methods And Systems For Detecting And Processing Speech Signals

    公开(公告)号:US20170287485A1

    公开(公告)日:2017-10-05

    申请号:US15624935

    申请日:2017-06-16

    Applicant: Google Inc.

    Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

    METHOD AND APPARATUS FOR DECODING PACKETIZED DATA
    5.
    发明申请
    METHOD AND APPARATUS FOR DECODING PACKETIZED DATA 审中-公开
    用于解码封装数据的方法和装置

    公开(公告)号:US20170063497A1

    公开(公告)日:2017-03-02

    申请号:US15344629

    申请日:2016-11-07

    Applicant: GOOGLE INC.

    Abstract: A method for decoding a packetized video signal including at least one encoded frame. In one case, the method includes receiving at least one FEC packet at a receiving station. The receiving station uses embedded data associated with the FEC packet to obtain more accurate knowledge of the packet loss state of the media packets. This improved knowledge can allow the receiver to make better use of packet retransmission requests. The embedded data associated with the FEC packet can include in some cases a base sequence number and a packet mask.

    Abstract translation: 一种用于对包括至少一个编码帧的分组化视频信号进行解码的方法。 在一种情况下,该方法包括在接收站处接收至少一个FEC分组。 接收站使用与FEC分组相关联的嵌入数据来获得关于媒体分组的分组丢失状态的更准确的知识。 这种改进的知识可以允许接收机更好地利用分组重传请求。 与FEC分组相关联的嵌入数据在一些情况下可以包括基本序列号和分组掩码。

    Methods and systems for detecting and processing speech signals

    公开(公告)号:US10163442B2

    公开(公告)日:2018-12-25

    申请号:US15597249

    申请日:2017-05-17

    Applicant: Google Inc.

    Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

    Methods And Systems For Detecting And Processing Speech Signals

    公开(公告)号:US20170287484A1

    公开(公告)日:2017-10-05

    申请号:US15622170

    申请日:2017-06-14

    Applicant: Google Inc.

    Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling, speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

    Methods And Systems For Detecting And Processing Speech Signals

    公开(公告)号:US20170249943A1

    公开(公告)日:2017-08-31

    申请号:US15597249

    申请日:2017-05-17

    Applicant: Google Inc.

    Abstract: Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.

    Visual speech detection using facial landmarks
    9.
    发明授权
    Visual speech detection using facial landmarks 有权
    使用面部地标的视觉语音检测

    公开(公告)号:US09190061B1

    公开(公告)日:2015-11-17

    申请号:US13839655

    申请日:2013-03-15

    Applicant: GOOGLE INC.

    Inventor: Mikhal Shemer

    CPC classification number: G10L15/25 G10L25/78

    Abstract: A data processing apparatus for detecting a probability of speech based on video data is disclosed. The data processing apparatus may include at least one processor, and a non-transitory computer-readable storage medium including instructions executable by the at least one processor, where execution of the instructions by the at least one processor causes the data processing apparatus to execute a visual speech detector. The visual speech detector may be configured to receive a coordinate-based signal. The coordinate-based signal may represent movement or lack of movement of at least one facial landmark of a person in a video signal. The visual speech detector may be configured to compute a probability of speech of the person based on the coordinate-based signal.

    Abstract translation: 公开了一种用于基于视频数据检测语音概率的数据处理装置。 数据处理装置可以包括至少一个处理器和包括由至少一个处理器可执行的指令的非暂时计算机可读存储介质,其中由至少一个处理器执行指令使数据处理装置执行 视觉语音检测器。 视觉语音检测器可以被配置为接收基于坐标的信号。 基于坐标的信号可以表示视频信号中的人的至少一个面部地标的移动或缺乏移动。 视觉语音检测器可以被配置为基于基于坐标的信号来计算人的语音概率。

Patent Agency Ranking