MULTI-PASS NEURAL NETWORK FOR SPEECH ENHANCEMENT

    公开(公告)号:US20240242726A1

    公开(公告)日:2024-07-18

    申请号:US18155674

    申请日:2023-01-17

    CPC classification number: G10L21/0208 G10L25/30

    Abstract: This disclosure provides methods, devices, and systems for audio signal processing. The present implementations more specifically relate to multi-pass neural networks configured for speech enhancement. In some aspects, a speech enhancement system may include a deep neural network (DNN) and a statistical signal processor (SSP). The DNN is configured to receive an input audio signal and infer a speech signal representing a speech component of the input audio signal based on a neural network model. The SSP is configured to further denoise the speech signal output by the DNN based on one or more statistical signal processing operations. In some implementations, the denoised speech signal may be fed back into the DNN (as an input audio signal) for further speech enhancement. As such, the speech enhancement system may recursively filter or suppress residual noise in the speech signal over a number of passes or iterations of a feedback loop.

    Word based channels last ordering in memory

    公开(公告)号:US12026396B2

    公开(公告)日:2024-07-02

    申请号:US17898262

    申请日:2022-08-29

    Inventor: Patrick Worfolk

    CPC classification number: G06F3/0655 G06F3/0604 G06F3/0673

    Abstract: A memory device includes a first word and a second word. The first word has a first subset of a plurality of elements. The first subset of the plurality of elements each have a first set of sequential index values along a first dimension of a tensor, a first single index value for a second dimension of the tensor, and a second single index value for a third dimension of the tensor. The second word has a second subset of the plurality of elements. The second subset of the plurality of elements each have the first set of sequential index values along the first dimension of the tensor that is the same as the first word, the first single index value for the second dimension of the tensor that is the same as the first word, and a third single index value for the third dimension of the tensor that is different than the second single index value for the first word. The second word is adjacent to the first word in memory.

    NEURAL NETWORK CACHING FOR VIDEO
    13.
    发明公开

    公开(公告)号:US20240212333A1

    公开(公告)日:2024-06-27

    申请号:US18069781

    申请日:2022-12-21

    CPC classification number: G06V10/82 G06V10/761

    Abstract: This disclosure provides methods, devices, and systems for machine learning. The present implementations more specifically relate to techniques for reducing the computational load of a convolutional neural network (CNN) when processing successive video frames. In some aspects, a machine learning system may cache or store the outputs (also referred to as “activations”) produced by one or more layers of a CNN so that one or more cached activations can be substituted for respective activations that would otherwise be computed by the CNN when processing a subsequent video frame. For example, the machine learning system may compare each video frame with a preceding frame of the video to detect pixels that undergo significant changes between successive frames (also referred to as “motion pixels”). In some aspects, the CNN may only perform neural network operations that involve one or more motion pixels or features derived from a motion pixel.

    Image processing system for region-of-interest-based video compression

    公开(公告)号:US11991371B2

    公开(公告)日:2024-05-21

    申请号:US17356274

    申请日:2021-06-23

    CPC classification number: H04N19/167 G06V10/25 G06V40/161 G06V40/172 H04N7/183

    Abstract: An apparatus for remote processing of raw image data receives the raw image data from a camera, such as a security camera. The apparatus includes a detection module to detect portions of the image data that contain possible regions of interest. Information indicating the portions that contain the possible regions of interest is then used during a compression process so that the portions that contain the possible regions of interest are compressed using one or more compression algorithms to facilitate further analysis and the remainder are treated differently. The compressed image data is then transmitted to a central system for decompression and further analysis. In some cases, the detection system may detect possible regions of interest which appear to be faces, but without performing full facial recognition. These parts of the image data are then compressed in such a way as to maintain as much facial detail as possible, so as to facilitate the facial recognition when it is carried out at the central server. The detection may be performed on the raw image data or may be performed as part of the compression process after a transformation of the raw image data has been carried out.

    DISTRIBUTED ANALOG DISPLAY NOISE SUPPRESSION CIRCUIT

    公开(公告)号:US20240152237A1

    公开(公告)日:2024-05-09

    申请号:US18402872

    申请日:2024-01-03

    CPC classification number: G06F3/044

    Abstract: A processing system including an amplifier configured to generate, from multiple spatial-common-mode-processed signals, a spatial common mode estimate and multiple feedback signals. The processing system includes multiple charge integrators configured to obtain resulting signals from the capacitive sensor electrodes, each of the resulting signals including a spatial common mode component and a residual noise component. The charge integrators generate multiple spatial-common-mode-processed signals by mitigating the spatial common mode component and the residual noise component in the resulting signals using the feedback signals. The processing system includes a programmable gain amplifier configured to determine the spatial common mode estimate.

    Device and method for driving a display panel

    公开(公告)号:US11972743B2

    公开(公告)日:2024-04-30

    申请号:US18180305

    申请日:2023-03-08

    Abstract: A processing system comprises a first integrated circuit (IC) and a second IC. The first IC comprises first image processing circuitry, first display panel driver circuitry, and first communication circuitry. The first image processing circuitry is configured to generate a first overlay image by overlaying a first partial input image with a first image element based on first partial input image data representing the first partial input image and first image element data representing the first image element. The first display panel driver circuitry is configured to drive a display panel based on the first overlay image. The first communication circuitry is configured to output second image element data representing a second image element to the second IC.

    CAPACITIVE DETECTION OF FOLD ANGLE FOR FOLDABLE DEVICES

    公开(公告)号:US20240094845A1

    公开(公告)日:2024-03-21

    申请号:US18515048

    申请日:2023-11-20

    Inventor: Guozhong Shen

    Abstract: A system for determining an open or closed state of a foldable device includes: a plurality of electrodes, including a first set of electrodes for performing absolute capacitance sensing for open/close detection, wherein each of the first set of electrodes is located proximate to an edge of the foldable device; and a processing system, configured to: obtain at least one first absolute capacitance measurement via the first set of electrodes; and determine whether the foldable device is in an open state or a closed state based on the at least one first absolute capacitance measurement.

Patent Agency Ranking