Method and system for tuning a camera image signal processor for computer vision tasks

    公开(公告)号:US11849212B2

    公开(公告)日:2023-12-19

    申请号:US17677919

    申请日:2022-02-22

    申请人: ALGOLUX INC.

    IPC分类号: H04N23/60

    CPC分类号: H04N23/64

    摘要: Image Signal Processing (ISP) optimization framework for computer vision applications is disclosed. The tuning of the ISP is performed automatically and presented as a nonlinear multi-objective optimization problem, followed by solving the problem using an evolutionary stochastic solver. An improved ISP of the embodiments of the invention includes at least features of search space reduction for reducing a number of ISP configurations, remapping the generated population to the reduced search space via mirroring, and global optimization function processing, which allow tuning all the blocks of the ISP at the same time instead of the prior art tuning of each ISP block separately. Also shown that an ISP tuned for image quality performs inferior compared with an ISP trained for a specific downstream image recognition task.

    Modelling user behavior in social network

    公开(公告)号:US11301915B2

    公开(公告)日:2022-04-12

    申请号:US16304838

    申请日:2017-06-13

    申请人: AFFINIO INC.

    摘要: Method and apparatus for measuring and influencing article selection in a social network are disclosed. A learning-and-guiding module tracks access to articles by users of the social network and determines patterns of users' attraction to articles based on contents of articles and attributes of users. The module utilizes learnt user-articles characteristics to influence article selection through communicating with users through the social network. The module relies on historical usage data characterizing user's affinity to articles. To guard against usage data obsolescence due to shifting interests, usage data are frequently adjusted to place more emphasis on recent usage patterns.

    Method and system for selective content processing based on a panoramic camera and a virtual-reality headset

    公开(公告)号:US11287653B2

    公开(公告)日:2022-03-29

    申请号:US16908592

    申请日:2020-06-22

    发明人: Jean Mayrand

    摘要: Gaze positions of an operator wearing a virtual-reality headset displaying a video stream define preferred view regions of the display. Starting with a reference gaze position, and for each subsequent distinctly different gaze position, the virtual-reality headset sends control data, including three spatial coordinates and a time coordinate expressed as a cyclical video-frame index, to a view adaptor receiving the video stream. The view adaptor stores contents of a number of most recent video frames of the video stream in a circular content-buffer and control data of a number of most recent gaze positions in a circular control-buffer. A content filter within the view adaptor determines a preferred view region surrounding a gaze position according to control data held in the circular control-buffer and extracts a partial content of a respective frame held in the circular content-buffer according to the preferred view region.

    Streaming network adapted to content selection

    公开(公告)号:US11108670B2

    公开(公告)日:2021-08-31

    申请号:US16699375

    申请日:2019-11-29

    发明人: Jean Mayrand

    IPC分类号: H04L12/26 H04L29/06

    摘要: A universal streaming server providing client-defined content at a permissible flow rate is disclosed. The server performs adaptive content filtering of panoramic multimedia signals based on clients' commands and regulates signal flow rate between the server and each of multiple client devices based on respective content specifications and performance measurements. The server sends a derivative of a panoramic signal capturing a panoramic view to a client device, receives content selection parameters, based on the derivative, from the client device, extracts a partial-coverage signal from the full-coverage signal according to the content selection parameters, and transmits the partial-coverage signal to the client device. The performance measurements include measurements pertinent to a client's receiver and measurement pertinent to a network path to the client's receiver. The server may employ multiple content filters and multiple encoders to serve a large number of clients concurrently.

    Method and system for panoramic multimedia streaming

    公开(公告)号:US11057632B2

    公开(公告)日:2021-07-06

    申请号:US16571615

    申请日:2019-09-16

    发明人: Jean Mayrand

    摘要: Methods and apparatus for panoramic multimedia streaming where viewers may control spatial coverage of panoramic video components of multimedia signals are disclosed. A novel flexible streaming server is devised to perform client-specific content filtering in addition to adapting multimedia signals to characteristics of individual client devices as well as to varying capacities of network paths to client devices. The server may distribute software modules to client devices to enable viewers to communicate preferred view regions of a panoramic scene. The server includes a learning module devised to retain viewing-preference data, correlate viewing preference to characteristics of client devices, and determine a default viewing preference for each client device. The server implements computationally efficient schemes of generating and distributing content-filtered multimedia signals to clients. The server may be implemented using hardware processing units and memory devices allocated within a shared cloud-computing network.

    Method and apparatus for video intermodal transcoding

    公开(公告)号:US10659805B2

    公开(公告)日:2020-05-19

    申请号:US15010428

    申请日:2016-01-29

    摘要: A video intermodal transcoder converts a compressed bitstream formulated according a type-1 compression scheme to a type-2 compressed bitstream formulated according to a type-2 compression scheme. The transcoder includes an augmented type-1 decoder, a transcoder kernel, and an augmented type-2 encoder. The transcoder kernel performs processes of creating motion-vector candidates and pre-computing prediction errors for each cell of a predefined image coding block and for each candidate motion vector for repetitive use in evaluating various image partitions. In an implementation where the type-1 compression scheme follows the H.264 standard and the type-2 compression scheme follows the HEVC standard, the transcoder exploits the flexibility of the coding-tree structure and other HEVC features to significantly reduce the bit rate of the compressed bit stream. The pre-computation of prediction errors significantly reduces the processing effort, hence increases the throughput of the transcoder.

    Method and system for fast mode decision for high efficiency video coding

    公开(公告)号:US10560692B2

    公开(公告)日:2020-02-11

    申请号:US16159817

    申请日:2018-10-15

    摘要: Methods and systems for encoding video data are provided. Evolving standards for video encoding such as High Efficiency Video Coding (HEVC) standard require a significant increase in computational complexity for both inter and intra encoding. The method includes calculating an approximate cost of each of a first set of prediction modes. Then selecting a second set of prediction modes from the first set of prediction modes based on probability distributions associated with each of the modes in the first set of prediction modes, the second set having substantially fewer prediction modes than the first. A number of candidate prediction modes prior to rate distortion optimization (RDO) is reduced. Experimental results show that the proposed method provides substantial time reduction and negligible quality loss as compared to the HEVC reference.