MULTIFACTOR VENUE LOCALIZATION USING CENTRALIZED LEARNING

    公开(公告)号:US20180300623A1

    公开(公告)日:2018-10-18

    申请号:US15489234

    申请日:2017-04-17

    Abstract: A central server receives a venue identification query from a client device in the venue and a test data set including information collected from the venue. The central server then queries a classifier to identify the venue based on the test data. The classifier returns an identity value (venue ID) and a confidence value for the venue ID. When the confidence value is less than a threshold value, the central server obtains additional data from the client device until the venue is identified. The central server associates the venue ID with the test data set, including the additional data, and adds the test data set to training data for the classifier.

    SYSTEM AND METHOD FOR PERSON COUNTING IN IMAGE DATA

    公开(公告)号:US20180247132A1

    公开(公告)日:2018-08-30

    申请号:US15445416

    申请日:2017-02-28

    Abstract: Systems and methods for person counting are disclosed. A method may include retrieving an image frame from a plurality of image frames captured by a camera. The image frame may be split into a grid of a plurality of cells of a pre-determined cell dimensions. The pre-determined cell dimensions may be based on dimensions of the retrieved image frame and reference dimensions of training images of a person detection classifier. At least a portion of the plurality of cells may be rearranged to generate a new image. The new image may be padded with at least one padding strip to adjust dimensions of the new image to the reference dimensions of the training images. Person detection may be performed using the new image and the person detection classifier to obtain a number of persons detected within the new image.

    Generating an inpainted image from a masked image using a patch-based encoder

    公开(公告)号:US12148131B2

    公开(公告)日:2024-11-19

    申请号:US17733634

    申请日:2022-04-29

    Abstract: The disclosure herein describes generating an inpainted image from a masked image using a patch-based encoder and an unquantized transformer. An image including a masked region and an unmasked region is received, and the received image is divided into a plurality of patches including masked patches. The plurality of patches is encoded into a plurality of feature vectors, wherein each patch is encoded to a feature vector. Using a transformer, a predicted token is generated for each masked patch using a feature vector encoded from the masked patch, and a quantized vector of the masked patch is determined using generated predicted token and a masked patch-specific codebook. The determined quantized vector of the masked patch is included into a set of quantized vectors associated with the plurality of patches, and an output image is generated from the set of quantized vectors using a decoder.

    Focused computer detection of objects in images

    公开(公告)号:US12106531B2

    公开(公告)日:2024-10-01

    申请号:US17383362

    申请日:2021-07-22

    CPC classification number: G06V10/22 G06T7/70 G06V40/10 G06T2207/30196

    Abstract: To improve the accuracy and efficiency of object detection through computer digital image analysis, the detection of some objects can inform the sub-portion of the digital image to which subsequent computer digital image analysis is directed to detect other objects. In such a manner object detection can be made more efficient by limiting the image area of a digital image that is analyzed. Such efficiencies can represent both computational efficiencies and communicational efficiencies arising due to the smaller quantity of digital image data that is analyzed. Additionally, the detection of some objects can render the detection of other objects more accurate by adjusting confidence thresholds based on the detection of those related objects. Relationships between objects can be utilized to inform both the image area on which subsequent object detection is performed and the confidence level of such subsequent object detection.

    Depth-based object re-identification

    公开(公告)号:US11238300B2

    公开(公告)日:2022-02-01

    申请号:US16688956

    申请日:2019-11-19

    Abstract: An object re-identifier. For each of a plurality of frames of a video, a quality of the frame is assessed and a confidence that a previously-recognized object is present in the frame is determined. The determined confidence for the frame is weighted based on the assessed quality of the frame such that frames with higher relative quality are weighted more heavily than frames with lower relative quality. An overall confidence that the previously-recognized object is present in the video is assessed based on the weighted determined confidences.

    COMBINING MOBILE DEVICES WITH PEOPLE TRACKING FOR LARGE DISPLAY INTERACTIONS

    公开(公告)号:US20170090560A1

    公开(公告)日:2017-03-30

    申请号:US14866534

    申请日:2015-09-25

    Abstract: The large display interaction implementations described herein combine mobile devices with people tracking to enable new interactions including making a non-touch-sensitive display touch-sensitive and allowing personalized interactions with the display. One implementation tracks one or more mobile computing device users relative to a large computer-driven display, and configures content displayed on the display based on a distance a given mobile computing device user is from the display. Another implementation personalizes user interactions with a large display. One or more mobile computing device users are tracked relative to a display. The identity of each of the one or more mobile computing device users is obtained. Content displayed on the display is configured based on a distance an identified mobile computing device user is from the display and the identity of the user that provides the content.

    Dynamic matrix convolution with channel fusion

    公开(公告)号:US12223412B2

    公开(公告)日:2025-02-11

    申请号:US17123697

    申请日:2020-12-16

    Abstract: A computer device for automatic feature detection comprises a processor, a communication device, and a memory configured to hold instructions executable by the processor to instantiate a dynamic convolution neural network, receive input data via the communication network, and execute the dynamic convolution neural network to automatically detect features in the input data. The dynamic convolution neural network compresses the input data from an input space having a dimensionality equal to a predetermined number of channels into an intermediate space having a dimensionality less than the number of channels. The dynamic convolution neural network dynamically fuses the channels into an intermediate representation within the intermediate space and expands the intermediate representation from the intermediate space to an expanded representation in an output space having a higher dimensionality than the dimensionality of the intermediate space. The features in the input data are automatically detected based on the expanded representation.

Patent Agency Ranking