Optical Character Recognition (OCR) Enhancement via Inertial Measurement Unit (IMU)-Supported Super-Resolution Imaging

    公开(公告)号:US20250104456A1

    公开(公告)日:2025-03-27

    申请号:US18884605

    申请日:2024-09-13

    Applicant: Apple Inc.

    Abstract: Electronic devices, methods, and program storage devices for achieving improved optical character recognition (OCR) operations are disclosed. Performing OCR operations on captured images, e.g., images captured by cameras that are affixed to a user's body (e.g., from mixed reality devices, such as smart HMDs) requires a low-power, robust camera design. Obtaining high spatial resolution in such captured images faces many challenges. However, images with higher spatial resolution can be created by combining information extracted from multiple images captured by such devices, leveraging information obtained from positional sensors of such devices, and performing SR post-processing operations. Such higher spatial resolution images may then be used to enable high-acuity OCR capabilities. The solutions disclosed herein also compensate for the missing ability of such devices due to the lack of a vestibulo-ocular reflex (i.e., the human visual system's ability to use compensating eye movement to fixate and read text clearly, despite head movement).

    Method and device for spatially designating private content

    公开(公告)号:US12175009B2

    公开(公告)日:2024-12-24

    申请号:US18538955

    申请日:2023-12-13

    Applicant: Apple Inc.

    Abstract: In one implementation, a method for spatially designating private content. The method includes: presenting, via a display device, an indication of a private viewing region relative to a location of the computing system; determining a first location for presentation of graphical content; and presenting, via the display device, the graphical content at the first location. The method further includes: transmitting a characterization vector associated with the graphical content to at least one other device for display thereon according to a determination that the first location of the graphical content is outside of the private viewing area; and forgoing transmission of the characterization vector associated with the graphical content to the at least one other device according to a determination that the first location of the graphical content is inside of the private viewing area.

    Ambient Augmented Language Tutoring
    5.
    发明公开

    公开(公告)号:US20230290270A1

    公开(公告)日:2023-09-14

    申请号:US18112450

    申请日:2023-02-21

    Applicant: Apple Inc.

    CPC classification number: G09B19/06 G06T19/006 G06F3/011

    Abstract: Devices, systems, and methods that facilitate learning a language in an extended reality (XR) environment. This may involve identifying objects or activities in the environment, identifying a context associated with the user or the environment, and providing language teaching content based on the objects, activities, or contexts. In one example, the language teaching content provides individual words, phrases, or sentences corresponding to the objects, activities, or contexts. In another example, the language teaching content requests user interaction (e.g., via quiz questions or educational games) corresponding to the objects, activities, or contexts. Context may be used to determine whether or how to provide the language teaching content. For example, based on a user's current course of language study (e.g., this week's vocabulary list), corresponding object or activities may be identified in the environment for use in providing the language teaching content.

    Proactive Actions Based on Audio and Body Movement

    公开(公告)号:US20220291743A1

    公开(公告)日:2022-09-15

    申请号:US17689460

    申请日:2022-03-08

    Applicant: APPLE INC.

    Abstract: Various implementations disclosed herein include devices, systems, and methods that determine that a user is interested in audio content by determining that a movement (e.g., a user's head bob) has a time-based relationship with detected audio content (e.g., the beat of music playing in the background). Some implementations involve obtaining first sensor data and second sensor data corresponding to a physical environment, the first sensor data corresponding to audio in the physical environment and the second sensor data corresponding to a body movement in the physical environment. A time-based relationship between one or more elements of the audio and one or more aspects of the body movement is identified based on the first sensor data and the second sensor data. An interest in content of the audio is identified based on identifying the time-based relationship. Various actions may be performed proactively based on identifying the interest in the content.

    METHOD AND DEVICE FOR MASKED LATE-STAGE SHIFT

    公开(公告)号:US20240062485A1

    公开(公告)日:2024-02-22

    申请号:US18385129

    申请日:2023-10-30

    Applicant: Apple Inc.

    CPC classification number: G06T19/006 G06T7/70

    Abstract: In one implementation, a method of performing late-stage shift is performed at a device including a display, one or more processors, and non-transitory memory. The method includes generating, based on a first predicted pose of the device for a display time period, a first image. The method includes generating a mask indicating a first region of the first image and a second region of the first image. The method includes generating a second image by shifting, based on a second predicted pose of the device for the display time period, the first region of the first image without shifting the second region of the first image. The method includes displaying, on the display at the display time period, the second image.

    Gaze and Head Pose Interaction
    10.
    发明公开

    公开(公告)号:US20240019928A1

    公开(公告)日:2024-01-18

    申请号:US18374125

    申请日:2023-09-28

    Applicant: Apple Inc.

    CPC classification number: G06F3/012 G06F3/013 G06F1/163

    Abstract: Various implementations disclosed herein include devices, systems, and methods for using a gaze vector and head pose information to effectuate a user interaction with a virtual object. In some implementations, a device includes a sensor for sensing a head pose of a user, a display, one or more processors, and a memory. In various implementations, a method includes displaying a set of virtual objects. Based on a gaze vector, it is determined that a gaze of the user is directed to a first virtual object of the set of virtual objects. A head pose value corresponding to the head pose of the user is obtained. An action relative to the first virtual object is performed based on the head pose value satisfying a head pose criterion.

Patent Agency Ranking