Selective interaction of robotic device with additional computing device(s)

    公开(公告)号:US12296471B2

    公开(公告)日:2025-05-13

    申请号:US17544117

    申请日:2021-12-07

    Applicant: GOOGLE LLC

    Abstract: Implementations set forth herein relate to a robotic computing device that can seek additional information from other nearby device(s) for fulfilling a request and/or delegating certain operations to the other nearby device(s). Delegating certain operations can involve the robotic computing device maneuvering to a location of a nearby device and soliciting the nearby device for assistance by providing an input from the robotic computing device to the nearby device. In some instances, the input can include an audible rendering of an invocation phrase and a command phrase for invoking an automated assistant that is accessible via the nearby device. A determination of whether to delegate certain operations or seek additional information can be based on a variety of factors such as predicted efficiency and estimated accuracy of performance for performing certain operations.

    HYBRID INFERENCE FOR AN EFFICIENT, LOW LATENCY LLM-BASED ASSISTANT

    公开(公告)号:US20250148217A1

    公开(公告)日:2025-05-08

    申请号:US18387768

    申请日:2023-11-07

    Applicant: GOOGLE LLC

    Abstract: Implementations utilize a hybrid use of a smaller LLM and a larger LLM to generate and refine content responsive to a user query/request for content generation. In various implementations, the smaller LLM is utilized to process the user query for content generation, to generate initial content responsive to the user query for content generation. The user query for content generation and the initial content can be utilized to generate a text prompt, where the text prompt can be configured to further include a request for focused edit(s). Such a text prompt can be processed using the larger LLM, to generate focused edit(s) to the initial content that refine the initiated content, so that revised content (with improved accuracy) responsive to the user query for content generation is acquired.

    Spatial audio for device assistants
    213.
    发明授权

    公开(公告)号:US12294848B2

    公开(公告)日:2025-05-06

    申请号:US18065717

    申请日:2022-12-14

    Applicant: Google LLC

    Abstract: A method includes, while a user is wearing stereo headphones in an environment, obtaining, from a target digital assistant, a response to a query issued by the user, and obtaining spatial audio preferences of the user. Based on the spatial audio preferences of the user, the method also includes determining a spatially disposed location within a playback sound-field for the user to perceive as a sound-source of the response to the query. The method further includes rendering output audio signals characterizing the response to the query through the stereo headphones to produce the playback sound-field. Here, the user perceives the response to the query as emanating from the sound-source at the spatially disposed location within the playback sound-field.

    VOICE INPUT DISAMBIGUATION
    214.
    发明申请

    公开(公告)号:US20250140249A1

    公开(公告)日:2025-05-01

    申请号:US18564884

    申请日:2022-11-09

    Applicant: Google LLC

    Abstract: A method for recognizing a voice input includes receiving a first voice input including a plurality of terms, processing the first voice input based on the plurality of terms to obtain a first speech recognition result including one or more candidate terms corresponding to one or more terms from the plurality of terms, receiving a second voice input providing at least one of contextual information relating to the first voice input or confirmation information relating to the one or more candidate terms, and processing the second voice input based on the at least one of the contextual information or the confirmation information to obtain a second speech recognition result including at least one of the one or more candidate terms or one or more new candidate terms, as corresponding to the one or more terms from the plurality of terms.

    Voice Query Handling in an Environment with Multiple Users

    公开(公告)号:US20250131925A1

    公开(公告)日:2025-04-24

    申请号:US19006522

    申请日:2024-12-31

    Applicant: Google LLC

    Abstract: A method includes detecting multiple users, receiving a first query issued by a first user, the first query including a command for a digital assistant to perform a first action, and enabling a round robin mode to control performance of actions commanded by queries. The method also includes, while performing the first action, receiving audio data corresponding to a second query including a command to perform a second action, performing speaker identification on the audio data, determining that the second query was spoken by the first user, preventing performing the second action, and prompting at least another user to issue a query. The method further includes receiving a third query issued by a second user, the third query including a command for the digital assistant to perform a third action, and when the digital assistant completes performing the first action, executing performance of the third action.

    SIMULTANEOUS ACOUSTIC EVENT DETECTION ACROSS MULTIPLE ASSISTANT DEVICES

    公开(公告)号:US20250131913A1

    公开(公告)日:2025-04-24

    申请号:US18991928

    申请日:2024-12-23

    Applicant: GOOGLE LLC

    Abstract: Implementations can detect respective audio data that captures an acoustic event at multiple assistant devices in an ecosystem that includes a plurality of assistant devices, process the respective audio data locally at each of the multiple assistant devices to generate respective measures that are associated with the acoustic event using respective event detection models, process the respective measures to determine whether the detected acoustic event is an actual acoustic event, and cause an action associated with the actional acoustic event to be performed in response to determining that the detected acoustic event is the actual acoustic event. In some implementations, the multiple assistant devices that detected the respective audio data are anticipated to detect the respective audio data that captures the actual acoustic event based on a plurality of historical acoustic events being detected at each of the multiple assistant devices.

    Media arbitration
    217.
    发明授权

    公开(公告)号:US12284417B2

    公开(公告)日:2025-04-22

    申请号:US18506085

    申请日:2023-11-09

    Applicant: Google LLC

    Abstract: A method using media arbitration includes, while a first assistant-enabled device is performing a first long-standing operation, determining the first assistant-enabled device satisfies a co-presence condition with a second assistant-enabled device, and determining that the second assistant-enabled device is performing a second long-standing operation that conflicts with the first long-standing operation performed by the first assistant-enabled device. Based on determining that the first long-standing operation and the second long-standing operation conflict, the method also includes executing an operation arbitration routine to identify one or more compromise operations for at least one of the first assistant-enabled device or the second assistant-enabled device to perform, and instructing the first assistant-enabled device or the second assistant-enabled device to perform a selected compromise operation among the identified compromise operations.

    GENERATIVE NAVIGATIONAL CORPUS
    218.
    发明申请

    公开(公告)号:US20250094521A1

    公开(公告)日:2025-03-20

    申请号:US18889117

    申请日:2024-09-18

    Applicant: Google LLC

    Abstract: Disclosed implementations relate to structures that support an on-demand navigational corpus. An example method involves receiving a navigation request from a client device pertaining to an intent, determining seed content associated with the navigation request, utilizing a large foundational model to create a web page incorporating the seed content, based on a navigation model, and the intent, and delivering the generated web page for presentation on the client device. The method enables efficient and personalized web page generation based on user intent, enhancing user experience and facilitating dynamic navigation using raw seed content.

    Methods and systems for providing a secure automated assistant

    公开(公告)号:US12254038B2

    公开(公告)日:2025-03-18

    申请号:US18538773

    申请日:2023-12-13

    Applicant: GOOGLE LLC

    Abstract: Implementations described herein relate to receiving user input directed to an automated assistant, processing the user input to determine whether data from a server and/or third-party application is needed to perform certain fulfillment of an assistant command included in the user input, and generating a prompt that requests a user consent to transmitting of a request to the server and/or the third-party application to obtain the data needed to perform the certain fulfillment. In implementations where the user consents, the data can be obtained and utilized to perform the certain fulfillment. In implementations where the user does not consent, client data can be generated locally at a client device and utilized to perform alternate fulfillment of the assistant command. In various implementations, the request transmitted to the server and/or third-party application can be modified based on ambient noise captured when the user input is received.

    ACCELEROMETER-BASED ENDPOINTING MEASURE(S) AND /OR GAZE-BASED ENDPOINTING MEASURE(S) FOR SPEECH PROCESSING

    公开(公告)号:US20250087214A1

    公开(公告)日:2025-03-13

    申请号:US18958655

    申请日:2024-11-25

    Applicant: GOOGLE LLC

    Abstract: An overall endpointing measure can be generated based on an audio-based endpointing measure and (1) an accelerometer-based endpointing measure and/or (2) a gaze-based endpointing measure. The overall endpointing measure can be used in determining whether a candidate endpoint is an actual endpoint. Various implementations include generating the audio-based endpointing measure by processing an audio data stream, capturing a spoken utterance of a user, using an audio model. Various implementations additionally or alternatively include generating the accelerometer-based endpointing measure by processing a stream of accelerometer data using an accelerometer model. Various implementations additionally or alternatively include processing an image data stream using a gaze model to generate the gaze-based endpointing measure.

Patent Agency Ranking