-
公开(公告)号:US11727919B2
公开(公告)日:2023-08-15
申请号:US17303066
申请日:2021-05-19
申请人: Sonos, Inc.
摘要: Network microphone devices configured to detect keywords can include microphones for capturing sound samples. Features can be extracted from the sound samples by storing the sound samples in a first portion of a dynamic-access memory block, performing first computations based on spectral coefficients of the sound samples using a second portion of the memory block, and storing results of the first computations as extracted features in a third portion of the memory block. The second and third portions of the memory block can be designated as temporary memory. The extracted features are then processed using a neural network by storing the extracted features in a fourth portion of the memory block, performing second computations on the extracted features using the temporary memory, the second computations comprising computing at least one layer of the neural network, and storing an output of the neural network as a classification in the temporary memory.
-
公开(公告)号:US20230252976A1
公开(公告)日:2023-08-10
申请号:US18301060
申请日:2023-04-14
发明人: Kristin A. Gray
IPC分类号: G10L15/00 , G06F40/174 , G10L15/187 , G10L15/01 , G10L15/06 , G10L15/22 , G10L15/30
CPC分类号: G10L15/00 , G06F40/174 , G10L15/187 , G10L15/01 , G10L15/063 , G10L15/22 , G10L15/30 , G10L2015/0635
摘要: Techniques and apparatuses for recognizing accented speech are described. In some embodiments, an accent module recognizes accented speech using an accent library based on device data, uses different speech recognition correction levels based on an application field into which recognized words are set to be provided, or updates an accent library based on corrections made to incorrectly recognized speech.
-
公开(公告)号:US11721344B2
公开(公告)日:2023-08-08
申请号:US18101301
申请日:2023-01-25
申请人: Nextiva, Inc.
摘要: A system and method are disclosed for generating a teleconference space for two or more communication devices using a computer coupled with a database and comprising a processor and memory. The computer generates a teleconference space and transmits requests to join the teleconference space to the two or more communication devices. The computer stores in memory identification information, and audiovisual data associated with one or more users, for each of the two or more communication devices. The computer stores audio transcription data, transmitted to the computer by each of the two or more communication devices and associated with one or more communication device users, in the computer memory. The computer merges the audio transcription data from each of the two or more communication devices into a master audio transcript, and transmits the master audio transcript to each of the two or more communication devices.
-
84.
公开(公告)号:US11721321B2
公开(公告)日:2023-08-08
申请号:US17409356
申请日:2021-08-23
申请人: Rovi Guides, Inc.
发明人: Shuchita Mehra
IPC分类号: G06F40/58 , G10L15/00 , G06F16/242 , G06F16/683 , H04N21/442 , H04N21/485
CPC分类号: G10L15/005 , G06F16/243 , G06F16/685 , G06F40/58 , H04N21/44213 , H04N21/4856
摘要: Systems and methods for identifying content corresponding to a language are provided. Language spoken by a first user based on verbal input received from the first user is automatically determined with voice recognition circuitry. A database of content sources is cross-referenced to identify a content source associated with a language field value that corresponds to the determined language spoken by the first user. The language field in the database identifies the language that the associated content source transmits content to a plurality of users. A representation of the identified content source is generated for display to the first user.
-
公开(公告)号:US11710483B2
公开(公告)日:2023-07-25
申请号:US17208006
申请日:2021-03-22
摘要: In an approach to controlling voice command execution via boundary creation, one or more computer processors determine one or more devices included in an Internet of Things platform. One or more computer processors receive, from a user, an indication of a boundary around two or more devices of the one or more devices. One or more computer processors create a boundary around the two or more devices of the one or more devices. One or more computer processors receive a voice command from the user associated with the two or more devices of the one or more devices. One or more computer processors transmit the voice command to the two or more devices of the one or more devices within the boundary.
-
公开(公告)号:US20230228832A1
公开(公告)日:2023-07-20
申请号:US17577885
申请日:2022-01-18
申请人: Vera Kozyr , Dmitry Lukashev , Egor Markov , Igor Mikhnenko
发明人: Vera Kozyr , Dmitry Lukashev , Egor Markov , Igor Mikhnenko
IPC分类号: G01S3/803 , G01S3/86 , G10L17/04 , G10L15/00 , G10L21/0216 , G10L21/0272 , G10L15/26 , G08B21/04 , G08B5/36
CPC分类号: G01S3/803 , G01S3/86 , G08B5/36 , G08B21/0446 , G10L15/005 , G10L15/26 , G10L17/04 , G10L21/0216 , G10L21/0272 , G10L2021/02165
摘要: A wearable badge for an employee that records and transmits audio from client interactions with the professional, comprising two microphones and two microphone channels that focus one microphone on the speech of the employee and the other microphone on the speech of the customer, making diarizing easier. The wearable badge also comprises a module to determine whether or not the employee is maintaining an appropriate social distance with customers.
-
公开(公告)号:US11682414B1
公开(公告)日:2023-06-20
申请号:US17576643
申请日:2022-01-14
申请人: Apple Inc.
IPC分类号: G10L25/51 , G10L15/00 , H04R3/04 , H04R3/00 , H04R5/04 , H04R5/033 , H04R5/027 , G06F3/01 , H04R3/12
CPC分类号: G10L25/51 , G06F3/017 , G10L15/00 , H04R3/005 , H04R3/04 , H04R3/12 , H04R5/027 , H04R5/033 , H04R5/04 , H04R2430/01
摘要: Audio processing with audio transparency can include receiving a user content audio signal and receiving a microphone signal. The microphone signal can contain sensed sound of a user environment. Strength of the sensed sound can be increased based on strength of the user content audio signal, to reduce a masking of the sensed sound during playback. The sensed sound and the user content audio signal can be combined in a composite output audio signal used to drive a speaker. Other aspects are also described and claimed.
-
公开(公告)号:US11676590B2
公开(公告)日:2023-06-13
申请号:US17077974
申请日:2020-10-22
申请人: Sonos, Inc.
CPC分类号: G10L15/22 , G06F3/167 , G10L15/08 , H04R3/005 , H04R3/12 , H04R5/04 , G10L2015/088 , G10L2015/223
摘要: Example techniques involve a control hierarchy for a “smart” home having smart appliances and related devices, such as wireless illumination devices, home-automation devices (e.g., thermostats, door locks, etc.), and audio playback devices, among others. An example home includes various rooms in which smart devices might be located. Under the example control hierarchy described herein and referred to as “home graph,” a name of a room (e.g., “Kitchen”) may represent a smart device (or smart devices) within that room. In other words, from the perspective of a user, the smart devices within a room are that room. This hierarchy permits a user to refer to a smart device within a given room by way of the name of the room when controlling smart devices within the home using a voice user interface (VUI) or graphical user interface (GUI).
-
公开(公告)号:US20230178081A1
公开(公告)日:2023-06-08
申请号:US18053364
申请日:2022-11-07
发明人: Hajime KAWATAKE , Tatsuya INOUE
CPC分类号: G10L15/26 , G10L15/005 , G06T11/60 , H04N7/144 , H04N2007/145
摘要: An input relay unit receives speech data indicating a speech entered by a speaker. An input relay unit receives a confirmation request that is output in response to a predetermined operation of the speaker. A character string relay unit controls translation of the speech indicated by the speech data, which has been received before the reception of the confirmation request, to be started in response to the reception of the confirmation request. A display control unit controls a display unit to display a screen including an image obtained by overlaying a character string representing a translation result of a speech indicated by speech data that has been received before the reception of the confirmation request on an image captured by a capturing unit.
-
公开(公告)号:US11664009B2
公开(公告)日:2023-05-30
申请号:US16260353
申请日:2019-01-29
发明人: Taejun Kwon , Seongil Hahm , Seungsoo Kang
IPC分类号: G10L15/00 , H04N21/258 , G10L15/22 , H04N21/436 , H04N21/422 , H04N21/41
CPC分类号: G10L15/005 , G10L15/22 , H04N21/25816 , H04N21/42203 , H04N21/43615 , G10L2015/223 , G10L2015/226 , H04N21/4126
摘要: An electronic apparatus which registers a device to a server by using a voice, and a method therefor are provided. The electronic apparatus includes a communication circuit, a microphone, a memory for storing computer executable instructions, and at least one processor configured to execute the computer executable instructions to acquire, from a voice received through the microphone, information on an external device which a user wishes to register, based on an external device corresponding to the acquired information being searched through the communication circuit, control the communication circuit to transmit information on an access point to the external device to enable the external device to communicate with a server, and control the communication circuit to transmit a registration request with respect to the external device to the server.
-
-
-
-
-
-
-
-
-