-
公开(公告)号:US11250217B1
公开(公告)日:2022-02-15
申请号:US16791421
申请日:2020-02-14
Applicant: SoundHound, Inc.
Inventor: Keyvan Mohajer , Christopher S. Wilson , Kheng Khov , Ian Graves
Abstract: A client device receives a user request (e.g., in natural language form) to execute a command of an application. The client device delegates interpretation of the request to a response-processing server. Using domain knowledge previously provided by a developer of the application, the response-processing server determines the various possible responses that client devices could make in response to the request based on circumstances such as the capabilities of the client devices and the state of the application data. The response-processing server accordingly generates a response package that describes a number of different conditional responses that client devices could have to the request and provides the response package to the client device. The client device selects the appropriate response from the response package based on the circumstances as determined by the client device, executes the command (if possible), and provides the user with some representation of the response.
-
公开(公告)号:US11238101B1
公开(公告)日:2022-02-01
申请号:US17081996
申请日:2020-10-27
Applicant: SoundHound, Inc.
Inventor: Keyvan Mohajer
IPC: G06F16/9032 , G10L15/18 , G06F16/2457 , G06F3/16 , G10L15/22 , H04N21/482 , G10L15/26
Abstract: A command-processing server receives a natural language command from a user. The command-processing server has a set of domain command interpreters corresponding to different domains in which commands can be expressed, such as the domain of entertainment, or the domain of travel. Some or all of the domain command interpreters recognize user commands having a verbal prefix, an optional pre-filter, an object, and an optional post-filter; the pre- and post-filters may be compounded expressions involving multiple atomic filters. Different developers may independently specify the domain command interpreters and the sub-structure interpreters on which they are based.
-
公开(公告)号:US20210350784A1
公开(公告)日:2021-11-11
申请号:US17314732
申请日:2021-05-07
Applicant: SoundHound, Inc.
Inventor: Mara SELVAGGI
IPC: G10L13/033 , G10L13/047 , G10L15/02
Abstract: A personalized name pronunciation is generated by receiving a request from a client device associated with a person ID. A lexical representation of a name is obtained and pronunciation information for the name of is created based on an input from to the client device. The pronunciation information is stored with the lexical representation associated with the person ID in a database. A message request to provide a message that includes the name associated with the person ID may be received and a script obtained. The database is accessed using the person ID to obtain the pronunciation information for the name. Speech representing lexical text of the script is synthesized and an audio representation of the name is generated based on the pronunciation information. The speech and the audio representation of the name are delivered to at least one individual as audio.
-
公开(公告)号:US11144731B2
公开(公告)日:2021-10-12
申请号:US16128227
申请日:2018-09-11
Applicant: SoundHound, Inc.
Inventor: Pranav Singh , Keyvan Mohajer , Kamyar Mohajer , Bernard Mont-Reynaud
IPC: G06F40/40 , G10L15/30 , G06Q30/02 , G06Q20/10 , G06F40/211
Abstract: A platform provides for developers of applications, such as devices, with natural language interfaces to configure the availability of vertical domain modules in applications. Modules can include grammars for parsing natural language expressions and interfaces to data sources. Third party developers can create modules with pricing models for their usage or access to their data. Device developers can browse or search available modules and test their performance for specific queries. The platform provides for devices users to access the chosen modules as configured by device developers and for charging and payment between users, application developers, and module developers.
-
公开(公告)号:US20210314699A1
公开(公告)日:2021-10-07
申请号:US17301308
申请日:2021-03-31
Applicant: SoundHound, Inc.
Inventor: Karl Stahl
IPC: H04R1/10
Abstract: A speaker device includes an electroacoustic transducer configured to convert an audio signal into a set of sound waves and a transmitter configured to transmit an electromagnetic signal that carries the audio signal for receipt at distances limited to an audibility range of the set of sound waves. The audibility range of the set of sound waves corresponds to a distance at which the set of sound waves is estimated to be below a predetermined sound level.
-
公开(公告)号:US20210312901A1
公开(公告)日:2021-10-07
申请号:US17146239
申请日:2021-01-11
Applicant: SoundHound, Inc.
Inventor: Anton V. RELIN
Abstract: Systems for automatic speech recognition and/or natural language understanding automatically learn new words by finding subsequences of phonemes that, if they were a new word, would enable a successful tokenization of a phoneme sequence. Systems can learn alternate pronunciations of words by finding phoneme sequences with a small edit distance to existing pronunciations. Systems can learn the part of speech of words by finding part-of-speech variations that would enable parses by syntactic grammars. Systems can learn what types of entities a word describes by finding sentences that could be parsed by a semantic grammar but for the words not being on an entity list.
-
公开(公告)号:US11138205B1
公开(公告)日:2021-10-05
申请号:US16292190
申请日:2019-03-04
Applicant: SoundHound, Inc.
Inventor: Keyvan Mohajer , Bernard Mont-Reynaud , Philipp Hubert
IPC: G06F16/00 , G06F16/2457 , G06F16/2455 , G06F40/40
Abstract: A query-processing server provides natural language services to applications. More specifically, the query-processing server receives and stores domain knowledge information from application developers, the domain knowledge information comprising a linguistic description of the natural language user queries that application developers wish their applications to support. A first portion of the domain knowledge information is applied to transform a natural language query received from an application to an ordered sequence of question elements. A second portion of the domain knowledge information is applied to group the ordered sequence of question elements into a plurality of distinct structured questions posed by the natural language query. The distinct structured questions may then be provided to the application, which may then execute them and obtain the corresponding data referenced by the questions.
-
公开(公告)号:US20210241759A1
公开(公告)日:2021-08-05
申请号:US16781214
申请日:2020-02-04
Applicant: SoundHound, Inc.
Inventor: Hsuan Yang , Qìndí Zhäng , Warren S. Heit
Abstract: A system and method are disclosed for ignoring a wakeword received at a speech-enabled listening device when it is determined the wakeword is reproduced audio from an audio-playing device. Determination can be by detecting audio distortions, by an ignore flag sent locally between an audio-playing device and speech-enabled device, by and ignore flag sent from a server, by comparison of received audio played audio to a wakeword within an audio-playing device or a speech-enabled device, and other means.
-
公开(公告)号:US20210224043A1
公开(公告)日:2021-07-22
申请号:US17225997
申请日:2021-04-08
Applicant: SoundHound, Inc.
Inventor: Bernard Mont-Reynaud , Seyed M. Emami , Chris Wilson , Keyvan Mohajer
Abstract: A method of building a natural language understanding application is provided. The method includes receiving at least one electronic record containing programming code and creating executable code from the programming code. Further, the executable code, when executed by a processor, causes the processor to create a parse and an interpretation of a sequence of input tokens, the programming code includes an interpret-block and the interpret-block includes an interpret-statement. Additionally, the interpret-statement includes a pattern expression and the interpret-statement includes an action statement.
-
公开(公告)号:US20210217431A1
公开(公告)日:2021-07-15
申请号:US16740440
申请日:2020-01-11
Applicant: SoundHound, Inc.
Inventor: Steve PEARSON
IPC: G10L21/013 , G10L21/0208 , G06N3/08 , G06N20/00
Abstract: A voice morphing apparatus having adjustable parameters is described. The disclosed system and method include a voice morphing apparatus that morphs input audio to mask a speaker's identity. Parameter adjustment uses evaluation of an objective function that is based on the input audio and output of the voice morphing apparatus. The voice morphing apparatus includes objectives that are based adversarially on speaker identification and positively on audio fidelity. Thus, the voice morphing apparatus is adjusted to reduce identifiability of speakers while maintaining fidelity of the morphed audio. The voice morphing apparatus may be used as part of an automatic speech recognition system.
-
-
-
-
-
-
-
-
-