Methods and systems for word edit distance embedding

    公开(公告)号:US12057108B2

    公开(公告)日:2024-08-06

    申请号:US17071913

    申请日:2020-10-15

    IPC分类号: G10L15/16 G06N3/08 G10L15/10

    CPC分类号: G10L15/16 G06N3/08 G10L15/10

    摘要: A system for classifying words in a batch of words can include at least one memory device storing instructions for causing at least one processor to create dictionary vectors for each of a plurality of dictionary words using a neural network (NN), store each dictionary vector along with a classification indicator corresponding to the associated dictionary word, and create word vectors for each word in a batch of words for classification using the NN. The closest matching dictionary vectors are found for each word vector and the classification indicators of the closest matching dictionary vector for each word vector in the batch is reported.

    MACHINE LEARNING (ML)-BASED DUAL LAYER CONVERSATIONAL ASSIST SYSTEM

    公开(公告)号:US20240184992A1

    公开(公告)日:2024-06-06

    申请号:US18074653

    申请日:2022-12-05

    摘要: Systems and methods for increasing accuracy in an online chat interface are provided. Methods may be executed via a machine-learning (ML)-based chat monitoring engine. Methods may include intercepting a request utterance. Methods may include computing, via a trained ML model, an intent of the request utterance; generating a target response to the request utterance; intercepting a response utterance; and calculating a difference between the response utterance and the target response. In response to the difference being less than a threshold difference, methods may include releasing the response utterance to be transmitted as a response message. In response to the difference being more than a threshold difference, methods may include preventing the response utterance from being transmitted as a response message, generating a revised response utterance that is less than a threshold difference apart from the target response, and transmitting the revised response as a response message to a remote computing device.

    Systems and methods for providing responses from media content

    公开(公告)号:US11887586B2

    公开(公告)日:2024-01-30

    申请号:US17191512

    申请日:2021-03-03

    申请人: Spotify AB

    摘要: A method includes retrieving a plurality of transcripts from a database. Each transcript in the plurality of transcripts corresponds to audio from a media content item of a plurality of media content items that are provided by a media providing service. The method also includes applying each transcript of the plurality of transcripts to a trained computational model, and receiving a user request for information regarding a topic. The method further includes, in response to the user request, identifying a transcript from the database that is relevant to the topic, and a position within the transcript that is relevant to the topic. The method also includes providing, by the media providing service, at least a portion of a media content item corresponding to the identified transcript, beginning at a starting position that is based on the position within the identified transcript that is relevant to the topic.

    Mitigating voice frequency loss
    7.
    发明授权

    公开(公告)号:US11854572B2

    公开(公告)日:2023-12-26

    申请号:US17302981

    申请日:2021-05-18

    摘要: Computer-implemented methods, computer program products, and computer systems for mitigating frequency loss may include one or more processors configured for receiving first audio data corresponding to unobstructed user utterances, receiving second audio data corresponding to first obstructed user utterances, generating a frequency loss (FL) model representing frequency loss between the first audio data and the second audio data, receiving third audio data corresponding to one or more second obstructed user utterances, processing the third audio data using the FL model to generate fourth audio data corresponding to a frequency loss mitigated version of the second obstructed user utterances, and transmitting the fourth audio data to a recipient computing device. The first obstructed user utterances are obstructed by a facemask and the one or more second obstructed user utterances is obstructed by the facemask. The FL model may be executed as an audio plugin in a web conferencing program.

    INFORMATION PROCESSING DEVICE AND INFORMATION PROCESSING METHOD

    公开(公告)号:US20190214023A1

    公开(公告)日:2019-07-11

    申请号:US16330131

    申请日:2017-08-04

    申请人: SONY CORPORATION

    发明人: Keigo IHARA

    摘要: [Object] To provide an information processing device and an information processing method that can collect speech voice of a user, and recognize a specific user on the basis of the number of speeches performed by the user within a predetermined period.[Solution] An information processing device including: a communication unit capable of receiving voice information regarding voice collected by a plurality of microphones disposed discretely; and a control unit configured to determine a user identified on the basis of voice information regarding voice collected by a specific microphone among the plurality of microphones, the voice information having been received via the communication unit, to be a specific user that has performed speech a predefined number of times or more within at least a certain period of time, and control voice information to be transmitted to the specific user, to be transmitted to a speaker corresponding to the specific microphone, via the communication unit.

    VOICE ANALYSIS TRAINING SYSTEM
    10.
    发明申请

    公开(公告)号:US20180261219A1

    公开(公告)日:2018-09-13

    申请号:US15914893

    申请日:2018-03-07

    申请人: SalesBoost, LLC

    发明人: Margaret L BROOKS

    摘要: A method for performing voice analysis includes storing, in a database, a simulation file for conducting a training session with a user, the simulation file including at least a script. The method includes further storing, in the database, desired attributes associated with the simulation file. The method also includes retrieving, by a server, the simulation file from the database and providing, by a client application, a user interface to conduct the voice analysis using the simulation file from the database. The method further includes receiving, at the client application, one or more voice impressions from a user and analyzing, at an audio analysis tool, at least one of the voice impressions of the user. The method additionally includes determining, at the audio analysis tool, attributes of the at least one voice impression in response to analyzing the at least one voice impression and comparing, at the audio analysis tool, the determined attributes to the desired attributes associated with the simulation file. The method provides, by the client application, feedback to the user based on the comparison.