METHOD FOR RECOMMENDING VIDEO CONTENT
    1.
    发明申请

    公开(公告)号:WO2020157283A1

    公开(公告)日:2020-08-06

    申请号:PCT/EP2020/052446

    申请日:2020-01-31

    Applicant: MOODAGENT A/S

    Abstract: A method of recommending video content using a computer-based system (30), the method comprising providing (101) an initial set comprising a plurality of videos (1); extracting (102) a digital audio signal (2) from each of the plurality of videos (1); determining (103) at least one temporal sequence (4) of low-level audio features for each digital audio signal (2) of the plurality of videos (1) by analyzing the digital audio signals (2); calculating (104) an audio similarity index (5) between each of the plurality of videos (1) by comparing their respective at least one temporal sequence (4) of low-level audio features; receiving (105) a query Q comprising reference to a seed video; the seed video being one of the plurality of videos (1); determining (106), for the seed video, a ranking (7) of the rest of the initial set of videos (1) based on their audio similarity index (5) with respect to the seed video; and returning (107), as a reply to the query Q , an ordered set of video references according to the ranking (7).

    APPARATUS, METHOD AND COMPUTER PROGRAM CODE FOR PROCESSING AN AUDIO STREAM

    公开(公告)号:WO2023020999A1

    公开(公告)日:2023-02-23

    申请号:PCT/EP2022/072773

    申请日:2022-08-15

    Abstract: Apparatus, method, and computer program code for processing an audio stream. The method includes: receiving (206) a candidate audio stream and candidate metadata;obtaining (208) candidate audio fingerprints; comparing (212) the candidate audio stream against a database using the candidate audio fingerprints; if no matching track is found, comparing (214, 218) the candidate audio stream against the sample storage using the candidate audio fingerprints, and computing (220) a merging score, and if the merging score meets a merging score threshold, merging (226, 228) the candidate audio fingerprints with sample audio fingerprints of matching sample tracks and the candidate metadata with sample metadata of the matching sample tracks to create a new final track, and saving (230) the new final track in the database, or else storing (236) the candidate audio fingerprints and the candidate metadata in the sample storage.

    语音包的推荐方法、装置、电子设备和存储介质

    公开(公告)号:WO2021068467A1

    公开(公告)日:2021-04-15

    申请号:PCT/CN2020/083545

    申请日:2020-04-07

    Abstract: 一种语音包的推荐方法、装置、电子设备和存储介质,涉及智能搜索技术领域。所述方法包括:获取用户的搜索请求(101);对搜索请求进行识别以获取用户的音色兴趣标识(102);根据音色兴趣标识搜索对应的目标语音包,并推荐给用户(103)。该方法通过对用户的搜索请求进行识别,获取用户的音色兴趣标识,根据用户的音色兴趣标识向用户推荐具有用户感兴趣的音色的语音包,实现了语音包的个性化推荐,无需用户通过逐一试听来选择语音包,操作简单,推荐精准,提高了智能化。

    SYSTEMS AND METHODS FOR OPERATING AN OUTPUT DEVICE

    公开(公告)号:WO2019236581A1

    公开(公告)日:2019-12-12

    申请号:PCT/US2019/035391

    申请日:2019-06-04

    Abstract: Systems and methods for operation and control of a smart device, generally a video output device. An aspect is a gesture-based control system that identifies the operative user, regardless of how many potential users are present in the room, and regardless of where each potential user is disposed in the room. Another aspect is controlling and interfacing with a user output device using various types of queries and context cues, and responding to queries by resolving ambiguities in the query. These aspects may be used independently or in combination.

    블록체인을 이용한 사물인터넷 기반의 원격 제어 가능한 전자 칠판 시스템

    公开(公告)号:WO2022114438A1

    公开(公告)日:2022-06-02

    申请号:PCT/KR2021/008999

    申请日:2021-07-13

    Inventor: 구기도

    Abstract: 본 발명의 목적은 터치패널을 포함하는 전자 칠판에서 여러 가지 IoT 입력 정보(음성 인식 입력정보, 터치 입력정보, 무선 입력정보)에 따라 사물인터넷(IoT) 네트워크로 연결된 외부 기기를 유무선으로 제어하는 기능을 수행하는 것이다. 이를 위하여, 터치 인식이 가능한 패널부와 상기 패널부가 터치되는 경우에 터치 방식에 따른 터치인식을 수행하는 센서부를 포함하고 일정한 물리적 거리 이내에 IoT 기반으로 연결된 주변 기기와 상호 연동되도록 유무선으로 연결된 전자 칠판에 있어서, 터치 입력 신호에 의하여 상기 패널부 및 센서부에서 인식한 터치 신호에 해당하는 제어 기능을 수행하는 제 1 신호 처리부; 무선 입력 신호에 의하여 무선 통신망을 통한 무선 제어 신호에 해당하는 제어 기능을 수행하는 제 2 신호 처리부; 음성 입력 신호에 의하여 아날로그 음성 신호를 디지털 텍스트 신호로 변환하여 명령어를 해석하고 해석된 명령어에 대응하는 제어 기능을 수행하는 제 3 신호 처리부; 및 상기 제 1 내지 3 신호 처리부에서 수행하는 제어 기능이 상기 전자 칠판에 유무선으로 연결된 주변 기기에 대한 제어인 경우에 원격으로 상기 주변 기기를 제어하는 컨트롤러를 구비한 제어부를 포함하는 전자 칠판 시스템을 제공한다.

    음성 비서를 통해 광고를 제공하는 전자 장치 및 그 제어 방법

    公开(公告)号:WO2022102975A1

    公开(公告)日:2022-05-19

    申请号:PCT/KR2021/013707

    申请日:2021-10-06

    Inventor: 민범기

    Abstract: 전자 장치는, 마이크, 스피커, 메모리, 프로세서를 포함한다. 프로세서는, 마이크를 통해 입력된 사용자 음성이 변환된 텍스트로부터 상품 키워드를 획득하고, 상품 키워드와 관련된 상품 카테고리를 식별하고, 발화 이력 정보에 따른 사용자의 관심도 및 사용자의 광고 피로도를 기반으로, 사용자 음성과 관련된 광고의 제공 여부를 결정하고, 광고를 제공하는 것으로 결정되면, 사용자의 관심도에 따라 결정된 타겟 상품 카테고리에 기초하여 광고 정보를 획득하고, 광고 정보에 기초하여 광고 음성을 출력하도록 스피커를 제어한다.

    SEMI-DELEGATED CALLING BY AN AUTOMATED ASSISTANT ON BEHALF OF HUMAN PARTICIPANT

    公开(公告)号:WO2021188126A1

    公开(公告)日:2021-09-23

    申请号:PCT/US2020/029346

    申请日:2020-04-22

    Applicant: GOOGLE LLC

    Abstract: Implementations are directed to using an automated assistant to initiate an assisted call on behalf of a given user. The assistant can, during the assisted call, receiving a request, from an additional user on the assisted call, for information that is not known to the assistant. In response, the assistant can render a prompt for the information and, while awaiting responsive input from the given user, continue the assisted call using already resolved value(s) for the assisted call. If responsive input is received within a threshold duration of time, synthesized speech, corresponding to the responsive input, is rendered as part of the assisted call. Implementations are additionally or alternatively directed to using the automated assistant to provide, during an ongoing call between a given user and an additional user, output that is based on a value requested by the additional user during the ongoing call.

    音響イベント認識装置および方法、並びにプログラム

    公开(公告)号:WO2020054409A1

    公开(公告)日:2020-03-19

    申请号:PCT/JP2019/033624

    申请日:2019-08-28

    Inventor: 島田 一希

    Abstract: 本技術は、事後に認識対象を追加することができるようにする音響イベント認識装置および方法、並びにプログラムに関する。 音響イベント認識装置は、入力音響信号から特徴量を抽出する特徴量抽出部と、特徴量の入力音響信号が、事前に付けたラベルの範囲内の音響イベントか否かを認識し、その認識結果を出力するラベル内認識部と、ラベル内認識部により音響イベントを認識できなかった場合、ラベルによらず取得した音響イベントとの同異を判定することにより判定結果を出力する同異判定部と、ラベル内認識部または同異判定部から出力された音響イベントに該当するフラグが有効になっているかを判定し、フラグが有効になっている場合、当該音響イベントを認識結果として出力するフラグ管理部とを備える。本技術は音響イベント認識装置に適用することができる。

    KEYWORD VARIATION FOR QUERYING FOREIGN LANGUAGE AUDIO RECORDINGS

    公开(公告)号:WO2022272281A1

    公开(公告)日:2022-12-29

    申请号:PCT/US2022/073113

    申请日:2022-06-23

    Abstract: Techniques are disclosed for searching audio recordings in a second language with a key phrase in a first language. For example, a system as described herein receives a first key phrase in the first language and an audio recording in the second language. The system converts the first key phrase into a second key phrase in the second language. The system processes the second key phrase to produce a second key phrase variant. The system identifies, from a graph of words in the second language generated from the audio recording, instances of the second key phrase or the second key phrase variant within the audio recording. The system displays the identified instances of the second key phrase or the second key phrase variant within the audio recording to enhance searchability of the audio recording in the second language.

Patent Agency Ranking