-
公开(公告)号:US09971774B2
公开(公告)日:2018-05-15
申请号:US15383986
申请日:2016-12-19
Applicant: Apple Inc.
Inventor: Sameer Badaskar
CPC classification number: G06F17/3005 , G06F17/30023 , G06F17/30026 , G06F17/30038 , G06F17/30265 , G06F17/30684 , G10L15/22 , G10L15/26 , G10L15/265
Abstract: Methods and systems for searching for media items using a voice-based digital assistant are described. Natural language text strings corresponding to search queries are provided. The search queries include query terms. The text strings may correspond to speech inputs input by a user into an electronic device. At least one information source is searched to identify at least one parameter associated with at least one of the query terms. The parameters include at least one of a time parameter, a date parameter, or a geo-code parameter. The parameters are compared to tags of media items to identify matches. In some implementations, media items whose tags match the parameter are presented to the user.
-
公开(公告)号:US12073831B1
公开(公告)日:2024-08-27
申请号:US17576419
申请日:2022-01-14
Applicant: Apple Inc.
Inventor: Saurabh Adya , Sameer Badaskar , Akanksha Bindal , Ahmed S. Hussen Abdelaziz , Xiaochuan Niu , Alkeshkumar M. Patel , Srikanth Vishnubhotla
CPC classification number: G10L15/22 , G06F18/214 , G06V10/82 , G06V20/50 , G10L15/063 , G10L15/16 , G10L15/18 , G10L15/24
Abstract: Systems and processes for operating a digital assistant are provided. An example method for processing an image include receiving an image, generating, based on the image, a question corresponding to a first object in the image, generating, based on the image, a caption corresponding to a second object of the image, receiving an utterance from a user, and determining a plurality of speech recognition results from the utterance based on the question and the caption.
-
公开(公告)号:US09547647B2
公开(公告)日:2017-01-17
申请号:US13681359
申请日:2012-11-19
Applicant: Apple Inc.
Inventor: Sameer Badaskar
CPC classification number: G06F17/3005 , G06F17/30023 , G06F17/30026 , G06F17/30038 , G06F17/30265 , G06F17/30684 , G10L15/22 , G10L15/26 , G10L15/265
Abstract: Methods and systems for searching for media items using a voice-based digital assistant are described. Natural language text strings corresponding to search queries are provided. The search queries include query terms. The text strings may correspond to speech inputs input by a user into an electronic device. At least one information source is searched to identify at least one parameter associated with at least one of the query terms. The parameters include at least one of a time parameter, a date parameter, or a geo-code parameter. The parameters are compared to tags of media items to identify matches. In some implementations, media items whose tags match the parameter are presented to the user.
Abstract translation: 描述了使用基于语音的数字助理搜索媒体项目的方法和系统。 提供了与搜索查询对应的自然语言文本字符串。 搜索查询包括查询字词。 文本串可以对应于用户输入到电子设备中的语音输入。 搜索至少一个信息源以识别与至少一个查询项相关联的至少一个参数。 这些参数包括时间参数,日期参数或地理代码参数中的至少一个。 将参数与媒体项目的标签进行比较,以识别匹配项。 在一些实现中,其标签与参数匹配的媒体项目被呈现给用户。
-
公开(公告)号:US20140081633A1
公开(公告)日:2014-03-20
申请号:US13681359
申请日:2012-11-19
Applicant: APPLE, INC.
Inventor: Sameer Badaskar
CPC classification number: G06F17/3005 , G06F17/30023 , G06F17/30026 , G06F17/30038 , G06F17/30265 , G06F17/30684 , G10L15/22 , G10L15/26 , G10L15/265
Abstract: Methods and systems for searching for media items using a voice-based digital assistant are described. Natural language text strings corresponding to search queries are provided. The search queries include query terms. The text strings may correspond to speech inputs input by a user into an electronic device. At least one information source is searched to identify at least one parameter associated with at least one of the query terms. The parameters include at least one of a time parameter, a date parameter, or a geo-code parameter. The parameters are compared to tags of media items to identify matches. In some implementations, media items whose tags match the parameter are presented to the user.
Abstract translation: 描述了使用基于语音的数字助理搜索媒体项目的方法和系统。 提供了与搜索查询对应的自然语言文本字符串。 搜索查询包括查询字词。 文本串可以对应于用户输入到电子设备中的语音输入。 搜索至少一个信息源以识别与至少一个查询项相关联的至少一个参数。 这些参数包括时间参数,日期参数或地理代码参数中的至少一个。 将参数与媒体项目的标签进行比较,以识别匹配项。 在一些实现中,其标签与参数匹配的媒体项目被呈现给用户。
-
-
-