-
公开(公告)号:US07645929B2
公开(公告)日:2010-01-12
申请号:US11519545
申请日:2006-09-11
申请人: Yu-Yao Chang , Ramin Samadani , Tong Zhang , Simon Widdowson
发明人: Yu-Yao Chang , Ramin Samadani , Tong Zhang , Simon Widdowson
IPC分类号: G04F10/06
CPC分类号: G10H1/40 , G10H2210/076
摘要: Various method and system embodiments of the present invention are directed to computational estimation of a tempo for a digitally encoded musical selection. In certain embodiments of the present invention, described below, a short portion of a musical selection is analyzed to determine the tempo of the musical selection. The digitally encoded musical selection sample is computationally transformed to produce a power spectrum corresponding to the sample, in turn transformed to produce a two-dimensional strength-of-onset matrix. The two-dimensional strength-of-onset matrix is then transformed into a set of strength-of-onset/time functions for each of a corresponding set of frequency bands. The strength-of-onset/time functions are then analyzed to find a most reliable onset interval that is transformed into an estimated tempo returned by the analysis.
摘要翻译: 本发明的各种方法和系统实施例涉及用于数字编码的音乐选择的速度的计算估计。 在下面描述的本发明的某些实施例中,分析音乐选择的短部分以确定音乐选择的速度。 数字编码的音乐选择样本被计算地转换以产生对应于样本的功率谱,然后被转换以产生二维的发射强度矩阵。 然后将二维强度发射矩阵转换成对应于一组频带中的每一个的一组起始/时间强度函数。 然后分析发作强度/时间函数以找到最可靠的开始间隔,其被转换成由分析返回的估计节奏。
-
公开(公告)号:US20080060505A1
公开(公告)日:2008-03-13
申请号:US11519545
申请日:2006-09-11
申请人: Yu-Yao Chang , Ramin Samadani , Tong Zhang , Simon Widdowson
发明人: Yu-Yao Chang , Ramin Samadani , Tong Zhang , Simon Widdowson
IPC分类号: G10H7/00
CPC分类号: G10H1/40 , G10H2210/076
摘要: Various method and system embodiments of the present invention are directed to computational estimation of a tempo for a digitally encoded musical selection. In certain embodiments of the present invention, described below, a short portion of a musical selection is analyzed to determine the tempo of the musical selection. The digitally encoded musical selection sample is computationally transformed to produce a power spectrum corresponding to the sample, in turn transformed to produce a two-dimensional strength-of-onset matrix. The two-dimensional strength-of-onset matrix is then transformed into a set of strength-of-onset/time functions for each of a corresponding set of frequency bands. The strength-of-onset/time functions are then analyzed to find a most reliable onset interval that is transformed into an estimated tempo returned by the analysis.
摘要翻译: 本发明的各种方法和系统实施例涉及用于数字编码的音乐选择的速度的计算估计。 在下面描述的本发明的某些实施例中,分析音乐选择的短部分以确定音乐选择的速度。 数字编码的音乐选择样本被计算地转换以产生对应于样本的功率谱,然后被转换以产生二维的发射强度矩阵。 然后将二维强度发射矩阵转换成对应于一组频带中的每一个的一组起始/时间强度函数。 然后分析发作强度/时间函数以找到最可靠的开始间隔,其被转换成由分析返回的估计节奏。
-
公开(公告)号:US07521620B2
公开(公告)日:2009-04-21
申请号:US11496999
申请日:2006-07-31
申请人: Ramin Samadani , Yu-Yao Chang , Tong Zhang , Ullas Gargi
发明人: Ramin Samadani , Yu-Yao Chang , Tong Zhang , Ullas Gargi
IPC分类号: G10H1/00
CPC分类号: G06F17/30752 , G06F17/30743 , G06F17/30749 , G06F17/30758 , G06F17/30772 , G06F17/30775 , G10H2240/131
摘要: The present invention provides a method of and system for browsing of music. In an embodiment, a method of browsing recorded music comprises steps of: selecting a song from a library; playing at least a portion of the selected song for a user; while the portion of the selected song is playing, accepting input from the user, the input comprising an indication of the user's enjoyment of the at least a portion of the selected song; repeating said steps of selecting, playing and accepting to generate a sequence of song portions; and creating a record comprising an identification of each selected song portions and the indication for the song portions.
摘要翻译: 本发明提供了一种用于浏览音乐的方法和系统。 在一个实施例中,浏览记录的音乐的方法包括以下步骤:从图书馆中选择歌曲; 为用户播放所选歌曲的至少一部分; 当所选歌曲的部分正在播放时,接收来自用户的输入,该输入包括用户对所选歌曲的至少一部分的享受的指示; 重复所述选择,播放和接受的步骤以产生歌曲部分的序列; 以及创建包括每个所选歌曲部分的标识和歌曲部分的指示的记录。
-
公开(公告)号:US20080022846A1
公开(公告)日:2008-01-31
申请号:US11496999
申请日:2006-07-31
申请人: Ramin Samadani , Yu-Yao Chang , Tong Zhang , Ullas Gargi
发明人: Ramin Samadani , Yu-Yao Chang , Tong Zhang , Ullas Gargi
CPC分类号: G06F17/30752 , G06F17/30743 , G06F17/30749 , G06F17/30758 , G06F17/30772 , G06F17/30775 , G10H2240/131
摘要: The present invention provides a method of and system for browsing of music. In an embodiment, a method of browsing recorded music comprises steps of: selecting a song from a library; playing at least a portion of the selected song for a user; while the portion of the selected song is playing, accepting input from the user, the input comprising an indication of the user's enjoyment of the at least a portion of the selected song; repeating said steps of selecting, playing and accepting to generate a sequence of song portions; and creating a record comprising an identification of each selected song portions and the indication for the song portions.
摘要翻译: 本发明提供了一种用于浏览音乐的方法和系统。 在一个实施例中,浏览记录的音乐的方法包括以下步骤:从图书馆中选择歌曲; 为用户播放所选歌曲的至少一部分; 当所选歌曲的部分正在播放时,接收来自用户的输入,该输入包括用户对所选歌曲的至少一部分的享受的指示; 重复所述选择,播放和接受的步骤以产生歌曲部分的序列; 以及创建包括每个所选歌曲部分的标识和歌曲部分的指示的记录。
-
公开(公告)号:US20050004690A1
公开(公告)日:2005-01-06
申请号:US10611449
申请日:2003-07-01
申请人: Tong Zhang , Ramin Samadani , Yining Deng , Ken Lin
发明人: Tong Zhang , Ramin Samadani , Yining Deng , Ken Lin
CPC分类号: G10L25/48 , G11B27/034 , G11B27/105 , G11B27/322 , G11B27/34 , G11B2220/20
摘要: In one aspect, audio summaries and transition audio segments are sequentially rendered with at least one transition audio segment rendered between each pair of sequential audio summaries. Each audio summary comprises digital content summarizing at least a portion of a respective associated audio piece. In another aspect, an original audio file is annotated by embedding therein information enabling rendering of at least one audio summary contained in the annotated audio file and comprising digital content summarizing at least a portion of the original audio file. In another aspect, an original audio file is annotated by providing at least one browsable link between the original audio file and at least one audio summary comprising digital content summarizing at least a portion of the original audio file, and storing the original audio file, the at least one browsable link, and the at least one audio summary on a common portable storage medium. In another aspect, an audio piece is divided into audio segments. Acoustical features are extracted from each audio segment. Audio segments are grouped into clusters based on the extracted features. A representative audio segment is identified in each cluster. A representative audio segment is selected as an audio summary of the audio piece.
摘要翻译: 在一个方面,音频摘要和转换音频段通过在每对顺序音频摘要之间呈现的至少一个转换音频片段被顺序渲染。 每个音频摘要包括总结相应相关音频片段的至少一部分的数字内容。 在另一方面,通过在其中嵌入信息来注释原始音频文件,使信息能够呈现包含在带标注的音频文件中的至少一个音频摘要,并且包括总结原始音频文件的至少一部分的数字内容。 在另一方面,原始音频文件通过在原始音频文件和至少一个音频摘要之间提供至少一个可浏览链接来注释,该至少一个音频摘要包括总结原始音频文件的至少一部分的数字内容,以及存储原始音频文件, 至少一个可浏览链接,以及在公共便携式存储介质上的至少一个音频摘要。 另一方面,音频片段被分成音频段。 从每个音频段中提取声学特征。 基于提取的特征将音频段分组成簇。 在每个集群中都标识了一个有代表性的音频段。 选择代表性音频片段作为音频片段的音频摘要。
-
公开(公告)号:US07522967B2
公开(公告)日:2009-04-21
申请号:US10611449
申请日:2003-07-01
申请人: Tong Zhang , Ramin Samadani , Yining Deng , Ken K. Lin
发明人: Tong Zhang , Ramin Samadani , Yining Deng , Ken K. Lin
IPC分类号: G06F17/00
CPC分类号: G10L25/48 , G11B27/034 , G11B27/105 , G11B27/322 , G11B27/34 , G11B2220/20
摘要: In one aspect, audio summaries and transition audio segments are sequentially rendered with at least one transition audio segment rendered between each pair of sequential audio summaries. Each audio summary comprises digital content summarizing at least a portion of a respective associated audio piece. In another aspect, an original audio file is annotated by embedding therein information enabling rendering of at least one audio summary contained in the annotated audio file and comprising digital content summarizing at least a portion of the original audio file. In another aspect, an original audio file is annotated by providing at least one browsable link between the original audio file and at least one audio summary comprising digital content summarizing at least a portion of the original audio file, and storing the original audio file, the at least one browsable link, and the at least one audio summary on a common portable storage medium. In another aspect, an audio piece is divided into audio segments. Acoustical features are extracted from each audio segment. Audio segments are grouped into clusters based on the extracted features. A representative audio segment is identified in each cluster. A representative audio segment is selected as an audio summary of the audio piece.
摘要翻译: 在一个方面,音频摘要和转换音频段通过在每对顺序音频摘要之间呈现的至少一个转换音频片段被顺序渲染。 每个音频摘要包括总结相应相关音频片段的至少一部分的数字内容。 在另一方面,通过在其中嵌入信息来注释原始音频文件,使信息能够呈现包含在带标注的音频文件中的至少一个音频摘要,并且包括总结原始音频文件的至少一部分的数字内容。 在另一方面,原始音频文件通过在原始音频文件和至少一个音频摘要之间提供至少一个可浏览链接来注释,该至少一个音频摘要包括总结原始音频文件的至少一部分的数字内容,以及存储原始音频文件, 至少一个可浏览链接,以及在公共便携式存储介质上的至少一个音频摘要。 另一方面,音频片段被分成音频段。 从每个音频段中提取声学特征。 基于提取的特征将音频段分组成簇。 在每个集群中都标识了一个有代表性的音频段。 选择代表性音频片段作为音频片段的音频摘要。
-
公开(公告)号:US20070266322A1
公开(公告)日:2007-11-15
申请号:US11433659
申请日:2006-05-12
申请人: Daniel Tretter , Tong Zhang , Simon Widdowson
发明人: Daniel Tretter , Tong Zhang , Simon Widdowson
IPC分类号: G06F9/00
CPC分类号: G11B27/34 , G11B27/105 , G11B27/28
摘要: An exemplary system for browsing videos comprises a memory for storing a plurality of videos, a processor for accessing the videos, and a video browsing user interface for enabling a user to browse the videos. The user interface is configured to enable video browsing in multiple states on a display screen, including a first state for displaying static representations of the videos, a second state for displaying dynamic representations of the videos, and a third state for playing at least a portion of a selected video.
摘要翻译: 用于浏览视频的示例性系统包括用于存储多个视频的存储器,用于访问视频的处理器,以及用于使用户能够浏览视频的视频浏览用户界面。 用户界面被配置为在显示屏幕上启用多种状态的视频浏览,包括用于显示视频的静态表示的第一状态,用于显示视频的动态表示的第二状态,以及用于播放至少一部分的第三状态 的所选视频。
-
公开(公告)号:US09239967B2
公开(公告)日:2016-01-19
申请号:US14234099
申请日:2011-07-29
申请人: Ke-Yan Liu , Xin-Yun Sun , Tong Zhang , Lei Wang , Min Wang
发明人: Ke-Yan Liu , Xin-Yun Sun , Tong Zhang , Lei Wang , Min Wang
CPC分类号: G06K9/6218 , G06F17/30247 , G06K9/6219
摘要: Methods, systems, and computer readable media with executable instructions, and/or logic are provided for incremental image clustering. An example method for incremental image clustering can include identifying, via a computing device, a number of candidate nodes from among evaluated leaf image cluster (LIC) nodes on an image cluster tree (ICT) based on a similarity between a feature of a new image and an average feature of each of the evaluated LIC nodes. The evaluated nodes include at least one node along each path from a root node to either a leaf node or a node having a similarity exceeding a first threshold. A most-similar node can be determined, via the computing device, from among the number of candidate nodes. The new image can be inserted to a node associated with the determined most-similar node, via the computing device.
摘要翻译: 提供了具有可执行指令和/或逻辑的方法,系统和计算机可读介质用于增量图像聚类。 用于增量图像聚类的示例性方法可以包括基于新图像的特征之间的相似性,经由计算设备识别来自图像簇树(ICT)上的评估叶图像簇(LIC)节点中的候选节点的数量 以及每个评估的LIC节点的平均特征。 评估的节点包括沿着从根节点到叶节点或具有超过第一阈值的相似度的节点的每个路径的至少一个节点。 可以通过计算设备从候选节点的数量中确定最相似的节点。 可以通过计算设备将新图像插入到与所确定的最相似的节点相关联的节点。
-
公开(公告)号:US09025864B2
公开(公告)日:2015-05-05
申请号:US13700820
申请日:2010-08-02
申请人: Tong Zhang , Wei Zhang , Daniel R Tretter
发明人: Tong Zhang , Wei Zhang , Daniel R Tretter
CPC分类号: G06K9/6256 , G06K9/00677
摘要: The disclosure relates to a system and a method for generating clothing feature data representative of at least one clothing feature of a piece of clothing being worn by the person in a set of images, and training a discriminative clothing classifier using the clothing feature data to provide a personal clothing model that corresponds to the piece of clothing. The personal clothing model can be used to identify additional images in which the person appears.
摘要翻译: 本公开涉及一种用于生成表示人在一组图像中穿着的服装的至少一个服装特征的服装特征数据的系统和方法,并且使用服装特征数据来训练鉴别服装分类器以提供 一件与服装相对应的个人服装模特。 个人服装模型可以用于识别该人出现的其他图像。
-
公开(公告)号:US08861873B2
公开(公告)日:2014-10-14
申请号:US13642352
申请日:2010-08-02
申请人: Tong Zhang , Wei Zhang , Daniel Tretter
发明人: Tong Zhang , Wei Zhang , Daniel Tretter
CPC分类号: G06K9/6218 , G06K9/00677
摘要: The disclosure is related to a system and method for learning robust clothing clustering based on a cluster ensemble technique applied to the clothing features of images to improve clustering of images. Different types of clothing features that are complementary to each other are computed to provide extensive description of the clothing in the images. Multiple partitions are computed based on the clothing features to generate a cluster ensemble set. A consensus function is applied to the multiple partitions to generate a final clothing consensus clustering that encompasses the information contained in the multiple partitions. A system and method are disclosed for clustering images based on the clothing of one or more persons in the images.
摘要翻译: 本公开涉及用于基于应用于图像的服装特征的群集合成技术来学习鲁棒服装聚类的系统和方法,以改善图像的聚类。 计算出彼此互补的不同类型的服装功能,以便对图像中的服装进行广泛的描述。 基于服装特征计算多个分区以生成群集集合集。 将共同功能应用于多个分区以生成包含多个分区中包含的信息的最终服装共识聚类。 公开了一种基于图像中的一个或多个人的衣服对图像进行聚类的系统和方法。
-
-
-
-
-
-
-
-
-