Systems and methods for microphone localization
    1.
    发明授权
    Systems and methods for microphone localization 有权
    麦克风定位的系统和方法

    公开(公告)号:US07522736B2

    公开(公告)日:2009-04-21

    申请号:US10840389

    申请日:2004-05-07

    CPC classification number: H04R3/005

    Abstract: Systems and methods determine the location of a microphone with an unknown location, given the location of a number of other microphones by determining a difference in an arrival time between a first audio signal generated by and microphone with a known location and a second audio signal generated by another microphone with an unknown location, wherein the first and second audio signals are a representation of a substantially same sound emitted from an acoustic source with a known location; determining, based on at least the determined difference in arrival time, a distance between the acoustic source with the known location and the microphone with the unknown location; and determining, based on the determined distance between the acoustic source with the known location and the microphone with the unknown location, the location of the unknown microphone.

    Abstract translation: 系统和方法通过确定由已知位置产生的第一音频信号和麦克风之间的到达时间之间的差异以及生成的第二音频信号,给出了多个其他麦克风的位置,确定具有未知位置的麦克风的位置 通过具有未知位置的另一个麦克风,其中第一和第二音频信号是从具有已知位置的声源发射的基本相同的声音的表示; 基于至少所确定的到达时间的差异,确定声源与已知位置之间的距离和具有未知位置的麦克风; 并且基于所确定的声源与已知位置之间的距离和具有未知位置的麦克风确定未知麦克风的位置。

    Intersection Detection in Panoramic Video
    2.
    发明申请
    Intersection Detection in Panoramic Video 有权
    全景视频中的交叉检测

    公开(公告)号:US20070296807A1

    公开(公告)日:2007-12-27

    申请号:US11852224

    申请日:2007-09-07

    CPC classification number: H04N5/268 G06T7/97 H04N5/262 H04N5/2622

    Abstract: Provides a system for detecting an intersection between more than one panoramic video sequence and detecting the orientation of the sequences forming the intersection. Video images and corresponding location data are received. If required, the images and location data is processed to ensure the images contain location data. An intersection between two paths is then derived from the video images by deriving a rough intersection between two images, determining a neighborhood for the two images, and dividing each image in the neighborhood into strips. An identifying value is derived from each strip to create a row of strip values which are then converted to the frequency domain. A distance measure is taken between strips in the frequency domain, and the intersection is determined from the images having the smallest distance measure between them. The orientation between the two paths may also be determined in the frequency domain by using the phases of signals representing the images in the Fourier domain or performing a circular cross correlation of two vectors representing the images.

    Abstract translation: 提供一种用于检测多个全景视频序列之间的交点并检测形成交集的序列的取向的系统。 接收视频图像和对应的位置数据。 如果需要,处理图像和位置数据,以确保图像包含位置数据。 然后通过导出两个图像之间的粗略交点,确定两个图像的邻域,并将邻域中的每个图像分割成条,从而从视频图像导出两条路径之间的交点。 从每个条带导出识别值,以创建条带值行,然后将其转换为频域。 在频域中的带之间采取距离测量,并且从它们之间具有最小距离测量的图像确定交点。 也可以通过使用表示傅立叶域中的图像的信号的相位或者执行表示图像的两个向量的循环互相关来在频域中确定两个路径之间的取向。

    Methods for browsing multiple images
    3.
    发明申请
    Methods for browsing multiple images 审中-公开
    浏览多个图像的方法

    公开(公告)号:US20070104390A1

    公开(公告)日:2007-05-10

    申请号:US11405132

    申请日:2006-04-17

    Applicant: Jonathan Foote

    Inventor: Jonathan Foote

    CPC classification number: H04N1/0035 H04N1/00448 H04N1/00458

    Abstract: Algorithms to show multiple images at the maximum possible resolution are proposed. Rather than reducing the resolution of each image, the portion of each image that is actually shown is reduced. The algorithms select which part of each image is to be shown. In one embodiment of the invention, changing the parameters over time further increases the information displayed.

    Abstract translation: 提出了以最大可能分辨率显示多个图像的算法。 不是降低每个图像的分辨率,而是减少了实际显示的每个图像的部分。 算法选择要显示每个图像的哪一部分。 在本发明的一个实施例中,随时间改变参数进一步增加显示的信息。

    Systems and methods for microphone localization
    4.
    发明申请
    Systems and methods for microphone localization 有权
    麦克风定位的系统和方法

    公开(公告)号:US20050249360A1

    公开(公告)日:2005-11-10

    申请号:US10840389

    申请日:2004-05-07

    CPC classification number: H04R3/005

    Abstract: Systems and methods determine the location of a microphone with an unknown location, given the location of a number of other microphones by determining a difference in an arrival time between a first audio signal generated by and microphone with a known location and a second audio signal generated by another microphone with an unknown location, wherein the first and second audio signals are a representation of a substantially same sound emitted from an acoustic source with a known location; determining, based on at least the determined difference in arrival time, a distance between the acoustic source with the known location and the microphone with the unknown location; and determining, based on the determined distance between the acoustic source with the known location and the microphone with the unknown location, the location of the unknown microphone.

    Abstract translation: 系统和方法通过确定由已知位置产生的第一音频信号和麦克风之间的到达时间之间的差异以及生成的第二音频信号,给出了多个其他麦克风的位置,确定具有未知位置的麦克风的位置 通过具有未知位置的另一个麦克风,其中第一和第二音频信号是从具有已知位置的声源发射的基本相同的声音的表示; 基于至少所确定的到达时间的差异,确定声源与已知位置之间的距离和具有未知位置的麦克风; 并且基于所确定的声源与已知位置之间的距离和具有未知位置的麦克风确定未知麦克风的位置。

    Multi-function image device
    5.
    发明申请
    Multi-function image device 审中-公开
    多功能图像设备

    公开(公告)号:US20070077059A1

    公开(公告)日:2007-04-05

    申请号:US11243603

    申请日:2005-10-05

    Abstract: A multi-function device that prints information images onto sheets of photo-addressable media is described. The multi-function device is comprised of an image acquisition component, an image generation component, optional image transformation components and an image projector to illuminate the photo-addressable medium with the optionally transformed information images. The effects of ambient light on the photo-addressable medium are reduced by tuning the response characteristics of the photo-addressable medium to respond to the wavelength of the projected light and/or to interpose band-pass filters that reduce non-projected light incident on the photo-addressable medium. Programmable characteristics of the photo-addressable medium are adjustable to compensate for ambient light. Registration marks on the photo-addressable medium allow the alignment of the projected image with the photo-addressable medium. Additional optional image transformations are applied to adjust the size of the information image, increase clarity and the like.

    Abstract translation: 描述了将信息图像打印到可光寻址介质片上的多功能设备。 该多功能装置包括图像获取部件,图像生成部件,可选的图像变换部件和图像投影仪,用可选地变换的信息图像照亮可光寻址介质。 通过调整光寻址介质的响应特性来响应光可寻址介质的影响,以响应于投射光的波长和/或介入带通滤波器,其减少入射到非可投影光的非投影光 光寻址介质。 光可寻址介质的可编程特性可调,以补偿环境光。 可光寻址介质上的对准标记允许投影图像与光寻址介质的对准。 应用附加的可选图像变换来调整信息图像的大小,增加清晰度等。

    System and method for detecting and ranking images in order of usefulness based on vignette score

    公开(公告)号:US20060153456A1

    公开(公告)日:2006-07-13

    申请号:US11032576

    申请日:2005-01-10

    Applicant: Jonathan Foote

    Inventor: Jonathan Foote

    CPC classification number: G06F17/30247

    Abstract: A system and method for detecting useful images and for ranking images in order of usefulness based on a vignette score describing how closely each one resembles a “vignette,” or a central object or image surrounded by a featureless or deemphasized background. Several methods for determining an image's vignette score are disclosed as examples. Variance ratio analysis entails calculation of the ratio of variance between the edge region of the image and the entire image. Statistical model analysis entails developing a statistical classifier capable of determining a statistical model of each image class based on pre-entered training data. Spatial frequency analysis involves estimating the energy at different spatial frequencies in the central and edge regions and in the image as a whole. A vignette score is calculated as the ratio of mid-frequency energies in the edge region to the mid-frequency energies of the entire image.

    Video enabled tele-presence control host
    8.
    发明申请
    Video enabled tele-presence control host 有权
    视频启用远程存在控制主机

    公开(公告)号:US20050028221A1

    公开(公告)日:2005-02-03

    申请号:US10629403

    申请日:2003-07-28

    CPC classification number: H04N7/147 G06F19/00 H04N7/15 H04N21/4622

    Abstract: A method for exchanging information in a shared interactive environment, comprising selecting a first physical device in a first live video image wherein the first physical device has information associated with it, causing the information to be transferred to a second physical device in a second live video image wherein the transfer is brought about by manipulating a visual representation of the information, wherein the manipulation includes interacting with the first live video image and the second live video image, wherein the first physical device and the second physical device are part of the shared interactive environment, and wherein the first physical device and the second physical device are not the same.

    Abstract translation: 一种用于在共享交互环境中交换信息的方法,包括:选择第一实时视频图像中的第一物理设备,其中所述第一物理设备具有与其相关联的信息,使得所述信息被传送到第二实时视频中的第二物理设备 图像,其中通过操纵所述信息的视觉表示来实现所述传送,其中所述操纵包括与所述第一实况视频图像和所述第二实况视频图像进行交互,其中所述第一物理设备和所述第二物理设备是所述共享交互的一部分 环境,并且其中所述第一物理设备和所述第二物理设备不相同。

    Remote audio device management system
    9.
    发明申请
    Remote audio device management system 有权
    远程音频设备管理系统

    公开(公告)号:US20050002535A1

    公开(公告)日:2005-01-06

    申请号:US10612429

    申请日:2003-07-02

    CPC classification number: H04H60/04

    Abstract: An audio device management system (ADMS) manages remote audio devices via user selections in video links. The system enhances audio acquisition quality by receiving and processing human suggestions, forming customized two-way audio links according to user requests, and learning audio pickup strategies and camera management strategies from user operations. The ADMS control interface for a remote user provides a multi-window GUI that provides an overview window and selection display window. The ADMS provides users with more flexibility to enhance audio signals according to their needs and makes it more convenient to form customized two-way audio links without requiring users to remember a list of phone numbers. The ADMS also automatically manages available microphones for audio pickup based on microphone sound quality and the system's past experience when users monitor a structured audio environment without explicitly expressing their attentions in the video window.

    Abstract translation: 音频设备管理系统(ADMS)通过视频链路中的用户选择来管理远程音频设备。 该系统通过接收和处理人的建议,根据用户要求形成定制的双向音频链接,以及从用户操作学习音频拾取策略和相机管理策略来提高音频采集质量。 远程用户的ADMS控制界面提供了一个提供概览窗口和选择显示窗口的多窗口GUI。 ADMS为用户提供了更多的灵活性,可根据需要增强音频信号,并使其更方便地形成定制的双向音频链接,而不需要用户记住电话号码列表。 ADMS还可以自动管理可用的麦克风,用于基于麦克风音质的音频拾音,以及当用户监视结构化音频环境而不明确表达其在视频窗口中的注意时,系统的过去体验。

    Method for automatic analysis of audio including music and speech
    10.
    发明授权
    Method for automatic analysis of audio including music and speech 有权
    自动分析包括音乐和言语在内的音频的方法

    公开(公告)号:US06542869B1

    公开(公告)日:2003-04-01

    申请号:US09569230

    申请日:2000-05-11

    Applicant: Jonathan Foote

    Inventor: Jonathan Foote

    Abstract: A method for determining points of change or novelty in an audio signal measures the self similarity of components of the audio signal. For each time window in an audio signal, a formula is used to determine a vector parameterization value. The self-similarity as well as cross-similarity between each of the parameterization values is then determined for all past and future window regions. A significant point of novelty or change will have a high self-similarity in the past and future, and a low cross-similarity. The extent of the time difference between “past” and “future” can be varied to change the scale of the system so that, for example, individual musical notes can be found using a short time extent while longer events, such as musical themes or changing of speakers, can be identified by considering windows further into the past or future. The result is a measure of the degree of change, or how novel the source audio is at any time. The method can be used in a wide variety of applications, including segmenting or indexing for classification and retrieval, beat tracking, and summarizing of speech or music.

    Abstract translation: 用于确定音频信号中的改变点或新颖性的方法测量音频信号的分量的自相似度。 对于音频信号中的每个时间窗口,使用公式来确定矢量参数化值。 然后,对于所有过去和未来的窗口区域,确定每个参数化值之间的自相似性以及交叉相似度。 一个重要的新颖性或变化点在过去和未来将具有很高的自相似性,并且具有低的相似度。 可以改变“过去”和“未来”之间的时间差的程度以改变系统的规模,使得例如可以在较短的时间范围内发现个体音符,而较长的事件,例如音乐主题或 通过考虑进一步过去或未来的窗口可以确定扬声器的变化。 结果是测量变化的程度,或源音频在任何时候的新颖性。 该方法可用于各种应用,包括分类或索引分类和检索,拍摄跟踪,以及语音或音乐的总结。

Patent Agency Ranking