-
公开(公告)号:US09576106B2
公开(公告)日:2017-02-21
申请号:US15047025
申请日:2016-02-18
申请人: PROXIMIE LLC
发明人: Talal Ali Ahmad
CPC分类号: G06F19/3418 , G06F3/00 , G06F3/011 , G06F3/0484 , G06F3/04883 , G06F3/1454 , G06F19/00 , G06K9/6202 , G06T15/00 , G09B23/28 , G09G2370/022 , G09G2380/08 , G16H80/00
摘要: Remote instruction and monitoring of health care may be facilitated by providing feedback to a local doctor performing the medical procedure while the local doctor is performing the medical procedure. Instructions for performing the medical procedure may be received from a remote doctor and transmitted to a local doctor. The local doctor may commence performing the medical procedure, and while the local doctor is performing the medical procedure, image or video of the medical procedure may be processed and compared to the instructions. For example, a first value of a parameter may be obtained from the image or video of the procedure and a second value of the parameter may be obtained from the instructions. Comparison data may be generated from the first value and the second value and presented to the local doctor and/or the remote doctor.
摘要翻译: 当当地医生执行医疗程序时,可以通过向执行医疗程序的当地医生提供反馈,从而促进远程指导和医疗保健监测。 可以从远程医生接收执行医疗程序的说明,并传送给当地医生。 当地医生可以开始执行医疗程序,当当地医生正在执行医疗程序时,医疗程序的图像或视频可以被处理并与说明进行比较。 例如,可以从过程的图像或视频获得参数的第一值,并且可以从指令获得参数的第二值。 可以从第一值和第二值生成比较数据,并呈现给当地医生和/或远程医生。
-
公开(公告)号:US09576589B2
公开(公告)日:2017-02-21
申请号:US15016801
申请日:2016-02-05
申请人: KNUEDGE, INC.
发明人: David C Bradley , Yao Huang Morin
IPC分类号: G10L15/00 , G10L21/0232 , G10L21/0208 , G10L21/0264 , G10L25/90
CPC分类号: G10L21/038 , G10L15/02 , G10L17/02 , G10L21/0205 , G10L21/0208 , G10L21/0232 , G10L21/0264 , G10L21/0388 , G10L25/18 , G10L25/90
摘要: Devices, systems and methods are disclosed for reducing noise in input data by performing a hysteresis operation followed by a lateral excitation smoothing operation. For example, an audio signal may be represented as a sequence of feature vectors. A row of the sequence of feature vectors may, for example, be associated with the same harmonic of the audio signal at different points in time. To determine portions of the row that correspond to the harmonic being present, the system may compare an amplitude to a low threshold and a high threshold and select a series of data points that are above the low threshold and include at least one data point above the high threshold. The system may iteratively perform a spreading technique, spreading a center value of a center data point in a kernel to neighboring data points in the kernel, to further reduce noise.
摘要翻译: 公开了用于通过执行滞后操作,随后进行横向激励平滑操作来减少输入数据中的噪声的装置,系统和方法。 例如,音频信号可以表示为特征向量序列。 特征向量序列的一行可以例如在不同的时间点与音频信号的相同谐波相关联。 为了确定与存在的谐波相对应的行的部分,系统可以将幅度与低阈值和高阈值进行比较,并且选择高于低阈值的一系列数据点,并且包括至少一个数据点 高门槛。 系统可以迭代地执行扩展技术,将内核中的中心数据点的中心值扩展到内核中的相邻数据点,以进一步减少噪声。
-
公开(公告)号:US09298884B1
公开(公告)日:2016-03-29
申请号:US14573176
申请日:2014-12-17
申请人: VITAAX LLC
发明人: Talal Ali Ahmad
CPC分类号: G06F19/3418 , G06F3/00 , G06F3/011 , G06F3/0484 , G06F3/04883 , G06F3/1454 , G06F19/00 , G06K9/6202 , G06T15/00 , G09B23/28 , G09G2370/022 , G09G2380/08 , G16H80/00
摘要: Remote instruction and monitoring of health care may be facilitated by transmitting and processing image data of a patient, image data demonstrating a medical procedure, and image data of performance of the medical procedure. Image data of a patient may be transmitted to a remote doctor. Remote doctor may view the image data of the patient and provide a demonstration of a medical procedure. Image data of the demonstration may be provided to a local doctor who may perform the medical procedure. Image data of the performance of the procedure may be compared with image data of the demonstration of the procedure, and a result of the comparison may be transmitted to local doctor or remote doctor.
摘要翻译: 可以通过发送和处理患者的图像数据,展示医疗程序的图像数据和医疗程序的执行的图像数据来促进医疗保健的远程指示和监视。 患者的图像数据可以被传送给远程医生。 远程医生可以查看患者的图像数据并提供医疗程序的演示。 演示的图像数据可以提供给可以执行医疗程序的当地医生。 可以将该过程的执行的图像数据与该过程的演示的图像数据进行比较,并且将该比较的结果发送给当地医生或远程医生。
-
公开(公告)号:US09558734B2
公开(公告)日:2017-01-31
申请号:US15138614
申请日:2016-04-26
申请人: VocaliD, Inc.
IPC分类号: G10L13/00 , G10L13/027 , G10L13/047 , G10L13/033 , G10L13/06
CPC分类号: G10L13/06 , G10L13/033 , G10L13/0335 , G10L25/48 , G10L2021/0135
摘要: A voice recipient may request a text-to-speech (TTS) voice that corresponds to an age or age range. An existing TTS voice or existing voice data may be used to create a TTS voice corresponding to the requested age by encoding the voice data to voice parameter values, transforming the voice parameter values using a voice-aging model, synthesizing voice data using the transformed parameter values, and then creating a TTS voice using the transformed voice data. The voice-aging model may model how one or more voice parameters of a voice change with age and may be created from voice data stored in a voice bank.
摘要翻译: 语音接收者可以请求对应于年龄或年龄范围的文本到语音(TTS)语音。 可以使用现有TTS语音或现有语音数据来通过将语音数据编码到语音参数值来创建与所请求年龄相对应的TTS语音,使用语音老化模型变换语音参数值,使用变换参数合成语音数据 值,然后使用变换的语音数据创建TTS语音。 语音老化模型可以模拟语音的一个或多个语音参数随着年龄而改变,并且可以从存储在语音库中的语音数据创建。
-
公开(公告)号:US09336782B1
公开(公告)日:2016-05-10
申请号:US14753233
申请日:2015-06-29
申请人: VocaliD, Inc.
发明人: Rupal Patel
IPC分类号: G10L13/00 , G10L17/22 , G10L13/033 , G10L13/06 , G10L13/04
CPC分类号: G10L17/22 , G10L13/00 , G10L13/027 , G10L13/033 , G10L13/04 , G10L13/06 , G10L17/24
摘要: Voice data may be collected by a plurality of voice donors and stored in a voice bank. A voice donor may authenticate to a voice collection system to start a session to provide voice data. During the voice collection session, the voice donor may be presented with a sequence of prompts to speak and voice data may be transferred to a server. The received voice data may be processed to determine the speech units spoken by the voice donor and a count of speech units received from the voice donor may be updated. Feedback may be provided to the voice donor indicating, for example, a progress of the voice collection, a quality level of the voice data, or information about speech unit counts. The voice bank may be used to create TTS voices for voice recipients, create a model of voice aging, or for other applications.
摘要翻译: 语音数据可以由多个语音提供者收集并存储在语音库中。 语音授权者可以向语音收集系统认证以开始会话以提供语音数据。 在语音收集会话期间,可以向语音提供者呈现一系列提示语句,并且语音数据可以被传送到服务器。 可以处理所接收的语音数据以确定由语音授权者说出的语音单元,并且可以更新从语音授权者接收的语音单元的计数。 可以向语音供体提供反馈,指示例如语音收集的进展,语音数据的质量水平或关于语音单元计数的信息。 语音库可用于为语音接收者创建TTS语音,创建语音老化模型或其他应用程序。
-
-
-
-