Patent search ap:("AT&T INTELLECTUAL PROPERTY I Page L.P.") AND inv:"Dimitrios Dimitriadis"

21.

发明申请
METHOD AND APPARATUS FOR PROCESSING COMMANDS DIRECTED TO A MEDIA CENTER 审中-公开

公开(公告)号：US20190141385A1

公开(公告)日：2019-05-09

申请号：US16239924

申请日：2019-01-04

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Dimitrios Dimitriadis , Horst Juergen Schroeter

IPC: H04N21/4223 , G06F3/038 , H04N21/439 , H04N21/44 , H04N21/4402 , G06F3/01 , H04N5/44 , H04N21/442 , G06F3/16 , H04N21/422 , G06F3/03 , H04N21/47 , H04N21/45 , H04N21/4415

CPC classification number: H04N21/4223 , G06F3/011 , G06F3/017 , G06F3/0304 , G06F3/038 , G06F3/167 , G06F2203/0381 , H04N5/4403 , H04N21/42203 , H04N21/4394 , H04N21/44008 , H04N21/440236 , H04N21/4415 , H04N21/44218 , H04N21/4532 , H04N21/47 , H04N2005/4428 , H04N2005/4432 , H04N2005/4442

Abstract: A system that incorporates teachings of the subject disclosure may include, for example, a method that identifies first and second gestures of first and second viewers in a proximity of a media center and associates the first and second gestures with first and second command A conflict is determined between the first and second commands and in response a notification is provided via the media center. The notification requests a resolution to the conflict. A cue is detected from a viewer responsive to the presenting of the notification. The cue identifies a selected one of the first viewer or the second viewer and control of the media center is assigned to one of the first viewer or the second viewer responsive to the cue. Other embodiments are disclosed.

22.

发明授权
Sensor enhanced speech recognition 有权

公开(公告)号：US10083350B2

公开(公告)日：2018-09-25

申请号：US15868546

申请日：2018-01-11

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Dimitrios Dimitriadis , Donald J. Bowen , Mazin E. Gilbert , Horst J. Schroeter

IPC: G10L15/25 , G06K9/00 , G10L15/065 , G10L15/22

CPC classification number: G06K9/00335 , G06K9/00664 , G10L15/065 , G10L2015/227 , G10L2015/228

Abstract: A system for sensor enhanced speech recognition is disclosed. The system may obtain visual content or other content associated with a user and an environment of the user. Additionally, the system may obtain, from the visual content, metadata associated with the user and the environment of the user. The system may also include determining, based on the visual content and metadata, if the user is speaking. If the user is determined to be speaking, the system may obtain audio content associated with the user and the environment. The system may then adapt, based on the visual content, audio content, and metadata, one or more acoustic models that match the user and the environment. Once the one or more acoustic models are adapted and loaded, the system may enhance a speech recognition process or other process associated with the user.

23.

发明申请
Exploiting Visual Information For Enhancing Audio Signals Via Source Separation And Beamforming 审中-公开

公开(公告)号：US20180181812A1

公开(公告)日：2018-06-28

申请号：US15905442

申请日：2018-02-26

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Dimitrios Dimitriadis , Donald J. Bowen , Lusheng Ji , Horst J. Schroeter

IPC: G06K9/00 , H04R5/04 , G10L21/0208

CPC classification number: G06K9/00684 , G06F3/165 , G10L21/0208 , H04R5/04 , H04R2430/20 , H04R2460/07

Abstract: A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.

24.

发明申请
Exploiting Visual Information For Enhancing Audio Signals Via Source Separation And Beamforming 有权
Title translation: 利用视觉信息，通过源分离和波束成形来增强音频信号

公开(公告)号：US20150365759A1

公开(公告)日：2015-12-17

申请号：US14302110

申请日：2014-06-11

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Dimitrios Dimitriadis , Donald J. Bowen , Lusheng Ji , Horst J. Schroeter

IPC: H04R3/00 , G06K9/00

CPC classification number: G06K9/00684 , G10L21/0208 , H04R5/04 , H04R2430/20 , H04R2460/07

Abstract: A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.

Abstract translation: 公开了一种利用视觉信息通过源分离和波束成形来增强音频信号的系统。系统可以获得与用户的环境相关联的可视内容，并且可以从视觉内容中提取与环境相关联的元数据。系统可以基于所提取的元数据来确定用户的位置。另外，系统可以基于位置加载与用户的位置相对应的音频简档。系统还可以加载包括与用户相关联的音频数据的用户的用户简档。此外，系统可以基于音频简档和用户简档来取消来自用户的环境的噪声。此外，系统可以包括基于音频简档和用户简档调整由用户生成的音频信号，以便在用户的通信会话期间增强音频信号。

25.

发明申请
EXPLOITING VISUAL INFORMATION FOR ENHANCING AUDIO SIGNALS VIA SOURCE SEPARATION AND BEAMFORMING 有权

公开(公告)号：US20220180632A1

公开(公告)日：2022-06-09

申请号：US17652497

申请日：2022-02-25

Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventor： Dimitrios Dimitriadis , Donald J. Bowen , Lusheng Ji , Horst J. Schroeter

IPC: G06V20/00 , G10L21/0208 , G06F3/16 , H04R5/04

Abstract: A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.

26.

发明授权
Pre-distortion system for cancellation of nonlinear distortion in mobile devices 有权

公开(公告)号：US11206332B2

公开(公告)日：2021-12-21

申请号：US16586269

申请日：2019-09-27

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Horst J. Schroeter , Donald J. Bowen , Dimitrios Dimitriadis , Lusheng Ji

IPC: H04M9/08 , H04R3/04 , H04R29/00 , G10L21/0208

Abstract: A pre-distortion system for improved mobile device communications via cancellation of nonlinear distortion is disclosed. The pre-distortion system may transmit an acoustic signal from a network to a device, wherein the acoustic signal includes a linear signal and a nonlinear cancellation signal that cancels at least a portion of nonlinear distortions created once a loudspeaker in the device emits the linear signal. Thus, when a loudspeaker of a mobile device is operating and nonlinear distortions are generated by the loudspeaker or adjacent components of the mobile device in close proximity to the loudspeaker, the pre-distortion system may create one or more nonlinear cancellation signals in the network. The nonlinear cancellation signal may be combined with the linear signal sent to the loudspeaker to cancel the nonlinear distortion signal created by the loudspeaker emitting acoustic sounds from the linear signal. Thus, the nonlinear cancellation signal becomes a pre-distortion signal.

27.

发明授权
Method and apparatus for processing commands directed to a media center 有权

公开(公告)号：US10743058B2

公开(公告)日：2020-08-11

申请号：US16239924

申请日：2019-01-04

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Dimitrios Dimitriadis , Horst Juergen Schroeter

IPC: H04N21/4223 , H04N21/439 , H04N21/44 , H04N21/4402 , H04N5/44 , H04N21/442 , H04N21/47 , H04N21/45 , H04N21/4415 , G06F3/038 , G06F3/01 , G06F3/16 , G06F3/03 , H04N21/422

Abstract: A system that incorporates teachings of the subject disclosure may include, for example, a method that identifies first and second gestures of first and second viewers in a proximity of a media center and associates the first and second gestures with first and second command A conflict is determined between the first and second commands and in response a notification is provided via the media center. The notification requests a resolution to the conflict. A cue is detected from a viewer responsive to the presenting of the notification. The cue identifies a selected one of the first viewer or the second viewer and control of the media center is assigned to one of the first viewer or the second viewer responsive to the cue. Other embodiments are disclosed.

28.

发明授权
Exploiting visual information for enhancing audio signals via source separation and beamforming 有权

公开(公告)号：US10402651B2

公开(公告)日：2019-09-03

申请号：US15905442

申请日：2018-02-26

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Dimitrios Dimitriadis , Donald J. Bowen , Lusheng Ji , Horst J. Schroeter

IPC: G06F17/00 , G06K9/00 , G10L21/0208 , G06F3/16 , H04R5/04

Abstract: A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.

29.

发明申请
SENSOR ENHANCED SPEECH RECOGNITION 审中-公开

公开(公告)号：US20180137348A1

公开(公告)日：2018-05-17

申请号：US15868546

申请日：2018-01-11

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Dimitrios Dimitriadis , Donald J. Bowen , Mazin E. Gilbert , Horst J. Schroeter

IPC: G06K9/00

CPC classification number: G06K9/00335 , G06K9/00664 , G10L15/065 , G10L2015/227 , G10L2015/228

Abstract: A system for sensor enhanced speech recognition is disclosed. The system may obtain visual content or other content associated with a user and an environment of the user. Additionally, the system may obtain, from the visual content, metadata associated with the user and the environment of the user. The system may also include determining, based on the visual content and metadata, if the user is speaking. If the user is determined to be speaking, the system may obtain audio content associated with the user and the environment. The system may then adapt, based on the visual content, audio content, and metadata, one or more acoustic models that match the user and the environment. Once the one or more acoustic models are adapted and loaded, the system may enhance a speech recognition process or other process associated with the user.

30.

发明授权
Pre-distortion system for cancellation of nonlinear distortion in mobile devices 有权

公开(公告)号：US09973633B2

公开(公告)日：2018-05-15

申请号：US14543261

申请日：2014-11-17

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Horst J. Schroeter , Donald J. Bowen , Dimitrios Dimitriadis , Lusheng Ji

IPC: H04M9/08 , G10L21/0208

CPC classification number: H04M9/082 , G10L2021/02082

Abstract: A pre-distortion system for improved mobile device communications via cancellation of nonlinear distortion is disclosed. The pre-distortion system may transmit an acoustic signal from a network to a device, wherein the acoustic signal includes a linear signal and a nonlinear cancellation signal that cancels at least a portion of nonlinear distortions created once a loudspeaker in the device emits the linear signal. Thus, when a loudspeaker of a mobile device is operating and nonlinear distortions are generated by the loudspeaker or adjacent components of the mobile device in close proximity to the loudspeaker, the pre-distortion system may create one or more nonlinear cancellation signals in the network. The nonlinear cancellation signal may be combined with the linear signal sent to the loudspeaker to cancel the nonlinear distortion signal created by the loudspeaker emitting acoustic sounds from the linear signal. Thus, the nonlinear cancellation signal becomes a pre-distortion signal.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification