-
公开(公告)号:US12058391B2
公开(公告)日:2024-08-06
申请号:US17854572
申请日:2022-06-30
Applicant: Amazon Technologies, Inc.
Inventor: Yongjun Wu , Sitaraman Ganapathy , Vasanthakumar Soundararajan , Nikhil Sharma
IPC: H04N21/234 , H04N21/81 , H04N21/845
CPC classification number: H04N21/23424 , H04N21/812 , H04N21/8455
Abstract: A system for utilizing media content reference point information to perform media content encoding, and supplemental content stitching and/or insertion. Media content can be encoded and packaged based on boundaries of the media content. The boundaries can be received from a third-party and/or generated via an automated process. Target boundaries can be selected based on accuracy levels associated with the received and/or generated boundaries. Supplemental content can be stitched and/or inserted into packaged media content based on audio and video content of the packaged media content being aligned.
-
公开(公告)号:US20230113297A1
公开(公告)日:2023-04-13
申请号:US17836330
申请日:2022-06-09
Applicant: Amazon Technologies, Inc.
Inventor: Antonio Bonafonte , Panagiotis Agis Oikonomou Filandras , Bartosz Perz , Arent van Korlaar , Ioannis Douratsos , Jonas Felix Ananda Rohnke , Elena Sokolova , Andrew Paul Breen , Nikhil Sharma
IPC: G10L13/10 , G10L13/047 , G10L15/16 , G10L15/18 , G10L15/22
Abstract: A speech-processing system receives both text data and natural-understanding data (e.g., a domain, intent, and/or entity) related to a command represented in the text data. The system uses the natural-understanding data to vary vocal characteristics in determining spectrogram data corresponding to the text data based on the natural-understanding data.
-
公开(公告)号:US20240007690A1
公开(公告)日:2024-01-04
申请号:US17854572
申请日:2022-06-30
Applicant: Amazon Technologies, Inc.
Inventor: Yongjun Wu , Sitaraman Ganapathy , Vasanthakumar Soundararajan , Nikhil Sharma
IPC: H04N21/234 , H04N21/845 , H04N21/81
CPC classification number: H04N21/23424 , H04N21/8455 , H04N21/812
Abstract: A system for utilizing media content reference point information to perform media content encoding, and supplemental content stitching and/or insertion. Media content can be encoded and packaged based on boundaries of the media content. The boundaries can be received from a third-party and/or generated via an automated process. Target boundaries can be selected based on accuracy levels associated with the received and/or generated boundaries. Supplemental content can be stitched and/or inserted into packaged media content based on audio and video content of the packaged media content being aligned.
-
公开(公告)号:US11367431B2
公开(公告)日:2022-06-21
申请号:US16818542
申请日:2020-03-13
Applicant: Amazon Technologies, Inc.
Inventor: Antonio Bonafonte , Panagiotis Agis Oikonomou Filandras , Bartosz Perz , Arent van Korlaar , Ioannis Douratsos , Jonas Felix Ananda Rohnke , Elena Sokolova , Andrew Paul Breen , Nikhil Sharma
IPC: G10L13/10 , G10L13/047 , G10L15/16 , G10L15/18 , G10L15/22
Abstract: A speech-processing system receives both text data and natural-understanding data (e.g., a domain, intent, and/or entity) related to a command represented in the text data. The system uses the natural-understanding data to vary vocal characteristics in determining spectrogram data corresponding to the text data based on the natural-understanding data.
-
公开(公告)号:US20210287656A1
公开(公告)日:2021-09-16
申请号:US16818542
申请日:2020-03-13
Applicant: Amazon Technologies, Inc.
Inventor: Antonio Bonafonte , Panagiotis Agis Oikonomou Filandras , Bartosz Perz , Arent van Korlaar , Ioannis Douratsos , Jonas Felix Ananda Rohnke , Elena Sokolova , Andrew Paul Breen , Nikhil Sharma
IPC: G10L13/10 , G10L15/22 , G10L15/18 , G10L13/047 , G10L15/16
Abstract: A speech-processing system receives both text data and natural-understanding data (e.g., a domain, intent, and/or entity) related to a command represented in the text data. The system uses the natural-understanding data to vary vocal characteristics in determining spectrogram data corresponding to the text data based on the natural-understanding data.
-
公开(公告)号:US12287639B1
公开(公告)日:2025-04-29
申请号:US17545587
申请日:2021-12-08
Applicant: Amazon Technologies, Inc.
Inventor: Tiago Etiene Queiroz , Nikhil Sharma , Prashant Anand Srivastava
Abstract: Systems and techniques for generation, storage, and updating of common floor plans across shared autonomous mobile devices. The systems and techniques include extraction of semantically meaningful data from one or more occupancy maps to provide a floorplan to a user that aligns with the user's understanding of the space using heuristics and machine learning techniques. The techniques also enable repair of damaged or inconsistent floor plan geometries through the use of polygon triangulation and graph cuts.
-
公开(公告)号:US20240428775A1
公开(公告)日:2024-12-26
申请号:US18823176
申请日:2024-09-03
Applicant: Amazon Technologies, Inc.
Inventor: Sebastian Dariusz Cygert , Daniel Korzekwa , Kamil Pokora , Piotr Tadeusz Bilinski , Kayoko Yanagisawa , Abdelhamid Ezzerg , Thomas Edward Merritt , Raghu Ram Sreepada Srinivas , Nikhil Sharma
IPC: G10L13/033 , G10L13/047 , G10L13/10
Abstract: Techniques for generating customized synthetic voices personalized to a user, based on user-provided feedback, are described. A system may determine embedding data representing a user-provided description of a desired synthetic voice and profile data associated with the user, and generate synthetic voice embedding data using synthetic voice embedding data corresponding a profile associated with a user determined to be similar to the current user. Based on user-provided feedback with respect to a customized synthetic voice, generated using synthetic voice characteristics corresponding to the synthetic voice embedding data and presented to the user, and the synthetic voice embedding data, the system may generate new synthetic voice embedding data, corresponding to a new customized synthetic voice. The system may be configured to assign the customized synthetic voice to the user, such that a subsequent user may not be presented with the same customized synthetic voice.
-
公开(公告)号:US11520332B1
公开(公告)日:2022-12-06
申请号:US16702008
申请日:2019-12-03
Applicant: AMAZON TECHNOLOGIES, INC.
Inventor: James Charles Zamiska , David Allen Fotland , Roger Robert Webster , Mohit Deshpande , Robert Franklin Ebert , Nikhil Sharma , Rachel Liao , Chang Young Kim
IPC: G05D1/00 , G01S17/931 , G05D1/02
Abstract: An autonomous mobile device (AMD) uses sensors to explore a physical space and determine the locations of obstacles. Simultaneous localization and mapping (SLAM) techniques are used by the AMD to designate as keyframes some images and their associated descriptors of features in the space. Each keyframe indicates a location and orientation of the AMD relative to those features. Anchors are specified relative to keyframes. A marker is specified relative to one or more anchors. Because markers are associated with features in the physical space, they maintain their association with the physical space through various processes such as SLAM loop closures. Markers may specify locations in the physical space, such as navigation waypoints, navigation destinations such as a goal pose for exploring an unexplored area, as an observation target to facilitate exploration, and so forth. Markers may also be used to specify block listed locations to be avoided during exploration.
-
公开(公告)号:US12087270B1
公开(公告)日:2024-09-10
申请号:US17955961
申请日:2022-09-29
Applicant: Amazon Technologies, Inc.
Inventor: Sebastian Dariusz Cygert , Daniel Korzekwa , Kamil Pokora , Piotr Tadeusz Bilinski , Kayoko Yanagisawa , Abdelhamid Ezzerg , Thomas Edward Merritt , Raghu Ram Sreepada Srinivas , Nikhil Sharma
IPC: G10L15/16 , G10L13/033 , G10L13/047 , G10L13/10 , G10L15/06 , G10L25/30
CPC classification number: G10L13/033 , G10L13/047 , G10L13/10
Abstract: Techniques for generating customized synthetic voices personalized to a user, based on user-provided feedback, are described. A system may determine embedding data representing a user-provided description of a desired synthetic voice and profile data associated with the user, and generate synthetic voice embedding data using synthetic voice embedding data corresponding a profile associated with a user determined to be similar to the current user. Based on user-provided feedback with respect to a customized synthetic voice, generated using synthetic voice characteristics corresponding to the synthetic voice embedding data and presented to the user, and the synthetic voice embedding data, the system may generate new synthetic voice embedding data, corresponding to a new customized synthetic voice. The system may be configured to assign the customized synthetic voice to the user, such that a subsequent user may not be presented with the same customized synthetic voice.
-
公开(公告)号:US12055947B1
公开(公告)日:2024-08-06
申请号:US17545762
申请日:2021-12-08
Applicant: Amazon Technologies, Inc.
Inventor: Nikhil Sharma , Tiago Etiene Queiroz , Prashant Anand Srivastava , Tushar Agarwal , Gaurav Guruprasad Manur , Aarthi Raveendran
CPC classification number: G05D1/0274 , G05D1/0212 , G05D1/0238 , G06T17/20
Abstract: Systems and techniques for updating floor plans for use by autonomous mobile devices. The techniques include accessing a first floor plan with associated spatial metadata. An updated occupancy map is received with additional geographic updates over the first floor plan. A transformation of the first floor plan is determined to match the new occupancy map. The spatial data and the first floor plan are transformed and the transformed spatial metadata is associated with the new occupancy map to form a second floor plan.
-
-
-
-
-
-
-
-
-