- 专利标题: Self-supervised pitch estimation
-
申请号: US17640579申请日: 2020-09-25
-
公开(公告)号: US11756530B2公开(公告)日: 2023-09-12
- 发明人: Marco Tagliasacchi , Mihajlo Velimirovic , Matthew Sharifi , Dominik Roblek , Christian Frank , Beat Gfeller
- 申请人: GOOGLE LLC
- 申请人地址: US CA Mountain View
- 专利权人: Google LLC
- 当前专利权人: Google LLC
- 当前专利权人地址: US CA Mountain View
- 代理机构: McDonnell Boehnen Hulbert & Berghoff LLP
- 国际申请: PCT/US2020/052722 2020.09.25
- 国际公布: WO2021/076297A 2021.04.22
- 进入国家日期: 2022-03-04
- 主分类号: G10L15/06
- IPC分类号: G10L15/06 ; G10L21/013 ; G10L25/30 ; G10L25/90
摘要:
Example embodiments relate to techniques for training artificial neural networks or oilier machine-learning encoders to accurately predict the pitch of input audio samples in a semitone or otherwise logarithmically-scaled pitch space. An example method may include generating, from a sample of audio data, two training samples by applying two different pitch shifts to the sample of audio training data. This can be done by converting the sample of audio data into the frequency domain and then shifting the transformed data. These known shifts are then compared to the predicted pitches generated by applying the two training samples to the encoder. The encoder is then updated based on the comparison, such that the relative pitch output by the encoder is improved with respect to accuracy. One or more audio samples, labeled with absolute pitch values, can then be used to calibrate the relative pitch values generated by the trained encoder.
公开/授权文献
- US20220343896A1 SELF-SUPERVISED PITCH ESTIMATION 公开/授权日:2022-10-27
信息查询