摘要:
Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.
摘要:
Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.
摘要:
Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.
摘要:
Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.
摘要:
Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized and its native language determined for obtaining a target phoneme for providing human-sounding speech in a language (e.g., dialect or accent) that is familiar to a user. The algorithms may be implemented on a system including several dedicated render engines. The system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesized speech. The front end may communicate media assets and associated synthesized speech content over a network to host devices coupled to portable electronic devices on which the media assets and synthesized speech are played back.
摘要:
This invention is directed to an audio menu provided in an electronic device having no display. The electronic device can further include an input interface having only a single sensing element (e.g., a single button) for controlling audio playback of the device and for accessing and controlling the device audio menu. In response to a particular input detected by the single sensing element, the electronic device can enable an audio menu mode and play back audio clips associated with different menu options. The user can provide selection instructions using the single sensing element during the playback of an audio clip to select the menu option associated with the played back audio clip. In some embodiments, the audio menu can be multi-dimensional (e.g., the device plays back audio clips for sub-options in response to a selection of a menu option). Suitable menu options can include, for example, groupings of audio (e.g., playlists), options to toggle (e.g., a shuffle option), or options associated with particular metadata tags associated with audio available to the device.
摘要:
This invention is directed to an audio menu provided in an electronic device having no display. The electronic device can further include an input interface having only a single sensing element (e.g., a single button) for controlling audio playback of the device and for accessing and controlling the device audio menu. In response to a particular input detected by the single sensing element, the electronic device can enable an audio menu mode and play back audio clips associated with different menu options. The user can provide selection instructions using the single sensing element during the playback of an audio clip to select the menu option associated with the played back audio clip. In some embodiments, the audio menu can be multi-dimensional (e.g., the device plays back audio clips for sub-options in response to a selection of a menu option). Suitable menu options can include, for example, groupings of audio (e.g., playlists), options to toggle (e.g., a shuffle option), or options associated with particular metadata tags associated with audio available to the device.
摘要:
Systems and methods are provided that allow steps to be counted in multiple pedometer modes. For example, a pedometer application on an electronic device can count steps in an ambient mode of operation and a session mode of operation. By maintaining separate counters for each pedometer mode and continuously monitoring the desired mode of operation, the pedometer application can accurately count the number of steps during a certain time period. In response to detecting a change in the desired mode, the pedometer application can seamlessly switch to the corresponding mode counter. In some embodiments, the pedometer application can display graphical representations of historical step information for one or more pedometer modes.
摘要:
This is directed to obfuscating a display to secure the display of information provided to a user. An electronic device can modulate the display of information using different approaches, including for example by adding artifacts or changing the color, frequency or polarity of displayed information, thus obfuscating the display. To view the displayed information, a user can place a filter between the user and the display (e.g., as part of glasses) such that the filter can remove or undo the obfuscation. In some embodiments, the device can display different confidential information for several users simultaneously, where different obfuscation approaches are used for each user. This can allow several users to interact with the device simultaneously while ensuring that each user's information remains confidential.
摘要:
This invention is directed to playing back streamed media items using an electronic device. In particular, this is directed to locally storing one or more clips corresponding to a media item such that the clips can be immediately played back in response to a user request to play back the media item. While the clips are played back, the electronic device can retrieve the remaining segments of the media item from the user's media library as a media stream over a communications network. Once the playback of the clip is complete, the electronic device can seamlessly switch playback to the media stream received from the user's library. The electronic device can authenticate with the remote library using any suitable approach, including for example one or more passwords and tokens.