摘要:
Methods and apparatus are provided for diagnosing a vehicle. In one embodiment, a method includes: initiating, by a processor, a recording of a noise by at least one microphone based on user selection data from a user of the vehicle; receiving, by the processor, audio signal data based on the recording; generating, by the processor, vector data based on the audio signal data; processing, by the processor, the vector data with at least one trained machine, by the processor, learning model to determine a classification of the noise; predicting, by the processor, an action to be taken based on the classification; and storing, by the processor, the audio signal data, the classification, and the action in a datastore.
摘要:
An infotainment system of a vehicle includes: a primary intent module configured to determine a primary intent included in voice input using automated speech recognition (ASR); and an execution module configured to, via a first hardware output device of the vehicle, execute the primary intent. A secondary intent module is configured to: based on the primary intent, determine a first domain of the primary intent; based on the first domain of the primary intent, determine a second domain; and based on the voice input and the second domain, determine a secondary intent included in the voice input using ASR. A display control module is configured to display a request for user input indicative of whether to execute the secondary intent. The execution module is further configured to, via a second hardware output device of the vehicle, execute the secondary intent in response to user input to execute the secondary intent.
摘要:
A processor receives a broadcast in a vehicle, select audio data from the broadcast, processes the audio data selected from the broadcast, determines a phonetic pattern of the selected audio data based on the processing, selects additional instances of audio data from the broadcast that resemble the selected audio data, processes the additional instances of audio data from the broadcast, determine phonetic patterns of the additional instances of audio data, and selects a plurality of phonetic patterns from the phonetic pattern of the selected audio data and the phonetic patterns of the additional instances of audio data. A transmitter transmits the plurality of phonetic patterns to a server to determine an optimal pronunciation of the selected audio data based on a statistical analysis of the plurality of phonetic patterns and to add the optimal pronunciation of the selected audio data to a database used to recognize speech in the vehicle.
摘要:
A system and method of performing speech arbitration at a client device that includes a neural network speech arbitration application, wherein the neural network speech arbitration application is configured to implement a neural network speech arbitration process, and wherein the method includes: receiving speech signals at a client device; generating and/or obtaining a set of inputs to be used in a speech arbitration neural network process, wherein the speech arbitration neural network process uses a neural network model that is tailored to speech arbitration and that can be used to determine whether and/or to what extent speech recognition processing of the received speech signals should be carried out at the client device; and receiving a speech arbitration output that indicates whether and/or to what extent the speech recognition processing of the received speech signals is to be carried out at the client device or at the remote server.
摘要:
A system and method of identifying and generating preferred emojis includes: detecting at a wireless device a plurality of selected emoji; determining the frequency with which each emoji is selected; identifying a defined number of emojis from the plurality of selected emojis based on the frequency with which each emoji is selected; and creating a frequently-used emoji library for the identified emojis.
摘要:
A system and method of controlling an automatic speech recognition (ASR) system includes: receiving speech at the ASR system from a vehicle occupant that includes a command to control a vehicle function; identifying a gate command from the speech; associating the identified gate command with the command to control the vehicle function; storing the associated gate command and vehicle command in a database; receiving additional speech at the ASR system from the vehicle occupant; detecting the gate command in the additional speech; and accessing the stored gate command and vehicle command from the database.
摘要:
A system and method of adjusting digital audio sampling used with wideband audio includes: performing audio sampling on an analog audio signal at an initial sampling rate and an initial bit rate over a wideband audio frequency range; generating a digital audio signal based on the audio sampling; detecting a qualitative error rate between the analog audio signal and the digital audio signal; and decreasing the initial sampling rate, the initial bit rate, or both for sampling subsequent analog audio when the qualitative error is below a threshold.
摘要:
An automatic speech recognition engine and a method of using the engine is described. The method pertains to front-end processing an audio signal and includes the steps of: identifying a plurality of voiced-frames of the audio signal; determining that one or more of the plurality of voiced-frames have a signal-to-noise (SNR) value greater than a first predetermined threshold; and based on the determination, bypassing noise suppression for the one or more of the plurality of voiced-frames.
摘要:
A method for processing a plurality of audio streams at a computer system onboard a vehicle is provided. The method receives the plurality of audio streams from a plurality of locations within a vehicle; prioritizes each of the plurality of audio streams to obtain a prioritization result; and completes a task associated with each of the plurality of audio streams, according to the prioritization result.
摘要:
At least first and second microphones with different frequency responses form part of a speech recognition system. The microphones are coupled to a processor that is configured to recognize a spoken word based on the microphone signals. The processor classifies the spoken word, and weights the signals from the microphones based on the classification of the spoken word.