SYSTEM AND METHOD FOR IMPROVING AIR TRAFFIC COMMUNICATION (ATC) TRANSCRIPTION ACCURACY BY INPUT OF PILOT RUN-TIME EDITS

    公开(公告)号:US20230267917A1

    公开(公告)日:2023-08-24

    申请号:US17658332

    申请日:2022-04-07

    CPC classification number: G10L15/063 G10L15/26 G06F40/166 G06F3/0482

    Abstract: Systems and methods are provided for training of an Automatic Speech Recognition (ASR) model during runtime of a transcription system, the system includes a background processor configured to operate with the transcription system to display a speech-to-text sample of an audio segment of a cockpit communication with an identifier which is converted using an ASR model wherein the background processor receives a response by a user during runtime of the transcription system and display of the speech-to-text sample and causes a change to the identifier to either a positive or negative attribute upon a determination of the correctness of a conversion process of the speech-to-text sample using the ASR model by review of a display of the content of the speech-to-text sample; and to train the ASR model based on information associated with the content of the speech-to-text sample in accordance with the response by the user.

    Methods and apparatus for voice-activated control of an interactive display

    公开(公告)号:US10198246B2

    公开(公告)日:2019-02-05

    申请号:US15242104

    申请日:2016-08-19

    Abstract: A method for controlling an interactive display is provided. The method receives a set of voice input data, via a voice input device communicatively coupled to the interactive display; interprets, by at least one processor, the set of voice input data to produce an interpreted result, wherein the at least one processor is communicatively coupled to the voice input device and the interactive display; presents, by the interactive display, a text representation of the interpreted result coupled to a user-controlled cursor; receives, by a user interface, a user input selection of a textual or graphical element presented by the interactive display, wherein the user interface is communicatively coupled to the at least one processor and the interactive display; and performs, by the at least one processor, an operation associated with the interpreted result and the user input selection.

    SYSTEM AND METHOD FOR CORRECTING ACCENT INDUCED SPEECH IN AN AIRCRAFT COCKPIT UTILIZING A DYNAMIC SPEECH DATABASE
    4.
    发明申请
    SYSTEM AND METHOD FOR CORRECTING ACCENT INDUCED SPEECH IN AN AIRCRAFT COCKPIT UTILIZING A DYNAMIC SPEECH DATABASE 有权
    用于使用动态语音数据库的航空器编码中的有意义的感应语音的校正系统和方法

    公开(公告)号:US20150100311A1

    公开(公告)日:2015-04-09

    申请号:US14047518

    申请日:2013-10-07

    CPC classification number: G10L15/07 G08G5/0013 G08G5/0021 G10L15/22

    Abstract: A system and method for recognizing speech on board an aircraft that compensates for different regional dialects over an area comprised of at least first and second distinct geographical regions, comprises analyzing speech in the first distinct geographical region using speech data characteristics representative of speech in the first distinct geographical region, detecting a change in position from the first distinct geographical region to the second geographical region, and analyzing speech in the second distinct geographical region using speech data characteristics representative of speech in the second distinct geographical region upon detecting that the aircraft has transitioned from the first distinct geographical region to the second distinct geographical region.

    Abstract translation: 一种用于识别飞机上的语音的系统和方法,用于在由至少第一和第二不同地理区域组成的区域上补偿不同的区域方言,包括使用代表第一和第二不同地理区域中的语音的语音数据特征来分析第一不同地理区域中的语音 检测从第一不同地理区域到第二地理区域的位置变化,以及在检测到飞行器已经转变时,使用表示第二不同地理区域中的语音的语音数据特征来分析第二不同地理区域中的语音 从第一个独特的地理区域到第二个不同的地理区域。

    Adaptive speech recognition methods and systems

    公开(公告)号:US12190861B2

    公开(公告)日:2025-01-07

    申请号:US17659596

    申请日:2022-04-18

    Abstract: Methods and systems are provided for assisting operation of a vehicle using speech recognition. One method involves analyzing a transcription of an audio communication with respect to the vehicle to characterize a nonstandard pattern within the transcription of the audio communication, obtaining a ground truth for the transcription of the audio communication, determining one or more performance metrics associated with the nonstandard pattern within the transcription based on a relationship between the transcription of the audio communication and the ground truth for the transcription, updating a speech recognition vocabulary for the vehicle to include the nonstandard pattern based at least in part on the one or more performance metrics and determining an updated speech recognition model for the vehicle using the updated speech recognition vocabulary and the audio communication.

    SYSTEM AND METHOD FOR HANDLING UNSPLIT SEGMENTS IN TRANSCRIPTION OF AIR TRAFFIC COMMUNICATION (ATC)

    公开(公告)号:US20230352042A1

    公开(公告)日:2023-11-02

    申请号:US17806565

    申请日:2022-06-13

    CPC classification number: G10L21/10 G10L15/26 G10L25/78 G10L25/93

    Abstract: Systems and methods are provided for a transcription system with voice activity detection (VAD). The system includes a VAD module to receive incoming audio and generate an audio segment; and a speech decoder with a split predictor to perform, in a first pass, a decode operation to transcribe text from an audio segment into a message; wherein in the first pass, if the message is determined not to contain a split point based on a content-based analysis performed by the split predictor, the speech decoder forwards the message for display and if the message is determined based on the content-based analysis to contain the split point, the speech decoder performs in a second pass, a re-decode operation to transcribe text from the audio segment based on the split point wherein the split point is configured within an audio domain of the audio segment by the split predictor and forward the message for display.

    Systems and methods for traffic prioritization
    8.
    发明授权
    Systems and methods for traffic prioritization 有权
    用于流量优先排序的系统和方法

    公开(公告)号:US09076326B2

    公开(公告)日:2015-07-07

    申请号:US13772985

    申请日:2013-02-21

    Abstract: Methods and apparatus are provided for traffic prioritization of surrounding air traffic for display onboard an aircraft. The apparatus includes a traffic data source configured to supply surrounding traffic data. The apparatus includes a traffic control module coupled to receive user selection data from the user input device and the surrounding traffic data. The traffic control module can be configured to determine a prioritization zone for prioritizing the surrounding air traffic to identify air traffic preceding the aircraft based on the user selection data, the range and the vertical speed of the surrounding air traffic, and set first traffic data that includes the surrounding air traffic within the prioritization zone listed by priority and second traffic data that includes the surrounding air traffic outside of the prioritization zone listed in received sequence. The apparatus displays a graphical user interface that includes the first traffic and the second traffic data.

    Abstract translation: 提供了用于在飞机上显示的周边空中交通流量优先化的方法和装置。 该装置包括被配置为提供周围交通数据的交通数据源。 该装置包括一个交通控制模块,它被耦合以从用户输入设备和周围的交通数据接收用户选择数据。 业务控制模块可以被配置为基于用户选择数据,周围空中业务的范围和垂直速度来确定用于优先考虑周围空中业务的优先级区域,以识别飞机之前的空中业务,并且设置第一业务数据, 包括由优先级列出的优先级区域内的周边空中业务,以及包括接收顺序列出的优先级区域外的周边空中业务的第二业务数据。 该装置显示包括第一流量和第二流量数据的图形用户界面。

    SYSTEM AND METHOD FOR EXTRACTING AND DISPLAYING SPEAKER INFORMATION IN AN ATC TRANSCRIPTION

    公开(公告)号:US20220383879A1

    公开(公告)日:2022-12-01

    申请号:US17305913

    申请日:2021-07-16

    Abstract: A system for extracting speaker information in an ATC transcription and displaying the speaker information on a graphical display unit is provided. The system is configured to: segment a stream of audio received from an ATC and other aircraft into a plurality of chunks; determine, for each chunk, if the speaker is enrolled in an enrolled speaker database; when the speaker is enrolled in the enrolled speaker database, decode the chunk using a speaker-dependent automatic speech recognition (ASR) model and tag the chunk with a permanent name for the speaker; when the speaker is not enrolled in the enrolled speaker database, assign a temporary name for the speaker, tag the chunk with the temporary name, and decode the chunk using a speaker independent speech recognition model; format the decoded chunk as text; and signal the graphical display unit to display the formatted text along with an identity for the speaker.

Patent Agency Ranking