摘要:
A system with aclient-server architecture for Automatic Speech Recognition (ASR) applications, that comprises a client-side including a distributed front end for converting acoustic waves to feature vectors representing the properties of the acoustic waves using DSP; VAD for separating between speech and non-speech acoustic signals and for environmental compensation; adaptor for Web Sockets to pipeline chunked feature vectors to the server side and a server side including A web layer utilizing HTTP protocols and including a Web Server having a Servlet Container for interacting with Java servlets and for managing the lifecycle of servlets using Web Sockets as a transducer that transfers Web Sockets' packets to messages; a Message-Oriented Middleware, connected to the Web Server, for feature vector delivery and decoding and for load balancing between the client side and the server side a recognition server and an adaptation server both connected to the Message-Oriented Middleware, the adaptation server being connected via an adaptation channel, the recognition server interacts with the client side via a recognition channel and a Distributed Frontend (DFE); an intermediate layer for transport, communication and integration; a Speech processing server consisting of a Recognition Server for instantiation of a recognition channel per client and for establishing a communication channel between the web layer and the recognition channel; and a Persistent layer for storing the acoustic models of each client in a hierarchical order, comprising a Language Knowledge Base connected to the recognition server and to the adaptation server, and includes a dictionary, acoustic models, statistical language models and language patterns.