VoiceXML Architecture

A VoiceXML application consists of severalTTS, audio play and record functions, as well as a
components, as shown in Figure 1:telephone network interface.
Application Server: Typically a Web server, whichInternet-style network: A TCP/IP-based packet
runs the application logic, and may contain anetwork that connects the application server and
database or interfaces to an external database ortelephony server via HTTP.
transaction server.Telephone Network: Typically the Public Switched
VoiceXML Telephony Server: A platform that runsTelephone Network (PSTN), but could be a
a VoiceXML interpreter that acts as a client toprivate telephone network (e.g. PBX), or VoIP
the application server. The interpreter understandspacket network. Caller: Any telephone that can
VoiceXML dialogs and controls speech andconnect to the telephone network.
telephony resources. These resource include ASR,