![]() ![]() ![]() ![]() ![]() |
||||||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||||||
|
|
CS 410/510 SLA-Spoken Language Applications Saturdays 9:00 -12:30 Instructor Student Contest Course motivation VoiceXML is affecting the speech industry dramatically. VoiceXML is changing how developers create speech-enabled Internet applications. By hiding many low-level details, developers use VoiceXML to create speech-enabled applications by specifying high-level prompt messages for menus and forms rather than the detailed, procedural level of programming languages. Decreasing the programming time and effort enables developers to perform additional iterations of usability testing and design refinement. VoiceXML is lowering the entry barrier to creating speech applications. While VoiceXML makes it easy to create a speech-enabled application, it is difficult to create a good one. An HTML programmer can easily learn the how to write VoiceXML scripts, but designing a usable VoiceXML form or menu is still an art more than a science. VoiceXML language manuals do not offer much in the way of advice for how to phrase a prompt, what to include in the grammar describing what a user may say in response to a prompt, and what to do if the user does not respond appropriately. This course answers fundamental speech user interface questions, including:
VoiceXML makes iterative design and testing of speech-enabled applications possible. Developers can quickly mock up designs for evaluation by prospective users. Developers quickly identify and fix trouble spots. VoiceXML hides the complex programming details, and enables the developer to concentrate on developing the overall design refining the detailed wording of prompts and messages spoken to the user. VoiceXML does NOT displace the need for user testing; it makes it possible to perform more user testing. Designing voice user interfaces is still an art. This course presents numerous guidelines suggestions, and conventional wisdom, but each day voice dialog designers are learning more so guidelines are evolving quickly. The VoiceXML language itself is evolving. The W3C Voice Browser Working Group meets at least three hours each week to discuss and modify VoiceXML and its related languages. While this course uses an early version of VoiceXML 2.0, some of the examples may be out of date by the time you read them. For the latest language specifications of VoiceXML and its related languages, see http://www.w3.org/voice With the introduction of mobil devices that integrate the functionality of cell phones and PDAs, multimodal applications that provide both a visual and verbal user interfaces will be popular. "X+V" for speech enabling (X)HTML pages provide one approach for implementing this important and exciting new class of applications Course Goal Course Content
Prerequisite Grading
Texts: VoiceXML Guide, a CD rom home study guide, available from http://www.vxmlguide.com or the PSU bookstore. X+V Programmer's Guide ftp://ftp.software.ibm.com/software/pervasive/info/multimodal /XHTML_voice_programmers_guide.pdf State Chart XML W3C Working Draft http://www.w3.org/TR/2006/WD-scxml-20060124/ Class lecture notes may be downloaded from http://www.larson-tech.com/CS410_F07/ Course Schedule
Project 1:Download the Prophecy voice platform http://www.voxeo.com/prophecy/ (medium, 102 MB file) to your PC and implement the voice equivalent of the following paper form: Ajax University Student Name____________________ Project 2:Step 1: Turn in a written proposal describing (a) the purpose of your proposed VoiceXML speech application, (b) An example scenario of the use of your application, and (c) the target use of your application. Step 2. After instructor approval, implement and demonstrate your project. Project 3:Step 1: Turn in a written proposal describing (a) the purpose of your proposed "X+V" multimodal application, (b) An example scenario of the use of your application, and (c) the target use of your application. Step 2. After instructor approval, implement and demonstrate your multimodal project.
|
|||||||||||||||||||||||||||||||||
![]() |
||||||||||||||||||||||||||||||||||