Larson Technical Services: Voice Technology Consulting

Exercise 3.4

Construct a language identification prompt by doing the following:

A. Choose an area somewhere in the world and identify the primary spoken national language and two secondary languages used in that area.
In Minneapolis, Minnesota, U.S.A. English is the main language. Several individuals also speak Norwegian.

B. Construct an explicit selection menu that asks callers which of the three languages they prefer to speak. Assume that a native speaker of the language will prerecord each prompt, so you only need to

specify what the native speaker should say. e.g., write the words for the prompt. Assume that the caller will respond with the English name of the spoken national language.


<menu id = "language">
     <prompt>do you want to speak English or Norwegian </prompt>
     <prompt>vil de snakker engelisk eller norsk </prompt>

     <choice next  = "#english">
          <grammar type="application/grammar+xml" version="1.0">
               <rule id = "english" scope = "public"> english </rule>
          </grammar>
     </choice>

     <choice next  = "#norwegianV>
          <grammar type="application/grammar+xml" version="1.0">
               <rule id = "norwegian" scope = "public"> norwegian </rule>
          </grammar>
     </choice>

</menu>

C. Repeat (b). Assume that an English speech synthesizer will speak each prompt. You will need to use a phoneme markup command to describe the phonemes in the prompt. The phoneme command is

described in the Speech Synthesis Markup Language document on the W3C Voice Browser Working Group web site: http://www.w3.org/voice. The phoneme markup command requires the use of a

phonetic language. Use the Worldbet phonetic language described on the Oregon Health and Sciences University web site at http://cslu.cse.ogi.edu/tutordemos/SpectrogramReading/ipa/ipachars.html.


<menu id = "language">
     <prompt> do you want to speak English or Norwegian </prompt>

     <prompt>
          <phoneme alphabet="worldbet" 
                    ph="v E l"> vil </phoneme>
          <phoneme alphabet="worldbet" 
                    ph="d I:"> de </phoneme>
          <phoneme alphabet="worldbet" 
                    ph="s n A kh E r"> snakker </phoneme>
          <phoneme alphabet="worldbet" 
                    ph="n u s kh"> norsk </phoneme>
          <phoneme alphabet="worldbet" 
                    ph="e l l e r"> eller </phoneme>
          <phoneme alphabet="worldbet"
                    ph="e N e l e s kh "> engelisk</phoneme>

     <choice next  = "#english">
          <grammar type="application/grammar+xml" version="1.0">
               <rule id = "english" scope = "public"> english </rule>
          </grammar>
     </choice>

     <choice next  = "#norwegian">
          <grammar type="application/grammar+xml" version="1.0">
               <rule id = "norwegian" scope = "public"> norwegian </rule>
          </grammar>
     </choice>

</menu>

D. Repeat (b) Assume that the speech recognition engine recognizes the local pronunciation of the names of the three spoken national languages. Write down the local pronunciation of the names of the

two spoken national languages using the Worldbet phonetic language. (In practice, this notation is placed in a lexicon, which is used by the speech recognition engine to recognize words.) For example,

“English” would be denoted as “E N l I S”.

The Norwegian word for the Norwegian language is norsk, pronounced “n u s k”

The Norwegian word for English is engelisk, prounced “e N e l e s kh”

Return to the Instructor’s Page

Comments are off for this post