Voice Assistant Components

The various components of the Assistant will be covered here

The Slang Voice Assistant has the following components built into it.

UI Components

Trigger (the microphone button)

The microphone button that users of the app can use to start the Voice Assistant

Onboarding (language selection and microphone permission)

The UI components that get the app's end-users onboarded with the Voice Assistant.

  • A customizable coachmark to bring attention to the Trigger

  • An onboarding wizard that does the following -

    • Educate the user about what they will get out of the Assistant. Customers can completely customize this step with their own image

    • Select the language they want to use to interact with the Assistant

    • Ask the user to grant microphone permission

Surface (the primary interactive component)

The visual component that overlays on top of the app and is what the users will be interacting with every time the Assistant gets triggered (either programmatically by the app or by the end-user themselves by clicking on the trigger). The surface consists of the following sub-components -

  • The feedback section - The area where the user's spoken text is shown as well as what the Assistant wants to ask

  • State and Progress indicators - The bottom part of the surface is used to visually indicate what state the Assistant is in at any point. It could be speaking, listening, or processing your request.

  • Hints - The section of the screen which helps the end-users know what to speak at any point in time

  • Mute Button - By default, the Assistant will speak out things to the user. It could be a greeting or a question (what we call a prompt) or a statement to inform the user about something.

  • Language selection - The end-user can change the Assistant language at any time by clicking on the language selection button

Help

This section is used to train the user about what they can do with the Assistant. This gets shown automatically when the end-user repeatedly fails to speak a valid command with the Assistant or when the "?" icon is clicked inside the surface.

Backend components

The Slang Assistant implicitly bundles the below technology components as part of its backend.

  • ASR

  • NLP

  • TTS

  • Translation

As a developer, one does not need to think in terms of these deep tech components, as Slang will abstract it out for you in the most natural way.

But where required, the Slang platform will expose knobs and configuration options to help improve the quality of the overall Voice Assistant experience specific to your app.

Last updated