Virtual Assistant (with Rasa)
Contents
 
Virtual Assistant (with Rasa)#
This Virtual Assistant (with Rasa) sample application demonstrates the integration of Rasa and the Riva Speech Service in the form of a weather chatbot web application. This sample is available in two options:
Option 1: Riva ASR + Riva TTS + Riva NLP + Rasa dialog manager
Option 2: Riva ASR + Riva TTS + Rasa NLU + Rasa dialog manager
To learn more about Rasa, follow this link.
Implementation#
At a high-level, the integration takes advantage of the native API support of Rasa and gRPC support of Riva. The Weatherbot Client coordinates the workflow with Riva Services and Rasa, then interacts with the end-user via a web UI.
- Container 1: Riva AI Services - Exposes Speech Services (ASR/NLP/TTS) over gRPC endpoints 
- Needs a GPU 
 
- Container 2: Riva Samples’ Riva-Rasa Chatbot - Functionality - Exposes Rasa functionality over API endpoints. Responsible for dialog management with Rasa as well as NLP with Riva or NLU with Rasa 
- Contains the Weatherbot Client application (web UI and web service) 
 
- Rasa - Up to two instances - Rasa Server 
- Rasa Action (if applicable) 
 
- Does not need GPUs but can be deployed on GPUs for performance 
 
- Weatherbot Client - Includes the Riva Python library installed 
- Communicates with Riva AI Services and Rasa over gRPC and REST API endpoints respectively 
- Pipelines ASR, NLP, TTS, and dialog manager functionalities 
- Does not need GPUs 
 
 
Architecture#
 

Code Structure#
This section shows the high-level code structure of the Weatherbot Client (in rasa-riva-weatherbot-webapp/riva folder).
- asr.py- This file contains the functionality to make the gRPC call to Riva ASR, using the Riva Python Client libraries, with the audio snippet, and returns the text transcript. 
- ASR is used in streaming mode 
 
- rasa.py- This file contains the functionality to make an API call to Rasa, with the user input and sender ID, and returns a text response obtained by processing the Rasa response object. 
 
- tts.pyand- tts_stream.py- These files contain the functionality to make the gRPC call to Riva TTS, using the Riva Python Client libraries, with a text snippet, and returns the corresponding audio speech. 
- TTS can be used in either - Batchor- Streamingmode, depending on whether- tts.pyor- tts\_stream.pyis used. This can be set by changing the import statements in lines 12 and 13 in- virtual-assistant-rasa/rasa-riva-weatherbot-webapp/riva/chatbot/chatbot.py.
 
- chatbot.py- This file contains the - Chatbotclass which is responsible for pipelining all the ASR, TTS, and Rasa operations.
- Creates one instance of the - Chatbotclass per conversation.
- Pipeline is as follows: - ASR is used in - Streamingmode, therefore, it’s a background operation. By default, we are listening to the audio of the user who is speaking into the microphone.
- ASR calls Rasa with the transcribed text either automatically when the user stops talking (when the - AutoSubmitflag is enabled) or when the Submit button is hit.
- When called, Rasa internally calls Riva NLP or Rasa NLU with the transcribed text to fetch the intents and slots which is then used by the Rasa dialog manager to get the final response text. The response text is returned. 
- TTS is automatically called with Rasa’s response text, if - System Speechis not muted.
- TTS gets the audio snippet and plays it back to the user on the speakers. 
 
 
Requirements and Setup#
Requirements:#
Before you try running the Riva client, ensure you meet the following requirements:
- You have access and are logged into NVIDIA NGC. For step-by-step instructions, refer to the NGC Getting Started Guide. 
- Python 3.8 (Support for other Python versions will be added in a future release). 
Setup:#
- Clone Riva Sample Apps repository: 
	git clone https://github.com/nvidia-riva/sample-apps.git
- Enter Riva and Rasa Virtual Assistant directory: 
	cd sample-apps/virtual-assistant-rasa
- Create parent directory for all Python virtual environments we will be using for this sample: 
	mkdir pythonenvs
- Create Python virtual environment for Rasa text-based chatbot and install necessary libraries: - Create and activate Python virtual environment for Rasa text-based chatbot: 
 - python3 -m venv pythonenvs/rasa . pythonenvs/rasa/bin/activate - Upgrade - pipand install Rasa dependencies.- requirements_rasa.txtcaptures all the Python dependencies needed for Rasa text-based chatbot:
 - pip3 install -U pip pip3 install -r requirements_rasa.txt - Deactivate Rasa Python virtual environment: 
 - deactivate
- Create Python virtual environment for Weatherbot web application, which uses Riva client, and install necessary libraries: - Create and activate Python virtual environment for Weatherbot web application: 
 - python3 -m venv pythonenvs/client . pythonenvs/client/bin/activate - Upgrade - pip:
 - pip3 install -U pip - Install Riva client libraries: - Download the Riva Quick Start scripts, if not already done. - x.y.zis the Riva Speech Skills version number - The latest Riva version number can be found in the Riva Quick Start Guide’s Local Deploymnent using Quick Start Scripts section
 - ngc registry resource download-version "nvidia/riva/riva_quickstart:x.y.z" - Install the Riva client library. 
 - cd riva_quickstart_v<x.y.z> pip install riva_api-<x.y.z>-py3-none-any.whl 
- Install weatherbot web application dependencies. - requirements_client.txtcaptures all other Python dependencies needed for weatherbot web application:
 - pip3 install -r requirements_client.txt # For Python 3.8 - Deactivate Weatherbot web application’s Python virtual environment: 
 - deactivate
Running the Demo#
- Start the Riva Speech Server, if not already done. Follow the steps in the Riva Quick Start Guide. 
- Navigate to the Riva and Rasa Virtual Assistant directory in the Riva sample-apps github repository that you cloned in the Setup section’s, step 1. 
	cd sample-apps/virtual-assistant-rasa
- Modify the API endpoint setting. There are two locations in the code base that have to be configured for inter-service communication: - rasa-weatherbot/endpoints.ymlfor the IP address of the macine running Rasa Action server endpoint:
 - # uncomment and populate the section below action_endpoint: url: "http://[rasa server host IP]:5055/webhook" - For example: - # uncomment and populate the section below action_endpoint: url: "http://10.20.30.40:5055/webhook" - config.pyfor IP address of the machine running Riva Speech Skills server and IP address of the machine running Rasa chatbot server:
 - # uncomment and populate the section below riva_config = { ... "RIVA_SPEECH_API_URL": "[riva speech service host IP]:50051", ... } # uncomment and populate the section below rasa_config = { ... "RASA_API_URL": "[rasa server host IP]:5005", ... } - For example: If the Riva Speech Skills Service and Rasa chatbot server are both running on the same machine, it would be: - # uncomment and populate the section below riva_config = { ... "RIVA_SPEECH_API_URL": "10.20.30.40:50051", ... } # uncomment and populate the section below rasa_config = { ... "RASA_API_URL": "10.20.30.40:5005", ... } - If the Riva Speech Skills Service and Rasa chatbot server are running on different machines, it would be: - # uncomment and populate the section below riva_config = { ... "RIVA_SPEECH_API_URL": "10.20.30.40:50051", ... } # uncomment and populate the section below rasa_config = { ... "RASA_API_URL": "20.30.40.50:5005", ... } 
- Start the Rasa Action server. - Open the - config.pyscript. In the dictionary on the right side of the- riva_configvariable, update the- WEATHERSTACK_ACCESS_KEYfield with your Weatherstack API key. A new Weatherstack API key can be obtained here.
- Activate the Python virtual environment for Rasa text-based chatbot. 
 - . pythonenvs/rasa/bin/activate - Navigate to the - rasa-weatherbotdirectory.
 - cd rasa-weatherbot - Run the Rasa Action server. 
 - rasa run actions --actions actions 
- Start the Rasa server in a different terminal. - Activate the Python virtual environment for Rasa text-based chatbot. 
 - . pythonenvs/rasa/bin/activate - Navigate to the - rasa-weatherbotdirectory.
 - cd rasa-weatherbot - Run the Rasa training. - For Riva NLP: Train the Rasa Core model. 
 - rasa train -c config/config_rivanlp.yml -d domain/domain_rivanlp.yml --out models/models_rivanlp/ --data data/nlu_rivanlp.yml data/rules_rivanlp.yml data/stories_rivanlp.yml - For Rasa NLU: Train the Rasa NLU and Rasa Core models. 
 - rasa train -c config/config_rasanlp.yml -d domain/domain_rasanlp.yml --out models/models_rasanlp/ --data data/nlu_rasanlp.yml data/rules_rasanlp.yml data/stories_rasanlp.yml 
- Run the Rasa server. - For Riva NLP. 
 - rasa run -m models/models_rivanlp/ --enable-api --log-file out.log --endpoints endpoints.yml - For Rasa NLU. 
 - rasa run -m models/models_rasanlp/ --enable-api --log-file out.log --endpoints endpoints.yml 
 
- Start the Riva Chatbot server. - Activate the Python virtual environment for the weather web application. 
 - . pythonenvs/client/bin/activate - Navigate to the - rasa-riva-weatherbot-webappdirectory.
 - cd rasa-riva-weatherbot-webapp - Starting the Chatbot Client server. 
 - python3 main.py - Open the web UI on a web browser. 
 - https://[riva chatbot server host IP]:5555/rivaWeather - For example: - [https://0.0.0.0:5555/rivaWeather](https://0.0.0.0:5555/rivaWeather) 
Sample Conversations#
Rasa NLU: Sample 1#
Virtual Assistant: Hi, welcome to RIVA weather service. How can I help you?
User: How’s the weather?
Virtual Assistant: For which location?
User: In Seattle.
Virtual Assistant: It is Haze, Smoke in Seattle at the moment. The temperature is 31 degrees, the humidity is 30 percent and the wind speed is 20 miles per hour.
Rasa NLU: Sample 2#
Virtual Assistant: Hi, welcome to RIVA weather service. How can I help you?
User: How’s the weather in Paris?
Virtual Assistant: It is Clear in Paris at the moment. The temperature is 20 degrees, the humidity is 73 percent and the wind speed is 9 miles per hour.
Rasa NLU: Sample 3#
Virtual Assistant: Hi, welcome to RIVA weather service. How can I help you?
User: hi
Virtual Assistant: Hey! How are you?
User: who are you?
Virtual Assistant: I am a bot, powered by Rasa.
Rasa NLU: Sample 4#
Virtual Assistant: Hi, welcome to RIVA weather service. How can I help you?
User: Good Morning
Virtual Assistant: Hey! How are you?
User: I am doing great
Virtual Assistant: Great, carry on!
User: See you around
Virtual Assistant: Bye
Riva NLP: Sample 1#
Virtual Assistant: Hi, welcome to RIVA weather service. How can I help you?
User: How’s the weather?
Virtual Assistant: For which location?
User: In Seattle.
Virtual Assistant: It is Haze, Smoke in Seattle at the moment. The temperature is 31 degrees, the humidity is 30 percent and the wind speed is 20 miles per hour.
Riva NLP: Sample 2#
Virtual Assistant: Hi, welcome to RIVA weather service. How can I help you?
User: How’s the weather in Paris?
Virtual Assistant: It is Clear in Paris at the moment. The temperature is 20 degrees, the humidity is 73 percent and the wind speed is 9 miles per hour.
Riva NLP: Sample 3#
Virtual Assistant: Hi, welcome to RIVA weather service. How can I help you?
User: Is it raining in Austin?
Virtual Assistant: The precipitation is 0.4 inches in Austin at the moment.
User: What is the temperature in Toronto today?
Virtual Assistant: It is 4 degree celsius in Toronto at the moment.
Limitations#
- The provided samples are not complete virtual assistants, but are intended as simple examples of how to build basic task-oriented chatbots with Riva. Consequently, the intent classifier and slot filling models have been trained with small amounts of data and are not expected to be highly accurate. 
- The Rasa virtual assistant sample supports intents for weather, temperature, rain, humidity, sunny, cloudy and snowfall checks. It does not support general conversational queries or other domains. 
- Both the Riva NLP and Rasa NLU samples support only 1 slot for city. Neither takes into account the day associated with the query. 
- Although the Rasa servers and the chatbot servers can be hosted on different machines, the provided code does not support independent scaling of the servers. 
- These samples support up to four concurrent users. This restriction is not because of Riva, but because of the web framework (Flask and Flask-ScoketIO) that is being used. The socket connection to stream audio to (TTS) and from (ASR) the user is unable to sustain more than four concurrent socket connections. 
- The Rasa virtual assistant is not optimized for low latency in case of multiple concurrent users. 
- Some erratic issues have been observed with the Rasa sample on the Firefox browser. The most common issue is the TTS output being taken in as input by ASR for certain microphone gain values. 
License#
The NVIDIA Riva License Agreement is included with the product. Licenses are also available along with the model application zip file. By pulling and using the Riva SDK container, downloading models, or using the sample applications here, you accept the terms and conditions of these licenses.   
This project uses Rasa and Rasa-SDK, the license information for which can be found below:
Rasa:
Licensed under the Apache License, Version 2.0. Copyright 2020 Rasa Technologies GmbH. Copy of the license can be found here.
Rasa-SDK:
Licensed under the Apache License, Version 2.0. Copyright 2020 Rasa Technologies GmbH. Copy of the license can be found here.
