Audio2Face Controller Microservice

Overview

The Audio2Face (A2F) Controller is designed to facilitate the management and integration of the A2F microservice within larger workflows. It acts as both the origin and the destination of the A2F outputs, simplifying the interaction with the A2F service by providing a bi-directional API. This controller makes it possible to use A2F either as a standalone application or as part of a complex pipeline involving additional microservices.

Communication

The A2F Controller microservice receives its data from a bi-directional-streaming RPC. The input data is composed of:

an audio stream header containing information about the upcoming audio data, as well face parameters, post-processing options and blendshape parameters.
audio data as well as emotion data with time code to start applying the emotion

And the output data is:

an animation header containing information about the blendshape names, audio output format, etc.
blendshape data with time code, as well as audio data and camera position if any.

Detailed description of the gRPC prototypes in the grpc prototypes section.

ID management

A big difference between Audio2Face Microservice and A2F Controller Microservice is the ID management.

Audio2Face Microservice expects IDs to be given as input to the gRPC call and will serve the same ID as output for the related stream.
For A2F Controller bidirectional connection, no IDs have to be provided. This Microservice hides the IDs from clients external to the cluster.

For that reason A2F Controller takes care of generating UUIDs when communicating with Audio2Face. These UUID will fill the id fields used in the Audio2Face Microservice gRPC interface.

Audio2Face Controller Microservice

Overview

Communication

ID management

Configuration