Release Notes#

New Features#

All new A2F-2D microservice that animates a person’s portrait photo using an audio input by animating the lip motion to match that of the audio.
Supports facial characteristics including lip sync, blinking and head pose animation.
Support two modes; quality mode for higher visual fidelity and performance mode for quicker run-time on real time streaming.
Algorithmic latency of 198 ms for model priming to streaming performance for a 30FPS output as:
- Performance mode
  Latency: 22ms (L4), 9.62ms (L40)
  
  Throughput: 1 concurrent stream (L4), 3 concurrent streams (L40)
- Quality mode (intended for offline enhancements)
  Latency: 57.80ms (L4), 20ms (L40)
  
  Throughput: 0 concurrent streams (L4), 1 concurrent streams (L40)