Release Notes#

New Features#

  • All new A2F-2D microservice that animates a person’s portrait photo using an audio input by animating the lip motion to match that of the audio.

  • Supports facial characteristics including lip sync, blinking and head pose animation.

  • Support two modes; quality mode for higher visual fidelity and performance mode for quicker run-time on real time streaming.

  • Algorithmic latency of 198 ms for model priming to streaming performance for a 30FPS output as:

    • Performance mode

      • Latency: 22ms (L4), 9.62ms (L40)

      • Throughput: 1 concurrent stream (L4), 3 concurrent streams (L40)

    • Quality mode (intended for offline enhancements)

      • Latency: 57.80ms (L4), 20ms (L40)

      • Throughput: 0 concurrent streams (L4), 1 concurrent streams (L40)