Invention Title:

PERSONALIZED MACHINE-LEARNED MODEL ENSEMBLES FOR RENDERING OF PHOTOREALISTIC FACIAL REPRESENTATIONS

Publication number:

US20250124653

Publication date:

2025-04-17

Section:

Physics

Class:

G06T17/20

Inventors:

Hany A. Heikal Aurora, CO, United States

Christopher Glen Turner Aurora, CO, United States

Dell Lawrence Wolfensparger Seaview, WA, United States

Applicant:

Charter Communications Operating, LLC St. Louis, MO, United States

Smart overview of the Invention

The patent application describes a method for creating photorealistic facial representations using personalized machine-learned model ensembles. It focuses on processing video data that depicts a user's face to generate a realistic 3D representation. This process involves multiple machine-learned models, each trained to handle different aspects of the facial representation, such as mesh generation, texture creation, and subsurface anatomy modeling.

Model Ensemble

The system employs a user-specific ensemble of machine-learned models to achieve photorealism. These models include a mesh representation model for 3D polygonal structures, a texture representation model for facial textures, and subsurface anatomical models for underlying facial features. Each model in the ensemble is optimized using a loss function that evaluates its output against desired results, ensuring high accuracy and realism in the final representation.

Applications

This technology is particularly useful in video-based communications, where realistic avatars can enhance virtual interactions. By generating detailed and lifelike avatars, users can engage more effectively in virtual worlds and videoconferences. The system also allows for capturing and rendering unique microexpressions, adding depth to virtual avatars' interactions with real-time dynamics.

Technical Benefits

A significant advantage of this approach is its potential to reduce bandwidth requirements in video communications. By transmitting only essential data needed to render a 3D avatar instead of full video streams, network load is minimized. This is particularly beneficial in environments with fluctuating network strength or when users face bandwidth limitations from their Internet Service Providers.

Practical Considerations

The described technology addresses common issues like signal strength fluctuations and high bandwidth costs associated with traditional video communications. By offering an efficient alternative through avatar-based representations, it enhances user experience by maintaining quality even under suboptimal network conditions. This makes it an attractive solution for improving the reliability and accessibility of videoconferencing.