Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any potential solution to improve the inference speed and quality if I have depth information and data from previous frames? #66

Open
felixshing opened this issue Jul 8, 2024 · 0 comments

Comments

@felixshing
Copy link

Thank you for your outstanding work!

I understand that the current solution operates on a single-frame basis with 2D input, similar to GeneFace++. While we have a video-driven solution, it appears that the inference remains single-frame basis.

I am exploring the application of these audio-to-face solutions within a 3D video streaming system, utilizing depth sensors to capture data. With depth information and data from previous frames, I believe it is possible to accelerate inference and enhance reconstruction quality.

I would appreciate any insights or advice on this approach. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant