Talking heads supercharged "Loopy" - new paper

old.reddit.com

Talking heads supercharged "Loopy" - new paper

old.reddit.com

Lemmit.Online botMAB to StableDiffusionEnglish · 1 year ago

**TL;DR**: we propose an end-to-end audio-only conditioned video diffusion model named **Loopy**. Specifically, we designed an inter- and...

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Won3wan32 on 2024-09-05 05:31:22+00:00.

TL;DR: we propose an end-to-end audio-only conditioned video diffusion model named Loopy. Specifically, we designed an inter- and intra-clip temporal module and an audio-to-latents module, enabling the model to leverage long-term motion information from the data to learn natural motion patterns and improving audio-portrait movement correlation. This method removes the need for manually specified spatial motion templates used in existing methods to constrain motion during inference, delivering more lifelike and high-quality results across various scenarios.

You must log in or register to comment.

Chat

StableDiffusion

stablediffusion

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Community locked: only moderators can create posts. You can still comment on posts.

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and…

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

3 users / day
4 users / week
5 users / month
18 users / 6 months
1 local subscriber
101 subscribers
16.1K Posts
8 Comments
Modlog

mods:
Lemmit.Online bot