LLM-guided instance-level image manipulation with diffusion u-net cross-attention maps
(2024)
Presentation / Conference Contribution
The advancement of text-to-image synthesis has introduced powerful generative models capable of creating realistic images from textual prompts. However, precise control over image attributes remains challenging, especially at the instance level. Whil... Read More about LLM-guided instance-level image manipulation with diffusion u-net cross-attention maps.