Abstract
Due to the significant generative capabilities of GenAI (generative artificial intelligence), the art community has actively embraced it to create painterly content. Large text-to-image models can quickly generate aesthetically pleasing outcomes. However, the process can be non-deterministic and often involves tedious trial-and-error as users struggle to formulate effective prompts to achieve their desired results. This paper describes a generative approach that empowers users to easily work with a large text-to-image (TTI) model to create their preferred painterly content. The authors propose a large model personalization method, namely Semantic Injection, to personalize a large TTI model in a given specific artistic style, i.e., Kandinsky’s paintings in Bauhaus era, as the Artist Model. Through working with a Kandinsky expert, the authors first establish a semantic descriptive guideline and a TTI dataset of Kandinsky style and then apply the Semantic Injection method to obtain an Artist Model of Kandinsky, empowering users to create preferred Kandinsky content in a deterministically controllable manner.
| Original language | English |
|---|---|
| Title of host publication | SIGGRAPH '24 |
| Subtitle of host publication | ACM SIGGRAPH 2024 Posters |
| Publisher | Association for Computing Machinery (ACM) |
| Pages | 1-2 |
| Number of pages | 2 |
| DOIs | |
| Publication status | Published - 25 Jul 2024 |
| Externally published | Yes |
-
When Kandinsky Met Wu Guanzhong: Ai-Fused Eastern and Western Arts
Wang, Y.-A., Zhou, A.-L., She, J. & Zhang, K., 1 Apr 2026, In: Leonardo.Research output: Contribution to journal › Article › peer-review
Open Access -
Steering Large Text-to-Image Model for Kandinsky Synthesis Through Preference-Based Prompt Optimization
Zhou, A.-L., Wu, W., Wang, Y.-A. & Zhang, K., 20 Apr 2025, Artificial Intelligence in Music, Sound, Art and Design (EvoMUSART 2025): International Conference on Computational Intelligence in Music, Sound, Art and Design (Part of EvoStar) . Springer, p. 417-433 17 p.Research output: Chapter in Book or Report/Conference proceeding › Chapter › peer-review
Open Access1 Citation (Scopus) -
Human Aesthetic Preference-Based Large Text-to-Image Model Personalization: Kandinsky Generation as an Example
Zhou, A.-L., Wang, Y.-A., Wu, W. & Zhang, K., 9 Feb 2024, (E-pub ahead of print) p. 1-9. 9 p.Research output: Contribution to conference › Paper
Open Access
Activities
- 1 Participating in an event e.g. a conference, workshop, …
-
ACM SIGGRAPH 2024
Zhou, A.-L. (Participant)
28 Jul 2024 → 1 Aug 2024Activity: Participating in or organising an event › Participating in an event e.g. a conference, workshop, …
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver