TY - CHAP
T1 - Steering Large Text-to-Image Model for Kandinsky Synthesis Through Preference-Based Prompt Optimization
AU - Zhou, Aven-Le
AU - Wu, Wei
AU - Wang, Yu-Ao
AU - Zhang, Kang
PY - 2025/4/20
Y1 - 2025/4/20
N2 - With the advancement of neural generative capabilities, the art community has increasingly embraced GenAI (Generative Artificial Intelligence), particularly large text-to-image models, for producing aesthetically compelling results. However, the process often lacks determinism and requires a tedious trial-and-error process, as users frequently struggle to devise effective prompts to achieve their desired outcomes. This paper introduces a prompting-free generative approach that applies a genetic algorithm and real-time iterative human feedback to optimize prompt generation, enabling the creation of user-preferred abstract art, e.g., Kandinsky’s Bauhaus style. The proposed two-part approach begins with constructing an Artist Model capable of deterministically generating Kandinsky paintings. The second phase integrates real-time user feedback to optimize prompt generation and obtains an “Optimized Prompting Model,” which adapts to user preferences and automatically generates prompts. Combined with the Artist Model, this approach allows users to create Kandinsky tailored to their preferences.
AB - With the advancement of neural generative capabilities, the art community has increasingly embraced GenAI (Generative Artificial Intelligence), particularly large text-to-image models, for producing aesthetically compelling results. However, the process often lacks determinism and requires a tedious trial-and-error process, as users frequently struggle to devise effective prompts to achieve their desired outcomes. This paper introduces a prompting-free generative approach that applies a genetic algorithm and real-time iterative human feedback to optimize prompt generation, enabling the creation of user-preferred abstract art, e.g., Kandinsky’s Bauhaus style. The proposed two-part approach begins with constructing an Artist Model capable of deterministically generating Kandinsky paintings. The second phase integrates real-time user feedback to optimize prompt generation and obtains an “Optimized Prompting Model,” which adapts to user preferences and automatically generates prompts. Combined with the Artist Model, this approach allows users to create Kandinsky tailored to their preferences.
U2 - 10.1007/978-3-031-90167-6_28
DO - 10.1007/978-3-031-90167-6_28
M3 - Chapter
SP - 417
EP - 433
BT - Artificial Intelligence in Music, Sound, Art and Design (EvoMUSART 2025)
PB - Springer
ER -