Image generation is a multimodal practice that extends far beyond the prompt or the final visual output. To understand this process, researchers must attend to the technological environments in which production takes place. This chapter unpacks the multimodal nature of AI image generation by distinguishing between the user interface and the underlying computational model. We argue that interfaces operate as semiotic …
Pfurtscheller, Daniel; Christ, Katharina