Le prompt comme image artificielle de l’imagination / The prompt as an artificial image of the imagination

07/2022

Théorie

En consultant les échanges sur les réseaux sociaux de personnes utilisant Dalle 2, Midjourney et Discodiffusion, on remarque la même interrogation qui revient : comment obtenir l’image que je veux à partir du texte que j’ai écrit. Le « prompt », ce texte descriptif que l’on entre dans le formulaire pour générer une image, apparait comme le secret de l’image. Les prompteurs se passent leurs formules et leurs trucs pour donner le style voulu à l’image : sujet et profondeur de champ, objectif et granulité de l’image. Des générateurs de prompts apparaissent pour faciliter le travail, tandis que des guides téléchargeables nous apprennent à devenir des bons prompteurs. Certains déduisent et copient le prompt d’autres personnes et reproduisent leur style et leur signature (j’en ai moi-même fait les frais).

Si cette interrogation semble aller de soi, car aller naturellement dans le sens du logiciel qui génère une image à partir d’un texte, il est remarquable que ce rapport entre le texte et l’image ait une histoire ancienne et soit fondatrice en Occident de la picturalité pendant des siècles. En effet, auparavant les images avaient principalement comme rôle d’illustrer un texte sacré. Jean-François Lyotard a longuement abordé cette question en la retournant par le concept de figural dans Discours Figure (1973). Il s’agissait d’apercevoir le figural occulté et tapi dans la sédimentation des discours.

Quel est donc le figural de tous ces prompts qui s’accumulent comme autant d’exemples de la pauvreté de l’imaginaire de notre époque ?

Pourquoi déterminons-nous avant toute chose ces prompts selon leur pauvreté ? C’est qu’il est remarquable que malgré les possibilités statistiques quasi infinies de ces logiciels, les images se ressemblent selon des catégories dont nous ferons la cartographie dans un prochain texte : animaux anthropomorphiques, kawai, heroic fantaisy, cartes à jouer, star wars, SF, cyberpunk, dessins de presse, etc. Certains illustrateurs comme Banksy ou Peeble semblent avoir anticipé, et nourris, ce kitsch génératif.

La valorisation du prompt dont l’image ne serait plus que l’illustration peut être interprétée comme une volonté de puissance et maitrise sur l’image dont le simulacre a été considéré depuis Platon comme indomptable, car proliférant. Or cette volonté semble occulter, et c’est là le premier abord de son figural, qu’on pourrait fort bien inverser la maitrise en reconstituant un cheminement historique.

La révolution industrielle a systématisé la conversion des étants en ressources libérant de l’énergie, c’est-à-dire une valeur indifférente à la singularité de chaque étant. La mesure de l’énergie est la même quelque soi l’étant qui libère, dans sa combustion, du travail. Les médias de masse, et en particulier la télévision, ont capturé l’attention des gens en les synchronisant sur des évènements retransmis. Le Web a consisté a enregistré les traces existentielles et a intensifié la capture de l’attention en la rendant réflexive et interpersonnelle : se donner aux autres pour se voir. L’époque qui s’ouvre avec la généralisation de la production inductive des images consiste en une capture fantasmatique de l’imaginaire des êtres humains, pour ainsi dire de leur intériorité, en leur proposant de projeter dans des textes ce qu’ils voudraient imaginer.

Ainsi ces différents logiciels donnent pour la première fois une image massive de l’imaginaire anthropique. C’est un miroir noir que nous tend l’imagination artificielle, plutôt que nous qui la pilotons, révélant notre imagination, notre désir de production d’images à partir d’un cadre programmatique déterminé. Ce cadre n’est pas indifférent, il constitue une simplification de nos facultés tout autant qu’une condition de sa capture qui retournera sur elle-même, l’alimentant dans une boucle récursive.

En effet, lorsque nous voyons ces images, nous ne regardons pas le résultat d’une machine, mais le croisement entre un stock culturel commun (le dataset) et une intentionnalité désirante : produire une image à partir d’un texte. Le texte peut être considéré comme une expression de la pensée. Par là, on comprend mieux le « projet » d’Arraisonnement qui est à l’œuvre : capturer l’intériorité humaine, même si celle-ci est un fantasme, une construction géographiquement et historiquement située.

C’est donc moins les performances technologiques et instrumentales de l’imagination artificielle qui sont remarquables, où une nouveauté en chasse une autre et où chacun, palpitant, suit celle-ci, que la symptomatologie figurale qui s’y cachent : l’immense tableau de l’imagination de notre époque.

–

If you look at the exchanges on social networks of people using Dalle 2, Midjourney and Discodiffusion, you’ll notice the same question coming up: how do I get the image I want from the text I wrote? The “prompt”, the descriptive text that you enter in the form to generate an image, appears to be the secret of the image. Prompters pass around their formulas and tricks to give the image the desired style: subject and depth of field, lens and image graininess. Prompt generators appear to make the job easier, while downloadable guides teach us how to become good prompter. Some people deduce and copy other people’s prompts and replicate their style and signature (I’ve done it myself).

If this question seems to be self-evident, as it goes naturally in the direction of the software that generates an image from a text, it is remarkable that this relationship between text and image has an ancient history and has been the foundation of pictoriality in the West for centuries. Indeed, before, the images had mainly as a role to illustrate a sacred text. Jean-François Lyotard approached this question at length by turning it around with the concept of figural in Discourse Figure (1973). It was a question of glimpsing the figural hidden and lurking in the sedimentation of discourses.

What is the figural of all these prompts that accumulate like so many examples of the poverty of the imaginary of our time?

Why do we determine above all these prompts according to their poverty? It is because it is remarkable that despite the almost infinite statistical possibilities of these programs, the images look alike according to categories that we will map out in a future text: anthropomorphic animals, kawai, heroic fantasy, playing cards, star wars, SF, cyberpunk, press cartoons, etc. Some illustrators, such as Banksy or the French artist, have been able to produce images that are not very popular. Some illustrators like Banksy or Peeble seem to have anticipated, and fed, this generative style.

The valorization of the prompt of which the image would be only the illustration can be interpreted as a will of power and mastery on the image whose simulacrum was considered since Plato as untamable, because proliferating. However this will seems to occult, and it is there the first approach of its figural, that one could very well reverse the control by reconstituting a historical path.

The industrial revolution has systematized the conversion of beings into resources liberating energy, that is to say a value indifferent to the singularity of each being. The measure of the energy is the same something being that releases, in its combustion, of the work. The mass media, and in particular the television, have captured the attention of the people synchronizing them on retransmitted events. The Web has consisted in recording existential traces and has intensified the capture of attention by making it reflexive and interpersonal: giving oneself to others in order to see oneself. The era that opens with the generalization of the inductive production of images consists in a phantasmatic capture of the imaginary of the human beings, so to speak of their interiority, by proposing to them to project in texts what they would like to imagine.

Thus, these different softwares give for the first time a massive image of the anthropic imaginary. It is a black mirror that the artificial imagination holds up to us, rather than us driving it, revealing our imagination, our desire to produce images from a determined programmatic framework. This framework is not indifferent, it constitutes a simplification of our faculties as much as a condition of its capture which will return on itself, feeding it in a recursive loop.

When we see these images, we do not look at the result of a machine, but the crossing between a common cultural stock (the dataset) and a desiring intentionality: to produce an image from a text. The text can be considered as an expression of the thought. By there, one understands better the “project” of Arraisonnement which is at work: to capture the human interiority, even if this one is a fantasy, a geographically and historically situated construction.

It is thus less the technological and instrumental performances of the artificial imagination which are remarkable, where a novelty drives out another one and where each, palpitating, follows this one, that the figurative symptomatology which hides there: the immense picture of the imagination of our time.