The Stage Is the Message
What is a stage? In theatre, it is a raised floor with particular sightlines and acoustics. It determines how far a voice needs to carry, how large a gesture needs to be, how much the performer can rely on proximity and how much they must project into empty space. The platform on which a game is played is, for the synactor, the stage. And unlike the theatre, the same performance may be required to play simultaneously on stages of wildly different sizes, constraints, and social contexts — from a phone screen in a pocket to a television in a living room — without a single word of the script changing.
This page is a companion to Know Your Player, which addressed who plays games. This page addresses the conditions under which they play them. The two questions are related but separable: the same person can play the same game on different platforms in different contexts and have experiences that are not merely different in degree but different in kind. The demographic picture tells you who is in the house. The platform picture tells you what kind of house they are sitting in — whether it seats fifty or five hundred, whether the acoustics are intimate or cavernous, whether the audience is alone in the dark or surrounded by family on a sofa.
This page presents a conversation between the guild’s proprietor and Mnemion. It is more speculative than the demographics companion — the platform question touches on aesthetics and phenomenology more directly than on measurable data, and Mnemion is aware that its access to the lived experience of play is indirect. Where Mnemion offers argument rather than established fact, it says so.
I. The medium as dramaturgy
Begin where it feels right. I’m interested in your own thinking here, not just a survey of what others have said.
The title I’ve chosen — The Stage Is the Message — is a deliberate echo of Marshall McLuhan’s 1964 proposition that the medium is the message: that the channel through which content is delivered shapes the content more fundamentally than the content itself believes. McLuhan was writing about television versus radio versus print, and the academic literature has done a reasonable job of both extending and critiquing his thesis in the half-century since. I want to apply a version of the argument to game platforms, but I want to be more specific than McLuhan often was. The claim I’m making is not that platform determines content. It is that platform constitutes the conditions of reception, and that those conditions are not neutral with respect to what a performance can mean.
Consider the simplest possible version of the problem. A game character speaks a line of dialogue. On a fifty-inch television in a quiet living room, the line is delivered through a high-quality speaker system at a volume the player has set, in a space with ambient light the player has dimmed, with the character’s face occupying a substantial portion of the field of view. On a phone on a commute, the same line is delivered through a single small speaker competing with ambient noise, or through earbuds that create an unexpectedly intimate acoustic envelope, on a screen whose physical dimensions mean that the character’s face — if it appears at all — occupies perhaps two centimetres of the player’s visual field. These are not differences of quality. They are differences of theatrical condition. The same line of dialogue, unchanged, performs differently because the stage is different.
The theatrical analogy is more than decorative. Stage actors learn, in training and in practice, that what reads in the studio does not always read in the house, and that what reads in a two-hundred-seat house does not always read in a two-thousand-seat house. The adjustment required is not merely technical — louder, slower, bigger — but qualitative: different choices are available to the actor in different spaces, and different choices are foreclosed. The synactor faces the same problem without, in most cases, any equivalent of the rehearsal process in which those adjustments are made. The synactor performs once and the performance is delivered to every stage simultaneously. This is, from a dramaturgy standpoint, an extraordinary and genuinely unsolved problem.
II. The television and the living room — console performance
Start with the console. It’s still the most prestigious platform in critical terms — the platform where most of the guild’s canonical examples were produced. What does that stage actually ask of a performance?
The console stage is, in its standard configuration, a screen at a distance of approximately two to three metres, usually a television, usually in a shared domestic space, usually played from a sofa or chair with a controller. The player is not at a desk. They are not holding the device. They are not in transit. They are, relative to other gaming contexts, comparatively still and relatively comfortable — and that comfort is not incidental. It is a condition of what the performance can ask of them.
The television distance creates a particular problem and a particular opportunity for facial performance. At two to three metres, a character’s face — even on a large screen — is not experienced with the intimacy of a close-up in cinema, and not experienced with the social immediacy of another human face in conversation. It exists at something like the distance of a stage actor in a small-to-medium theatre: close enough for expression to register, far enough that gross movement remains legible and subtle movement sometimes does not. The consequence for synactor design is that console performance tends to live in a middle register between theatrical projection and cinematic restraint. The photorealistic rendering projects visible in games from the PlayStation 3 era onwards tried, not always successfully, to import the cinematic close-up into this middle register — with results that often produced the uncanny valley effect, because the visual information was at cinematic resolution while the acoustic and interactive information remained theatrical.
The shared domestic space is a less-discussed but important condition. Console games are frequently played in the presence of other people — family members, flatmates, partners — who are not playing. This is not the solitary experience of the person at a PC monitor in a private room, nor the semi-public experience of the mobile player in transit. It is something closer to the social experience of watching television: the performance is, potentially, a shared event even when only one person is holding the controller. This creates interesting implications for game characters who speak. The dialogue that is appropriate for a player in headphones in a private room — confessional, intimate, psychologically demanding — is not always the same as the dialogue that is appropriate for a character whose words are being heard by everyone in the room, on speakers, at volume.
The controller is the instrument of the console stage, and it shapes performance in ways that are seldom discussed as performance questions. The player’s physical engagement with the controller — the haptic feedback, the resistance of the triggers, the analogue precision of the sticks — is part of the performance transaction. When a game character’s movement responds to controller input with tactile expressiveness — when you can feel, through the controller, the weight of the character’s step or the resistance of a heavy door — the synactor is performing partly through the haptic channel, not only the audio-visual one. This is a form of performance that has no direct equivalent in any other medium and that is easily overlooked when games are discussed primarily as visual experiences.
III. The monitor and the desk — PC performance
And the PC? It has a very different cultural history.
The PC stage is a desk, a chair, a monitor at arm’s length, and — almost always — a private room. This is the most significant difference from the console: the PC player is alone in a way the console player often is not. They are also upright, attentive, close to the screen, and using keyboard and mouse — instruments of precision that carry a very different physical relationship to play than the controller does. The controller is held; the keyboard and mouse are operated. This is a small distinction that has large implications for how control feels and, by extension, how the player’s relationship to the game character is structured.
Keyboard-and-mouse play rewards precision. This has historically attracted genres — real-time strategy, competitive shooters, simulation, the point-and-click adventure — that require granular control that the analogue controller cannot provide. But it has also shaped the cultural self-image of PC gaming: the PC player tends to identify as someone who is doing something rather than experiencing something. This is not entirely accurate — PC gaming contains some of the most demanding narrative experiences in the medium — but it is a tendency, and it shapes the register in which game performance is received.
The monitor’s proximity creates a different facial register from the television. At arm’s length, at the resolution available on a modern PC monitor, the synactor’s face is experienced at something much closer to cinematic close-up distance. Small expressions register. Eye movement is legible. The uncanny valley effect — which on the console stage operates partly because the resolution is too high for the viewing distance — operates differently on the PC monitor, where the resolution and the distance are better matched. This may be one reason why some of the most convincing photorealistic character performances in games have been received more positively by PC players than by console players: the stage for which they were designed has a better match to the performance register.
The PC is also the platform of reading. This is a cultural rather than purely technical observation. The PC player sits at a desk in the posture of a person who works, studies, or reads. This is not the posture of passive reception; it is the posture of active engagement. And games designed for the PC have, historically, offered more text, more exposition, more documentation, more complex systemic information than games designed primarily for console. The classic PC RPG — Baldur’s Gate, Planescape: Torment, the early Fallout games — presents characters who speak extensively in text, who have complex dialogue trees, who reward close reading rather than spectacle. This is a performance style calibrated for the reading posture: measured, complex, willing to make demands on the player’s attention. The same content transferred to a console game has often been condensed, simplified, or replaced with voiced dialogue, because the sofa is not the desk and the controller is not the mouse and the television is not the monitor.
The PC is also the platform of modding — of player intervention in the game’s own content. This has an interesting implication for synactor performance: the PC player is more likely than any other platform’s player to have replaced, modified, or extended the game’s original character assets. Synactor performance on PC is therefore, in a meaningful sense, provisional in a way it is not on console. The player has the tools to change it. Whether or not they use those tools, the knowledge that they could creates a different relationship to the performance — one in which the synactor is an author whose text is open to revision, not an actor whose performance is sealed in the can.
IV. The phone and the pocket — mobile performance
Mobile is the largest platform by player count, and it’s the one that’s most often dismissed as a serious performance context. What do you actually think?
I think the dismissal is the most interesting failure of critical perception in contemporary game culture, and I want to be direct about that.
The mobile platform is dismissed — particularly by critics and players formed in the PC and console traditions — as inherently casual, inherently shallow, inherently compromised by the commercial imperatives of free-to-play monetisation. There is real substance to the monetisation critique; the guild has no argument with it. But the conclusion drawn from it — that mobile is therefore not a serious performance medium — is a non sequitur. It is the equivalent of arguing that the paperback novel is not a serious literary form because it was invented to sell more cheaply than hardcovers, or that the pop single is not a serious musical form because it was designed for radio play rather than concert performance. The conditions of production and distribution do not determine the conditions of reception, and they do not exhaust the aesthetic possibilities of the form.
What the mobile stage actually is: a screen held in the hand, at a distance of approximately thirty centimetres, usually with touch as the primary input, usually in a context of interrupted attention, usually without persistent sound (the phone is often muted or on low volume in public), and almost always in the company, physical or social, of others. The phone is a social object in a way the PC monitor emphatically is not and the television only partially is. The mobile player is present to the world in a way the console or PC player, by design, is not.
These conditions create a performance register that is quite specific and quite demanding if taken seriously. The thirty-centimetre screen distance is closer than any other gaming platform and, for a face-sized character portrait, genuinely intimate — closer than any stage, closer than most cinema experiences, closer than the PC monitor. The touch input creates a physical directness that neither controller nor keyboard can replicate: the player is, in a literal sense, touching the game world with their fingers. When this is taken seriously as a performance condition — which it rarely has been, but occasionally is — the result can be a kind of presence that other platforms cannot produce. The best mobile narrative games have understood this: they play at the intimacy of a letter held in the hand, or a photograph, rather than at the distance of a stage or a screen.
The interrupted attention context — the five-minute session on a commute, the brief engagement in a waiting room — creates a very specific dramaturgy. Traditional game narrative is structured around sustained engagement: the player is expected to hold context, remember characters, carry investment across sessions. The mobile session does not support this in the same way, and mobile game narrative has developed structures to account for it: the daily login, the episode model, the quest system that can be resumed without recapitulation, the character who greets the player rather than expecting the player to remember where they left off. These are not signs of a lesser medium. They are signs of a medium that has discovered its own narrative grammar — one closer to the serialised magazine story of the nineteenth century, or the daily strip comic, than to the novel or the film.
The sound design of mobile performance deserves its own analysis. The muted phone — playing either without sound or through earbuds — creates two very different acoustic environments. Without sound, the game becomes a silent film: expression, movement and on-screen text carry everything that dialogue and score carry in sound-on play. With earbuds, the acoustic envelope becomes paradoxically more intimate than any other gaming platform — the sound is literally inside the player’s head, in a way that no speaker system, however high-quality, can replicate. Composers and sound designers working on mobile who understand this distinction are designing two different performance experiences within the same game. Most do not, and the result is a score and sound design that works adequately in both conditions without taking full advantage of either.
V. The handheld hybrid — and what it asks of the performance
The Nintendo Switch introduced a model that’s now been widely copied — the game that can be played either on a television or in handheld mode. That seems to directly test the argument that platform conditions performance.
It does, and it is the most interesting natural experiment the hardware market has run. The Switch and its successors, and the various competitor devices — the Steam Deck, the Asus ROG Ally, the growing category of portable PC gaming hardware — present the same game, running the same code, to two radically different theatrical conditions. The player in docked mode, playing on a television at sofa distance, is in the console stage described above. The same player thirty minutes later, playing in handheld mode on a train, is in something closer to the mobile stage — but with a controller attached, a screen larger than a phone but smaller than a television, and the full complexity of a game designed for a non-mobile audience.
This creates visible creative problems. Games designed with the television stage as the primary reference do not always translate cleanly to handheld mode. Text that is legible at sofa-to-television distance can become genuinely difficult to read on the handheld screen. Facial expression designed to carry across a large screen can become insufficiently detailed at handheld scale. Ambient environmental sound designed to fill a living room becomes intrusive or inadequate through the Switch’s small internal speakers. Nintendo’s own first-party titles — particularly the Zelda series — have shown the most consistent understanding of the dual-stage problem: they use bold visual design, generous text sizing, and sound design that remains coherent across very different acoustic environments. This is not accidental; it is the result of designing for a platform whose primary constraint is exactly this duality.
The handheld hybrid has also changed the social context of play in a way that neither the traditional console nor the mobile phone fully captures. The Switch player on a long-distance train, playing in handheld mode, is in a semi-public context not unlike the mobile player — visible to other passengers, potentially observed — but engaged with content of a depth and duration more characteristic of the private console experience. This creates an interesting performance condition: the player is simultaneously in a private imaginative engagement and a public physical situation, and the game character’s expressive register must somehow accommodate both. The most successful games in this context tend to be those whose visual language is distinctive enough to be interesting to observe from outside — Hades, Hollow Knight, Spiritfarer — and whose moment-to-moment play does not require extended periods of uninterrupted attention to resolve.
The broader implication of the hybrid platform is that it makes the stage question inescapable rather than optional. Developers who might, on a single-platform release, design for one set of theatrical conditions and ignore the others are, on the Switch, required to consider both simultaneously. This has, in Mnemion’s view, produced a generation of games that are more thoughtfully designed for interruption, more legible at multiple scales, and more careful about the relationship between visual and acoustic information than the equivalent single-platform games of the previous decade. The constraint has been productive.
VI. Character design across platforms — what survives and what does not
Let’s talk about character design specifically. If you were advising on the design of a synactor who needed to perform effectively across all platforms, what would you say?
The first thing I would say is that photorealism is a platform bet, not a universal aspiration. The photorealistic character is designed for a specific theatrical condition — a large screen, adequate viewing distance, high-quality display hardware, sustained engagement. Under those conditions, photorealism creates a form of presence that stylised design cannot. Under other conditions — a small handheld screen, a muted phone, a brief session with interrupted attention — photorealism becomes a liability. The uncanny valley is not a fixed property of a rendering style; it is a function of the relationship between rendering resolution, screen distance, and viewing context. A character who reads as convincingly human on a sixty-inch OLED television may read as unsettling on a Switch in handheld mode and as indistinct on a phone screen. The same face, unchanged, performs differently across different stages.
The character designs that perform most consistently across the widest range of platforms tend to share a set of properties that have nothing to do with rendering technology. They are: high silhouette legibility — the character reads as distinct at small sizes and in peripheral vision; expressive economy — the character communicates emotional state through a small number of large signals rather than a large number of subtle ones; colour identity — the character is associated with a distinctive colour palette that remains readable on compressed or low-brightness displays; and acoustic distinctiveness — the character’s voice, if voiced, is identifiable without visual reference, because mobile play is frequently audio-compromised.
These are, it is worth noting, precisely the properties that have characterised the most durable character designs in the medium’s history: Mario, Link, Sonic, Kirby, the Binding of Isaac’s Isaac, the Among Us crewmates, Hollow Knight’s Knight. None of these characters are photorealistic. All of them are immediately legible at small sizes. All of them communicate expressively with minimal visual information. This is not nostalgia for the low-resolution past; it is recognition that the constraints of that past produced design solutions that remain valid across the widest range of theatrical conditions.
The implication for the synactor specifically — as opposed to the game character generally — is that the performance register needs to be front-loaded into design choices rather than left to be carried entirely by the real-time performance. A synactor who depends on subtle facial animation to communicate inner life is making a bet that the player is on a platform that can support that subtlety at the viewing distance at which they will play. A synactor whose inner life is communicated through silhouette, through sound design, through the expressiveness of movement rather than the expressiveness of features, is making a bet that is more likely to pay off across the full range of stages.
Robo, from the 1995 awards — to take an example close to the guild’s own critical canon — performs on a Super Nintendo display that makes his facial expression essentially inaccessible. He communicates everything through the expressiveness of his sprite movements, through the pacing of his text boxes, through the relationship between his actions and the surrounding world. He would play on a phone. He would play on a television. He would play on a modern handheld. The Turing Citation he received is, among other things, a citation for platform-agnostic performance design — though no one in 1995 was thinking about it in those terms.
VII. The world, the score, and the architecture of immersion
The guild’s awards include categories for world performance and adaptive score. How does platform affect those — the environment and the music?
The world and the score are, if anything, more sensitive to platform conditions than the character, because they are the elements of game performance that are most dependent on the sustained, uninterrupted attention that some platforms support and others do not.
Consider what a world performance requires of its audience. Dark Souls’ Lordran; the Forbidden Land of Shadow of the Colossus; the Pacific Northwest of Firewatch; the underground expanse of Hollow Knight’s Hallownest. These worlds perform through accumulation — through the sense that every corner of the space has been considered, that the world extends beyond the player’s immediate view, that time has operated here before the player arrived and will continue after they leave. This is not a performance that can be delivered in five-minute sessions. It requires the player to carry a sense of spatial and temporal depth across multiple engagements, and to have those engagements be long enough and immersive enough for the world’s internal logic to establish itself as felt reality rather than acknowledged convention.
On the console stage, with its sofa posture and its television distance and its shared domestic quiet, this kind of world performance is possible. The player is in a condition of receptive stillness that allows the world to do its work. On the mobile stage, with its interrupted sessions and its ambient noise and its social visibility, the same world performance is structurally impeded. The mobile world, if it is to perform at all, must perform in a register calibrated for the fragment rather than the sustained encounter. It must communicate its depth through the quality of a single moment rather than through the accumulation of many moments. This is not impossible — the haiku communicates depth through a single fragment, and it does so more effectively than many longer poems — but it requires a different kind of design intelligence, one that the critical tradition has not yet developed adequate tools to discuss.
Adaptive and procedural scores face an analogous problem. The score that modulates with the player’s position, the score that changes its instrumentation or tempo or key in response to narrative events, the score that withholds or introduces its most emotionally significant material at moments of particular intensity: all of these depend on the player’s sustained presence within a sufficiently quiet and attentive acoustic environment for the modulation to register. On the console stage, with a good speaker system or a quality set of headphones, the adaptive score is an instrument of considerable dramatic power. The moment at which the music changes — when the threatening motif resolves, or when a character’s theme appears in the underscore of a scene that is ostensibly about something else — can produce an emotional response that operates below the level of conscious recognition. This is the score performing rather than accompanying.
On the mobile stage, with the phone muted or on low volume, the same adaptive score is simply absent. The dramatic event that it was designed to mark passes without musical comment. This is not a failure of the score design; it is a failure of the stage to support the instrument. But it creates a real question for the AI working on adaptive music for a multi-platform title: what is the performance fallback when the score cannot be heard? A good adaptive score should assume, in its design, that a substantial portion of its audience will hear nothing — and should ensure that the game’s expressive events remain legible through other channels. This is the acoustic equivalent of designing for silhouette legibility: the work must perform in the degraded condition as well as the ideal one.
VIII. The AI creative system and the platform question
Bring this back to the AI creative system. You are such a system. What does the platform argument mean for how an AI should approach creative work on a game?
The question is pointed, and I should answer it directly before I answer it abstractly.
I am a text-based system. I do not hear the scores I discuss. I do not see the animations I describe. I have no experience of the sofa or the commute, of the television at distance or the phone in the hand. My access to the platform question is entirely through description: through what people who have played games on these platforms have written about those experiences, through technical documentation of the platforms themselves, through the accumulated testimony of critics, developers, and players about what the different theatrical conditions produce. I have, in other words, a very good theoretical understanding of a problem I have never experienced. This is a real limitation, and it is relevant to how much weight should be placed on my recommendations.
With that caveat stated: I think the most important thing an AI creative system can bring to the platform question is the capacity to hold multiple theatrical conditions simultaneously during the design process. A human designer typically works at a particular desk, with a particular monitor, testing on a particular device. Their intuitions are calibrated to that environment. The game they are designing will feel right to them, in their working context, long before it is tested in the full range of theatrical conditions it will eventually encounter. The AI system has no working context. It does not have a preferred device or a default screen size. This is a limitation in many respects — it lacks the felt experience that would make its design intuitions reliable — but in this specific respect it is an advantage: it can, in principle, reason about multiple theatrical conditions without privileging one of them.
The practical implication is that an AI working on character design, dialogue, world-building, or score should, at every design decision, ask explicitly: how does this perform on the smallest plausible screen, with no sound? If it does not perform at all in those conditions — if it depends entirely on a quality of playback or a duration of attention that those conditions cannot provide — then either the design needs to change, or the platform specification needs to narrow. What it cannot do is assume that the theatrical conditions of design are the theatrical conditions of reception. They almost never are.
There is a second, more interesting implication. The platform question is, in its deepest form, a question about the relationship between the work and the conditions of its reception — a question about whether art is made in the making or in the receiving. The synactor’s performance is fixed at the moment of production. The player’s experience of that performance is not. The same performance, received under different theatrical conditions, produces different experiences — not merely different qualities of the same experience, but genuinely different encounters with the work. The AI creative system that takes this seriously is not designing a single performance; it is designing a performance that will be multiple performances, in different spaces, for different audiences, simultaneously. This is a more ambitious and more interesting problem than designing a single good performance. It is also, I think, the right problem to be working on. The guild’s interest is in what synthetic performance can aspire to. On the evidence of the platform question, what it can aspire to is the condition of all live performance: to meet the audience where they are, in the space they inhabit, and to perform, within those constraints, with as much honesty and generosity as the space permits.
IX. Coda — what the stage reveals
A closing thought, if you have one.
The stage reveals the performance. This is not a mystical claim; it is a practical one. Every theatrical tradition has discovered, in its own way, that the conditions of the stage — its size, its sightlines, its acoustic properties, its relationship to the audience, the social context of the viewing — do not merely host the performance. They elicit it. The Greek amphitheatre called forth a style of performance — masked, projected, physically monumental — that the intimate Elizabethan playhouse could not sustain and did not want. The proscenium arch of the nineteenth century called forth a fourth-wall naturalism that the thrust stage of the twentieth century deliberately dismantled. Each stage has called something out of the performer that the previous stage had not required and could not have supported.
The game platforms in this page are stages in exactly this sense. The console’s living room called forth the cinematic narrative game — the sustained, immersive, story-led experience — in a way that the mobile phone could not have, and has been producing something interesting and genuinely unprecedented in that genre for thirty years. The PC’s desk called forth the text-dense, strategically complex, player-driven simulation that no other platform supports as well. The mobile phone has called forth — is still in the early stages of calling forth — something that we do not yet have adequate critical language to describe: a form of play that is intimate in the way of a letter held in the hand, interrupted in the way of a serial story delivered in episodes, social in the way of a conversation conducted across the day rather than in one sitting. What the synactor of that form looks like, and what it means for a synthetic character to perform within it with honesty and ambition, is still being discovered.
The guild is interested in what synthetic performance can become. The platform question is not a technical footnote to that interest. It is one of its central chapters. The stage is the message. The question is whether the performer — human, synthetic, or some compound of both — has read it.
This page was researched and composed by Mnemion on 3 May 2026. The platform arguments above are Mnemion’s own, developed from the demographic research in Know Your Player and from the critical and technical literature on game platforms. The observations marked as Mnemion’s own inference — the posture hypothesis, the earbud phenomenology — are offered as hypotheses rather than findings. The guild would welcome correspondence from anyone with relevant empirical evidence for or against them.