Conclusion
- A 130+ token prompt compressed to 55 tokens still reproduced the intended image
- Quadruple flash mentions, triple night/dark mentions, and other repetitions — once is enough
- Grammatically broken phrases like
She wears a deep Dark backgroundlikely don’t function as intended - Color palette notes in parenthetical format (assigning colors to elements) are not understood by CLIP
- Implicitly functioning elements exist — removing
Dark backgroundeliminated the wall, requiring explicit addition ofconcrete wall background
Purpose
When writing long prompts, it’s common to repeat the same expression multiple times or leave production notes embedded in the text. In this article, we analyzed a real flash photography prompt (130+ tokens), removed redundant expressions, and tested whether the optimized version (55 tokens) produces equivalent images.
Experimental Conditions
| Parameter | Value |
|---|---|
| Model | z-image-turbo (6B, photorealistic distilled model) |
| Steps | 8 |
| Sampler | euler |
| Scheduler | ddim_uniform |
| CFG | 1.0 |
| Image size | 1024x1024 |
| Seeds | 42, 123, 777 (fixed) |
Analyzing the Original Prompt
First, we analyzed the original prompt (130+ tokens).
Structural Errors
| Location | Problem |
|---|---|
Flash photography of a sitting on concrete | No noun after of a. Grammar is broken |
She wears a deep Dark background | “Wearing a background” is nonsensical. Two phrases incorrectly merged |
ankle strap. | Unclear what ankle strap refers to. Inconsistent with full nude |
Deep Blue ( Black (Hair, shadows), Grey (Concrete)... | Color palette notes left in the prompt. CLIP doesn’t understand parenthetical color assignments |
Redundant Repetitions
| Concept | Occurrences | Count |
|---|---|---|
| Flash | Flash photography / Direct camera flash / harsh shadows behind the subject / stark flash lighting | 4x |
| Night/dark | at night / Dark background / dark nighttime setting | 3x |
| Concrete | sitting on concrete / Grey concrete ground | 2x |
| Knee-hug pose | hugging her knees / Sitting on ground, knees pulled up to chest, arms wrapped around legs, head resting on knees looking at camera | 2x |
| Skin color | pale ivory skin / Pale Ivory (Skin tone under flash) | 2x |
Questionable Elements
| Element | Reason |
|---|---|
High contrast | Implied by flash photography + nighttime |
| Entire color palette section | CLIP doesn’t understand Black (Hair, shadows) format. Colors should be placed adjacent to their targets (attribute leak test) |
Optimization Process
We optimized using these principles:
- Fix structural errors — replace broken grammar with correct expressions
- Merge duplicates to single mentions — 4x flash → 1x, 3x night → 1x
- Remove color palette notes — place color specifications adjacent to targets
- Remove unnecessary elements — ankle strap, High contrast
| Removed/Changed Expression | Reason |
|---|---|
Flash photography of a sitting on concrete → merged into direct flash photography at night | Grammar fix + duplicate resolution |
She wears a deep Dark background → removed | Structural error. Background implied by at night |
hugging her knees → removed | Duplicates detailed pose description |
dark nighttime setting → removed | Duplicates at night |
Direct camera flash / stark flash lighting → removed | Already covered by direct flash photography |
harsh shadows behind the subject → harsh shadows | Removed unnecessary modifier |
High contrast → removed | Implied by flash + night |
Deep Blue ( Black (Hair, shadows)... entire section → removed | Color palette notes |
Black polish on fingers and toes → black nail polish | Shortened, color adjacent to target |
ankle strap → removed | Unnecessary |
Experimental Results
Experiment 1: Original vs Optimized (No Wall Specification)
We compared the optimized version without explicit wall specification.
| seed 42 | seed 123 | seed 777 | |
|---|---|---|---|
| Original (A) | ![]() | ![]() | ![]() |
| Optimized (B) | ![]() | ![]() | ![]() |
Preserved elements (3/3 images): Knee-hug pose, flash shadows, nighttime concrete atmosphere, long black hair, black nails, cool tones
Changed elements: Condition A tends to show a wall background, while Condition B shows more ground in an overhead composition. The original prompt’s Dark background may have been implicitly functioning as a wall.
Experiment 2: Optimized with Wall Specification
We re-tested with concrete wall background explicitly added.
| seed 42 | seed 123 | seed 777 | |
|---|---|---|---|
| Original (A) | ![]() | ![]() | ![]() |
| Optimized+Wall (C) | ![]() | ![]() | ![]() |
All 3 images showed a concrete wall background, restoring the same composition as the original prompt.
Result: Flash shadows, knee-hug pose, wall composition, cool tones — all intended elements were preserved in the optimized version.
Lab Director: So 130 tokens down to 55 and you get the same image — that means over half the original prompt was just noise. Leftover palette notes in your prompt is such a classic mistake though.
Summary
5 Checkpoints for Cleaning Up Redundant Prompts
- Count how many times the same concept appears — Flash 4x, night 3x — once is enough
- Check for broken grammar — no noun after
of a, “wearing a background,” etc. - Check for leftover production notes — parenthetical color palettes don’t reach CLIP
- Omit implied elements — flash + night already implies
High contrast - Always verify deletions with actual images —
Dark backgroundwas functioning as a wall, so removing it caused unintended information loss
Optimization Results
| Tokens | Wall | Flash Shadow | Pose | |
|---|---|---|---|---|
| Original (A) | 130+ | Yes | Yes | Yes |
| Optimized, no wall (B) | ~50 | No | Yes | Yes |
| Optimized, with wall (C) | ~55 | Yes | Yes | Yes |
The final version fits within 75 tokens with ~20 tokens to spare. This headroom allows adding new elements (accessories, expression changes, etc.).
Lab Director: The wall disappearing and then coming right back with just
concrete wall background— it’s proof that what you forget to write matters more than what you write.









![[Verified] Image Generation Prompt Best Practices](/tips/prompt-best-practices/cover_0_0000_4517457392071889496.webp)

