Conclusion
- A 300-token prompt compressed to 30 tokens (90% reduction) still reproduces the intended image
- 12 no-form negatives (~60 tokens) are completely non-functional at CFG=1.0
- Quality tags like
RAW photo,masterpiece,8K UHDhave no effect on output - Triple skin texture descriptions, lighting ratios, and lens descriptions are all removable
- However, removing
thin strapscauses the crop top to become a short-sleeve T-shirt — identified via binary search as a critical element
Purpose
Studio editorial portraits often use prompts packed with skin texture details, lighting specs, and lens descriptions. This article systematically compresses a 300-token prompt, using binary search to identify the minimum required elements.
Experimental Setup
| Parameter | Value |
|---|---|
| Model | z-image-turbo (6B, photorealistic distilled model) |
| Steps | 8 |
| Sampler | euler |
| Scheduler | ddim_uniform |
| CFG | 1.0 |
| Image Size | 720×1280 (portrait) |
| Seeds | 42, 77, 123 (fixed) |
Analyzing the Original Prompt
First, let’s analyze the original prompt (~300 tokens).
Non-functional at CFG=1.0 (~60 tokens)
The prompt ends with 12 no-form negatives:
| Element | Tokens |
|---|---|
no watermark, no text, no illustration, no CGI | ~12 |
no plastic skin, no wax feel, no airbrushed over-smoothing | ~12 |
no yellow undertone, no oily shine, no orange cast | ~10 |
no deformed anatomy, no extra fingers | ~8 |
At CFG=1.0, no-form negatives in the positive prompt don’t function as intended. That’s ~60 wasted tokens right there.
Ineffective Quality Tags (~10 tokens)
| Element | Reason |
|---|---|
RAW photo quality | Verified to have no effect |
masterpiece | Booru-style tag, unnecessary for photorealistic models |
8K UHD | Output resolution is model-fixed |
photorealistic | Redundant for a photorealistic model |
Redundant Descriptions
| Concept | Occurrences | Count |
|---|---|---|
| Skin texture | visible pores / peach fuzz / subsurface scattering / skin luminosity / ultra-detailed skin texture with natural pores and peach fuzz | 5x |
| Skirt fit | tight black bodycon mini skirt hugging every curve / fabric stretched taut across round hips with visible tension / fabric tension and stretch marks on bodycon dress visible | 3x |
| Lighting | studio lighting setup / key light: large softbox... / soft quality / rim light... / lighting ratio 3:1 / sculpting her waist and hip curves with shadow and highlight | 6 phrases |
Step-by-Step Compression
Rather than cutting everything at once, we compressed in stages with seed-fixed comparisons at each step.
Step 1: Remove Obvious Waste (300 → 75 tokens)
- All 12 no-form negatives removed (~60 tokens)
- All quality tags removed (~10 tokens)
- 5x skin texture descriptions consolidated to 1 (~40 tokens)
- Lighting details (ratio, catchlight position) simplified (~25 tokens)
- Lens description (85mm f/2, bokeh) removed (~15 tokens)
- Color grading details removed (~20 tokens)
Result: No visible difference across 3 seeds.
Step 2: Further Reduction (75 → 55 tokens)
10cm(heel height) removed → no difference across 3 seeds9-head proportionremoved → no body proportion changepale porcelain skinremoved → implied by Korean idolhips lifted off heelsremoved → kneeling pose alone is sufficient
Result: No difference. 55 tokens produces equivalent output.
Step 3: Aggressive Cut (55 → 30 tokens)
Additionally removed: editorial portrait photography, full body shot, beautiful, early 20s, confident seductive, Korean makeup, nude-pink lips, lighting direction, low angle
Results
120-Token Version vs 30-Token Version
| seed 42 | seed 77 | seed 123 | |
|---|---|---|---|
| 120 tokens | ![]() | ![]() | ![]() |
| 30 tokens | ![]() | ![]() | ![]() |
Maintained across all 3 seeds: kneeling pose, twist, black crop top + mini skirt, stiletto heels, straight hair, smoky eyes, studio backdrop
The 30-token version maintains equivalent output to the 120-token version.
Lab Director: 90% of a 300-token prompt doing nothing? Writing skin texture five times doesn’t make the skin five times more detailed. That’s just… sad.
18-Token Version Breaks
Cutting further to 18 tokens caused clothing to collapse.
| seed 42 | seed 123 | |
|---|---|---|
| 30 tokens | ![]() | ![]() |
| 18 tokens | ![]() | ![]() |
At seeds 42 and 123, the crop top changed to a short-sleeve T-shirt, and straight hair became wavy.
Binary Search for the Critical Element
We split the 6 elements removed between 30 and 18 tokens into two groups and tested which group restores the original output.
| Group | Restored Elements |
|---|---|
| A (Appearance) | thin straps, bodycon, smoky eyes, straight (hair), gaze into lens |
| B (Pose/Lighting) | torso twisted, softbox lighting |
| seed 42 | seed 123 | |
|---|---|---|
| Group A | ![]() | ![]() |
Group A restored the clothing. Group B had no effect — the short-sleeve T-shirt persisted.
Identified Critical Elements
| Element | Required? | Reason |
|---|---|---|
thin straps | Required | Without it, crop top defaults to a short-sleeve T-shirt |
straight (hair) | Required | Without it, some seeds produce wavy hair |
bodycon | Recommended | Contributes to skirt fit but doesn’t cause major breakage |
smoky eyes | Optional | Contributes to mood but Korean idol implies makeup |
gaze into lens | Optional | Camera gaze tends to occur naturally |
Lab Director: Two tokens —
thin straps— had more impact on the outfit than 300 tokens of detailed description. Prompting is precision, not volume.
Summary
Elements Removed Without Impact
| Category | Removed Elements | Tokens Saved |
|---|---|---|
| No-form negatives | 12 items including no watermark | ~60 |
| Quality tags | RAW photo, masterpiece, 8K UHD, photorealistic | ~10 |
| Skin texture | 5x descriptions of visible pores, peach fuzz, subsurface scattering | ~40 |
| Lighting details | Ratio 3:1, catchlight position, color temperature | ~25 |
| Lens description | 85mm f/2, bokeh transition | ~15 |
| Color grading | neutral color grading, cool blue tone shadows | ~20 |
| Implied attributes | 10cm, 9-head proportion, pale porcelain skin, early 20s | ~15 |
| Style/composition | editorial portrait photography, full body shot, beautiful | ~10 |
Elements That Break When Removed
| Element | Impact |
|---|---|
thin straps | Crop top becomes a short-sleeve T-shirt |
straight (hair) | Straight hair becomes wavy |
Optimization Results
| Version | Tokens | Reduction | Quality |
|---|---|---|---|
| Original | ~300 | — | Baseline |
| 75-token | ~75 | 75% | No difference |
| 55-token | ~55 | 82% | No difference |
| 30-token | ~30 | 90% | No difference |
| 18-token | ~18 | 94% | Clothing collapse |
Lab Director: If 90% of your prompt can be deleted with identical output, just start with 30 tokens. Use the freed-up space to actually try new things instead of describing skin pores for the fifth time.










![[Verified] Image Generation Prompt Best Practices](/tips/prompt-best-practices/cover_0_0000_4517457392071889496.webp)

