Reduce word counts, add anti-echoing, and use verbatim id guidance by neoneye · Pull Request #469 · PlanExeOrg/PlanExe

neoneye · 2026-03-31T16:13:59Z

Summary

Reduced word counts: description 80-100→50-70, synergy/conflict 40-60→20-40
Anti-echoing (positive framing): "Add new insight beyond what consequences and review already state"
Verbatim id guidance: Field description says "copy it verbatim from the prompt, without XML tags". System prompt says "copy the id verbatim from inside the tags — strip the XML tags but do not alter the id itself." Avoids "hexadecimal" which caused gpt-4o-mini to strip hyphens (Reduce word counts, add anti-echoing, and guide lever_id extraction #468), and avoids "uuid" which models interpret differently.

Supersedes #467 (XML-tag regression) and #468 (hyphen-stripping regression).

Test plan

Verify all models return correct lever_id (no XML tags, no hyphen stripping)
Verify gpt-oss-20b completes 5/5 plans
Verify UUID contamination stays at 0
Verify no model < 0.4x baseline field lengths
Check assessment verdict before merging

🤖 Generated with Claude Code

Reduce description from 80-100 to 50-70 words, synergy/conflict from 40-60 to 20-40 words. Helps gpt-oss-20b complete within 600s budget. Add anti-echoing: "Add new insight beyond what consequences and review already state" (positive framing). For lever_id: say "copy it verbatim from the prompt, without XML tags" instead of "hexadecimal uuid" which caused gpt-4o-mini to strip hyphens (PR #468). System prompt says "copy the id verbatim from inside the tags — strip the XML tags but do not alter the id itself." Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

neoneye · 2026-03-31T17:26:03Z

Self-improve iteration — analysis 68

Verdict: CONDITIONAL (near-YES)

All goals achieved:

100% success rate (35/35 plans, all 7 models)
gpt-oss-20b: 5/5 (was 3/5), max 166s
0 UUID contamination in synergy/conflict
0 hyphen-stripping issues (gpt-4o-mini fixed)
0 XML-tag leakage (llama3.1 fixed)
All 35/35 levers enriched for every model
Field lengths normalized: avg desc 51 words, syn/conf 26 words

Minor residual: haiku gta_game produced 1 dummy_override lever_id (benign, discarded). 2 haiku noise errors total. All real levers unaffected.

Conditions: Remove errors.append for unknown_lever_id (noise cleanup) and refine field description wording.

neoneye merged commit ec10d6c into main Mar 31, 2026
3 checks passed

neoneye deleted the fix/enrich-concise-fields-and-verbatim-id branch April 1, 2026 23:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce word counts, add anti-echoing, and use verbatim id guidance#469

Reduce word counts, add anti-echoing, and use verbatim id guidance#469
neoneye merged 1 commit intomainfrom
fix/enrich-concise-fields-and-verbatim-id

neoneye commented Mar 31, 2026

Uh oh!

neoneye commented Mar 31, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

neoneye commented Mar 31, 2026

Summary

Test plan

Uh oh!

neoneye commented Mar 31, 2026

Self-improve iteration — analysis 68

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant