Skip to content

docs(research): add LLM instruction architecture entries 24-27#62

Merged
nullhack merged 6 commits intomainfrom
docs/agent-architecture-research
Apr 17, 2026
Merged

docs(research): add LLM instruction architecture entries 24-27#62
nullhack merged 6 commits intomainfrom
docs/agent-architecture-research

Conversation

@nullhack
Copy link
Copy Markdown
Owner

Summary

  • Add entries 24–27 to docs/academic_research.md covering the empirical and inferred foundations of the three-file agent architecture split
  • Entry 24: Instruction conflict resolution failure in LLMs (Geng et al. AAAI-26; Wallace et al. OpenAI) — Confirmed
  • Entry 25: Positional attention degradation in long contexts — Liu et al. TACL 2023 — Confirmed with caveat
  • Entry 26: Modular prompt de-duplication reduces interference — Sharma & Henley 2026 — Partially confirmed
  • Entry 27: Three-file agent architecture split with explicit what-runs-when table — Inferred from convergence of 23–26
  • Amend Entry 23 status to reflect that on-demand loading benefit is vendor guidance + RAG extrapolation, not directly A/B-tested on agent instruction architectures
  • Add bibliography entries 27–31

…le: blocks

- Replace folder-per-feature with one .feature file per feature
- User stories are now Rule: blocks; ACs are Example: blocks under each Rule
- Discovery content embedded in feature description free text
- Test layout: tests/features/<feature-name>/<rule-slug>_test.py
- Function naming: test_<rule_slug>_<id_hex>()
- Rewrite gen_test_stubs.py to parse Rule: blocks (one test file per Rule)
- Update gen_todo.py to find .feature files directly in in-progress/
- Update all skills: scope, tdd, implementation, verify, session-workflow
- Add mandatory Self-Declaration block in TODO.md at SELF-DECLARE phase
- Enforce Hypothesis @given + @pytest.mark.slow on all tests/unit/ tests
- Migrate completed/display-version to new single-file format
- Clarify OC-8: fix must produce a new named class, no workarounds
…s, fix gen_test_stubs duplicate detection

- Rewrite developer.md, product-owner.md, reviewer.md as thin routing scripts (~60 lines each, down from ~150)
- Remove duplicated workflow details, principle lists, and path specs from agent files
- Fix stale folder-based paths in developer.md and product-owner.md (v4.1 restructure)
- Fix product-owner.md Phase 3 description (one .feature file with Rule: blocks, not one per story)
- Fix reviewer.md report template using <story> instead of <rule_slug>
- Consolidate Self-Declaration checklist in session-workflow/SKILL.md to reference implementation/SKILL.md
- Fix gen_test_stubs.py find_duplicate_ids() false positives: use set instead of list so the same feature in multiple stage dirs (completed + in-progress) does not trigger spurious warnings
Replace the hard rule that tests/unit/ must use @given with guidance:
use Hypothesis for properties spanning many inputs, plain pytest for
specific behaviors or single edge cases. @pytest.mark.slow remains
mandatory on all @given-decorated tests.

Affected files: AGENTS.md, tdd/SKILL.md, implementation/SKILL.md, verify/SKILL.md
…ntries

Add entries 24-27 covering instruction conflict resolution failure,
positional attention degradation, modular prompt de-duplication, and
the three-file agent architecture split. Amend entry 23 status to
reflect RAG extrapolation caveat. Add bibliography entries 27-31.
@nullhack nullhack merged commit 5341f34 into main Apr 17, 2026
5 checks passed
@nullhack nullhack deleted the docs/agent-architecture-research branch April 17, 2026 09:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant