docs(research): add LLM instruction architecture entries 24-27 by nullhack · Pull Request #62 · nullhack/python-project-template

nullhack · 2026-04-17T09:36:31Z

Summary

Add entries 24–27 to docs/academic_research.md covering the empirical and inferred foundations of the three-file agent architecture split
Entry 24: Instruction conflict resolution failure in LLMs (Geng et al. AAAI-26; Wallace et al. OpenAI) — Confirmed
Entry 25: Positional attention degradation in long contexts — Liu et al. TACL 2023 — Confirmed with caveat
Entry 26: Modular prompt de-duplication reduces interference — Sharma & Henley 2026 — Partially confirmed
Entry 27: Three-file agent architecture split with explicit what-runs-when table — Inferred from convergence of 23–26
Amend Entry 23 status to reflect that on-demand loading benefit is vendor guidance + RAG extrapolation, not directly A/B-tested on agent instruction architectures
Add bibliography entries 27–31

@given

…le: blocks - Replace folder-per-feature with one .feature file per feature - User stories are now Rule: blocks; ACs are Example: blocks under each Rule - Discovery content embedded in feature description free text - Test layout: tests/features/<feature-name>/<rule-slug>_test.py - Function naming: test_<rule_slug>_<id_hex>() - Rewrite gen_test_stubs.py to parse Rule: blocks (one test file per Rule) - Update gen_todo.py to find .feature files directly in in-progress/ - Update all skills: scope, tdd, implementation, verify, session-workflow - Add mandatory Self-Declaration block in TODO.md at SELF-DECLARE phase - Enforce Hypothesis @given + @pytest.mark.slow on all tests/unit/ tests - Migrate completed/display-version to new single-file format - Clarify OC-8: fix must produce a new named class, no workarounds

…s, fix gen_test_stubs duplicate detection - Rewrite developer.md, product-owner.md, reviewer.md as thin routing scripts (~60 lines each, down from ~150) - Remove duplicated workflow details, principle lists, and path specs from agent files - Fix stale folder-based paths in developer.md and product-owner.md (v4.1 restructure) - Fix product-owner.md Phase 3 description (one .feature file with Rule: blocks, not one per story) - Fix reviewer.md report template using <story> instead of <rule_slug> - Consolidate Self-Declaration checklist in session-workflow/SKILL.md to reference implementation/SKILL.md - Fix gen_test_stubs.py find_duplicate_ids() false positives: use set instead of list so the same feature in multiple stage dirs (completed + in-progress) does not trigger spurious warnings

@given

Replace the hard rule that tests/unit/ must use @given with guidance: use Hypothesis for properties spanning many inputs, plain pytest for specific behaviors or single edge cases. @pytest.mark.slow remains mandatory on all @given-decorated tests. Affected files: AGENTS.md, tdd/SKILL.md, implementation/SKILL.md, verify/SKILL.md

…ntries Add entries 24-27 covering instruction conflict resolution failure, positional attention degradation, modular prompt de-duplication, and the three-file agent architecture split. Amend entry 23 status to reflect RAG extrapolation caveat. Add bibliography entries 27-31.

nullhack added 6 commits April 16, 2026 14:47

chore(release): bump version to v4.1.20260416 - Recursive Acinonyx

cc3ec20

docs(workflow): add complete workflow diagram and reference

6bf5550

nullhack merged commit 5341f34 into main Apr 17, 2026
5 checks passed

nullhack deleted the docs/agent-architecture-research branch April 17, 2026 09:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(research): add LLM instruction architecture entries 24-27#62

docs(research): add LLM instruction architecture entries 24-27#62
nullhack merged 6 commits intomainfrom
docs/agent-architecture-research

nullhack commented Apr 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

nullhack commented Apr 17, 2026

Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant