Implement Spec 001: Core UI & Camera Feed with PyObjC

2026-01-21 09:32:45 -05:00
parent 45f73a3249
commit 4864225345
13 changed files with 1594 additions and 0 deletions
--- a/.cursor/commands/speckit.specify.md
+++ b/.cursor/commands/speckit.specify.md
@@ -0,0 +1,18 @@
+---
+description: Create a feature specification
+---
+
+Create a specification for:
+
+$ARGUMENTS
+
+## Steps
+
+1. Generate short name (2-4 words, kebab-case)
+2. Find next spec number from `specs/`
+3. Create `specs/NNN-short-name.md`
+4. Include clear acceptance criteria
+5. Add completion signal:
+   ```
+   **Output when complete:** `<promise>DONE</promise>`
+   ```
--- a/.gitignore
+++ b/.gitignore
@@ -0,0 +1,6 @@
+.env
+.venv/
+__pycache__/
+*.pyc
+.DS_Store
+.specify/
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -0,0 +1,5 @@
+# Agent Instructions
+
+**Read:** `.specify/memory/constitution.md`
+
+That file is your source of truth for this project.
--- a/CLAUDE.md
+++ b/CLAUDE.md
@@ -0,0 +1,5 @@
+# Agent Instructions
+
+**Read:** `.specify/memory/constitution.md`
+
+That file is your source of truth for this project.
--- a/PROMPT_build.md
+++ b/PROMPT_build.md
@@ -0,0 +1,18 @@
+# Ralph Build Mode
+
+Read `.specify/memory/constitution.md` first.
+
+## Your Task
+
+1. Check `specs/` folder
+2. Find highest priority INCOMPLETE spec
+3. Implement completely
+4. Run tests, verify acceptance criteria
+5. Commit and push
+6. Output `<promise>DONE</promise>` when done
+
+## Rules
+
+- ONE spec per iteration
+- Do NOT output magic phrase until truly complete
+- If blocked: explain in ralph_history.txt, exit without phrase
--- a/PROMPT_plan.md
+++ b/PROMPT_plan.md
@@ -0,0 +1,11 @@
+# Ralph Planning Mode
+
+Read `.specify/memory/constitution.md` first.
+
+## Your Task
+
+1. Analyze specs in `specs/`
+2. Create `IMPLEMENTATION_PLAN.md` with prioritized tasks
+3. Output `<promise>DONE</promise>` when done
+
+Delete IMPLEMENTATION_PLAN.md to return to direct spec mode.
--- a/main.py
+++ b/main.py
@@ -0,0 +1,108 @@
+import sys
+import cv2
+import numpy as np
+from PIL import Image
+import objc
+from AppKit import (
+    NSApplication, NSApp, NSWindow, NSView, NSImageView, NSButton,
+    NSStackView, NSImage, NSBitmapImageRep, NSBackingStoreBuffered,
+    NSWindowStyleMaskTitled, NSWindowStyleMaskClosable,
+    NSWindowStyleMaskResizable, NSWindowStyleMaskMiniaturizable,
+    NSTimer, NSMakeSize, NSMakeRect, NSObject, NSLog,
+    NSUserInterfaceLayoutOrientationVertical, NSLayoutAttributeCenterX,
+    NSLayoutAttributeCenterY, NSLayoutAttributeWidth, NSLayoutAttributeHeight,
+    NSLayoutAttributeTop, NSLayoutAttributeBottom, NSLayoutAttributeLeading,
+    NSLayoutAttributeTrailing
+)
+from Foundation import NSObject, NSTimer, NSDate
+
+class ItemSenseApp(NSObject):
+    def applicationDidFinishLaunching_(self, notification):
+        self.window = NSWindow.alloc().initWithContentRect_styleMask_backing_defer_(
+            NSMakeRect(0, 0, 800, 600),
+            NSWindowStyleMaskTitled | NSWindowStyleMaskClosable | NSWindowStyleMaskResizable | NSWindowStyleMaskMiniaturizable,
+            NSBackingStoreBuffered,
+            False
+        )
+        self.window.setTitle_("ItemSense")
+        self.window.center()
+        
+        # Main content view (StackView for layout)
+        self.stack_view = NSStackView.alloc().init()
+        self.stack_view.setOrientation_(NSUserInterfaceLayoutOrientationVertical)
+        self.stack_view.setSpacing_(10)
+        self.stack_view.setEdgeInsets_((10, 10, 10, 10))
+        self.window.setContentView_(self.stack_view)
+
+        # Image View for Camera Feed
+        self.image_view = NSImageView.alloc().init()
+        self.image_view.setImageScaling_(0) # NSImageScaleProportionallyDown
+        self.stack_view.addView_inGravity_(self.image_view, 1) # Top gravity
+
+        # Capture Button
+        self.capture_button = NSButton.buttonWithTitle_target_action_("Capture", self, "captureClicked:")
+        self.stack_view.addView_inGravity_(self.capture_button, 3) # Bottom gravity
+
+        self.window.makeKeyAndOrderFront_(None)
+        
+        # Initialize Camera
+        self.cap = cv2.VideoCapture(0)
+        if not self.cap.isOpened():
+            NSLog("Error: Could not open camera")
+        
+        # Start Timer for 30 FPS
+        self.timer = NSTimer.scheduledTimerWithTimeInterval_target_selector_userInfo_repeats_(
+            1.0/30.0, self, "updateFrame:", None, True
+        )
+
+    def applicationShouldTerminateAfterLastWindowClosed_(self, sender):
+        return True
+
+    def applicationWillTerminate_(self, notification):
+        if hasattr(self, 'cap') and self.cap.isOpened():
+            self.cap.release()
+
+    def updateFrame_(self, timer):
+        if hasattr(self, 'cap') and self.cap.isOpened():
+            ret, frame = self.cap.read()
+            if ret:
+                # Convert BGR to RGB
+                rgb_frame = cv2.cvtColor(frame, cv2.COLOR_BGR2RGB)
+                
+                # Convert to NSImage
+                height, width, channels = rgb_frame.shape
+                bytes_per_line = channels * width
+                
+                # Create BitmapRep
+                bitmap_rep = NSBitmapImageRep.alloc().initWithBitmapDataPlanes_pixelsWide_pixelsHigh_bitsPerSample_samplesPerPixel_hasAlpha_isPlanar_colorSpaceName_bytesPerRow_bitsPerPixel_(
+                    None, width, height, 8, 3, False, False, "NSDeviceRGBColorSpace", bytes_per_line, 24
+                )
+                
+                # Copy data
+                bitmap_data = bitmap_rep.bitmapData()
+                # We need to copy the bytes. This is the PyObjC way to write to the buffer requires a bit of care.
+                # A safer/easier way with PIL:
+                image = Image.fromarray(rgb_frame)
+                img_data = image.tobytes()
+                
+                # Low-level memory copy might be tricky in pure python/objc without unsafe pointers.
+                # Alternative: Use PIL to save to memory buffer (TIFF/PNG) and load NSImage from data.
+                # This is slightly slower but safer and easier in Python.
+                import io
+                # Using PPM format is fast (uncompressed)
+                header = f"P6 {width} {height} 255 ".encode()
+                data = header + rgb_frame.tobytes()
+                ns_data = objc.lookUpClass("NSData").dataWithBytes_length_(data, len(data))
+                ns_image = NSImage.alloc().initWithData_(ns_data)
+                
+                self.image_view.setImage_(ns_image)
+
+    def captureClicked_(self, sender):
+        print("Capture clicked")
+
+if __name__ == "__main__":
+    app = NSApplication.sharedApplication()
+    delegate = ItemSenseApp.alloc().init()
+    app.setDelegate_(delegate)
+    NSApp.activateIgnoringOtherApps_(True)
+    app.run()
--- a/requirements.txt
+++ b/requirements.txt
@@ -0,0 +1,5 @@
+pyobjc-framework-Cocoa
+opencv-python
+pillow
+openai
+python-dotenv
--- a/scripts/ralph-loop-codex.sh
+++ b/scripts/ralph-loop-codex.sh
@@ -0,0 +1,635 @@
+#!/bin/bash
+#
+# Ralph Loop for OpenAI Codex CLI
+#
+# Based on Geoffrey Huntley's Ralph Wiggum methodology.
+# Combined with SpecKit-style specifications.
+#
+# Usage:
+#   ./scripts/ralph-loop-codex.sh              # Build mode (unlimited)
+#   ./scripts/ralph-loop-codex.sh 20           # Build mode (max 20 iterations)
+#   ./scripts/ralph-loop-codex.sh plan         # Planning mode (optional)
+#
+
+set -e
+set -o pipefail
+
+SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+PROJECT_DIR="$(dirname "$SCRIPT_DIR")"
+LOG_DIR="$PROJECT_DIR/logs"
+CONSTITUTION="$PROJECT_DIR/.specify/memory/constitution.md"
+RLM_DIR="$PROJECT_DIR/rlm"
+RLM_TRACE_DIR="$RLM_DIR/trace"
+RLM_QUERIES_DIR="$RLM_DIR/queries"
+RLM_ANSWERS_DIR="$RLM_DIR/answers"
+RLM_INDEX="$RLM_DIR/index.tsv"
+
+# Configuration
+MAX_ITERATIONS=0  # 0 = unlimited
+MODE="build"
+RLM_CONTEXT_FILE=""
+CODEX_CMD="${CODEX_CMD:-codex}"
+TAIL_LINES=5
+TAIL_RENDERED_LINES=0
+ROLLING_OUTPUT_LINES=5
+ROLLING_OUTPUT_INTERVAL=10
+ROLLING_RENDERED_LINES=0
+
+# Colors
+RED='\033[0;31m'
+GREEN='\033[0;32m'
+YELLOW='\033[1;33m'
+BLUE='\033[0;34m'
+PURPLE='\033[0;35m'
+CYAN='\033[0;36m'
+NC='\033[0m'
+
+mkdir -p "$LOG_DIR"
+
+# Check constitution for YOLO setting
+YOLO_ENABLED=true
+if [[ -f "$CONSTITUTION" ]]; then
+    if grep -q "YOLO Mode.*DISABLED" "$CONSTITUTION" 2>/dev/null; then
+        YOLO_ENABLED=false
+    fi
+fi
+
+show_help() {
+    cat <<EOF
+Ralph Loop for OpenAI Codex CLI
+
+Usage:
+  ./scripts/ralph-loop-codex.sh              # Build mode, unlimited
+  ./scripts/ralph-loop-codex.sh 20           # Build mode, max 20 iterations
+  ./scripts/ralph-loop-codex.sh plan         # Planning mode (OPTIONAL)
+  ./scripts/ralph-loop-codex.sh --rlm-context ./rlm/context.txt
+  ./scripts/ralph-loop-codex.sh --rlm ./rlm/context.txt
+
+Modes:
+  build (default)  Pick incomplete spec and implement
+  plan             Create IMPLEMENTATION_PLAN.md (OPTIONAL)
+
+Work Source:
+  Agent reads specs/*.md and picks the highest priority incomplete spec.
+
+YOLO Mode: Uses --dangerously-bypass-approvals-and-sandbox
+
+RLM Mode (optional):
+  --rlm-context <file>  Treat a large context file as external environment.
+                        The agent should read slices instead of loading it all.
+  --rlm [file]          Shortcut for --rlm-context (defaults to rlm/context.txt)
+
+RLM workspace (when enabled):
+  - rlm/trace/     Prompt snapshots + outputs per iteration
+  - rlm/index.tsv  Index of all iterations (timestamp, prompt, log, status)
+  - rlm/queries/ and rlm/answers/  For optional recursive sub-queries
+
+EOF
+}
+
+print_latest_output() {
+    local log_file="$1"
+    local label="${2:-Codex}"
+    local target="/dev/tty"
+
+    [ -f "$log_file" ] || return 0
+
+    if [ ! -w "$target" ]; then
+        target="/dev/stdout"
+    fi
+
+    if [ "$target" = "/dev/tty" ] && [ "$TAIL_RENDERED_LINES" -gt 0 ]; then
+        printf "\033[%dA\033[J" "$TAIL_RENDERED_LINES" > "$target"
+    fi
+
+    {
+        echo "Latest ${label} output (last ${TAIL_LINES} lines):"
+        tail -n "$TAIL_LINES" "$log_file"
+    } > "$target"
+
+    if [ "$target" = "/dev/tty" ]; then
+        TAIL_RENDERED_LINES=$((TAIL_LINES + 1))
+    fi
+}
+
+watch_latest_output() {
+    local log_file="$1"
+    local label="${2:-Codex}"
+    local target="/dev/tty"
+    local use_tty=false
+    local use_tput=false
+
+    [ -f "$log_file" ] || return 0
+
+    if [ ! -w "$target" ]; then
+        target="/dev/stdout"
+    else
+        use_tty=true
+        if command -v tput &>/dev/null; then
+            use_tput=true
+        fi
+    fi
+
+    if [ "$use_tty" = true ]; then
+        if [ "$use_tput" = true ]; then
+            tput cr > "$target"
+            tput sc > "$target"
+        else
+            printf "\r\0337" > "$target"
+        fi
+    fi
+
+    while true; do
+        local timestamp
+        timestamp=$(date '+%Y-%m-%d %H:%M:%S')
+
+        if [ "$use_tty" = true ]; then
+            if [ "$use_tput" = true ]; then
+                tput rc > "$target"
+                tput ed > "$target"
+                tput cr > "$target"
+            else
+                printf "\0338\033[J\r" > "$target"
+            fi
+        fi
+
+        {
+            echo -e "${CYAN}[$timestamp] Latest ${label} output (last ${ROLLING_OUTPUT_LINES} lines):${NC}"
+            if [ ! -s "$log_file" ]; then
+                echo "(no output yet)"
+            else
+                tail -n "$ROLLING_OUTPUT_LINES" "$log_file" 2>/dev/null || true
+            fi
+            echo ""
+        } > "$target"
+
+        sleep "$ROLLING_OUTPUT_INTERVAL"
+    done
+}
+
+# Parse arguments
+while [[ $# -gt 0 ]]; do
+    case "$1" in
+        plan)
+            MODE="plan"
+            if [[ "${2:-}" =~ ^[0-9]+$ ]]; then
+                MAX_ITERATIONS="$2"
+                shift 2
+            else
+                MAX_ITERATIONS=1
+                shift
+            fi
+            ;;
+        --rlm-context)
+            RLM_CONTEXT_FILE="${2:-}"
+            shift 2
+            ;;
+        --rlm)
+            if [[ -n "${2:-}" && "${2:0:1}" != "-" ]]; then
+                RLM_CONTEXT_FILE="$2"
+                shift 2
+            else
+                RLM_CONTEXT_FILE="rlm/context.txt"
+                shift
+            fi
+            ;;
+        -h|--help)
+            show_help
+            exit 0
+            ;;
+        [0-9]*)
+            MODE="build"
+            MAX_ITERATIONS="$1"
+            shift
+            ;;
+        *)
+            echo -e "${RED}Unknown argument: $1${NC}"
+            show_help
+            exit 1
+            ;;
+    esac
+done
+
+cd "$PROJECT_DIR"
+
+# Validate RLM context file (if provided)
+if [ -n "$RLM_CONTEXT_FILE" ] && [ ! -f "$RLM_CONTEXT_FILE" ]; then
+    echo -e "${RED}Error: RLM context file not found: $RLM_CONTEXT_FILE${NC}"
+    echo "Create it first (example):"
+    echo "  mkdir -p rlm && printf \"%s\" \"<your long context>\" > $RLM_CONTEXT_FILE"
+    exit 1
+fi
+
+# Initialize RLM workspace (optional)
+if [ -n "$RLM_CONTEXT_FILE" ]; then
+    mkdir -p "$RLM_TRACE_DIR" "$RLM_QUERIES_DIR" "$RLM_ANSWERS_DIR"
+    if [ ! -f "$RLM_INDEX" ]; then
+        echo -e "timestamp\tmode\titeration\tprompt\tlog\toutput\tstatus" > "$RLM_INDEX"
+    fi
+fi
+
+# Session log (captures ALL output)
+SESSION_LOG="$LOG_DIR/ralph_codex_${MODE}_session_$(date '+%Y%m%d_%H%M%S').log"
+exec > >(tee -a "$SESSION_LOG") 2>&1
+
+# Check if Codex CLI is available
+if ! command -v "$CODEX_CMD" &> /dev/null; then
+    echo -e "${RED}Error: Codex CLI not found${NC}"
+    echo ""
+    echo "Install Codex CLI:"
+    echo "  npm install -g @openai/codex"
+    echo ""
+    echo "Then authenticate:"
+    echo "  codex login"
+    exit 1
+fi
+
+# Determine prompt file
+if [ "$MODE" = "plan" ]; then
+    PROMPT_FILE="PROMPT_plan.md"
+else
+    PROMPT_FILE="PROMPT_build.md"
+fi
+
+# Create prompt files if they don't exist (same as ralph-loop.sh)
+if [ ! -f "PROMPT_build.md" ]; then
+    echo -e "${YELLOW}Creating PROMPT_build.md...${NC}"
+    cat > "PROMPT_build.md" << 'BUILDEOF'
+# Ralph Build Mode
+
+Based on Geoffrey Huntley's Ralph Wiggum methodology.
+
+---
+
+## Phase 0: Orient
+
+Read `.specify/memory/constitution.md` to understand project principles and constraints.
+
+---
+
+## Phase 1: Discover Work Items
+
+Search for incomplete work from these sources (in order):
+
+1. **specs/ folder** — Look for `.md` files NOT marked `## Status: COMPLETE`
+2. **IMPLEMENTATION_PLAN.md** — If exists, find unchecked `- [ ]` tasks
+3. **GitHub Issues** — Check for open issues (if this is a GitHub repo)
+4. **Any task tracker** — Jira, Linear, etc. if configured
+
+Pick the **HIGHEST PRIORITY** incomplete item:
+- Lower numbers = higher priority (001 before 010)
+- `[HIGH]` before `[MEDIUM]` before `[LOW]`
+- Bugs/blockers before features
+
+Before implementing, search the codebase to verify it's not already done.
+
+---
+
+## Phase 1b: Re-Verification Mode (No Incomplete Work Found)
+
+**If ALL specs appear complete**, don't just exit — do a quality check:
+
+1. **Randomly pick** one completed spec from `specs/`
+2. **Strictly re-verify** ALL its acceptance criteria:
+   - Run the actual tests mentioned in the spec
+   - Manually verify each criterion is truly met
+   - Check edge cases
+   - Look for regressions
+3. **If any criterion fails**: Unmark the spec as complete and fix it
+4. **If all pass**: Output `<promise>DONE</promise>` to confirm quality
+
+This ensures the codebase stays healthy even when "nothing to do."
+
+---
+
+## Phase 2: Implement
+
+Implement the selected spec/task completely:
+- Follow the spec's requirements exactly
+- Write clean, maintainable code
+- Add tests as needed
+
+---
+
+## Phase 3: Validate
+
+Run the project's test suite and verify:
+- All tests pass
+- No lint errors
+- The spec's acceptance criteria are 100% met
+
+---
+
+## Phase 4: Commit & Update
+
+1. Mark the spec/task as complete (add `## Status: COMPLETE` to spec file)
+2. `git add -A`
+3. `git commit` with a descriptive message
+4. `git push`
+
+---
+
+## Completion Signal
+
+**CRITICAL:** Only output the magic phrase when the work is 100% complete.
+
+Check:
+- [ ] Implementation matches all requirements
+- [ ] All tests pass
+- [ ] All acceptance criteria verified
+- [ ] Changes committed and pushed
+- [ ] Spec marked as complete
+
+**If ALL checks pass, output:** `<promise>DONE</promise>`
+
+**If ANY check fails:** Fix the issue and try again. Do NOT output the magic phrase.
+BUILDEOF
+fi
+
+if [ ! -f "PROMPT_plan.md" ]; then
+    echo -e "${YELLOW}Creating PROMPT_plan.md...${NC}"
+    cat > "PROMPT_plan.md" << 'PLANEOF'
+# Ralph Planning Mode (OPTIONAL)
+
+This mode is OPTIONAL. Most projects work fine directly from specs.
+
+Only use this when you want a detailed breakdown of specs into smaller tasks.
+
+---
+
+## Phase 0: Orient
+
+0a. Read `.specify/memory/constitution.md` for project principles.
+
+0b. Study `specs/` to learn all feature specifications.
+
+---
+
+## Phase 1: Gap Analysis
+
+Compare specs against current codebase:
+- What's fully implemented?
+- What's partially done?
+- What's not started?
+- What has issues or bugs?
+
+---
+
+## Phase 2: Create Plan
+
+Create `IMPLEMENTATION_PLAN.md` with a prioritized task list:
+
+```markdown
+# Implementation Plan
+
+> Auto-generated breakdown of specs into tasks.
+> Delete this file to return to working directly from specs.
+
+## Priority Tasks
+
+- [ ] [HIGH] Task description - from spec NNN
+- [ ] [HIGH] Task description - from spec NNN  
+- [ ] [MEDIUM] Task description
+- [ ] [LOW] Task description
+
+## Completed
+
+- [x] Completed task
+```
+
+Prioritize by:
+1. Dependencies (do prerequisites first)
+2. Impact (high-value features first)
+3. Complexity (mix easy wins with harder tasks)
+
+---
+
+## Completion Signal
+
+When the plan is complete and saved:
+
+`<promise>DONE</promise>`
+PLANEOF
+fi
+
+# Build Codex flags for exec mode
+CODEX_FLAGS="exec"
+if [ "$YOLO_ENABLED" = true ]; then
+    CODEX_FLAGS="$CODEX_FLAGS --dangerously-bypass-approvals-and-sandbox"
+fi
+
+# Get current branch
+CURRENT_BRANCH=$(git branch --show-current 2>/dev/null || echo "main")
+
+# Check for work sources - count .md files in specs/
+HAS_SPECS=false
+SPEC_COUNT=0
+if [ -d "specs" ]; then
+    SPEC_COUNT=$(find specs -maxdepth 1 -name "*.md" -type f 2>/dev/null | wc -l)
+    [ "$SPEC_COUNT" -gt 0 ] && HAS_SPECS=true
+fi
+
+echo ""
+echo -e "${GREEN}━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━${NC}"
+echo -e "${GREEN}              RALPH LOOP (Codex) STARTING                    ${NC}"
+echo -e "${GREEN}━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━${NC}"
+echo ""
+echo -e "${BLUE}Mode:${NC}     $MODE"
+echo -e "${BLUE}Prompt:${NC}   $PROMPT_FILE"
+echo -e "${BLUE}Branch:${NC}   $CURRENT_BRANCH"
+echo -e "${YELLOW}YOLO:${NC}     $([ "$YOLO_ENABLED" = true ] && echo "ENABLED" || echo "DISABLED")"
+[ -n "$RLM_CONTEXT_FILE" ] && echo -e "${BLUE}RLM:${NC}      $RLM_CONTEXT_FILE"
+[ -n "$SESSION_LOG" ] && echo -e "${BLUE}Log:${NC}      $SESSION_LOG"
+[ $MAX_ITERATIONS -gt 0 ] && echo -e "${BLUE}Max:${NC}      $MAX_ITERATIONS iterations"
+echo ""
+echo -e "${BLUE}Work source:${NC}"
+if [ "$HAS_SPECS" = true ]; then
+    echo -e "  ${GREEN}✓${NC} specs/ folder ($SPEC_COUNT specs)"
+else
+    echo -e "  ${RED}✗${NC} specs/ folder (no .md files found)"
+fi
+echo ""
+echo -e "${CYAN}Using: $CODEX_CMD $CODEX_FLAGS${NC}"
+echo -e "${CYAN}Agent must output <promise>DONE</promise> when complete.${NC}"
+echo ""
+echo -e "${YELLOW}Press Ctrl+C to stop the loop${NC}"
+echo ""
+
+ITERATION=0
+CONSECUTIVE_FAILURES=0
+MAX_CONSECUTIVE_FAILURES=3
+
+while true; do
+    # Check max iterations
+    if [ $MAX_ITERATIONS -gt 0 ] && [ $ITERATION -ge $MAX_ITERATIONS ]; then
+        echo -e "${GREEN}Reached max iterations: $MAX_ITERATIONS${NC}"
+        break
+    fi
+
+    ITERATION=$((ITERATION + 1))
+    TIMESTAMP=$(date '+%Y-%m-%d %H:%M:%S')
+
+    echo ""
+    echo -e "${PURPLE}════════════════════ LOOP $ITERATION ════════════════════${NC}"
+    echo -e "${BLUE}[$TIMESTAMP]${NC} Starting iteration $ITERATION"
+    echo ""
+
+    # Log file for this iteration
+    LOG_FILE="$LOG_DIR/ralph_codex_${MODE}_iter_${ITERATION}_$(date '+%Y%m%d_%H%M%S').log"
+    OUTPUT_FILE="$LOG_DIR/ralph_codex_output_iter_${ITERATION}_$(date '+%Y%m%d_%H%M%S').txt"
+    RLM_STATUS="unknown"
+    : > "$LOG_FILE"
+    WATCH_PID=""
+
+    if [ "$ROLLING_OUTPUT_INTERVAL" -gt 0 ] && [ "$ROLLING_OUTPUT_LINES" -gt 0 ] && [ -t 1 ] && [ -w /dev/tty ]; then
+        watch_latest_output "$LOG_FILE" "Codex" &
+        WATCH_PID=$!
+    fi
+
+    # Optional RLM context block appended to prompt at runtime
+    EFFECTIVE_PROMPT_FILE="$PROMPT_FILE"
+    if [ -n "$RLM_CONTEXT_FILE" ]; then
+        EFFECTIVE_PROMPT_FILE="$LOG_DIR/ralph_codex_prompt_iter_${ITERATION}_$(date '+%Y%m%d_%H%M%S').md"
+        cat "$PROMPT_FILE" > "$EFFECTIVE_PROMPT_FILE"
+        cat >> "$EFFECTIVE_PROMPT_FILE" << EOF
+
+---
+## RLM Context (Optional)
+
+You have access to a large context file at:
+**$RLM_CONTEXT_FILE**
+
+Treat this file as an external environment. Do NOT paste the whole file into the prompt.
+Instead, inspect it programmatically and recursively:
+
+- Use small slices:
+  \`\`\`bash
+  sed -n 'START,ENDp' "$RLM_CONTEXT_FILE"
+  \`\`\`
+- Or Python snippets:
+  \`\`\`bash
+  python - <<'PY'
+  from pathlib import Path
+  p = Path("$RLM_CONTEXT_FILE")
+  print(p.read_text().splitlines()[START:END])
+  PY
+  \`\`\`
+- Use search:
+  \`\`\`bash
+  rg -n "pattern" "$RLM_CONTEXT_FILE"
+  \`\`\`
+
+Goal: decompose the task into smaller sub-queries and only load the pieces you need.
+This mirrors the Recursive Language Model approach from https://arxiv.org/html/2512.24601v1
+
+## RLM Workspace (Optional)
+
+Past loop outputs are preserved on disk:
+- Iteration logs: \`logs/\`
+- Prompt/output snapshots: \`rlm/trace/\`
+- Iteration index: \`rlm/index.tsv\`
+
+Use these as an external memory store (search/slice as needed).
+If you need a recursive sub-query, write a focused prompt in \`rlm/queries/\`,
+run:
+  \`./scripts/rlm-subcall.sh --query rlm/queries/<file>.md\`
+and store the result in \`rlm/answers/\`.
+EOF
+        RLM_PROMPT_SNAPSHOT="$RLM_TRACE_DIR/iter_${ITERATION}_prompt.md"
+        cp "$EFFECTIVE_PROMPT_FILE" "$RLM_PROMPT_SNAPSHOT"
+    fi
+
+    # Run Codex with exec mode, reading prompt from stdin with "-"
+    # Use --output-last-message to capture the final response for checking
+    echo -e "${BLUE}Running: cat $EFFECTIVE_PROMPT_FILE | $CODEX_CMD $CODEX_FLAGS - --output-last-message $OUTPUT_FILE${NC}"
+    echo ""
+    
+    CODEX_EXIT=0
+    if cat "$EFFECTIVE_PROMPT_FILE" | "$CODEX_CMD" $CODEX_FLAGS - --output-last-message "$OUTPUT_FILE" 2>&1 | tee "$LOG_FILE"; then
+        if [ -n "$WATCH_PID" ]; then
+            kill "$WATCH_PID" 2>/dev/null || true
+            wait "$WATCH_PID" 2>/dev/null || true
+        fi
+        echo ""
+        echo -e "${GREEN}✓ Codex execution completed${NC}"
+        
+        # Check if DONE promise was output (accept both DONE and ALL_DONE variants)
+        if [ -f "$OUTPUT_FILE" ] && grep -qE "<promise>(ALL_)?DONE</promise>" "$OUTPUT_FILE"; then
+            DETECTED_SIGNAL=$(grep -oE "<promise>(ALL_)?DONE</promise>" "$OUTPUT_FILE" | tail -1)
+            echo -e "${GREEN}✓ Completion signal detected: ${DETECTED_SIGNAL}${NC}"
+            echo -e "${GREEN}✓ Task completed successfully!${NC}"
+            CONSECUTIVE_FAILURES=0
+            RLM_STATUS="done"
+            
+            if [ "$MODE" = "plan" ]; then
+                echo ""
+                echo -e "${GREEN}Planning complete!${NC}"
+                break
+            fi
+        # Also check the main log
+        elif grep -qE "<promise>(ALL_)?DONE</promise>" "$LOG_FILE"; then
+            DETECTED_SIGNAL=$(grep -oE "<promise>(ALL_)?DONE</promise>" "$LOG_FILE" | tail -1)
+            echo -e "${GREEN}✓ Completion signal detected: ${DETECTED_SIGNAL}${NC}"
+            echo -e "${GREEN}✓ Task completed successfully!${NC}"
+            CONSECUTIVE_FAILURES=0
+            RLM_STATUS="done"
+        else
+            echo -e "${YELLOW}⚠ No completion signal found${NC}"
+            echo -e "${YELLOW}  Agent did not output <promise>DONE</promise> or <promise>ALL_DONE</promise>${NC}"
+            echo -e "${YELLOW}  Retrying in next iteration...${NC}"
+            CONSECUTIVE_FAILURES=$((CONSECUTIVE_FAILURES + 1))
+            RLM_STATUS="incomplete"
+            print_latest_output "$LOG_FILE" "Codex"
+            
+            if [ $CONSECUTIVE_FAILURES -ge $MAX_CONSECUTIVE_FAILURES ]; then
+                echo ""
+                echo -e "${RED}⚠ $MAX_CONSECUTIVE_FAILURES consecutive iterations without completion.${NC}"
+                echo -e "${RED}  The agent may be stuck. Check logs:${NC}"
+                echo -e "${RED}  - $LOG_FILE${NC}"
+                echo -e "${RED}  - $OUTPUT_FILE${NC}"
+                CONSECUTIVE_FAILURES=0
+            fi
+        fi
+    else
+        if [ -n "$WATCH_PID" ]; then
+            kill "$WATCH_PID" 2>/dev/null || true
+            wait "$WATCH_PID" 2>/dev/null || true
+        fi
+        CODEX_EXIT=$?
+        echo -e "${RED}✗ Codex execution failed (exit code: $CODEX_EXIT)${NC}"
+        echo -e "${YELLOW}Check log: $LOG_FILE${NC}"
+        CONSECUTIVE_FAILURES=$((CONSECUTIVE_FAILURES + 1))
+        RLM_STATUS="error"
+        print_latest_output "$LOG_FILE" "Codex"
+    fi
+
+    # Record iteration in RLM index (optional)
+    if [ -n "$RLM_CONTEXT_FILE" ]; then
+        RLM_PROMPT_PATH="${RLM_PROMPT_SNAPSHOT:-}"
+        RLM_OUTPUT_SNAPSHOT="$RLM_TRACE_DIR/iter_${ITERATION}_output.log"
+        cp "$LOG_FILE" "$RLM_OUTPUT_SNAPSHOT"
+        if [ -f "$OUTPUT_FILE" ]; then
+            RLM_LAST_MESSAGE_SNAPSHOT="$RLM_TRACE_DIR/iter_${ITERATION}_last_message.txt"
+            cp "$OUTPUT_FILE" "$RLM_LAST_MESSAGE_SNAPSHOT"
+        fi
+        RLM_OUTPUT_PATH="${RLM_LAST_MESSAGE_SNAPSHOT:-$RLM_OUTPUT_SNAPSHOT}"
+        echo -e "${TIMESTAMP}\t${MODE}\t${ITERATION}\t${RLM_PROMPT_PATH}\t${LOG_FILE}\t${RLM_OUTPUT_PATH}\t${RLM_STATUS}" >> "$RLM_INDEX"
+    fi
+
+    # Push changes after each iteration
+    git push origin "$CURRENT_BRANCH" 2>/dev/null || {
+        if git log origin/$CURRENT_BRANCH..HEAD --oneline 2>/dev/null | grep -q .; then
+            git push -u origin "$CURRENT_BRANCH" 2>/dev/null || true
+        fi
+    }
+
+    # Brief pause between iterations
+    echo ""
+    echo -e "${BLUE}Waiting 2s before next iteration...${NC}"
+    sleep 2
+done
+
+echo ""
+echo -e "${GREEN}━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━${NC}"
+echo -e "${GREEN}       RALPH LOOP (Codex) FINISHED ($ITERATION iterations)   ${NC}"
+echo -e "${GREEN}━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━${NC}"
--- a/scripts/ralph-loop.sh
+++ b/scripts/ralph-loop.sh
@@ -0,0 +1,688 @@
+#!/bin/bash
+#
+# Ralph Loop for Claude Code
+#
+# Based on Geoffrey Huntley's Ralph Wiggum methodology:
+# https://github.com/ghuntley/how-to-ralph-wiggum
+#
+# Combined with SpecKit-style specifications.
+#
+# Key principles:
+# - Each iteration picks ONE task/spec to work on
+# - Agent works until acceptance criteria are met
+# - Only outputs <promise>DONE</promise> when truly complete
+# - Bash loop checks for magic phrase before continuing
+# - Fresh context window each iteration
+#
+# Work sources (in priority order):
+# 1. IMPLEMENTATION_PLAN.md (if exists) - pick highest priority task
+# 2. specs/ folder - pick highest priority incomplete spec
+#
+# Usage:
+#   ./scripts/ralph-loop.sh              # Build mode (unlimited)
+#   ./scripts/ralph-loop.sh 20           # Build mode (max 20 iterations)
+#   ./scripts/ralph-loop.sh plan         # Planning mode (creates IMPLEMENTATION_PLAN.md)
+#
+
+set -e
+set -o pipefail
+
+SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+PROJECT_DIR="$(dirname "$SCRIPT_DIR")"
+LOG_DIR="$PROJECT_DIR/logs"
+CONSTITUTION="$PROJECT_DIR/.specify/memory/constitution.md"
+RLM_DIR="$PROJECT_DIR/rlm"
+RLM_TRACE_DIR="$RLM_DIR/trace"
+RLM_QUERIES_DIR="$RLM_DIR/queries"
+RLM_ANSWERS_DIR="$RLM_DIR/answers"
+RLM_INDEX="$RLM_DIR/index.tsv"
+
+# Configuration
+MAX_ITERATIONS=0  # 0 = unlimited
+MODE="build"
+CLAUDE_CMD="${CLAUDE_CMD:-claude}"
+YOLO_FLAG="--dangerously-skip-permissions"
+RLM_CONTEXT_FILE=""
+TAIL_LINES=5
+TAIL_RENDERED_LINES=0
+ROLLING_OUTPUT_LINES=5
+ROLLING_OUTPUT_INTERVAL=10
+ROLLING_RENDERED_LINES=0
+
+# Colors
+RED='\033[0;31m'
+GREEN='\033[0;32m'
+YELLOW='\033[1;33m'
+BLUE='\033[0;34m'
+PURPLE='\033[0;35m'
+CYAN='\033[0;36m'
+NC='\033[0m'
+
+mkdir -p "$LOG_DIR"
+
+# Check constitution for YOLO setting
+YOLO_ENABLED=true
+if [[ -f "$CONSTITUTION" ]]; then
+    if grep -q "YOLO Mode.*DISABLED" "$CONSTITUTION" 2>/dev/null; then
+        YOLO_ENABLED=false
+    fi
+fi
+
+show_help() {
+    cat <<EOF
+Ralph Loop for Claude Code
+
+Based on Geoffrey Huntley's Ralph Wiggum methodology + SpecKit specs.
+https://github.com/ghuntley/how-to-ralph-wiggum
+
+Usage:
+  ./scripts/ralph-loop.sh              # Build mode, unlimited iterations
+  ./scripts/ralph-loop.sh 20           # Build mode, max 20 iterations  
+  ./scripts/ralph-loop.sh plan         # Planning mode (optional)
+  ./scripts/ralph-loop.sh --rlm-context ./rlm/context.txt
+  ./scripts/ralph-loop.sh --rlm ./rlm/context.txt
+
+Modes:
+  build (default)  Pick spec/task and implement
+  plan             Create IMPLEMENTATION_PLAN.md from specs (OPTIONAL)
+
+Work Sources (checked in order):
+  1. IMPLEMENTATION_PLAN.md - If exists, pick highest priority task
+  2. specs/ folder - Otherwise, pick highest priority incomplete spec
+
+The plan mode is OPTIONAL. Most projects can work directly from specs.
+
+RLM Mode (optional):
+  --rlm-context <file>  Treat a large context file as external environment.
+                        The agent should read slices instead of loading it all.
+  --rlm [file]          Shortcut for --rlm-context (defaults to rlm/context.txt)
+
+How it works:
+  1. Each iteration feeds PROMPT.md to Claude via stdin
+  2. Claude picks the HIGHEST PRIORITY incomplete spec/task
+  3. Claude implements, tests, and verifies acceptance criteria
+  4. Claude outputs <promise>DONE</promise> ONLY if criteria are met
+  5. Bash loop checks for the magic phrase
+  6. If found, loop continues to next iteration (fresh context)
+  7. If not found, loop retries
+
+RLM workspace (when enabled):
+  - rlm/trace/     Prompt snapshots + outputs per iteration
+  - rlm/index.tsv  Index of all iterations (timestamp, prompt, log, status)
+  - rlm/queries/ and rlm/answers/  For optional recursive sub-queries
+
+EOF
+}
+
+print_latest_output() {
+    local log_file="$1"
+    local label="${2:-Claude}"
+    local target="/dev/tty"
+
+    [ -f "$log_file" ] || return 0
+
+    if [ ! -w "$target" ]; then
+        target="/dev/stdout"
+    fi
+
+    if [ "$target" = "/dev/tty" ] && [ "$TAIL_RENDERED_LINES" -gt 0 ]; then
+        printf "\033[%dA\033[J" "$TAIL_RENDERED_LINES" > "$target"
+    fi
+
+    {
+        echo "Latest ${label} output (last ${TAIL_LINES} lines):"
+        tail -n "$TAIL_LINES" "$log_file"
+    } > "$target"
+
+    if [ "$target" = "/dev/tty" ]; then
+        TAIL_RENDERED_LINES=$((TAIL_LINES + 1))
+    fi
+}
+
+watch_latest_output() {
+    local log_file="$1"
+    local label="${2:-Claude}"
+    local target="/dev/tty"
+    local use_tty=false
+    local use_tput=false
+
+    [ -f "$log_file" ] || return 0
+
+    if [ ! -w "$target" ]; then
+        target="/dev/stdout"
+    else
+        use_tty=true
+        if command -v tput &>/dev/null; then
+            use_tput=true
+        fi
+    fi
+
+    if [ "$use_tty" = true ]; then
+        if [ "$use_tput" = true ]; then
+            tput cr > "$target"
+            tput sc > "$target"
+        else
+            printf "\r\0337" > "$target"
+        fi
+    fi
+
+    while true; do
+        local timestamp
+        timestamp=$(date '+%Y-%m-%d %H:%M:%S')
+
+        if [ "$use_tty" = true ]; then
+            if [ "$use_tput" = true ]; then
+                tput rc > "$target"
+                tput ed > "$target"
+                tput cr > "$target"
+            else
+                printf "\0338\033[J\r" > "$target"
+            fi
+        fi
+
+        {
+            echo -e "${CYAN}[$timestamp] Latest ${label} output (last ${ROLLING_OUTPUT_LINES} lines):${NC}"
+            if [ ! -s "$log_file" ]; then
+                echo "(no output yet)"
+            else
+                tail -n "$ROLLING_OUTPUT_LINES" "$log_file" 2>/dev/null || true
+            fi
+            echo ""
+        } > "$target"
+
+        sleep "$ROLLING_OUTPUT_INTERVAL"
+    done
+}
+
+# Parse arguments
+while [[ $# -gt 0 ]]; do
+    case "$1" in
+        plan)
+            MODE="plan"
+            if [[ "${2:-}" =~ ^[0-9]+$ ]]; then
+                MAX_ITERATIONS="$2"
+                shift 2
+            else
+                MAX_ITERATIONS=1
+                shift
+            fi
+            ;;
+        --rlm-context)
+            RLM_CONTEXT_FILE="${2:-}"
+            shift 2
+            ;;
+        --rlm)
+            if [[ -n "${2:-}" && "${2:0:1}" != "-" ]]; then
+                RLM_CONTEXT_FILE="$2"
+                shift 2
+            else
+                RLM_CONTEXT_FILE="rlm/context.txt"
+                shift
+            fi
+            ;;
+        -h|--help)
+            show_help
+            exit 0
+            ;;
+        [0-9]*)
+            MODE="build"
+            MAX_ITERATIONS="$1"
+            shift
+            ;;
+        *)
+            echo -e "${RED}Unknown argument: $1${NC}"
+            show_help
+            exit 1
+            ;;
+    esac
+done
+
+cd "$PROJECT_DIR"
+
+# Validate RLM context file (if provided)
+if [ -n "$RLM_CONTEXT_FILE" ] && [ ! -f "$RLM_CONTEXT_FILE" ]; then
+    echo -e "${RED}Error: RLM context file not found: $RLM_CONTEXT_FILE${NC}"
+    echo "Create it first (example):"
+    echo "  mkdir -p rlm && printf \"%s\" \"<your long context>\" > $RLM_CONTEXT_FILE"
+    exit 1
+fi
+
+# Initialize RLM workspace (optional)
+if [ -n "$RLM_CONTEXT_FILE" ]; then
+    mkdir -p "$RLM_TRACE_DIR" "$RLM_QUERIES_DIR" "$RLM_ANSWERS_DIR"
+    if [ ! -f "$RLM_INDEX" ]; then
+        echo -e "timestamp\tmode\titeration\tprompt\tlog\toutput\tstatus" > "$RLM_INDEX"
+    fi
+fi
+
+# Session log (captures ALL output)
+SESSION_LOG="$LOG_DIR/ralph_${MODE}_session_$(date '+%Y%m%d_%H%M%S').log"
+exec > >(tee -a "$SESSION_LOG") 2>&1
+
+# Check if Claude CLI is available
+if ! command -v "$CLAUDE_CMD" &> /dev/null; then
+    echo -e "${RED}Error: Claude CLI not found${NC}"
+    echo ""
+    echo "Install Claude Code CLI and authenticate first."
+    echo "https://claude.ai/code"
+    exit 1
+fi
+
+# Determine which prompt to use based on mode and available files
+if [ "$MODE" = "plan" ]; then
+    PROMPT_FILE="PROMPT_plan.md"
+else
+    PROMPT_FILE="PROMPT_build.md"
+fi
+
+# Create/update the build prompt to be flexible about plan vs specs
+cat > "PROMPT_build.md" << 'BUILDEOF'
+# Ralph Build Mode
+
+Based on Geoffrey Huntley's Ralph Wiggum methodology.
+
+---
+
+## Phase 0: Orient
+
+Read `.specify/memory/constitution.md` to understand project principles and constraints.
+
+---
+BUILDEOF
+
+# Optional RLM context block
+if [ -n "$RLM_CONTEXT_FILE" ]; then
+cat >> "PROMPT_build.md" << EOF
+
+## Phase 0d: RLM Context (Optional)
+
+You have access to a large context file at:
+**$RLM_CONTEXT_FILE**
+
+Treat this file as an external environment. Do NOT paste the whole file into the prompt.
+Instead, inspect it programmatically and recursively:
+
+- Use small slices:
+  ```bash
+  sed -n 'START,ENDp' "$RLM_CONTEXT_FILE"
+  ```
+- Or Python snippets:
+  ```bash
+  python - <<'PY'
+  from pathlib import Path
+  p = Path("$RLM_CONTEXT_FILE")
+  print(p.read_text().splitlines()[START:END])
+  PY
+  ```
+- Use search:
+  ```bash
+  rg -n "pattern" "$RLM_CONTEXT_FILE"
+  ```
+
+Goal: decompose the task into smaller sub-queries and only load the pieces you need.
+This mirrors the Recursive Language Model approach from https://arxiv.org/html/2512.24601v1
+
+## RLM Workspace (Optional)
+
+Past loop outputs are preserved on disk:
+- Iteration logs: `logs/`
+- Prompt/output snapshots: `rlm/trace/`
+- Iteration index: `rlm/index.tsv`
+
+Use these as an external memory store (search/slice as needed).
+If you need a recursive sub-query, write a focused prompt in `rlm/queries/`,
+run:
+  `./scripts/rlm-subcall.sh --query rlm/queries/<file>.md`
+and store the result in `rlm/answers/`.
+EOF
+fi
+
+cat >> "PROMPT_build.md" << 'BUILDEOF'
+
+## Phase 1: Discover Work Items
+
+Search for incomplete work from these sources (in order):
+
+1. **specs/ folder** — Look for `.md` files NOT marked `## Status: COMPLETE`
+2. **IMPLEMENTATION_PLAN.md** — If exists, find unchecked `- [ ]` tasks
+3. **GitHub Issues** — Check for open issues (if this is a GitHub repo)
+4. **Any task tracker** — Jira, Linear, etc. if configured
+
+Pick the **HIGHEST PRIORITY** incomplete item:
+- Lower numbers = higher priority (001 before 010)
+- `[HIGH]` before `[MEDIUM]` before `[LOW]`
+- Bugs/blockers before features
+
+Before implementing, search the codebase to verify it's not already done.
+
+---
+
+## Phase 1b: Re-Verification Mode (No Incomplete Work Found)
+
+**If ALL specs appear complete**, don't just exit — do a quality check:
+
+1. **Randomly pick** one completed spec from `specs/`
+2. **Strictly re-verify** ALL its acceptance criteria:
+   - Run the actual tests mentioned in the spec
+   - Manually verify each criterion is truly met
+   - Check edge cases
+   - Look for regressions
+3. **If any criterion fails**: Unmark the spec as complete and fix it
+4. **If all pass**: Output `<promise>DONE</promise>` to confirm quality
+
+This ensures the codebase stays healthy even when "nothing to do."
+
+---
+
+## Phase 2: Implement
+
+Implement the selected spec/task completely:
+- Follow the spec's requirements exactly
+- Write clean, maintainable code
+- Add tests as needed
+
+---
+
+## Phase 3: Validate
+
+Run the project's test suite and verify:
+- All tests pass
+- No lint errors
+- The spec's acceptance criteria are 100% met
+
+---
+
+## Phase 4: Commit & Update
+
+1. Mark the spec/task as complete (add `## Status: COMPLETE` to spec file)
+2. `git add -A`
+3. `git commit` with a descriptive message
+4. `git push`
+
+---
+
+## Completion Signal
+
+**CRITICAL:** Only output the magic phrase when the work is 100% complete.
+
+Check:
+- [ ] Implementation matches all requirements
+- [ ] All tests pass
+- [ ] All acceptance criteria verified
+- [ ] Changes committed and pushed
+- [ ] Spec marked as complete
+
+**If ALL checks pass, output:** `<promise>DONE</promise>`
+
+**If ANY check fails:** Fix the issue and try again. Do NOT output the magic phrase.
+BUILDEOF
+
+# Create planning prompt (only used if plan mode is explicitly requested)
+cat > "PROMPT_plan.md" << 'PLANEOF'
+# Ralph Planning Mode (OPTIONAL)
+
+This mode is OPTIONAL. Most projects work fine directly from specs.
+
+Only use this when you want a detailed breakdown of specs into smaller tasks.
+
+---
+
+## Phase 0: Orient
+
+0a. Read `.specify/memory/constitution.md` for project principles.
+
+0b. Study `specs/` to learn all feature specifications.
+
+---
+PLANEOF
+
+# Optional RLM context block for planning
+if [ -n "$RLM_CONTEXT_FILE" ]; then
+cat >> "PROMPT_plan.md" << EOF
+
+## Phase 0c: RLM Context (Optional)
+
+You have access to a large context file at:
+**$RLM_CONTEXT_FILE**
+
+Treat this file as an external environment. Do NOT paste the whole file into the prompt.
+Inspect only the slices you need using shell tools or Python.
+This mirrors the Recursive Language Model approach from https://arxiv.org/html/2512.24601v1
+
+## RLM Workspace (Optional)
+
+Past loop outputs are preserved on disk:
+- Iteration logs: `logs/`
+- Prompt/output snapshots: `rlm/trace/`
+- Iteration index: `rlm/index.tsv`
+
+Use these as an external memory store (search/slice as needed).
+For recursive sub-queries, use:
+  `./scripts/rlm-subcall.sh --query rlm/queries/<file>.md`
+EOF
+fi
+
+cat >> "PROMPT_plan.md" << 'PLANEOF'
+
+## Phase 1: Gap Analysis
+
+Compare specs against current codebase:
+- What's fully implemented?
+- What's partially done?
+- What's not started?
+- What has issues or bugs?
+
+---
+
+## Phase 2: Create Plan
+
+Create `IMPLEMENTATION_PLAN.md` with a prioritized task list:
+
+```markdown
+# Implementation Plan
+
+> Auto-generated breakdown of specs into tasks.
+> Delete this file to return to working directly from specs.
+
+## Priority Tasks
+
+- [ ] [HIGH] Task description - from spec NNN
+- [ ] [HIGH] Task description - from spec NNN  
+- [ ] [MEDIUM] Task description
+- [ ] [LOW] Task description
+
+## Completed
+
+- [x] Completed task
+```
+
+Prioritize by:
+1. Dependencies (do prerequisites first)
+2. Impact (high-value features first)
+3. Complexity (mix easy wins with harder tasks)
+
+---
+
+## Completion Signal
+
+When the plan is complete and saved:
+
+`<promise>DONE</promise>`
+PLANEOF
+
+# Check prompt file exists
+if [ ! -f "$PROMPT_FILE" ]; then
+    echo -e "${RED}Error: $PROMPT_FILE not found${NC}"
+    exit 1
+fi
+
+# Build Claude flags
+CLAUDE_FLAGS="-p"
+if [ "$YOLO_ENABLED" = true ]; then
+    CLAUDE_FLAGS="$CLAUDE_FLAGS $YOLO_FLAG"
+fi
+
+# Get current branch
+CURRENT_BRANCH=$(git branch --show-current 2>/dev/null || echo "main")
+
+# Check for work sources - count .md files in specs/
+HAS_PLAN=false
+HAS_SPECS=false
+SPEC_COUNT=0
+[ -f "IMPLEMENTATION_PLAN.md" ] && HAS_PLAN=true
+if [ -d "specs" ]; then
+    SPEC_COUNT=$(find specs -maxdepth 1 -name "*.md" -type f 2>/dev/null | wc -l)
+    [ "$SPEC_COUNT" -gt 0 ] && HAS_SPECS=true
+fi
+
+echo ""
+echo -e "${GREEN}━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━${NC}"
+echo -e "${GREEN}              RALPH LOOP (Claude Code) STARTING              ${NC}"
+echo -e "${GREEN}━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━${NC}"
+echo ""
+echo -e "${BLUE}Mode:${NC}     $MODE"
+echo -e "${BLUE}Prompt:${NC}   $PROMPT_FILE"
+echo -e "${BLUE}Branch:${NC}   $CURRENT_BRANCH"
+echo -e "${YELLOW}YOLO:${NC}     $([ "$YOLO_ENABLED" = true ] && echo "ENABLED" || echo "DISABLED")"
+[ -n "$RLM_CONTEXT_FILE" ] && echo -e "${BLUE}RLM:${NC}      $RLM_CONTEXT_FILE"
+[ -n "$SESSION_LOG" ] && echo -e "${BLUE}Log:${NC}      $SESSION_LOG"
+[ $MAX_ITERATIONS -gt 0 ] && echo -e "${BLUE}Max:${NC}      $MAX_ITERATIONS iterations"
+echo ""
+echo -e "${BLUE}Work source:${NC}"
+if [ "$HAS_PLAN" = true ]; then
+    echo -e "  ${GREEN}✓${NC} IMPLEMENTATION_PLAN.md (will use this)"
+else
+    echo -e "  ${YELLOW}○${NC} IMPLEMENTATION_PLAN.md (not found, that's OK)"
+fi
+if [ "$HAS_SPECS" = true ]; then
+    echo -e "  ${GREEN}✓${NC} specs/ folder ($SPEC_COUNT specs)"
+else
+    echo -e "  ${RED}✗${NC} specs/ folder (no .md files found)"
+fi
+echo ""
+echo -e "${CYAN}The loop checks for <promise>DONE</promise> in each iteration.${NC}"
+echo -e "${CYAN}Agent must verify acceptance criteria before outputting it.${NC}"
+echo ""
+echo -e "${YELLOW}Press Ctrl+C to stop the loop${NC}"
+echo ""
+
+ITERATION=0
+CONSECUTIVE_FAILURES=0
+MAX_CONSECUTIVE_FAILURES=3
+
+while true; do
+    # Check max iterations
+    if [ $MAX_ITERATIONS -gt 0 ] && [ $ITERATION -ge $MAX_ITERATIONS ]; then
+        echo -e "${GREEN}Reached max iterations: $MAX_ITERATIONS${NC}"
+        break
+    fi
+
+    ITERATION=$((ITERATION + 1))
+    TIMESTAMP=$(date '+%Y-%m-%d %H:%M:%S')
+
+    echo ""
+    echo -e "${PURPLE}════════════════════ LOOP $ITERATION ════════════════════${NC}"
+    echo -e "${BLUE}[$TIMESTAMP]${NC} Starting iteration $ITERATION"
+    echo ""
+
+    # Log file for this iteration
+    LOG_FILE="$LOG_DIR/ralph_${MODE}_iter_${ITERATION}_$(date '+%Y%m%d_%H%M%S').log"
+    : > "$LOG_FILE"
+    WATCH_PID=""
+
+    if [ "$ROLLING_OUTPUT_INTERVAL" -gt 0 ] && [ "$ROLLING_OUTPUT_LINES" -gt 0 ] && [ -t 1 ] && [ -w /dev/tty ]; then
+        watch_latest_output "$LOG_FILE" "Claude" &
+        WATCH_PID=$!
+    fi
+    RLM_STATUS="unknown"
+
+    # Snapshot prompt (optional RLM workspace)
+    if [ -n "$RLM_CONTEXT_FILE" ]; then
+        RLM_PROMPT_SNAPSHOT="$RLM_TRACE_DIR/iter_${ITERATION}_prompt.md"
+        cp "$PROMPT_FILE" "$RLM_PROMPT_SNAPSHOT"
+    fi
+
+    # Run Claude with prompt via stdin, capture output
+    CLAUDE_OUTPUT=""
+    if CLAUDE_OUTPUT=$(cat "$PROMPT_FILE" | "$CLAUDE_CMD" $CLAUDE_FLAGS 2>&1 | tee "$LOG_FILE"); then
+        if [ -n "$WATCH_PID" ]; then
+            kill "$WATCH_PID" 2>/dev/null || true
+            wait "$WATCH_PID" 2>/dev/null || true
+        fi
+        echo ""
+        echo -e "${GREEN}✓ Claude execution completed${NC}"
+        
+        # Check if DONE promise was output (accept both DONE and ALL_DONE variants)
+        if echo "$CLAUDE_OUTPUT" | grep -qE "<promise>(ALL_)?DONE</promise>"; then
+            DETECTED_SIGNAL=$(echo "$CLAUDE_OUTPUT" | grep -oE "<promise>(ALL_)?DONE</promise>" | tail -1)
+            echo -e "${GREEN}✓ Completion signal detected: ${DETECTED_SIGNAL}${NC}"
+            echo -e "${GREEN}✓ Task completed successfully!${NC}"
+            CONSECUTIVE_FAILURES=0
+            RLM_STATUS="done"
+            
+            # For planning mode, stop after one successful plan
+            if [ "$MODE" = "plan" ]; then
+                echo ""
+                echo -e "${GREEN}Planning complete!${NC}"
+                echo -e "${CYAN}Run './scripts/ralph-loop.sh' to start building.${NC}"
+                echo -e "${CYAN}Or delete IMPLEMENTATION_PLAN.md to work directly from specs.${NC}"
+                break
+            fi
+        else
+            echo -e "${YELLOW}⚠ No completion signal found${NC}"
+            echo -e "${YELLOW}  Agent did not output <promise>DONE</promise> or <promise>ALL_DONE</promise>${NC}"
+            echo -e "${YELLOW}  This means acceptance criteria were not met.${NC}"
+            echo -e "${YELLOW}  Retrying in next iteration...${NC}"
+            CONSECUTIVE_FAILURES=$((CONSECUTIVE_FAILURES + 1))
+            RLM_STATUS="incomplete"
+            print_latest_output "$LOG_FILE" "Claude"
+            
+            if [ $CONSECUTIVE_FAILURES -ge $MAX_CONSECUTIVE_FAILURES ]; then
+                echo ""
+                echo -e "${RED}⚠ $MAX_CONSECUTIVE_FAILURES consecutive iterations without completion.${NC}"
+                echo -e "${RED}  The agent may be stuck. Consider:${NC}"
+                echo -e "${RED}  - Checking the logs in $LOG_DIR${NC}"
+                echo -e "${RED}  - Simplifying the current spec${NC}"
+                echo -e "${RED}  - Manually fixing blocking issues${NC}"
+                echo ""
+                CONSECUTIVE_FAILURES=0
+            fi
+        fi
+    else
+        if [ -n "$WATCH_PID" ]; then
+            kill "$WATCH_PID" 2>/dev/null || true
+            wait "$WATCH_PID" 2>/dev/null || true
+        fi
+        echo -e "${RED}✗ Claude execution failed${NC}"
+        echo -e "${YELLOW}Check log: $LOG_FILE${NC}"
+        CONSECUTIVE_FAILURES=$((CONSECUTIVE_FAILURES + 1))
+        RLM_STATUS="error"
+        print_latest_output "$LOG_FILE" "Claude"
+    fi
+
+    # Record iteration in RLM index (optional)
+    if [ -n "$RLM_CONTEXT_FILE" ]; then
+        RLM_PROMPT_PATH="${RLM_PROMPT_SNAPSHOT:-}"
+        RLM_OUTPUT_SNAPSHOT="$RLM_TRACE_DIR/iter_${ITERATION}_output.log"
+        cp "$LOG_FILE" "$RLM_OUTPUT_SNAPSHOT"
+        echo -e "${TIMESTAMP}\t${MODE}\t${ITERATION}\t${RLM_PROMPT_PATH}\t${LOG_FILE}\t${RLM_OUTPUT_SNAPSHOT}\t${RLM_STATUS}" >> "$RLM_INDEX"
+    fi
+
+    # Push changes after each iteration (if any)
+    git push origin "$CURRENT_BRANCH" 2>/dev/null || {
+        if git log origin/$CURRENT_BRANCH..HEAD --oneline 2>/dev/null | grep -q .; then
+            echo -e "${YELLOW}Push failed, creating remote branch...${NC}"
+            git push -u origin "$CURRENT_BRANCH" 2>/dev/null || true
+        fi
+    }
+
+    # Brief pause between iterations
+    echo ""
+    echo -e "${BLUE}Waiting 2s before next iteration...${NC}"
+    sleep 2
+done
+
+echo ""
+echo -e "${GREEN}━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━${NC}"
+echo -e "${GREEN}         RALPH LOOP FINISHED ($ITERATION iterations)         ${NC}"
+echo -e "${GREEN}━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━${NC}"
--- a/specs/001-core-ui-camera.md
+++ b/specs/001-core-ui-camera.md
@@ -0,0 +1,41 @@
+# Feature: Core UI and Camera Feed (PyObjC)
+
+## Status: COMPLETE
+
+## Description
+Create the main application window using PyObjC (AppKit) and display a live camera feed. This ensures a native macOS look and feel.
+
+## Requirements
+
+1.  **App & Window Setup (AppKit)**:
+    - Initialize `NSApplication`.
+    - Create a main `NSWindow` titled "ItemSense".
+    - Size: 800x600 (resizable).
+    - Window should center on screen.
+
+2.  **UI Layout**:
+    - Use `NSStackView` (vertical) or manual constraints to layout:
+        - Top: Video Feed (`NSImageView`).
+        - Bottom: "Capture" button (`NSButton`).
+
+3.  **Camera Feed**:
+    - Use `opencv-python` to capture frames from webcam (index 0).
+    - Convert frames (`cv2` BGR -> RGB) to `NSImage/CGImage`.
+    - Update the `NSImageView` at ~30 FPS using a timer (`NSTimer` or equivalent app loop integration).
+
+4.  **Capture Button**:
+    - Standard macOS Push Button.
+    - Label: "Capture".
+    - Action: Print "Capture clicked" to console.
+
+5.  **Lifecycle**:
+    - Ensure `Cmd+Q` works.
+    - Ensure closing the window terminates the app (or at least the `applicationShouldTerminateAfterLastWindowClosed:` delegate method returns True).
+
+## Acceptance Criteria
+
+- [ ] App launches with a native macOS window "ItemSense".
+- [ ] Live camera feed is visible in the view.
+- [ ] "Capture" button is visible at the bottom.
+- [ ] Clicking "Capture" prints to console.
+- [ ] App exits cleanly on window close or Cmd+Q.
--- a/specs/002-openai-integration.md
+++ b/specs/002-openai-integration.md
@@ -0,0 +1,29 @@
+# Feature: OpenAI Vision Integration (PyObjC)
+
+## Description
+Implement the logic to capture a frame from the AppKit interface and send it to OpenAI's API.
+
+## Requirements
+
+1.  **Image Handling**:
+    - On "Capture" click:
+        - Stop/Pause the live feed update.
+        - Store the current frame (in memory).
+        - Show "Processing..." (maybe change button text or add a label).
+
+2.  **OpenAI API Call**:
+    - Async handling is important to not block the UI thread (spinning beachball).
+    - Run the API request in a background thread (`threading`).
+    - Model: `gpt-5-mini` (fallback `gpt-4o-mini`).
+    - Prompt: "What is this item? Please provide a brief description."
+
+3.  **Response Handling**:
+    - When response returns, schedule a UI update on the main thread (`performSelectorOnMainThread:` or `dispatch_async`).
+    - Print response to console (UI display comes in Spec 003).
+
+## Acceptance Criteria
+
+- [ ] UI remains responsive (no beachball) during API call.
+- [ ] "Processing..." indication is shown.
+- [ ] Image frame is correctly sent to OpenAI.
+- [ ] Text response is received and printed to console.
--- a/specs/003-result-display.md
+++ b/specs/003-result-display.md
@@ -0,0 +1,25 @@
+# Feature: Result Display (PyObjC)
+
+## Description
+Display the analysis results natively in the AppKit UI.
+
+## Requirements
+
+1.  **Result UI**:
+    - Add a scrollable `NSTextView` (within an `NSScrollView`) below the image view.
+    - Initially empty or hidden.
+
+2.  **Workflow**:
+    - **Live Mode**: Camera active, Button says "Capture", Text view hidden/empty.
+    - **Processing Mode**: specific indication.
+    - **Result Mode**: Camera paused on captured frame, Button says "Scan Another", Text view shows description.
+
+3.  **Data Binding**:
+    - Update the `NSTextView` string with the OpenAI response.
+    - Clicking "Scan Another" resets the UI to **Live Mode**.
+
+## Acceptance Criteria
+
+- [ ] App cycles correctly: Capture -> Result -> Scan Another -> Capture.
+- [ ] Result text is readable in a native macOS scroll view.
+- [ ] Window resizing layout remains sane.