𧬠Genesis Self-Improvement Architecture
Complete system architecture for continuous learning
High-Level Architecture
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β GENESIS ECOSYSTEM β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
βββββββββββββββ βββββββββββββββ βββββββββββββββ βββββββββββββββ
β USER ββββββΆβ GENESIS ββββββΆβ EXECUTOR ββββββΆβ FEEDBACK β
β PROMPTS β β GENERATES β β RUNS CODE β β CAPTURES β
βββββββββββββββ βββββββββββββββ βββββββββββββββ ββββββββ¬βββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β LEARNING LAYER β
β βββββββββββββ βββββββββββββ βββββββββββββ βββββββββββββ β
β βCorrectionsβ β Patterns β β Errors β β Successes β β
β βββββββ¬ββββββ βββββββ¬ββββββ βββββββ¬ββββββ βββββββ¬ββββββ β
β βββββββββββββββββββΌββββββββββββββββββΌββββββββββββββ β
β βΌ β β
β ββββββββββββ β β
β β LEARNER ββββββββββββββ β
β ββββββ¬ββββββ β
β β β
β βΌ β
β ββββββββββββββββββββββββ β
β β WEAVIATE CORPUS β β
β β (Semantic Storage) β β
β ββββββββββββ¬ββββββββββββ β
βββββββββββββββββββββββββββΌββββββββββββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β SELF-IMPROVEMENT DAEMON β
β β
β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β MONITORS (every 60 minutes) β β
β β β Learning count: 45/100 β β
β β β Days since training: 1/7 β β
β β β Performance: 87% / 90% β β
β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β β
β βΌ (Threshold reached) β
β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β TRIGGERS β β
β β 1. Load corpus from Weaviate β β
β β 2. Run training pipeline β β
β β 3. Evaluate new model β β
β β 4. Deploy if improved β β
β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
βββββββββββββββββββββββββββββββββββββ¬ββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β TRAINING PIPELINE β
β ββββββββββββββ ββββββββββββββ ββββββββββββββ ββββββββββββββ β
β β Load ββββΆβ Prepare ββββΆβ Fine-Tune ββββΆβ Save β β
β β Corpus β β Data β β (LoRA) β β Model β β
β ββββββββββββββ ββββββββββββββ ββββββββββββββ ββββββββββββββ β
β β
β Outputs: β
β β’ New model: genesis-YYYYMMDD-HHMMSS β
β β’ Metrics: accuracy, perplexity, BLEU β
β β’ Checkpoints: For rollback β
βββββββββββββββββββββββββββββββββββββ¬ββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β MODEL EVALUATOR β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β Compare Models: β β
β β β’ Current: genesis-20251201 (85% accuracy) β β
β β β’ New: genesis-20251211 (87% accuracy) β β
β β β’ Improvement: +2% β (threshold: 1%) β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
βββββββββββββββββββββββββββββββββββββ¬ββββββββββββββββββββββββββββββββββββββ
β
βΌ (If improved)
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β DEPLOYMENT β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β 1. Deploy new model to Ollama β β
β β 2. Update state: β β
β β β’ current_model = genesis-20251211 β β
β β β’ training_runs += 1 β β
β β β’ improvement_score = 0.87 β β
β β β’ learnings_since_training = 0 (reset) β β
β β 3. Log metrics to Prometheus β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
βββββββββββββββββββββββββββββββββββββ¬ββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββ
β CYCLE COMPLETE β
β (Back to top) β
βββββββββββββββββββ
Component Details
1. User Interaction Layer
USER
β (sends prompt)
GENESIS CHAT HANDLER
β (generates code)
GENESIS EXECUTOR
β (executes code)
FEEDBACK CAPTURE
β (corrections/patterns/errors)
LEARNER
2. Learning Layer
βββββββββββββββββββββββββββββββββββββββ
β GENESIS LEARNER β
βββββββββββββββββββββββββββββββββββββββ€
β Captures: β
β β’ User corrections β
β β’ Cursor fix patterns β
β β’ Error patterns β
β β’ Successful patterns β
βββββββββββββββββββββββββββββββββββββββ€
β Storage: β
β β’ File: genesis_learnings.json β
β β’ Neo4j: GenesisLearning nodes β
β β’ Weaviate: Semantic embeddings β
β β’ Redis: Fast lookup cache β
βββββββββββββββββββββββββββββββββββββββ
3. Self-Improvement Daemon
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β SELF-IMPROVEMENT DAEMON (runs continuously) β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Check interval: 60 minutes β
β State file: genesis_improvement_state.json β
β Metrics port: 9127 (Prometheus) β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Components: β
β β’ StateManager: Load/save state β
β β’ LearningsMonitor: Count new learnings β
β β’ TriggerEvaluator: Decide when to trigger β
β β’ TrainingOrchestrator: Run training β
β β’ ModelEvaluator: Compare models β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Triggers: β
β β’ Learning threshold: 100 new learnings β
β β’ Time threshold: 62 days β
β β’ Performance threshold: < 90% accuracy β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ
4. Training Pipeline
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β TRAINING PIPELINE (2 hours) β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Phase 1: Load Corpus β
β β’ Query Weaviate for code examples β
β β’ Filter by quality score (>= 0.7) β
β β’ Prepare instruction-following format β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Phase 2: Hyperparameter Optimization β
β β’ Use H2O AutoML β
β β’ Optimize learning rate, batch size β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Phase 3: Fine-Tuning β
β β’ Base model: Qwen 235B β
β β’ Method: LoRA (efficient fine-tuning) β
β β’ Config: rank=8, alpha=16, dropout=0.05 β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Phase 4: Save β
β β’ Model: genesis-YYYYMMDD-HHMMSS β
β β’ Metrics: accuracy, perplexity, BLEU β
β β’ Checkpoints: For rollback β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ
5. Model Evaluation
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β MODEL EVALUATOR β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Test prompts: β
β β’ "Write a Python function to calculate factorial" β
β β’ "Write a FastAPI endpoint for auth" β
β β’ "Write a function to merge sorted arrays" β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Metrics: β
β β’ Accuracy: % of successful generations β
β β’ Perplexity: Model confidence β
β β’ BLEU score: Code quality β
β β’ Code execution success rate β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Comparison: β
β β’ Current model accuracy β
β β’ New model accuracy β
β β’ Improvement delta β
β β’ Deployment decision (> 1% improvement) β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Data Flow
1. Learning Accumulation
User Correction
β
GenesisLearner.capture_correction()
β
Store in:
β’ data/genesis_learnings.json (file)
β’ Weaviate (semantic search)
β’ Neo4j (relationships)
β’ Redis (cache)
β
learnings_since_training += 1
2. Trigger Evaluation
Daemon wakes up (every 60 min)
β
Load state from genesis_improvement_state.json
β
Count new learnings
β
Calculate days since training
β
Evaluate triggers:
IF learnings >= 100 OR days >= 7:
TRIGGER = True
ELSE:
TRIGGER = False
3. Training Execution
TRIGGER = True
β
TrainingOrchestrator.trigger_training()
β
subprocess.run(["python3", "genesis-training-pipeline.py"])
β
Wait for completion (max 2 hours)
β
Parse training results
β
Get new model name from models/genesis/model_name.txt
4. Model Comparison
ModelEvaluator.compare_models(current, new)
β
Run eval harness on both models
β
Calculate:
improvement = new_accuracy - current_accuracy
β
IF improvement > 0.01: # 1% threshold
Deploy new model
ELSE:
Keep current model
5. State Update
IF model deployed:
state.current_model = new_model
state.training_runs += 1
state.improvement_score = new_accuracy
state.last_training_time = now()
state.learnings_since_training = 0 # RESET
β
Save state to genesis_improvement_state.json
β
Log metrics to Prometheus
API Integration
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β GENESIS API ROUTER β
β (api/routers/genesis.py) β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β GET /api/v1/genesis/improvement/status β
β β’ Returns current improvement state β
β β’ Shows learnings count, current model β
β β’ Indicates if should trigger β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β POST /api/v1/genesis/improvement/trigger β
β β’ Manually trigger improvement cycle β
β β’ Bypasses automatic triggers β
β β’ Runs in background β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β GET /api/v1/genesis/feedback/statistics β
β β’ Shows feedback loop stats β
β β’ Execution success rates β
β β’ Learning capture rates β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββ
State Machine
βββββββββββ
β INITIAL β (No state file exists)
ββββββ¬βββββ
β
βΌ
βββββββββββββββ
β MONITORING β (Check triggers every 60 min)
ββββ¬ββββ¬βββββββ
β β
β ββ(threshold not met)ββ Continue monitoring
β
ββ(100 learnings)ββ TRIGGERED
ββ(62 days)βββββββββ TRIGGERED
ββ(perf < 90%)βββββ TRIGGERED
β
βΌ
ββββββββββββββββ
β TRAINING β (2 hours)
ββββββββ¬ββββββββ
β
βΌ
ββββββββββββββββ
β EVALUATING β (10 minutes)
ββββββββ¬ββββββββ
β
βββββββββββ΄ββββββββββ
β β
βΌ βΌ
ββββββββββββββββ ββββββββββββββββ
β IMPROVED β β NO IMPROVEMENTβ
β (deploy) β β (keep current)β
ββββββββ¬ββββββββ ββββββββ¬βββββββββ
β β
βββββββββββ¬ββββββββββ
β
βΌ
ββββββββββββββββ
β UPDATE STATE β
ββββββββ¬ββββββββ
β
βΌ
ββββββββββββββββ
β MONITORING β (back to monitoring)
ββββββββββββββββ
File Structure
truth-si-dev-env/
βββ scripts/
β βββ genesis-self-improvement-daemon.py # Main daemon
β βββ genesis-training-pipeline.py # Training
β βββ genesis-eval-harness.py # Evaluation
β βββ genesis-ingestion-daemon.py # Corpus ingestion
β βββ test-genesis-self-improvement.py # Tests
β
βββ api/
β βββ genesis/
β β βββ learner.py # Learning capture
β βββ routers/
β βββ genesis.py # API endpoints
β
βββ data/
β βββ genesis_improvement_state.json # State persistence
β βββ genesis_learnings.json # Learnings storage
β
βββ models/
β βββ genesis/
β βββ model_name.txt # Latest model
β βββ checkpoints/ # Training checkpoints
β βββ metrics/ # Training metrics
β
βββ systemd/
β βββ truthsi-genesis-self-improvement.service
β
βββ docs/
βββ genesis/
βββ SELF_IMPROVEMENT_CYCLE.md # Full docs
βββ FIX_ISSUE_5_SELF_IMPROVEMENT_CYCLE.md
βββ GENESIS_SELF_IMPROVEMENT_ARCHITECTURE.md
Created: Session 318 - THE ARCHITECT Status: β PRODUCTION READY Visualization: Complete system architecture