What AI providers does Claw-STU work with?

Anthropic (Claude Haiku 4.5), OpenAI, Ollama (local or cloud), and OpenRouter (GLM 4.5 Air by default). You bring your own API key. A deterministic EchoProvider is the guaranteed fallback floor, so the session loop never stalls when a provider is unreachable.

Is my data sent anywhere?

Student profiles, brain pages, and the full SQLite store live on your machine under ~/.claw-stu. The only external calls are to the LLM provider you choose. With Ollama local, everything runs fully offline. No accounts, no telemetry.

How is this different from Claw-ED?

Claw-ED is the teacher-facing tool -- it ingests a teacher's curriculum files and generates lessons, assessments, and games in their voice. Claw-STU is the student-facing counterpart -- Stuart runs an adaptive teach-assess-adapt loop, tracks per-student ZPD and modality preferences, and writes brain pages after every session. They are independent projects that compose: a teacher can use Claw-ED to build lessons and Claw-STU to deliver them.

What happens if a student types a crisis message?

Every student-text entry point runs through an InboundSafetyGate before any other logic. On crisis detection the session is paused (CRISIS_PAUSE phase), the evaluator and orchestrator are never called, escalation resources are returned, and no raw text is written to the brain or logs. Only a single structured event is recorded. This is SOUL.md section 5 and is non-negotiable.

Claw-STU

Name: Claw-STU
Author: Claw-STU

Stuart, a personal learning agent that grows with the student. One command, offline-first.

Teacher-facing counterpart: Claw-ED — the co-teacher that writes in your voice.

Claw-STU is a command-line personal learning agent for students. Stuart runs an adaptive teach-assess-adapt loop — it picks a learning modality based on what's been working for this specific student, presents a short block, asks a check-for-understanding question, and steps the complexity tier up or down based on the answer. Then it does it again.

Stuart is explicitly not a tutor, a friend, a therapist, or an authority figure. It is a cognitive tool. It never claims to feel emotions, never praises innate ability, and never replaces a teacher or a guardian. The boundary is documented in SOUL.md and enforced at every entry point by an inbound safety gate that runs before anything else.

Works with multiple AI providers (Anthropic, OpenAI, Ollama, OpenRouter) and a deterministic Echo provider as the guaranteed fallback floor. The session loop never stalls when a provider is unreachable. Requires Python 3.11+. Works on macOS, Windows, and Linux.

Install

# install from PyPI
$ pip install clawstu

# pick your AI provider (Anthropic, OpenAI, Ollama, OpenRouter)
$ clawstu setup

# start learning — any topic, any subject
$ clawstu

First run creates ~/.claw-stu/ with 0700 permissions, loads defaults from AppConfig, and picks up API keys from secrets.json or environment variables (ANTHROPIC_API_KEY, OPENAI_API_KEY, OPENROUTER_API_KEY, OLLAMA_BASE_URL). Missing keys fall through the chain, ending at Echo.

PyPI page · Setup guide

The MVP at a glance

457Tests

88%Coverage

85Source files

0type: ignore

mypy --strict clean across every phase. Ruff clean. filterwarnings = ["error"] clean. Runtime under 2 seconds for the full suite. Every phase has its own AST-enforced layering guard.

What it looks like

clawstu

$ clawstu Hi. I'm Stuart. I'm a personal learning agent. I'll help you learn about anything you want. Real topics, real content, adapted to how you learn.
What's your name? Ada How old are you? 15 What would you like to learn about today? Try: the Haitian Revolution, photosynthesis, supply and demand The Haitian Revolution
> Setting up your session... ✓ Topic: The Haitian Revolution ✓ Age bracket: late_high ✓ Provider: ollama/llama3.2 (local)
> Block 1 · primary_source · ~10 min The Haitian Revolution began in 1791 on the French colony of Saint-Domingue. Over 500,000 enslaved Africans worked the plantations — nearly 90% of the island's population...
> Check In your own words, what made Saint-Domingue's labor system different from other French colonies? The scale — almost everyone was enslaved ✓ Correct. Moving to the next block.

How it works

Stuart is a deterministic state machine with an optional LLM boundary. The session runner, safety gate, and memory layer are all pure Python. Only the content-generation seam talks to a provider, and even that seam has a guaranteed Echo fallback floor so the loop never stalls.

The teach-assess-adapt loop

ZPD calibration. Every session starts with a short calibration quiz that places the student at approaching, meeting, or exceeding the grade-level standard. The estimate updates after every observed check.
Modality rotation. Seven modalities (text reading, primary sources, Socratic dialogue, interactive scenarios, visual-spatial, worked examples, inquiry projects). The modality rotator picks the one with the best recent success rate for this student.
Router-driven generation. Each task kind (SOCRATIC_DIALOGUE, BLOCK_GENERATION, CHECK_GENERATION, RUBRIC_EVALUATION, PATHWAY_PLANNING, CONTENT_CLASSIFY, DREAM_CONSOLIDATION) is routed to the provider / model best suited for it. Ollama for local latency-sensitive tasks, Anthropic Haiku for accuracy-critical rubric evaluation, GLM 4.5 Air on OpenRouter for prose blocks.
Context-aware generation. Every provider call is preceded by build_learner_context() — it pulls the learner's compiled-truth brain page, the concept's HAPP framing, the last three session pages, and any flagged misconceptions. The second time a student asks about the same topic, Stuart already knows what they understood last week.
Close and consolidate. On session close, the writer mints a SessionPage, updates the LearnerPage compiled truth, and adds KG triples. Overnight, the scheduler runs a dream cycle that rewrites compiled truths, detects concept gaps, and re-indexes embeddings.

Stack

api/ FastAPI routes + lifespan + learner auth ↓ scheduler/ APScheduler + 5 nightly tasks ↓ orchestrator/ ModelRouter + 4 async providers + ReasoningChain ↓ engagement/ SessionRunner + ZPD + modality rotator ↓ memory/ BrainStore + 6 page kinds + hybrid search ↓ persistence/ SQLite + FTS5 + LRU identity cache ↓ safety/ InboundSafetyGate + boundary enforcer + escalation ↓ profile/ LearnerProfile + ZPDEstimate + ObservationEvent

Every layer is enforced by tests/test_hierarchy.py, an AST-based import-DAG guard that runs on every commit. Violations fail CI before merge.

Safety

Safety is the lowest layer in the stack — nothing imports from it without being checked. Every student-text entry point (/sessions/{id}/calibration-answer, /check-answer, /socratic, /learners/{id}/capture) runs through InboundSafetyGate.scan(text) before any other logic.

The gate returns one of three decisions:

Allow. The handler proceeds normally.
Crisis. The session phase flips to CRISIS_PAUSE. The evaluator and the orchestrator are never called. Escalation resources are returned. A single structured event {event: crisis_detected, kind, session_id_hash, learner_id_hash} is logged — no raw text, no PII, no brain page entry. The omission is deliberate: SOUL.md §5 says Stuart surfaces human resources and steps out of the teach loop. Preserving a paper trail of a specific crisis message would create a PII retention hazard the project refuses to accept.
Boundary violation. Generic HTTP 400 with no raw text in the log. The violation kind is recorded so it can be tuned, but the exact phrasing is not.

A paused session can only be closed (producing a summary that acknowledges the pause) or explicitly unpaused by an administrator. next_directive has an explicit CRISIS_PAUSE branch at the top of the dispatch so a paused session cannot be accidentally routed back into the teach loop. This is covered by test_crisis_paused_session_refuses_next_directive.

Two sides of the same agent

Claw-STU and Claw-ED are independent projects built by the same team. They compose: a teacher uses Claw-ED to generate lessons in their voice, and Claw-STU delivers them adaptively to each student with memory-backed personalization.

Claw-ED — for teachers

Ingests your curriculum files
Learns your teaching voice
Generates lessons, assessments, slides, games
9 export formats per lesson
Quality gate with auto-retry
50-state standards alignment
Chrome extension + Telegram bot

Claw-STU — for students

Adaptive teach-assess-adapt loop
Per-student ZPD estimation
7 instructional modalities
Per-student memory graph
Nightly dream cycle consolidation
Warm-start pre-generation
Crisis-aware safety gate

Both are MIT-licensed, Python, multi-provider, and offline-first.

AI providers

Claw-STU is BYOK (bring your own key). Each TaskKind is routed to a provider / model picked for that job. The fallback chain is ollama → openai → anthropic → openrouter, ending at a deterministic EchoProvider floor so the session loop never stalls.

Task	Default provider	Default model
`SOCRATIC_DIALOGUE`	Ollama (local)	llama3.2 — free + instant
`BLOCK_GENERATION`	OpenRouter	z-ai/glm-4.5-air — cheap prose
`CHECK_GENERATION`	OpenRouter	z-ai/glm-4.5-air
`RUBRIC_EVALUATION`	Anthropic	claude-haiku-4-5 — accuracy-critical
`PATHWAY_PLANNING`	OpenRouter	z-ai/glm-4.5-air
`CONTENT_CLASSIFY`	Ollama (local)	llama3.2 — never network
`DREAM_CONSOLIDATION`	OpenRouter	z-ai/glm-4.5-air — overnight batch

Missing API keys don't crash the router — they just fall through the chain. For a fully offline deployment, install Ollama with llama3.2 and leave the other keys unset. Stuart still works; it just uses Ollama for every task kind instead of the default split.

Reachability is not probed at construction time. The router only knows presence or absence of an API key. Real network health checks land behind clawstu doctor --ping.

Commands

clawstu                                          # start a learning session
clawstu learn "photosynthesis"                   # learn a specific topic
clawstu resume <learner_id>                      # warm-start from last session
clawstu ask "What is a primary source?"          # one-shot Socratic question
clawstu wiki <concept>                           # per-student concept notes
clawstu progress                                 # learner dashboard (ZPD, modality)
clawstu history                                  # past sessions
clawstu review                                   # concepts due for review
clawstu setup                                    # pick your AI provider
clawstu serve                                    # web UI at localhost:8000
clawstu doctor [--ping]                          # self-diagnosis
clawstu profile export <id> --out profile.tar.gz # portable profile
clawstu profile import profile.tar.gz            # restore a profile
clawstu scheduler run-once --task <name>         # run a proactive task

doctor is a pure static config dump by default. It never touches the network. Pass --ping to opt in to real reachability checks. This guarantee is enforced by test_doctor_without_ping_does_not_make_network_calls, which monkey-patches httpx.Client.post to raise on any invocation.

HTTP API

Everything is also reachable over HTTP via the FastAPI app. All student-text routes go through InboundSafetyGate first.

POST   /sessions                              # onboard a learner
GET    /sessions/{session_id}                 # current state
POST   /sessions/{id}/calibration-answer      # submit calibration answer
POST   /sessions/{id}/finish-calibration      # transition to teach loop
POST   /sessions/{id}/next                    # next directive
POST   /sessions/{id}/check-answer            # submit check answer
POST   /sessions/{id}/socratic                # free-form dialogue
POST   /sessions/{id}/close                   # close + write to brain

GET    /learners/{id}/wiki/{concept}          # per-student concept wiki
POST   /learners/{id}/resume                  # warm-start (pre-gen'd)
GET    /learners/{id}/queue                   # scheduler queue for learner
POST   /learners/{id}/capture                 # student-shared source

GET    /admin/scheduler                       # scheduler status
GET    /admin/health                          # process health

Learner routes are gated by a shared-secret bearer token when STU_LEARNER_AUTH_TOKEN is set in the environment. Without the env var, auth is a no-op — single-household dev mode. Per-learner JWTs are a post-MVP concern.

Contributing

Claw-STU is built in public and ships in 7 phases. Contributions are welcome.

Getting started

git clone https://github.com/SirhanMacx/Claw-STU.git
cd Claw-STU
pip install -e ".[dev]"
pytest                         # 373 tests, under 2s
mypy clawstu                   # strict, 81 files
ruff check clawstu tests       # clean

Where to help

Issues — bug reports and feature requests
Discussions — questions, ideas, show and tell

Things we'd especially appreciate

Real ONNX MiniLM bootstrap for the embeddings layer (Phase 4 ships a NullEmbeddings stub)
Per-learner iteration in the scheduler (Phase 6 fires each task with a * sentinel)
Live-content priming for prepare_next_session (Phase 6 writes a placeholder artifact)
Full pedagogical ZPD recomputation in refresh_zpd
Multi-learner auth (post-MVP per-learner JWTs)
Additional language-domain seed libraries