System: MWMS
Document Type: Framework
Authority Level: MCR Source Of Truth
Status: Draft For MCR
Version: v1.0
Primary Location: MCR
Future Operational Destination: HeadOffice Brain, AIBS Brain, Automation Brain, Data Brain, Research Brain, AI Employee Router, Task Executor Systems, Brain Room, Client AIOS Systems
Parent Page: HeadOffice
Owner: Martyn
Developer Boundary: Do Not Touch M’s Active Build Areas Unless Specifically Assigned
Source Of Truth: MCR
Last Reviewed: 2026-06-01
Source / Origin: AI Automations by Jack — Advanced Technology / RAG & Supabase Masterclass
MWMS Classification: RAG Framework / Supabase Vector Memory Standard / AIOS Knowledge Retrieval Architecture / Context Engineering Infrastructure
Primary Brain: Data Brain
Supporting Brains: HeadOffice Brain, AIBS Brain, Automation Brain, Research Brain, Product Brain, Operations Brain, Risk Brain, Compliance Brain, SIT Brain, Content Brain, Customer Brain
Related Pages: MWMS AI Agent Memory And Context Framework, MWMS Context Engineering Framework, MWMS AI Operating System Architecture Framework, MWMS AI Dashboard Capability Framework, MWMS Client AI Interface Selection Framework, MWMS Advanced AI Capability Stack Framework, MWMS Advanced AI Capability Activation Registry, MWMS AI Tool Permission And Access Framework, MWMS AI Agent Operations Core, MWMS Automation Build Planning Framework, MWMS AI Automation Security And Risk Checklist, MWMS Source Visibility And Evidence Display Standard
Source Evidence: AI Automations by Jack demonstrates a Supabase-based RAG system using n8n, Google Drive file ingestion, Supabase vector store tables, OpenAI embeddings, chunking with overlap, metadata, vector retrieval, Cohere reranking, Postgres chat memory, n8n AI agents, tool calling, and webhook connection to front-end apps such as Bolt or Lovable-style interfaces. The lesson contrasts RAG with context stuffing, explaining that RAG retrieves only the most relevant chunks instead of placing an entire long context into every prompt.
Purpose
The purpose of the MWMS Supabase RAG And Vector Memory Framework is to define how MWMS should use Supabase, vector stores, embeddings, retrieval, reranking, metadata, chat memory, and n8n workflows to create governed knowledge retrieval systems for AI Employees, Brains, AI Operating Systems, dashboards, chatbots, client systems, and future AIBS packages.
This framework exists because MWMS is becoming knowledge-heavy.
MWMS already contains and will continue to contain:
- MCR pages
- Brain pages
- course transcripts
- newsletters
- frameworks
- standards
- SOPs
- task logs
- event logs
- client documents
- product research
- offer evaluations
- campaign notes
- development save points
- AI Employee role cards
- AIOS architecture pages
- compliance records
- dashboard records
- client knowledge bases
AI Employees cannot safely or efficiently use all of this by stuffing everything into every prompt.
That creates context overload.
The better long-term system is:
store knowledge properly, retrieve only what is relevant, validate the retrieved context, and then generate an answer or action.
This is the role of RAG.
RAG means Retrieval-Augmented Generation.
For MWMS, RAG is not just a technical feature.
It is a future knowledge infrastructure layer.
Core Doctrine
The MWMS doctrine is:
Do not stuff everything into context. Retrieve the right knowledge at the right time from the right source, then validate it before use.
RAG helps MWMS avoid:
- huge prompts
- irrelevant context
- stale memory
- hallucinated recall
- weak chatbot answers
- slow processing
- high token cost
- wrong-source answers
- client knowledge leakage
- over-reliance on general AI memory
- loss of source traceability
But RAG also creates risk.
Retrieved content is not automatically correct.
Vector similarity is not authority.
A retrieved chunk may be related but wrong.
Therefore, MWMS RAG must be governed.
Strategic Importance
This framework is strategically important because Supabase is already part of MWMS infrastructure.
The lesson shows that Supabase can function not only as a normal database, but also as a vector store and Postgres memory layer.
This directly supports the MWMS long-term architecture.
Supabase RAG can eventually support:
- Brain Room AI replies
- AI Manager context retrieval
- AI Employee Router context selection
- MCR page retrieval
- course knowledge retrieval
- newsletter archive retrieval
- Research Brain intelligence
- client knowledge bases
- chatbot support systems
- voice AI knowledge retrieval
- HeadOffice dashboards
- AIBS client AIOS systems
- Prompt Vault search
- browser-captured page analysis
- automation troubleshooting
- structured memory systems
This means Supabase is not only a data store.
It can become part of the MWMS knowledge brain.
Definition
RAG is a system where external knowledge is retrieved from a stored knowledge base and injected into an AI prompt so the AI can answer or act using relevant context.
A vector store is a database that stores text chunks along with embeddings, allowing semantic retrieval of related content.
Embeddings are numerical representations of text that allow similar content to be found by meaning rather than exact keyword match.
Reranking is a second-stage filtering process where retrieved chunks are reviewed and ranked again so the most useful chunks are passed to the AI.
Postgres chat memory stores conversation history in a database so multiple workflows, apps, or agents can access shared conversation memory.
MWMS Definition
MWMS Supabase RAG And Vector Memory is:
A governed knowledge retrieval architecture that stores source material in Supabase, converts it into searchable vector memory, retrieves relevant chunks when needed, reranks and validates them, and passes only appropriate context to AI Employees, chatbots, dashboards, or AIOS workflows.
RAG Versus Context Stuffing
The lesson explains the difference between placing everything in context and using RAG. With normal chat context, the model may keep resending the full conversation or long context until the context window becomes too large. RAG instead breaks source material into chunks and retrieves only the most relevant pieces for the current question.
Context Stuffing
Context stuffing means putting large amounts of text directly into the prompt or conversation.
Examples:
- full books
- full course transcripts
- full MCR pages
- full client documents
- full newsletter archives
- full conversation history
- full SOP libraries
Problems With Context Stuffing
Context stuffing can cause:
- high token cost
- slower responses
- context window overflow
- irrelevant content dilution
- weaker reasoning
- missed instructions
- source blending
- model confusion
- stale or unrelated material influencing output
RAG
RAG solves this by:
- splitting source material into chunks
- storing chunks in a database
- embedding those chunks
- retrieving only relevant chunks
- optionally reranking chunks
- passing selected chunks into the AI prompt
- producing a source-aware answer
MWMS Rule
Use RAG when the knowledge base is too large, too dynamic, or too important to stuff into the prompt.
Core Supabase RAG Workflow
The core MWMS Supabase RAG workflow has two sides:
- Ingestion / Vectorization
- Retrieval / Answering
Part 1: Ingestion / Vectorization
This is where source material is added into Supabase.
The lesson demonstrates an n8n workflow where a Google Drive trigger watches a folder, downloads new files, sends them into a Supabase vector store, splits them into chunks, creates embeddings, and stores the results in a documents table.
Ingestion Steps
- Source file is added
- Workflow is triggered
- File is downloaded
- File is loaded by data loader
- File is split into chunks
- Chunk overlap is applied
- Embeddings are generated
- Chunks are stored in Supabase vector table
- Metadata is stored
- File becomes searchable
MWMS Use Cases
Ingestion may apply to:
- course transcripts
- MCR pages
- Brain pages
- newsletters
- PDFs
- SOPs
- client documents
- product knowledge
- support documents
- campaign records
- AIOS documentation
- Prompt Vault entries
- browser-captured pages
MWMS Rule
No source should enter vector memory without ownership, metadata, and purpose.
Part 2: Retrieval / Answering
This is where an AI agent retrieves relevant knowledge.
The lesson demonstrates an n8n AI agent using a Supabase vector tool to query a vector store, retrieve chunks, rerank them, and use the best chunks to answer the user.
Retrieval Steps
- User asks a question
- Query is embedded
- Vector store searches similar chunks
- Top chunks are retrieved
- Reranker filters strongest chunks
- Relevant chunks are passed to the AI
- AI answers using retrieved context
- Conversation memory may be stored
- Output may return to a dashboard, chatbot, app, or webhook response
MWMS Use Cases
Retrieval may support:
- Brain Room questions
- AI Employee task context
- chatbot answers
- client support
- MCR lookup
- course knowledge extraction
- Research Brain retrieval
- AIBS knowledge assistants
- dashboard summaries
- voice agent responses
- internal decision support
MWMS Rule
The AI answer is only as trustworthy as the retrieved context, source authority, and validation layer.
Supabase Role In MWMS
Supabase can serve several roles inside MWMS.
1. Structured Database
Supabase stores normal relational data.
Examples:
- tasks
- events
- users
- Brain records
- dashboard rows
- client records
- workflow statuses
- logs
2. Vector Store
Supabase stores text chunks, embeddings, and metadata for semantic retrieval.
Examples:
- MCR page chunks
- course transcript chunks
- client knowledge chunks
- support policy chunks
- newsletter chunks
3. Chat Memory Store
Supabase/Postgres can store conversation history.
The lesson shows n8n Postgres chat memory storing human and AI messages in a database table and explains that Postgres chat memory is more scalable than simple memory because it can be queried across multiple applications and shared by more than one workflow.
4. AIOS Knowledge Layer
Supabase may eventually support the context and memory layer for client AIOS systems.
Examples:
- client-specific memory
- approved knowledge sources
- chatbot knowledge
- workflow history
- reporting data
- AIOS source records
5. Dashboard Data Source
Supabase can power dashboards with live structured data.
Examples:
- task dashboards
- AIOS dashboards
- automation health dashboards
- client reporting dashboards
MWMS Rule
Supabase should be treated as a system layer, not only a storage tool.
Supabase Vector Table Standard
A vector table should store enough information to retrieve knowledge safely.
A basic vector table may include:
- id
- content
- metadata
- embedding/vector
The lesson’s quick-start vector table includes id, content, metadata, and embedding/vector fields.
MWMS Recommended Fields
For MWMS, vector memory should eventually include:
id
Unique chunk ID.
content
Text chunk.
metadata
Structured metadata object.
embedding
Vector representation.
source_id
Original source identifier.
source_title
Human-readable source title.
source_type
MCR page, Brain page, course transcript, newsletter, client doc, SOP, etc.
owning_brain
Brain that owns the source.
client_id
Client identifier if client-specific.
authority_level
MCR source of truth, draft, client-approved, raw source, archived, deprecated.
created_at
When chunk was created.
updated_at
When source or chunk was updated.
source_date
Date of original source.
version
Version where relevant.
freshness_status
Fresh, recent, stable, stale, deprecated.
sensitivity_level
Public, internal, confidential, client-sensitive, credential-sensitive.
allowed_use
Internal, client-specific, support, research, dashboard, chatbot, etc.
retrieval_allowed
True/false.
review_status
Raw, reviewed, approved, rejected.
MWMS Rule
Metadata is not optional.
Metadata is what prevents RAG from becoming uncontrolled memory.
Metadata Governance
Metadata controls how retrieved content should be interpreted.
Without metadata, a chunk is just text.
With metadata, MWMS can know:
- where the chunk came from
- who owns it
- whether it is authoritative
- whether it is current
- whether it is client-specific
- whether it is safe to use
- whether it can be shown to users
- whether it requires human review
Required Metadata Questions
Ask:
- What is the source?
- Who owns it?
- Is it MCR, Brain page, raw source, draft, or client-approved?
- Is it current?
- Is it sensitive?
- Is it client-specific?
- Is it allowed in this workflow?
- Can it be shown?
- Can it be used for client-facing output?
- Does it need review?
MWMS Rule
Retrieved content without metadata should be treated as low-trust context.
Chunking Standard
Chunking is the process of splitting documents into smaller pieces before embedding.
The lesson demonstrates recursive text splitting with chunk size and overlap, explaining that overlap lets some content appear in both adjacent chunks so context is not lost.
MWMS Chunking Goals
Chunks should:
- preserve meaning
- include section context
- avoid cutting important instructions
- retain headings where possible
- support source traceability
- avoid too much irrelevant text
- retrieve accurately
Chunk Overlap
Chunk overlap helps preserve context across chunk boundaries.
Example:
If a chunk contains tokens 0–1000, the next chunk might start at token 800 instead of 1001.
This keeps the last part of one chunk available in the next.
Chunking Risks
Poor chunking can create:
- partial answers
- missing instructions
- wrong interpretation
- broken source context
- repeated irrelevant chunks
- weak retrieval
- hallucinated synthesis
MWMS Rule
Chunking must preserve meaning, not just divide text.
Embedding Standard
Embeddings turn text into vectors.
The lesson uses OpenAI text-embedding-3-small as a simple embedding model and notes that the same embedding model should be used for ingestion and retrieval.
MWMS Embedding Rules
- Use the same embedding model for storage and query unless migration is planned.
- Record embedding model in metadata or system configuration.
- Avoid mixing embeddings from different models in the same retrieval table without clear separation.
- Test retrieval quality before trusting.
- Use higher-quality embeddings when knowledge accuracy matters.
- Track cost where large document volumes are involved.
MWMS Rule
Embedding choice is an infrastructure decision.
It affects retrieval quality, cost, and future compatibility.
Retrieval Limit Rule
The lesson demonstrates increasing retrieved chunks, such as retrieving 20 chunks, then using reranking to select the most relevant subset.
Retrieval Limit Questions
Ask:
- How many chunks should be retrieved?
- How many should be passed to the AI?
- Is reranking used?
- Is the answer likely to require multiple sources?
- Is too much retrieval causing noise?
- Is too little retrieval missing evidence?
- Does the workflow need source diversity?
MWMS Rule
Retrieval should return enough context to answer well, but not so much that irrelevant chunks dilute the answer.
Reranking Standard
Reranking is a second-stage retrieval quality step.
The lesson uses Cohere reranking to take a larger set of retrieved chunks and identify the best ones for the question.
Why Reranking Matters
Vector retrieval can return semantically similar chunks that are not actually the best.
Reranking helps:
- improve relevance
- reduce noise
- select the best chunks
- improve answer quality
- avoid passing too much context
- support better source use
MWMS Reranking Use Cases
Use reranking when:
- knowledge base is large
- source quality matters
- client-facing answer depends on accuracy
- multiple chunks are retrieved
- the question is complex
- similar documents may confuse retrieval
- wrong-source risk is high
MWMS Rule
Reranking should be used for serious knowledge retrieval where accuracy matters.
Postgres Chat Memory Standard
Postgres chat memory stores conversation history in a database.
The lesson demonstrates connecting n8n to Supabase Postgres chat memory and explains that it is more scalable than simple memory because it can be accessed across multiple applications or workflows.
MWMS Use Cases
Postgres chat memory may support:
- Brain Room conversations
- client chatbot memory
- support assistant memory
- voice agent memory
- AIOS conversation history
- multi-app shared memory
- internal AI assistant continuity
- handoff summaries
Risks
Postgres chat memory can create:
- privacy risk
- sensitive data retention
- wrong-client leakage
- stale memory
- conversation clutter
- unclear deletion rules
- over-personalization
- AI relying on old statements
MWMS Rule
Postgres chat memory should not be added casually.
It must have purpose, retention rules, client isolation, and review controls.
Shared Memory Across Apps
One of the strengths of Postgres chat memory is that more than one app or workflow may access the same stored conversation memory.
This can be useful.
It can also be dangerous.
Useful When
- same client AIOS uses multiple interfaces
- chatbot and dashboard share context
- voice agent and support dashboard share handoff history
- Brain Room and task system share thread context
- internal assistant needs continuity across workflows
Dangerous When
- multiple clients share one memory table without isolation
- apps access memory outside their role
- memory is not filtered by user/client/workflow
- old conversation data influences unrelated tasks
- sensitive information appears in another interface
MWMS Rule
Shared memory must be scoped by user, client, workflow, Brain, and permission.
n8n RAG Workflow Standard
The lesson uses n8n for both vectorizing documents and retrieving knowledge through an AI agent.
MWMS should treat n8n as a likely future RAG orchestration layer.
Ingestion Workflow Pattern
Trigger:
- Google Drive file added
- form submission
- file upload
- MCR page update
- newsletter received
- client document approved
- manual trigger
Actions:
- download file
- parse/load file
- split text
- generate embeddings
- store chunks in Supabase
- write metadata
- log ingestion
- mark review status
Retrieval Workflow Pattern
Trigger:
- chat message
- Brain Room question
- dashboard query
- client chatbot question
- voice agent request
- AI Employee task
- webhook request
Actions:
- receive query
- embed query
- retrieve chunks
- rerank chunks
- validate metadata
- pass context to AI
- generate answer
- return via webhook/dashboard/chat
- log sources and answer
MWMS Rule
RAG workflows must have both ingestion governance and retrieval governance.
Webhook Front-End Connection Rule
The lesson demonstrates sending front-end chat messages to an n8n webhook, processing them through an AI agent, and returning the answer through a response webhook node.
This is important for MWMS because many future interfaces may connect to AI backends through webhooks.
Possible interfaces:
- dashboard
- chatbot
- Bolt/Lovable app
- WordPress front-end
- client portal
- Chrome extension
- internal admin panel
- voice interface
Webhook Risks
Webhooks can create:
- unauthorized access
- spam input
- prompt injection
- data leakage
- wrong workflow triggering
- external action risk
- weak authentication
- public endpoint abuse
Required Controls
Webhook-connected RAG systems should define:
- endpoint purpose
- allowed input
- authentication
- rate limits
- logging
- source validation
- payload schema
- error response
- tool permissions
- client isolation
- replay protection where needed
MWMS Rule
A webhook is a system doorway.
It must be protected.
RAG Source Authority Rule
RAG retrieves content.
It does not determine authority.
A retrieved source may be:
- MCR source of truth
- Brain page
- old draft
- raw transcript
- client document
- archived note
- deprecated page
- low-quality newsletter
- outdated support policy
The AI must know the authority level.
Authority Questions
Ask:
- Is this retrieved source approved?
- Is it MCR?
- Is it a draft?
- Is it outdated?
- Is it client-specific?
- Is it safe to use?
- Is it allowed for this user?
- Is it allowed for this output?
- Is it allowed for client-facing answer?
MWMS Rule
RAG should retrieve sources.
Governance decides whether the source may be used.
Client Isolation Rule
Client-specific RAG must isolate client data.
Client A’s knowledge must not appear in Client B’s answer.
This requires:
- client_id metadata
- tenant filtering
- separate tables or namespaces where needed
- access permission checks
- retrieval filters
- logging
- testing for leakage
- offboarding/delete process
MWMS Rule
No client-facing RAG system may launch without client isolation.
Source Visibility Rule
RAG outputs should preserve source traceability.
For internal MWMS work, sources should be visible or traceable.
For client-facing systems, source visibility may be shown directly or stored internally depending on the interface.
Source Visibility May Include
- source title
- source type
- source date
- chunk ID
- MCR page
- document name
- retrieved passage
- confidence note
- authority level
- freshness status
MWMS Rule
Important RAG answers need source visibility or source traceability.
RAG And Chatbots
Chatbots become more useful when connected to RAG.
A chatbot can answer based on approved knowledge rather than generic AI memory.
Chatbot RAG Use Cases
- FAQ assistant
- support bot
- internal knowledge bot
- onboarding assistant
- client documentation assistant
- Brain Room assistant
- sales assistant
- course library assistant
Chatbot RAG Risks
- stale knowledge
- wrong source retrieval
- hallucinated answer after weak retrieval
- no handoff
- wrong-client leakage
- unsupported claims
- user assumes answer is official
- sensitive knowledge exposure
MWMS Rule
Chatbot RAG must include approved knowledge, retrieval filters, memory rules, tool boundaries, and handoff logic.
RAG And Dashboards
Dashboards can use RAG to summarize, explain, or answer questions about dashboard data.
Examples:
- “Why did this metric change?”
- “What actions are overdue?”
- “What changed this week?”
- “Which client tasks need review?”
- “What source supports this recommendation?”
Dashboard RAG Risks
- AI summary overstates data
- dashboard shows unreviewed interpretation
- old source influences current metric
- user cannot see source
- AI hides uncertainty
MWMS Rule
Dashboard RAG should support visibility and decision-making, not replace source data.
RAG And MCR
MCR is the source of truth for MWMS governance.
MCR pages are strong candidates for vector retrieval, but they must be carefully tagged.
MCR retrieval should preserve:
- page title
- document type
- version
- authority level
- parent page
- date reviewed
- Brain ownership
- current/deprecated status
MCR RAG Use Cases
- AI Employee rule retrieval
- page formatting support
- change log support
- Brain canon lookup
- course absorption anti-duplication
- developer boundary lookup
- context pack creation
- AIOS governance review
MWMS Rule
MCR RAG must preserve authority and version.
A retrieved MCR chunk without title/version/status is not enough.
RAG And Course Absorption
Course transcripts can be vectorized so future absorption can retrieve prior frameworks, avoid duplication, and compare new course material against existing MWMS pages.
Use Cases
- find similar previous course content
- locate existing frameworks
- check whether a lesson adds new value
- retrieve source evidence
- build course closeout summaries
- support future training systems
Risks
- weak course content retrieved as if strong
- raw transcript treated as canon
- duplicate pages
- source chunk lacks context
- old course material overrides current MWMS standards
MWMS Rule
Course RAG should support absorption.
It should not turn raw course content into approved MWMS doctrine automatically.
RAG And Research Brain
Research Brain can use RAG to retrieve internal and external research records.
Use cases:
- market research archive
- tool evaluation archive
- competitor analysis
- offer intelligence
- newsletter patterns
- source-backed reports
- research evidence recall
MWMS Rule
Research RAG must distinguish source evidence from AI interpretation.
RAG And AIBS Client Systems
AIBS client systems may use Supabase RAG to power client-specific knowledge assistants, dashboards, chatbots, and AIOS workflows.
Use cases:
- client support assistant
- internal company knowledge assistant
- SOP assistant
- sales assistant
- onboarding assistant
- report explanation assistant
- customer service bot
- AI receptionist knowledge layer
AIBS Requirements
Client RAG must define:
- client_id
- approved knowledge sources
- source owner
- update process
- access permissions
- retention rules
- deletion process
- handoff rules
- fallback response
- source visibility
- support owner
- risk review
AIBS Rule
Client RAG must be isolated, approved, source-aware, and supportable.
RAG Build Path
MWMS should use a controlled build path.
Stage 1: Define Use Case
Ask:
- What question must the AI answer?
- Who asks it?
- What source should it use?
- What happens after the answer?
- Is this internal or client-facing?
Stage 2: Define Source Set
Select approved sources.
Examples:
- MCR pages
- Brain pages
- client docs
- SOPs
- course transcripts
- newsletters
Stage 3: Define Metadata
Define:
- source title
- source type
- owner
- Brain
- client_id
- authority level
- freshness
- sensitivity
- allowed use
Stage 4: Ingest And Chunk
Load documents, split them, apply overlap, and preserve metadata.
Stage 5: Embed
Generate embeddings using selected model.
Record embedding model.
Stage 6: Store In Supabase
Store content, metadata, and vector.
Stage 7: Retrieve
Test semantic retrieval with real questions.
Stage 8: Rerank
Use reranking where accuracy matters.
Stage 9: Validate Output
Check source authority, answer accuracy, and missing context.
Stage 10: Connect Interface
Connect to:
- chatbot
- dashboard
- Brain Room
- AI Employee
- WordPress
- Chrome extension
- client app
Stage 11: Log And Improve
Log:
- query
- retrieved chunks
- source used
- answer
- user feedback
- failures
- improvements
RAG Launch Readiness Checklist
Before launching an important Supabase RAG system, check:
- Use case is clear
- User type is defined
- Source set is approved
- Source authority is tagged
- Metadata is complete
- Chunking preserves meaning
- Embedding model is selected
- Retrieval is tested
- Reranking is considered
- Client isolation exists if needed
- Sensitive data is protected
- Tool permissions are defined
- Webhook is protected if used
- Source visibility exists
- Stale source handling exists
- Handoff/fallback exists
- Logs are captured
- Human review path exists
- Maintenance owner is assigned
- Update process is defined
- Deletion/offboarding process exists where relevant
RAG Failure Modes
Failure Mode 1: Context Stuffing Instead Of RAG
Large files are shoved into prompts repeatedly.
Correction:
Use vector retrieval when knowledge is large or repeated.
Failure Mode 2: No Metadata
Chunks are stored without source ownership, date, authority, or client ID.
Correction:
Add metadata before trusting retrieval.
Failure Mode 3: Wrong Source Retrieved
Vector search finds similar but wrong content.
Correction:
Use metadata filters, reranking, and source validation.
Failure Mode 4: Raw Source Treated As Canon
Course transcript or newsletter content is treated as approved MWMS doctrine.
Correction:
Tag raw source separately from MCR source-of-truth.
Failure Mode 5: Wrong-Client Leakage
Client-specific content is retrieved in the wrong client workflow.
Correction:
Use strict client filters, separate namespaces, and leakage tests.
Failure Mode 6: Stale Knowledge
Old pricing, policy, or system state is retrieved.
Correction:
Use freshness metadata and review cadence.
Failure Mode 7: No Source Visibility
AI answers cannot be traced.
Correction:
Store and display source references.
Failure Mode 8: Weak Chunking
Important context is split badly.
Correction:
Adjust chunk size, overlap, and metadata.
Failure Mode 9: Reranking Missing
Too many irrelevant chunks enter the prompt.
Correction:
Add reranking for serious systems.
Failure Mode 10: Webhook Exposed
Public endpoint can be abused.
Correction:
Add authentication, schema validation, rate limiting, and logging.
Failure Mode 11: Chat Memory Overreach
Conversation memory becomes long-term truth.
Correction:
Separate session memory from persistent memory.
Failure Mode 12: No Maintenance Owner
Knowledge base becomes outdated.
Correction:
Assign owner and update cadence.
Application To HeadOffice Brain
HeadOffice Brain may use Supabase RAG to retrieve:
- MCR rules
- system standards
- recent change logs
- dashboard context
- Brain routing rules
- prior decisions
- course absorption history
- newsletter intelligence history
HeadOffice Rule
HeadOffice RAG should prioritize source-of-truth and current governance.
Application To AIBS Brain
AIBS Brain may use Supabase RAG to power client AIOS systems.
Use cases:
- client internal knowledge assistant
- client support chatbot
- client reporting assistant
- client onboarding assistant
- AI receptionist knowledge base
- client dashboard explanation layer
AIBS Rule
AIBS RAG must be client-isolated and client-approved.
Application To Automation Brain
Automation Brain may use n8n and Supabase to automate ingestion and retrieval.
Use cases:
- document upload to vector store
- webhook query to RAG agent
- chatbot backend
- dashboard question answering
- agent sub-workflow retrieval
- trigger-based knowledge updates
Automation Rule
RAG automation must include error handling, logging, and credential governance.
Application To Data Brain
Data Brain owns the structure of vector memory.
Data Brain should define:
- tables
- metadata
- schemas
- source types
- vector fields
- freshness
- sensitivity
- retrieval filters
- client isolation
- logs
Data Rule
Vector memory is data infrastructure and must be governed as such.
Application To Research Brain
Research Brain may use RAG to retrieve prior research and source material.
Research Brain should preserve:
- evidence
- source
- date
- reliability
- interpretation status
- confidence level
Research Rule
Research RAG must not confuse evidence with conclusion.
Application To Risk And Compliance Brain
Risk and Compliance Brain must review RAG systems that touch:
- client data
- sensitive data
- public answers
- customer support
- legal/financial/health topics
- compliance-sensitive policies
- personal data
- browser-captured data
- external webhooks
- tool access
Risk Rule
The more user-facing or client-facing the RAG system, the stronger the risk review required.
Application To SIT Brain
SIT Brain should test RAG quality.
Test cases should include:
- correct retrieval
- wrong-source prevention
- stale-source prevention
- client isolation
- no-answer fallback
- source visibility
- reranking performance
- sensitive data protection
- prompt injection resistance
- hallucination under weak retrieval
- edge cases
SIT Rule
RAG systems require retrieval tests, not just answer tests.
Related AI Employee Capabilities
RAG Architect Agent
Designs the source set, metadata, chunking, retrieval, reranking, and interface connection.
Vector Memory Data Steward
Maintains vector tables, source metadata, freshness, sensitivity, and review states.
Retrieval Quality Tester
Tests whether retrieval returns the right chunks for real questions.
RAG Source Authority Reviewer
Checks whether retrieved content is approved, current, and allowed for the workflow.
Client RAG Isolation Reviewer
Tests whether client-specific RAG prevents cross-client leakage.
Chat Memory Governance Agent
Reviews Postgres chat memory scope, retention, and safety.
RAG Dashboard Integration Agent
Designs how RAG answers appear inside dashboards.
Future Expansion
This framework may later produce:
- MWMS Supabase Vector Table Schema Standard
- MWMS RAG Source Metadata Standard
- MWMS RAG Retrieval Quality Testing Protocol
- MWMS Client RAG Isolation Standard
- MWMS Postgres Chat Memory Governance Standard
- MWMS n8n RAG Workflow Template
- MWMS MCR Vector Retrieval Standard
- MWMS RAG Dashboard Integration Standard
These should be created when the relevant build becomes active.
Registry Update Required
This page requires an update to the MWMS Advanced AI Capability Activation Registry.
Add new capability:
Supabase RAG / Vector Memory Systems
Status: Active Strategic Capability / Framework Created
Future Page Created: MWMS Supabase RAG And Vector Memory Framework
Owning Brain: Data Brain
Supporting Brains: HeadOffice Brain, AIBS Brain, Automation Brain, Research Brain, Risk Brain, Compliance Brain, SIT Brain
Also update:
Advanced n8n Systems
Add note that n8n + Supabase vector store + Postgres chat memory + reranking + webhook front-end connection is a major future implementation path.
Strategic Summary
The RAG & Supabase Masterclass is one of the most important technical lessons in the Advanced Technology section because it shows a future MWMS AIOS backbone.
The lesson demonstrates how files can be uploaded, vectorized, stored in Supabase, retrieved by an AI agent, reranked, combined with Postgres chat memory, and connected to external front-end apps through webhooks.
For MWMS, this supports the long-term direction of:
- AI Employees with source-aware context
- Brain Room with retrieval
- MCR knowledge lookup
- client-specific AIOS knowledge bases
- chatbot and voice agent knowledge
- dashboard intelligence
- Supabase as memory and vector infrastructure
- n8n as orchestration layer
- RAG as the alternative to context stuffing
The key lesson is:
Supabase can become the knowledge memory layer of MWMS, but only if metadata, source authority, retrieval quality, client isolation, and governance are built into the system.
RAG gives MWMS power.
Governance makes that power safe.
Final Standard
The MWMS standard is:
Store knowledge with metadata.
Retrieve only what is relevant.
Rerank when accuracy matters.
Validate source authority before use.
Keep client knowledge isolated.
Preserve source visibility.
Treat Supabase RAG as AIOS infrastructure, not casual memory.
RAG is not magic.
RAG is governed retrieval.
Change Log
Version: v1.0
Date: 2026-06-01
Author: MWMS HeadOffice
Change:
Created the MWMS Supabase RAG And Vector Memory Framework from AI Automations by Jack — Advanced Technology / RAG & Supabase Masterclass.
Captured the lesson’s core concepts including RAG versus context stuffing, Google Drive file ingestion, n8n workflow triggering, Supabase vector store setup, SQL-created vector tables, document chunking, chunk overlap, OpenAI embeddings, metadata, vector retrieval, Cohere reranking, Postgres chat memory, n8n AI agent tool calling, and webhook-based front-end connections.
Defined Supabase as a multi-role MWMS infrastructure layer: structured database, vector store, chat memory store, AIOS knowledge layer, and dashboard data source.
Added Supabase Vector Table Standard, Metadata Governance, Chunking Standard, Embedding Standard, Retrieval Limit Rule, Reranking Standard, Postgres Chat Memory Standard, Shared Memory Across Apps, n8n RAG Workflow Standard, Webhook Front-End Connection Rule, RAG Source Authority Rule, Client Isolation Rule, and Source Visibility Rule.
Mapped RAG into chatbots, dashboards, MCR, course absorption, Research Brain, and AIBS client systems.
Added RAG Build Path and RAG Launch Readiness Checklist.
Added RAG Failure Modes covering context stuffing, missing metadata, wrong-source retrieval, raw source treated as canon, wrong-client leakage, stale knowledge, no source visibility, weak chunking, missing reranking, exposed webhooks, chat memory overreach, and missing maintenance ownership.
Mapped framework responsibilities across HeadOffice Brain, AIBS Brain, Automation Brain, Data Brain, Research Brain, Risk Brain, Compliance Brain, and SIT Brain.
Added related AI Employee capabilities: RAG Architect Agent, Vector Memory Data Steward, Retrieval Quality Tester, RAG Source Authority Reviewer, Client RAG Isolation Reviewer, Chat Memory Governance Agent, and RAG Dashboard Integration Agent.
Added Registry Update Required note to add Supabase RAG / Vector Memory Systems into MWMS Advanced AI Capability Activation Registry and update Advanced n8n Systems with n8n + Supabase RAG implementation path.
Purpose of creation:
To give MWMS a formal Supabase RAG and vector memory framework for future AI Employees, Brain Room retrieval, MCR knowledge lookup, client-specific AIOS knowledge bases, chatbots, dashboards, voice agents, and governed AIOS context infrastructure.