We implement a multi-hop architecture, industry-standardized as the Medallion Architecture. This separates data based on its level of processing and trust.
landing-zone)bronze-*)bronze-education: Raw PDF notes, lesson plans, exercises.
notability/Priveles/{Subject}/{StudentName}/*.pdfbronze-creative: Original photos, raw audio recordings, project assets.bronze-communication: Raw logs from Chatwoot, WhatsApp exports, Email dumps.bronze-cv: Raw LaTeX and build artifacts for CVs.silver-*)silver-education: Processed notes ready for the dashboard (thumbnails, text index).
notability/Priveles/{Subject}/{StudentName}/{file}.metadata.jsonnotability/Priveles/{Subject}/{StudentName}/{file}.thumbnail.jpgnotability/Priveles/{Subject}/{StudentName}/.student.jsonsilver-communication: Structured conversation history for analysis.gold-*)gold-analytics: Star schemas for BI
student_progress_weekly/{date}.jsonlesson_statistics_monthly/{date}.jsonfinancial_aggregates_monthly/{date}.jsongold-reports: Pre-computed reports
student_summary/{student_id}.jsonteacher_dashboard/{date}.jsonsandbox-user)In our AWS S3 datalake, these zones are implemented as distinct S3 buckets (or bucket prefixes) to enforce security isolation at the bucket policy level.