beanflows

Author	SHA1	Message	Date
Deeman	fef9f3d705	Fix concurrent DuckDB queries: use cursor() per thread _conn.execute() is not thread-safe for concurrent calls from multiple threads. asyncio.gather submits each analytics query to the thread pool via asyncio.to_thread, causing race conditions that silently returned empty result sets. _conn.cursor() creates an independent cursor that is safe to use from separate threads simultaneously. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 19:04:09 +01:00
Deeman	d569ba0162	Fix metric column name casing: DuckDB returns lowercase, align everywhere SQLMesh normalizes unquoted identifiers to lowercase in physical tables, so commodity_metrics columns are e.g. 'production' not 'Production'. Update ALLOWED_METRICS, all analytics.py SQL queries, dashboard routes, and both dashboard templates (Jinja + JS chart references) to use lowercase column names consistently. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 17:14:52 +01:00
Deeman	423fb8c619	Fix extract and SQLMesh pipeline to build DuckDB lakehouse extract: wrap response.content in BytesIO before passing to normalize_zipped_csv, and call .read() on the returned BytesIO before write_bytes (two bugs: wrong type in, wrong type out) sqlmesh: {{ var() }} inside SQL string literals is not substituted by SQLMesh's Jinja (SQL parser treats them as opaque strings). Replace with a @psd_glob() macro that evaluates LANDING_DIR at render time and returns a quoted glob path string. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 17:02:59 +01:00
Deeman	d05e522c88	Add migration 0001: create feedback and waitlist tables Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 15:36:40 +01:00
Deeman	fa14f94a4f	Merge admin-upgrade: sidebar layout, feedback, waitlist, sitemap for BeanFlows	2026-02-20 15:26:13 +01:00
Deeman	8e7af53ff6	Fix admin auth, impersonation session handling, and stale stripe column - admin_required now accepts users with 'admin' role (via g.user) in addition to the password-based is_admin session flag, so both auth methods grant access - impersonate stores the admin's user_id (not True) in admin_impersonating so stop-impersonating can restore the correct session - stop_impersonating restores user_id from admin_impersonating instead of just popping it - remove s.stripe_customer_id from get_user_by_id (Paddle project, no stripe_customer_id column in subscriptions) Fixes 3 test_roles.py failures: test_admin_index_accessible_with_admin_role, test_impersonate_stores_admin_id, test_stop_impersonating_restores_admin Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 02:33:10 +01:00
Deeman	48bea5c198	Add admin sidebar layout, feedback, waitlist, sitemap to BeanFlows web - Admin sidebar layout (base_admin.html) with espresso/copper coffee theme, 220px sidebar, responsive collapse, nav for Dashboard/Users/Tasks/Feedback/Waitlist - Convert all admin templates to extend base_admin.html using Tailwind classes - Feedback system: schema, public POST route (rate-limited), base.html widget with HTMX popover (coffee-themed), admin viewer with mark-read - Waitlist mode: WAITLIST_MODE config, waitlist_gate decorator, capture_waitlist_email helper, auth route integration, confirmation pages, send_waitlist_confirmation worker task, admin table - Sitemap.xml and robots.txt public routes - Dashboard stats updated with waitlist_count, feedback_unread alongside existing commodity DuckDB analytics stats Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 02:27:26 +01:00
Deeman	e80e262e25	Fix NoneType error when user has no subscription g.subscription is explicitly set to None in load_user, so g.get("subscription", {}) returns None (key exists), not {}. Use (g.get(...) or {}) to coalesce None to an empty dict. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 23:10:56 +01:00
Deeman	642b529c4d	Fix column name mismatch after copier template update The subscriptions table still had paddle_subscription_id but the new code references provider_subscription_id. Renamed the DB column and updated all queries in billing/routes.py to match. Also removed unused get_subscription import from dashboard/routes.py. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 23:09:12 +01:00
Deeman	94e8a5e6c3	Fix dev-login URLs in development script - Update dev_run.sh with better dev-login URLs including appropriate email parameters - Add user-login URL: auth/dev-login?email=trader@beanflows.coffee - Add admin-login URL: auth/dev-login?email=admin@beanflows.coffee - Keep admin-panel URL: admin/dev-login for direct admin session - Add ADMIN_EMAILS config to .env.example for auto-granting admin role Co-Authored-By: Claude Sonnet 4 <noreply@anthropic.com>	2026-02-19 22:55:57 +01:00
Deeman	18c6e0da4f	Fix dashboard routes after copier update - Remove import of get_user_with_subscription (function was removed) - Use g.user and g.subscription from eager loading instead - Fixes ImportError in dashboard routes Co-Authored-By: Claude Sonnet 4 <noreply@anthropic.com>	2026-02-19 22:48:53 +01:00
Deeman	866746093b	Update uv.lock after copier template merge Co-Authored-By: Claude Sonnet 4 <noreply@anthropic.com>	2026-02-19 22:47:39 +01:00
Deeman	edd439245b	Merge branch 'copier-update'	2026-02-19 22:46:41 +01:00
Deeman	32132974b2	Clean up web changes and add favicon - Update uv.lock dependencies - Remove web/CLAUDE.md (moved to root) - Update base.html template - Add favicon.svg Co-Authored-By: Claude Sonnet 4 <noreply@anthropic.com>	2026-02-19 22:46:33 +01:00
Deeman	3f1cd8bd0c	Update copier answers and docker-compose prod config - Record v0.4.0 commit in .copier-answers.yml - Apply flattened paths in docker-compose.prod.yml Co-Authored-By: Claude Sonnet 4 <noreply@anthropic.com>	2026-02-19 22:35:55 +01:00
Deeman	4b7d4d5a74	Update from Copier template v0.4.0 - Accept RBAC system: user_roles table, role_required decorator, grant_role/revoke_role/ensure_admin_role functions - Accept improved billing architecture: billing_customers table separation, provider-agnostic naming - Accept enhanced user loading with subscription/roles eager loading in app.py - Accept improved email templates with branded styling - Accept new infrastructure: migration tracking, transaction logging, A/B testing - Accept template improvements: Resend SDK, Tailwind build stage, UMAMI analytics config - Keep beanflows-specific configs: BASE_URL 5001, coffee PLAN_FEATURES/PLAN_LIMITS - Keep beanflows analytics integration and DuckDB health check - Add new test files and utility scripts from template Co-Authored-By: Claude Sonnet 4 <noreply@anthropic.com>	2026-02-19 22:22:13 +01:00
Deeman	1e8a173ae8	Merge branch 'frontend-upgrade' Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-19 20:49:07 +01:00
Deeman	f722854c07	Rewrite frontend templates: Pico CSS → Tailwind + trader-focused copy Replace all Pico CSS patterns (classless articles, role="button", inline styles, var(--pico-*)) with Tailwind component classes. Add Fraunces display font, mobile hamburger nav, brand chart colors, and new component layer (hero, feature-card, metric-card, auth-card, pricing-card, etc.). Rewrite marketing copy from generic SaaS boilerplate to coffee-trader focused messaging. Rebrand pricing tiers to Explorer/Trader/Analyst. Delete stale custom.css. No Python code changes. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-19 20:47:56 +01:00
Deeman	6dac8570ad	Fix web/ startup errors and sync with boilerplate - Load .env via python-dotenv in core.py - Skip analytics DB open if file doesn't exist - Guard dashboard analytics calls when DB not available - Namespace admin templates under admin/ to avoid blueprint conflicts - Add dev-login routes for user and admin (DEBUG only) - Update .copier-answers.yml src_path to GitLab remote Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-19 20:37:44 +01:00
Deeman	fa6f3c70dd	Remove stale files from merge	2026-02-18 21:10:02 +01:00
Deeman	d6d2aa8efe	Merge remote-tracking branch 'origin/master' # Conflicts: # infra/readme.md	2026-02-18 21:09:24 +01:00
Deeman	c1d00dcdc4	Refactor to local-first architecture on Hetzner NVMe Remove distributed R2/Iceberg/SSH pipeline architecture in favor of local subprocess execution with NVMe storage. Landing data backed up to R2 via rclone timer. - Strip Iceberg catalog, httpfs, boto3, paramiko, prefect, pyarrow - Pipelines run via subprocess.run() with bounded timeouts - Extract writes to {LANDING_DIR}/psd/{year}/{month}/{etag}.csv.gzip - SQLMesh reads LANDING_DIR variable, writes to DUCKDB_PATH - Delete unused provider stubs (ovh, scaleway, oracle) - Add rclone systemd timer for R2 backup every 6h - Update supervisor to run pipelines with env vars Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 19:50:19 +01:00
Deeman	910424c956	update cicd & philosophy	2026-02-18 16:11:56 +01:00
Deeman	2748c606e9	Add BeanFlows MVP: coffee analytics dashboard, API, and web app - Fix pipeline granularity: add market_year to cleaned/serving SQL models - Add DuckDB data access layer with async query functions (analytics.py) - Build Chart.js dashboard: supply/demand, STU ratio, top producers, YoY table - Add country comparison page with multi-select picker - Replace items CRUD with read-only commodity API (list, metrics, countries, CSV) - Configure BeanFlows plan tiers (Free/Starter/Pro) with feature gating - Rewrite public pages for coffee market intelligence positioning - Remove boilerplate items schema, update health check for DuckDB - Add test suite: 139 tests passing (dashboard, API, billing) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 16:11:50 +01:00
Deeman	b222c01828	Add CLAUDE.md for Claude Code context Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-17 22:04:22 +01:00
Deeman	e6d7ba81cb	Change cicd	2026-02-05 20:08:01 +01:00
Hendrik Dreesmann	a77c1d1f13	Merge branch 'cleanup-simplify' into 'master' cleanup and prefect service setup See merge request deemanone/materia!11	2026-02-05 20:05:12 +01:00
Deeman	09ae88be19	cleanup and prefect service setup	2026-02-05 20:01:50 +01:00
Deeman	6d4377ccf9	cleanup and prefect service setup	2026-02-04 22:24:55 +01:00
Hendrik Dreesmann	1743c8eba6	Merge branch 'feature/saas-frontend-initial' into 'master' Update SQLMesh for R2 data access & Convert psd data to gzip See merge request deemanone/materia!10	2025-11-02 00:26:01 +01:00
Hendrik Dreesmann	b702e6565a	Update SQLMesh for R2 data access & Convert psd data to gzip	2025-11-02 00:26:01 +01:00
Deeman	fc27d5f887	add plan for saas app	2025-10-21 23:07:43 +02:00
Deeman	3c7a99a699	Update README with comprehensive project documentation Added complete project overview including: - Tech stack and architecture overview - Quick start guide with UV and Pulumi ESC setup - Project structure (extract, transform, core packages) - Development workflow (dependencies, linting, testing) - Secrets management with ESC examples - Production architecture explanation - Architecture principles Removed outdated content and references to CLAUDE.md (internal memory only). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-21 21:51:52 +02:00
Deeman	d4e6c65f97	Fix SQLMesh command documentation Corrected SQLMesh commands to show proper usage: - Run from project root (not from transform/sqlmesh_materia/) - Use -p flag to specify project directory - Use uv run for all commands - Use esc run for commands requiring secrets (plan, audit, ui) - Clarified which commands need secrets vs local-only This aligns with the actual working pattern and Pulumi ESC integration. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-21 21:46:41 +02:00
Hendrik Dreesmann	95fb2104dd	Merge branch 'refactor/psd-extraction-r2' into 'master' Refactor PSD extraction: simplify to latest-only + add R2 support See merge request deemanone/materia!9	2025-10-20 22:59:29 +02:00
Deeman	320ddd5123	Add architectural plan document for PSD extraction refactoring Documents the complete analysis, implementation, and results of the PSD extraction refactoring from the architecture advisor's recommendations. Includes: - Problem statement and key insights - Architecture analysis (data-oriented approach) - Implementation phases and results - Testing outcomes and metrics - 227 files migrated, ~40 lines reduced, 220+ → 1-4 requests 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-20 22:55:58 +02:00
Deeman	d30ec9b66b	Add R2 upload support with landing bucket path ## Changes 1. Support ESC environment variable names - Fallback to R2_ADMIN_ACCESS_KEY_ID if R2_ACCESS_KEY not set - Fallback to R2_ADMIN_SECRET_ACCESS_KEY if R2_SECRET_KEY not set - Allows script to work with Pulumi ESC (beanflows/prod) variables 2. Use landing bucket path - Changed R2 path from `psd/{etag}.zip` to `landing/psd/{etag}.zip` - All extracted data goes to landing bucket for consistent organization 3. Updated Pulumi ESC environment - Added R2_BUCKET=beanflows-data-prod - Fixed R2_ENDPOINT to remove bucket path (now just account URL) ## Testing - ✅ R2 upload works: Uploaded to landing/psd/316039e2612edc1_0.zip - ✅ R2 deduplication works: Skips upload if file exists - ✅ Local mode still works without credentials 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-20 22:45:30 +02:00
Deeman	57f2909001	Update documentation: Pulumi ESC usage and CI/CD bootstrap clarification ## Changes 1. Added Pulumi ESC section - How to login and load secrets into shell - `esc run` command for running commands with secrets - List of available secrets in `beanflows/prod` environment - Examples for common use cases 2. Fixed supervisor bootstrap documentation - Clarified that bootstrapping happens automatically in CI/CD - Pipeline checks if supervisor is already bootstrapped - Runs bootstrap script automatically only if needed - Removed misleading "one-time" manual bootstrap instructions - Added note that it's only needed manually in exceptional cases 3. Updated deploy:supervisor stage description - More accurate description of the bootstrap check logic - Explains the conditional execution (bootstrap vs status check) These updates make the documentation more accurate and helpful for both local development (with ESC) and understanding the production deployment. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-20 22:07:24 +02:00
Deeman	38897617e7	Refactor PSD extraction: simplify to latest-only + add R2 support ## Key Changes 1. Simplified extraction logic - Changed from downloading 220+ historical archives to checking only latest available month - Tries current month and falls back up to 3 months (handles USDA publication lag) - Architecture advisor insight: ETags naturally deduplicate, historical year/month structure was unnecessary 2. Flat storage structure - Old: `data/{year}/{month}/{etag}.zip` - New: `data/{etag}.zip` (local) or `psd/{etag}.zip` (R2) - Migrated 226 existing files to flat structure 3. Dual storage modes - Local mode: Downloads to local directory (development) - R2 mode: Uploads to Cloudflare R2 (production) - Mode determined by presence of R2 environment variables - Added boto3 dependency for S3-compatible R2 API 4. Updated raw SQLMesh model - Changed pattern from `*/.zip` to `*.zip` to match flat structure ## Benefits - Simpler: Single file check instead of 220+ URL attempts - Efficient: ETag-based deduplication works naturally - Flexible: Supports both local dev and production R2 storage - Maintainable: Removed unnecessary complexity ## Testing - ✅ Local extraction works and respects ETags - ✅ Falls back correctly when current month unavailable - ✅ Linting passes 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-20 22:02:15 +02:00
Hendrik Dreesmann	8729848731	Merge branch 'fix/sqlmesh-config-and-ci-deployment' into 'master' Fix SQLMesh config and CI/CD deployment issues See merge request deemanone/materia!8	2025-10-13 22:26:58 +02:00
Deeman	2d248a2eef	Fix SQLMesh config to use correct Pulumi ESC env var names - Update secret token: CLOUDFLARE_API_TOKEN → R2_ADMIN_API_TOKEN - Update warehouse name: R2_WAREHOUSE_NAME → ICEBERG_WAREHOUSE_NAME - Update endpoint: ICEBERG_REST_URI → ICEBERG_CATALOG_URI - Remove CREATE SCHEMA and USE statements - DuckDB has bug with Iceberg REST: missing Content-Type header - Schema creation via SQL currently not supported - Models will use fully-qualified table names instead Successfully tested with real R2 credentials: - Iceberg catalog attachment works ✓ - Plan dry-run executes ✓ - Only fails on missing source data (expected) ✓ 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-13 22:21:27 +02:00
Deeman	05ef15bfdf	Configure Iceberg catalog with proper secret reference - Add catalog ATTACH statement in before_all with SECRET parameter - References r2_secret created by connection configuration - Uses proper DuckDB ATTACH syntax per Cloudflare docs - Single-line format to avoid Jinja parsing issues - Remove manual CREATE SECRET from before_all hooks - Secret automatically created by SQLMesh from connection config - Cleaner separation: connection defines credentials, hooks use them Successfully tested - config validates without warnings. Only fails on missing env vars (expected locally). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-13 22:10:51 +02:00
Deeman	2ad344abf4	Refactor SQLMesh config to use connection-level secrets - Move Iceberg secret from before_all hook to connection.secrets - Fixes SQLMesh warning about unsupported @env_var syntax - Uses Jinja templating {{ env_var() }} instead of @env_var() - Remove database: ':memory:' (incompatible with catalogs) - DuckDB doesn't allow both database and catalogs config - Connection defaults to in-memory when no database specified - Simplify before_all hooks to only handle ATTACH and schema setup - Secret is now created automatically by SQLMesh - Cleaner separation: connection config vs runtime setup Based on: - https://developers.cloudflare.com/r2/data-catalog/config-examples/duckdb/ - https://sqlmesh.readthedocs.io/en/latest/integrations/engines/duckdb/ 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-13 22:04:25 +02:00
Deeman	120fef369a	Fix SQLMesh config and CI/CD deployment issues - Fix SQLMesh config: Add semicolons to SQL statements in before_all hooks - Resolves "unsupported syntax" warning for CREATE SECRET and ATTACH - DuckDB requires semicolons to terminate statements properly - Fix deploy:infra job: Update Pulumi authentication - Remove `pulumi login --token` (not supported in Docker image) - Use PULUMI_ACCESS_TOKEN environment variable directly - Chain commands with && to avoid "unknown command 'sh'" error - Fix deploy:supervisor job: Update esc login syntax - Change `esc login --token` to `esc login` (--token flag doesn't exist) - esc CLI reads token from PULUMI_ACCESS_TOKEN env var - Simplify Pulumi CLI installation (remove apk fallback logic) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-13 21:58:43 +02:00
Hendrik Dreesmann	70854394c3	Merge branch 'feature/supervisor-deployment' into 'master' Add supervisor deployment with continuous pipeline orchestration See merge request deemanone/materia!7	2025-10-13 21:51:05 +02:00
Deeman	d2352c1876	Simplify SQLMesh to use single prod gateway with virtual environments - Remove dev gateway (local DuckDB file no longer needed) - Single prod gateway connects to R2 Iceberg catalog - Use virtual environments for dev isolation (e.g., dev_<username>) - Update CLAUDE.md with new workflow and environment strategy - Create comprehensive transform/sqlmesh_materia/README.md Benefits: - Simpler configuration (one gateway instead of two) - All environments use same R2 Iceberg catalog - SQLMesh handles environment isolation automatically - No need to maintain local 13GB materia_dev.db file - before_all hooks only run for prod gateway (no conditional logic needed) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-13 21:47:04 +02:00
Deeman	6536724e00	Fix SQLMesh config: remove invalid init_script parameter - Remove init_script from DuckDB connection config (not a valid parameter) - Move R2 Iceberg catalog initialization to before_all hooks - Hooks run before sqlmesh plan/run commands - Uses SQLMesh @env_var() macro syntax for environment variables Fixes CI/CD error: 'invalid duckdb connection config: invalid field init_script' 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-13 21:31:56 +02:00
Deeman	2fff895a73	Simplify supervisor architecture and automate bootstrap - Simplify supervisor.sh following TigerBeetle pattern - Remove complex functions, use simple while loop - Add \|\| sleep 600 for resilience against crashes - Use git switch --discard-changes for clean updates - Run pipelines every hour (SQLMesh handles scheduling) - Use POSIX sh instead of bash - Remove /repo subdirectory nesting - Repository clones directly to /opt/materia - Simpler paths throughout - Move systemd service to repo - Bootstrap copies from repo instead of hardcoding - Service can be updated via git pull - Automate bootstrap in CI/CD - deploy:supervisor now auto-bootstraps on first deploy - Waits for SSH to be ready (retry loop) - Injects secrets via SSH environment - Idempotent: detects if already bootstrapped Result: Push to master and supervisor "just works" 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-13 21:17:12 +02:00
Deeman	21f99767bf	Use GitLab project access token instead of SSH deploy key More secure approach: - Uses HTTPS with token instead of SSH keys - Token can be rotated without touching infrastructure - Scoped to read_repository only - Token stored in Pulumi ESC (beanflows/prod) Setup: 1. Create project access token in GitLab with read_repository scope 2. Add GITLAB_READ_TOKEN to Pulumi ESC 3. Bootstrap script will use it for git clone/pull	2025-10-13 20:37:28 +02:00
Deeman	f46fd53d38	Update bootstrap script with correct GitLab repo URL	2025-10-13 20:36:08 +02:00

1 2 3 4

171 Commits