rebreak-monorepo

Author	SHA1	Message	Date
chahinebrini	c1250836a3	fix(backend): remove display-name pattern support for v1.0 User explicitly chose to drop display-name matching from v1.0 after the UX trap surfaced — a user typing "EXTRASPIN" without a domain got a 400 INVALID_DOMAIN back, which is a confusing dead-end. v1.1 will ship a dedicated display-name UI; until then mail input is domain-only. - resolveTypeAndValue returns a discriminated union — kind='mail' with no dot or @ now resolves to { ok: false, error: 'INVALID_MAIL_DOMAIN' } instead of silently turning into a mail_display_name row. - Full-address mail input (local@domain.tld) still gets its local-part stripped server-side so the stored value is always a clean domain. - Variant-B body { type: 'mail_display_name' } returns 400 DISPLAY_NAME_NOT_SUPPORTED for direct API consumers. - The DISPLAY_NAME_PATTERN regex is gone — the path that used it can no longer be reached. - classifyMail's Layer 2.6 (the display-name substring match) is intentionally left in place as dead code with a v1.1 marker, so re-enabling later is just wiring the input field back up and feeding the customDisplayNames array. - Tests rewritten: the two pre-existing display-name tests now assert the 400 INVALID_MAIL_DOMAIN path, plus a new positive case for the full-address local-part strip. 217 vitest passes, 4 pre-existing skips. Staging DB clean — the type column hasn't been deployed yet so no mail_display_name rows exist to backfill.	2026-05-16 02:17:50 +02:00
chahinebrini	7dbcac6700	feat(backend): custom mail patterns — display-name match + type-aware api Completes the custom-mail-patterns feature (schema + migration shipped in ba170af alongside the chat-tab-badge commit — apologies for the mishap, agent staging collided with mine). This is the actual logic that makes the new type column do work: - mail-classifier.ts: new layer 2.6 between brand+random-token detect and the score-based heuristic. Case-insensitive substring match of the From-display-name against the user's customDisplayNames list. Hard-block when matched, skip score entirely. - db/domains.ts: getCustomMailDisplayNames(userId) reads the new type=mail_display_name rows. countActiveCustomDomains stays a shared total — matches the user's pick of a single 5/5/10 pool spanning web + mail patterns rather than separate counts per type. - scan-internal.post.ts and scan.post.ts both preload the display-name list per user before the message loop and thread it into classifyMail. - POST /api/custom-domains accepts { pattern, kind: 'web' \| 'mail' } with the server inferring the concrete type — 'mail' splits into mail_domain when the input contains a TLD-like shape, otherwise mail_display_name. Existing { domain } body shape stays accepted for backwards compatibility with older clients. - POST /api/custom-domains/:id/submit treats both mail types as community-submittable. The user explicitly chose this; the admin review pipeline is the backstop against display-name false positives. - vitest cases cover: substring match, case insensitivity, no-match fallthrough to score, mail_domain still flowing through the existing domain-set path, and shared-pool slot counts (3 web + 2 mail_domain + 1 mail_display_name = 6 against the 10-slot legend cap).	2026-05-16 01:53:59 +02:00
chahinebrini	f2e3c00943	refactor(mail): remove groq llm layer — deterministic pipeline only User-Direktive: Mail-Filter bleibt auf dem deterministischen Score+Layer-2.5-Stack. Groq-LLM Borderline-Call (Layer 4) entfernt. Layer 2.5 Brand+Random fängt den Apple Hide-My-Email Fall (icloud.com-Adressen mit kryptischen Local-Parts + Brand-DisplayName) weiterhin sauber via Hard-Block. Score-Mid-Range 25-79 entscheidet jetzt deterministisch: ≥50 → BLOCK, sonst PASS. Damit auch DSGVO-P0-Items aus dem Hans-Müller-Review obsolet (AVV-Annex Groq, Drittland-USA-Consent-Toggle, Datenschutzerklärung-Absatz). - mail-classifier.ts: callGroqClassifier + redactLocalPartForLLM + groq-Feld raus - scan.post.ts + scan-internal.post.ts: groqApiKey-Param raus, groq-Sample-Felder raus - mail-classifier.test.ts: Groq-Tests + redactLocalPart-Tests entfernt, 46 Tests grün DB-Spalten in mail_classification_samples (groq_) bleiben als legacy nullable — Cleanup-Migration optional in späterem Sprint. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-14 22:15:32 +02:00
chahinebrini	bdd93668ae	feat(mail): multi-layer classifier — Brand+Random, Relay-Decoder, Score, Groq + ML-Sampling Layer 0–4 Klassifikations-Pipeline in mail-classifier.ts: - Layer 2: Domain-Hard-Block + Relay-Decoder (=domain.tld aus SendGrid/Mailchimp-Bounces) - Layer 2.5: Brand+Random-Token-Hard-Block (Gambling-Brand-Normalisierung + Random-Token-Detection) verhindert LLM-Call für bekannte Gambling-Relayer (Gamblezen, BetandPlay etc.) - Layer 3: Score 0–100 (TS-Gewichte: Domain-Keywords, Subject-Keywords, Name-Match, Geld-Pattern, Urgency, All-Caps, Short-Random-Domain, Brand/Random-Ergänzungen) - Layer 4: Groq Llama 3.3 70B Borderline-Klassifikation (Score 25–75) mit Local-Part-Redaction (DSGVO: nur behalten wenn local-part selbst Keyword enthält) - Layer 5: MailClassificationSample-Insert nach jeder Klassifikation (ML-Phase 3) Migrations: - 20260514_add_mail_blocked_trigger_source: ADD COLUMN trigger_source auf mail_blocked - 20260514_add_mail_classification_sample: CREATE TABLE mail_classification_samples 50 neue Tests (mail-classifier.test.ts): alle Layer, beide Screenshot-Beispiele (Gamblezen + BetandPlay) bestätigt als Layer-2.5-Hard-Block ohne LLM-Call, Whitelist, Score, Redaction. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-14 22:05:35 +02:00

4 Commits