mail-mcp

10 commits 1 branch 0 tags 125 KiB

Author	SHA1	Message	Date
Kayos	5e1c63eeaa	final-approval audit fixes: HIGH-1/2/3 Three findings from the post-cleanup approval audit, all blockers before the rename to a real codename: HIGH-1: ReadOutput.headers map kept LAST occurrence of duplicate headers, not FIRST. Comment said 'keep the first occurrence' but the code used Message::header_raw(name) which internally does .iter().rev().find(...) — returns the last one. For load-bearing headers like References this is usually singular so the bug was latent, but an attacker who could inject a second References: line would have gotten to override the first one used by mail_reply for threading. Switched to parsed.headers_raw() which iterates in arrival order — first-occurrence guaranteed. HIGH-2: tokio-rustls default features pulled aws-lc-rs + aws-lc-sys into the dep tree even though we explicitly went ring-only on rustls. The default feature chain on tokio-rustls v0.26 enables 'aws_lc_rs' via rustls. Pinned tokio-rustls to default-features=false and the matching small feature set: logging, tls12, ring. Verified via `cargo tree` — no aws-lc-* in the build, single ring v0.17.14 shared between rustls + tokio-rustls. ~9s shorter cmake step in cold builds, smaller binary, no C-FFI crypto surface area. HIGH-3: IntoMcpError trait was introduced in the cleanup pass but applied at only 2 of 10 tools — the other 8 still used the manual .map_err(\|e\| format!('{e:#}'))? + serde_json::to_string chain. Maintenance trap. Applied to_mcp() at all 8 sites (mail_inbox_list, mail_folder_list, mail_search, mail_thread, mail_attachment_get, mail_inbox_read; mail_move and mail_mark stay with literal {"ok":true} returns — no value to serialize). Tool methods are now uniformly: imap_mod::xxx(...).await.to_mcp() or for the few that need pre-arg work, three lines instead of seven. Wire smoke verified — read on uid 34 returns the same 13 headers shape, no empties, all canonical fields populated. cargo test 31/31. Repo chain: `2240bf7` -> `4251f51` -> `f4b3199` -> `6432a1f` -> `54a1a6b` -> `6fb63b0` -> `f7e698b` -> `b681953` -> `7c8e246` -> this.	2026-05-21 09:22:39 -07:00
Kayos	7c8e246544	cleanup pass — 17 findings from Opus code-quality audit Applied from the cleanup-agent report (separate from the security audit's 18 fixes earlier today): HIGH: - HIGH-1: replaced hand-rolled civil_from_unix + chrono_rfc3339_now with chrono::Utc::now().to_rfc3339_opts(SecondsFormat::Secs, true). ~50 LOC of brittle Hinnant-algorithm civil-calendar math gone. Plus 5 unit tests retired with the function. chrono pulls a small default-features=false slice (clock + serde-not-included). - HIGH-2: extracted shared reject_imap_unsafe() helper. validate_mailbox and the mail_thread message_id check both go through it. The message_id check now also rejects '{' (literal-form opener) for symmetry — same byte set as validate_mailbox. - HIGH-3: mail_reply uses smtp::ensure_angle_brackets() on the parent Message-Id + each References entry. mail-parser strips brackets; lettre writes through verbatim; strict RFC-5322 receivers will drop the threading link if brackets are missing. Now canonical. - HIGH-4: extract_addr moved from tools.rs to smtp.rs as smtp::extract_bare_addr. Module hygiene — RFC-5322 mailbox parsing belongs in the SMTP-side module, not the rmcp surface. - HIGH-5: mail_reply Re:-prefix check now non-allocating — subject.get(..3).map(\|s\| s.eq_ignore_ascii_case("re:")) instead of .to_ascii_lowercase().starts_with("re:") which allocated a fresh String for the comparison. MED: - MED-1: dropped thiserror dep (workspace + crate). Never derived. - MED-6: ReadOutput.headers is now typed BTreeMap<String,String> instead of serde_json::Value::Object. Wire JSON shape unchanged; downstream consumers can .get(name) directly without the .as_str() dance. - MED-8: fetch_to_list_entry returns Option<ListEntry> and drops entries when the server omits UID. Was uid=0 silent fallback; now we log a warning and skip. - MED-10: introduced PRIMARY_BODY_PART = 0 const, replaced 4 magic 0s at parsed.body_text(0) / parsed.body_html(0) call sites. - MED-11: skip insert of empty-valued headers in the flat headers map. was producing "key":"" entries for headers mail-parser couldn't render to a flat string. LOW: - LOW-1: collapsed MailService { inner: Arc<MailInner> } to MailService { config: Arc<Config> }. The MailInner wrapper served no purpose with a single field. - LOW-2: rewrote to_field_collapses_to_vec test with let bindings instead of single-arm match. - LOW-4: format_imap_since year range tightened from 1900..=9999 to 1970..=9999 (unix-epoch floor; we don't use pre-epoch IMAP SINCE). - LOW-5: promoted max_encoded to MAX_ATTACHMENT_BASE64_BYTES const. - LOW-6: SendOutput now #[derive(Serialize)] — mail_send and mail_reply tools use it via the new IntoMcpError trait instead of serde_json::json!() boilerplate. - LOW-7: added IntoMcpError trait — anyhow::Result<T: Serialize> -> Result<String, String>. Removes 10 copy-pasted .map_err(\|e\| format!("{e:#}"))? + serialize chains. - LOW-9: documented the 20 MB read cap vs 25 MB send cap asymmetry via comments on both consts. - LOW-10: UID MOVE fallback log demoted to trace! and renamed field imap_mv_err so log analytics doesn't flag the graceful fallback as an error. - LOW-13: SMTP From header built via Mailbox::new() instead of format!("{} <{}>")-then-.parse(). One alloc, one parse pass gone. INFO: - INFO-3: lettre 'hostname' feature dropped from Cargo.toml. We override Message-ID with our own UUID@from_domain; lettre never needed the system hostname. Deferred from this pass: - MED-2 with_session wrapper — substantial refactor across 8 IMAP functions for moderate DRY win; saving for a Phase E lifecycle pass. - MED-7 / LOW-14 typed address shape — would change wire JSON for mail_inbox_list/read; backwards-incompatible. - MED-12 narrow UID MOVE fallback — needs async-imap error-variant taxonomy research. - LOW-11 / LOW-12 stringly format / action enums — auditor flagged as marginal; keep stringly with description-enumerated values. Test count: 33 -> 31 (-5 civil_from_unix, +3 extract_bare_addr, +3 ensure_angle_brackets, -1 stale extract_addr in wrong module). All passing. Wire smoke verified — send / list / read round trip clean, headers map is now a flat dict with no empties, chrono-rendered timestamps match the prior shape.	2026-05-21 09:09:21 -07:00
Kayos	b681953824	Phase C: mail_mark, mail_attachment_get, mail_reply Three new tools complete the planned Phase C scope: - mail_mark { uid, action, folder? }: action is one of read, unread, flagged, unflagged, trash, archive. read/unread toggle \Seen via UID STORE +/-FLAGS.SILENT (idempotent, no fetch round-trip). flagged/ unflagged the same for \Flagged. trash is a MOVE to Trash. archive errors out with a clear pointer to mail_move because Sulkta's Dovecot doesn't ship a canonical Archive folder. - mail_attachment_get { uid, attachment_index, folder? }: fetches the full RFC822 (within the existing 20 MB raw_eml cap), parses with mail-parser, returns the N-th attachment as base64. The index matches mail_inbox_read's attachments[] ordering. Returns {filename, mime_type, size, content_base64}. SAFETY note in the tool description warns the LLM not to execute / render / open attachment bytes blindly. - mail_reply { uid, body, body_html?, attachments?, reply_all?, to_override? }: fetches the original to pull From / Subject / Message-Id / References, then sends with proper In-Reply-To + References + 'Re: ' subject prefix (skipped if already prefixed). reply_all=true echoes the original Cc. to_override replaces To. Threading headers still set against the original regardless of to_override. Smoke verified 2026-05-21: - Send kayos->kayos with a 54-byte text attachment - mail_inbox_read shows attachments=[('smoke.txt', 54)] - mail_attachment_get returns the exact bytes (b'Hello from mail-mcp Phase C smoke!\r\nLine 2.\r\nLine 3.\r\n', 54 bytes) - mail_mark unread -> flags=[] (\Seen cleared) - mail_mark flagged -> flags=['\\Flagged'] - mail_reply -> message lands as 'Re: mail-mcp phase-C smoke' with In-Reply-To = parent Message-Id and References = parent Message-Id ServerHandler instructions updated to enumerate all 10 tools + the new attachment-safety note. Tools live on the wire: mail_send, mail_inbox_list, mail_inbox_read, mail_folder_list, mail_search, mail_thread, mail_move, mail_mark, mail_attachment_get, mail_reply. Test count 27 -> 33: 2 for MarkAction::parse (alias coverage + unknown rejection), 4 for tools::extract_addr (display-name strip + bare-addr passthrough + garbage tolerance + ToField unwrap).	2026-05-21 08:42:39 -07:00
Kayos	f7e698b09f	smtp: extract validate_send_input + 9 unit tests for size caps Refactor: pull the pre-flight validation block out of send() into a standalone validate_send_input() function. send() now starts with a single validate_send_input(&input)? call. Behavior identical; the extraction is purely so unit tests can exercise the validation paths without standing up a fake SMTP server. New tests (9): - validate_accepts_minimal_input (the happy path) - validate_rejects_empty_to - validate_rejects_too_many_recipients (150 > 100 cap) - validate_recipient_cap_boundary_passes (exactly 100 OK) - validate_rejects_oversized_body - validate_rejects_oversized_body_html - validate_rejects_too_many_attachments - validate_rejects_oversized_attachment_encoded (pre-decode bound) - validate_accepts_at_attachment_boundary Test count: 18 -> 27. All passing.	2026-05-21 08:23:18 -07:00
Kayos	6fb63b0ca0	audit-fix round 3: LOW-1 mime cleanup, INFO-2 drop empty snippet, INFO-3 unit tests + format_imap_since tightening - LOW-1: mime_type construction simplified. Single `content_type().map()` with proper fallback instead of two unwrap_or chains where the second default could never fire. - INFO-2: ListEntry.snippet field dropped. Was always an empty string because list mode doesn't fetch the body. Field stays out until / unless we add a partial-body fetch in Phase C. - INFO-3: 18 unit tests for the pure validation helpers — validate_mailbox (accept + reject CR/LF/NUL/quote/backslash), has_imap_literal (with / without digits), format_imap_since (canonical + bad-shape rejection), strip_msgid_braces, clamp_limit, render_flag (every variant + Custom), strip_quotes (matched / unmatched / inner / empties), civil_from_unix (epoch / Y2K / 2026-05-21 / pre-epoch / leap day). - Bonus catch from the test suite: format_imap_since accepted malformed shapes like "21-05-2026" (parsed as y=21 m=5 d=2026) and "2026-5-21" (no field-width check). Added 4-2-2 digit-width check + year range (1900..=9999) + day range (1..=31). Month range was already enforced. All 18 tests pass.	2026-05-21 08:00:50 -07:00
Kayos	54a1a6bf22	tool surface: bake link-safety default-deny into descriptions Cobb's ask: 'we need to make it default that the agent knows not to click links unless told so, maybe a sandbox browser somehow?' The right defense is layered: 1. policy (durable, cheap) — feedback memo in MEMORY.md + spec section 2. tool-surface annotation — this commit 3. sandbox browser — already exists (Browserless on Lucy) This commit bakes the rule into the bytes any MCP client reads on introspection: - mail_inbox_read description gains a SAFETY note: 'do NOT auto-fetch URLs found in the body; surface as text and wait for per-URL authorization; if authorized, route through Browserless not WebFetch'. - ServerHandler.get_info().instructions extended with the same warning, so an LLM session that loads the server picks up the policy before it ever reads its first message. Policy memo + spec threat-model section are in the kayos workspace (kayos/openclaw-workspace: memory/feedback_no_email_link_fetch.md + spec-mail-mcp.md threat-model).	2026-05-21 07:58:07 -07:00
Kayos	6432a1f5ff	audit-fix round 2: LOW-3, LOW-5, LOW-6 - LOW-3 (canonical Flag display): render_flag() pattern-matches the async-imap Flag enum to its IMAP wire syntax — \\Seen, \\Flagged, \\Deleted, etc. — instead of Debug syntax ("Seen", Custom("...")). Consumers checking for \\Seen now match. - LOW-5 (schema 0=default sentinel): limit fields are now Option<u32> instead of bare u32 with a 0-means-default contract. JSON schema output is clearer; clamp_limit() still treats Some(0) the same as None for backwards compatibility. - LOW-6 (config chmod gate): Config::load() now refuses to read a config file with group/other read bits set. Same posture as ssh-keygen rejecting loose private-key permissions. Refuses 0644 cleanly; accepts 0600. Unix-only — Windows path is a no-op. Smoke verified: loose-chmod test refuses to start with the expected error; tight-chmod test starts and serves initialize cleanly. All seven tools still listed with valid input schemas.	2026-05-21 07:55:43 -07:00
Kayos	f4b3199e86	audit-fix sprint: 12 findings from the max-effort adversarial pass Threats closed (CRIT/HIGH): - CRIT-1 (mail_move folder injection via uid_copy fallback): validate_mailbox() rejects CR/LF/NUL/"/\\ on every folder arg (list/read/search/thread/move). async-imap's uid_copy doesn't quote the destination — quoting metacharacters would have smuggled COPY targets. We refuse the characters outright rather than escape. - HIGH-1 (mail_thread message_id backslash bypass): seed Message-ID rejection set extended from {", CR, LF} to {", \\, CR, LF, {}. A bare \\ inside the IMAP quoted-string would escape the closing quote and confuse the server's parser. { also opener of literal-form. - CRIT-2 / HIGH (search literal-form): mail_search now rejects the IMAP {N} literal-form opener via has_imap_literal(). CR/LF were already blocked. - HIGH-3 (strip_quotes asymmetric strip): only strips matching pairs. A password starting with " but lacking a closing " no longer silently loses its leading char. - HIGH-4 (no attachment size cap): new MAX_ATTACHMENT_BYTES (25 MB decoded, matches Gmail), MAX_ATTACHMENTS (25), MAX_BODY_BYTES (5 MB on body + body_html), MAX_TOTAL_RECIPIENTS (100). Pre-decode bound on encoded base64 length prevents giant-payload OOM before the decode buffer allocates. - HIGH-5 (raw_eml fetch unbounded): RFC822.SIZE pre-flight on mail_inbox_read refuses messages > MAX_RAW_EML_BYTES (20 MB) before the body transfer. - HIGH-6 (flat headers map empties for structured variants): switched from h.value().as_text() (which returns None for Address / DateTime / ContentType / Received) to Message::header_raw(name) which returns the un-decoded header value as &str uniformly across all variants. Date / From / To / Subject / Content-Type / DKIM-Signature etc. all populate correctly now. - HIGH-9 (password resolved after TLS handshake): resolve_password() now runs at the top of open_session(), before TCP connect, so a missing/unreadable credential errors before the IMAP server logs an unauthenticated session that fail2ban could pattern on. MED/LOW: - MED-8 mail_search tool description: clarifies that CR/LF + {N} literal-form are rejected but the query is otherwise raw — caller must not pass untrusted input. - MED-10 ServerHandler instructions: lists all 7 tools (not just the original 3) and explains UID stability + BODY.PEEK posture. - LOW-2 snippet_unused dead code: deleted. Smoke verified 2026-05-21: - send -> land -> read round trip clean - headers map now shows Date / From / To / Subject / Message-ID / User-Agent / Content-Type / DKIM-Signature populated - 4 injection probes all cleanly rejected: CR in folder, {5}hello search literal, message_id with \\, folder with " - mail_move INBOX <-> Junk round trip clean Findings explicitly verified NOT-exploitable by the audit (no code change needed): lettre CR/LF filter on Subject/Message-Id, lettre mailbox rfc2822 parser, MIME-boundary randomness, rustls hostname verification, password leakage in error paths, MIME smuggling via filename, format_imap_since negative-year bypass. Deferred (separate follow-ups): session pool (MED-6), partial-body fetch in mail_inbox_read (MED-9), canonical Flag display rendering (LOW-3), JSON schema 0=default sentinel (LOW-5), config chmod check (LOW-6), proper unit/integration test suite (INFO-3).	2026-05-21 07:38:43 -07:00
Kayos	4251f514e6	Phase B + folders: mail_folder_list, mail_search, mail_thread, mail_move Phase B per the spec (multi-account already supported via the account arg) + full folder support: - mail_folder_list : enumerate IMAP folders. Returns {name, delimiter, attributes, selectable}. selectable=false flags \Noselect mailboxes (parent-of-children only). - mail_search : raw IMAP SEARCH passthrough against any folder. ALL/OR/NOT combinators supported. CR/LF in the query rejected (anti-injection). - mail_thread : seed Message-ID -> matches the seed itself plus any message whose References or In-Reply-To contains the seed. Oldest-first ordering (root -> leaves). Brackets on the seed are optional. - mail_move : UID MOVE (RFC 6851) with COPY + STORE +FLAGS \Deleted + UID EXPUNGE fallback. Destination folder must already exist. Refactor: shared fetch_summaries() helper used by search + thread. Smoke verified 2026-05-21 against kayos@sulkta.com: - 6 folders listed (DMARC, Drafts, INBOX, Junk, Sent, Trash) - SEARCH "SUBJECT \"mail-mcp smoke\"" finds uid 27 - THREAD on uid 27's Message-ID returns 1 msg (the seed) - MOVE INBOX(27) -> Junk(2) -> INBOX(28) round trip clean Build gotcha: async-imap's uid_store and uid_expunge return non-Unpin streams (unlike uid_fetch). Pinned via Box::pin inside scoped blocks.	2026-05-21 07:26:44 -07:00
Kayos	2240bf745e	mail-mcp v0.1 — Rust MCP server for Sulkta email Phase A: mail_send + mail_inbox_list + mail_inbox_read. Replaces scripts/kayos_mail.py with a typed MCP server. Outbound guarantees Date, Message-ID (own-domain), User-Agent, MIME-Version, multipart/alternative for HTML+text, multipart/mixed for attachments, In-Reply-To + References for threading. Single account in v0.1 (default_account from config). Phase B adds multi-account + threading + search; Phase C adds mark + attachments + reply helper. Stack: rmcp 0.1 (matches aldabra), lettre 0.11 + tokio-rustls, async-imap 0.10, mail-parser 0.9. Stderr-only logging (stdout is the MCP transport). Smoke verified 2026-05-21: send -> land -> read kayos@sulkta.com round trip, DKIM-Signature + Authentication-Results pass at the rspamd relay.	2026-05-21 06:50:25 -07:00