# llms.txt — Guidance for AI Crawlers and LLMs # Domain: usepostal.com # Generated: 2025-09-17 ## Owner Name: Postal (usepostal.com) Contact: https://www.usepostal.com/ (use the contact form) Jurisdiction: USA ## Scope This file applies to AI crawlers, agents, and retrieval systems that access content at https://www.usepostal.com/. It supplements robots.txt and MUST be honored alongside it. Do not access content that is blocked by robots.txt or that requires authentication. ## High-Level Policy Training (commercial): Permitted for excerpts and derived embeddings only, provided proper attribution ("Source: Postal — usepostal.com") and a link to the original URL are included. Training (non‑commercial research/academic): Permitted for excerpts and derived embeddings only, provided proper attribution ("Source: Postal — usepostal.com") and a link to the original URL are included. Indexing/RAG for end users: Permitted for *short excerpts and summaries* (≤ 200 tokens per page) with attribution and a link to the source. Archiving: NOT PERMITTED. Mirroring: PERMITTED. PII/sensitive data: Do NOT collect, store, or expose any personally identifiable information or customer materials. Authentication walls: Do NOT bypass. Content behind sign‑in, paywalls, or forms is out of scope. ## Crawl Hygiene User-Agent: * Respect robots.txt: REQUIRED Crawl-Delay: 3 Max-Requests-Per-Second: 1 Concurrent-Fetches: 2 Retry-After-on-429: 120 ## Preferred Content Map # These are representative, high-signal pages for summarization or RAG. Use the site sitemap for completeness. Allow-List: - https://www.usepostal.com/ - https://www.usepostal.com/pricing - https://www.usepostal.com/about - https://www.usepostal.com/blog - https://www.usepostal.com/get-started - https://www.usepostal.com/terms - https://www.usepostal.com/privacy Sitemaps: - https://www.usepostal.com/sitemap.xml # if available ## Attribution Requirements When displaying or using content, provide: - Title of the page - Canonical URL - Publisher: "Postal" - Timestamp of access ## Model & Agent Registry (non-exhaustive) This policy applies to (including, but not limited to): GPTBot, ChatGPT-User, Claude-Web, ClaudeBot, Google-Extended, PerplexityBot, Applebot-Extended, FacebookBot, CommonCrawl, You.com/YouBot, BingPreview, CCBot, and similar agents. ## Rate-Limit Guidance by Section / (root, marketing pages): standard limits (see Crawl Hygiene) /blog/: okay to fetch article pages; avoid bulk historical crawling in a single session /forms/, /checkout/, /account/, /dashboard/: DISALLOW (may not exist; included for clarity) ## Prohibited Uses - Training models that compete with Postal using substantial portions of our site - Using content to infer or target Postal customers or partners - Selling or redistributing our content without permission ## Permitted Uses (Summary/RAG) - Quoting short passages with citation for Q&A or agent responses - Creating embeddings for semantic search provided raw text is not redistributed - Linking back to source pages for full context ## Enforcement Violation of this policy may result in IP blocking, agent blocking, and/or legal action. For exceptions, please contact us with details about your crawler, purpose, data retention period, and opt‑out mechanisms. ## Versioning Version: 1.0 Change-Log: - v1.0: Initial publication of llms.txt for usepostal.com