SaaS Glossary — Plain-English Definitions
158 plain-English definitions covering SaaS metrics, AI concepts, proxy types, web scraping, and affiliate marketing — written for practitioners, not textbooks.
158 of 158 terms
3
3PL (Third-Party Logistics)
An outsourced provider that handles warehousing, picking, packing, and shipping of orders on a merchant’s behalf.
A
AI Agent
An AI agent is an LLM-based system that can take autonomous actions — calling tools, browsing the web, writing code, querying databases — to complete a multi-step task without manual intervention at each step. Unlike a chatbot that generates text, an agent executes work in the world. Agents use a perceive-reason-act loop: read the environment, decide what to do, call a tool or write output, observe the result, and repeat. The risk is that errors compound across steps, making robust error handling, sandboxing, and human oversight critical for production deployments. Claude, GPT-4, and Gemini all support function calling as the foundation for agent behavior.
API Key
A secret token that identifies and authorizes an application calling an API, controlling access and tracking usage.
API Rate Limit
A cap on how many requests a client can make to an API in a time window, protecting servers from overload and abuse.
API-First
API-first means the product is designed so every feature is accessible via API before — or simultaneously with — any UI. Developers can build on top of it programmatically, making it composable within other stacks and workflows. The API is the product, not an afterthought. API-first products dominate developer infrastructure: Stripe for payments, Twilio for communications, Cloudflare for networking. The approach unlocks a second distribution channel — developer adoption — and a powerful moat, because switching requires unwinding embedded API calls across an entire codebase, not just a dashboard login.
ARPU (Average Revenue Per User)
ARPU is total recurring revenue divided by the total number of paying accounts for a given period. It tells you the average deal size across your customer base and is the baseline for LTV calculations. Rising ARPU usually signals successful upsell motions or healthy expansion — but can also mask churn of small accounts. Falling ARPU is often the first signal that a product is attracting a different, lower-value customer segment than intended. ARPU segmented by cohort or plan tier is far more useful than blended ARPU for diagnosing pricing strategy.
ARR (Annual Recurring Revenue)
Annual Recurring Revenue is MRR × 12 — the yearly run-rate of all active subscription revenue at a point in time. It excludes one-time fees, services, and overages. ARR is the headline metric investors and boards track because it smooths monthly noise and maps cleanly to the annual contract cycle most SaaS companies sell.
Activation Rate
The share of new signups that reach a product’s "aha moment" or first key action, a leading indicator of retention and PLG success.
Affiliate Network
An affiliate network is a marketplace that connects advertisers (merchants) with publishers (affiliates). The network handles tracking, attribution, fraud detection, and payments — taking a 20-30% cut of commission volume. Well-known B2B SaaS networks include PartnerStack, Impact, and Tune. Consumer networks include CJ, Awin, ShareASale, and Rakuten. Choosing a network is mostly about which merchants they already represent in your vertical.
Agentic AI
AI systems that autonomously plan and execute multi-step tasks using tools, memory, and reasoning — going beyond single prompt-response interactions.
Annual Contract Value (ACV)
The average annualized revenue from a single customer contract, excluding one-time fees — a core SaaS sales metric for sizing deals.
Annual Run Rate
A projection of annual revenue based on current performance, usually monthly recurring revenue multiplied by twelve.
Anti-Bot Detection
Anti-bot detection is a suite of techniques websites use to identify and block automated requests. It combines IP reputation scoring, behavioral analysis (mouse movement, click patterns, timing), browser fingerprinting, JavaScript challenges, and ML-based anomaly detection into a risk score per request. Enterprise anti-bot systems like Cloudflare Bot Management, Akamai Bot Manager, and DataDome combine dozens of signals to score requests in real time. Bypassing them requires matching the behavioral and technical fingerprint of a real browser — not just spoofing a user agent. The sophistication arms race between scraper tooling and anti-bot vendors drives most of the complexity in modern professional scraping.
Anti-Detect Browser
A browser that spoofs or isolates fingerprints and profiles, letting users manage many accounts without triggering bot detection.
Attribution Window
The attribution window is the length of time between a user click and a conversion during which an affiliate or ad source can claim credit. Typical values: 7 days (DTC), 30 days (SaaS), 90 days (high-consideration B2B). Shortening the window improves last-click advertiser economics; lengthening it rewards top-of-funnel partners. Every advertiser fight with their affiliate program eventually becomes a fight about the attribution window.
Average Order Value (AOV)
The average amount spent per transaction. Raising AOV via upsells or bundles lifts revenue without acquiring new customers.
B
Bandwidth (Proxy)
The amount of data transferred through a proxy, often the basis for residential-proxy pricing, measured per gigabyte.
Bookings
The total value of contracts customers commit to in a period — a forward-looking signal of future revenue, distinct from recognized revenue.
Bounce Rate
The share of visitors who leave a site after viewing one page without interacting — high rates can signal poor relevance or UX.
Browser Fingerprinting
Browser fingerprinting is the collection and analysis of browser attributes — installed fonts, screen resolution, WebGL renderer, audio context, timezone, plugin list, canvas rendering output — to create a unique identifier for a client without using cookies. Anti-bot systems use fingerprinting to track clients across IP changes and detect bots that rotate proxies but maintain a consistent technical fingerprint. Defeating fingerprinting requires a full browser environment with realistic, randomized attribute values. Playwright's stealth plugins and commercial tools like Bright Data's Scraping Browser provide hardened browser instances specifically designed to pass fingerprint-based challenges.
Burn Rate
How fast a startup spends cash each month. Gross burn is total spend; net burn subtracts revenue. Determines runway.
C
CAC (Customer Acquisition Cost)
CAC is the total cost of acquiring one new paying customer — marketing spend, sales salaries, tools, and overhead — divided by the number of customers landed in the same period. It is the single most important cost metric in SaaS because it determines whether growth is profitable. Keeping CAC low relative to LTV is the core unit-economics test. A CAC:LTV ratio of 1:3 or better is considered healthy; below 1:2 usually signals growth is too expensive to sustain. Tracking CAC separately by channel (paid search, outbound, content) reveals which acquisition paths are worth scaling.
CAC Payback Period
Payback period is the number of months required to recover the cost of acquiring a customer through that customer's gross profit. Formula: CAC ÷ (ARPU × gross margin). It answers the question: how long before this customer is profitable? Under 12 months is healthy for SMB SaaS. Enterprise SaaS can tolerate 18–24 months because of higher LTV and lower churn. Payback period directly determines how much capital a company needs to scale — a 6-month payback business can fund growth from revenue far sooner than a 24-month one.
CAPTCHA
CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) is a challenge mechanism designed to block automated requests. Common forms: image recognition puzzles, audio challenges, invisible behavioral analysis (reCAPTCHA v3), and proof-of-work challenges like Cloudflare's Turnstile. CAPTCHAs are the primary defense layer for high-value scraping targets. Solving them programmatically requires either human-labor CAPTCHA farms, AI-based solver APIs (2captcha, Anti-Captcha, CapSolver), or browser fingerprint quality good enough to pass behavior-based challenges without triggering them. CAPTCHA solving adds latency and cost to any scraping pipeline that encounters it.
CDN (Content Delivery Network)
A network of distributed servers that cache and serve content from locations near users, cutting latency and speeding page loads.
CPA (Cost Per Action)
CPA is an affiliate commission model where the advertiser pays only when a user completes a specific action — typically a purchase, subscription signup, lead form submission, or app install. The affiliate bears all traffic-acquisition risk; the advertiser pays only for verified results. CPA is the dominant model in performance affiliate marketing because it aligns incentives cleanly: publishers optimize for quality conversions, not just clicks. Common CPA rates range from $2–$5 for lead generation to 20–40% of the first payment for SaaS subscriptions. Networks like Commission Junction, Impact, and PartnerStack facilitate CPA tracking between advertisers and publishers.
CPC (Cost Per Click)
CPC is a pricing model where the advertiser pays for each click on their ad or affiliate link, regardless of whether the click converts into a purchase or signup. Google Ads and most paid search platforms default to CPC bidding because it ties cost directly to traffic intent signals. In affiliate marketing, CPC deals are less common than CPA because the advertiser bears conversion risk. They appear in content networks, sponsored content placements, and comparison sites where traffic volume matters more than direct attribution. CPC rates vary widely by niche — competitive SaaS categories can run $10–$50+ per click on branded terms.
CPM (Cost Per Mille)
CPM is an advertising pricing model where the advertiser pays per 1,000 impressions (views), regardless of clicks or conversions. "Mille" is Latin for thousand. CPM is the standard model for brand awareness campaigns, display advertising, and newsletter sponsorships. For affiliate marketers running display placements, CPM is a useful metric to benchmark media efficiency — but conversion tracking downstream is essential because a high CPM can be justified by strong conversion rates. Newsletter CPMs in B2B SaaS niches typically run $40–$150 per thousand, significantly above broad consumer audiences.
CSS Selector
A CSS selector is a pattern used to identify and extract specific elements from an HTML document, based on element type, class, ID, attribute, or hierarchy. Scrapers use them — via libraries like BeautifulSoup, Cheerio, or Playwright — to pull specific fields from page markup. CSS selectors are often more stable than XPath for scraping because they are tied to visual presentation rather than exact document hierarchy. When a site redesigns its HTML structure, selectors break — building robust selectors that target semantic attributes (data-* attributes, aria roles) rather than class names reduces maintenance burden because semantic attributes tend to be more stable than presentational class names.
Cart Abandonment
When a shopper adds items to a cart but leaves without buying. Recovery emails and retargeting win back a share of these sales.
Cashback
A reward that returns a percentage of spend to the buyer after purchase, common in affiliate and credit-card promotions.
Chain of Thought
A prompting technique where an LLM reasons step by step before answering, improving accuracy on complex, multi-step problems.
Chargeback
A forced transaction reversal initiated by a cardholder’s bank, often from disputes or fraud, costing merchants the sale plus fees.
Churn
Churn is the rate at which subscription customers cancel or downgrade in a given period. Gross churn counts only departures; net churn subtracts expansion from the same cohort. Healthy B2B SaaS typically sees 0.5-1.5% monthly gross churn. SMB SaaS runs 2-5%. Anything north of 5% signals product-market-fit or pricing issues that compound into revenue death spirals if not addressed.
Click-Through Rate (CTR)
The percentage of people who click a link or ad out of those who saw it — a core measure of creative and targeting relevance.
Cohort Analysis
Grouping users by a shared start period to track retention, revenue, and behavior over time — revealing trends a single average hides.
Cold Outreach
Contacting prospects who have no prior relationship with you, typically via email or LinkedIn, to start a sales conversation.
Concurrency (Scraping)
The number of simultaneous requests or browser sessions a scraper runs, balancing speed against rate limits and detection risk.
Concurrent Connections
Concurrent connections (or concurrency) is the number of simultaneous requests a proxy plan supports at once. Higher concurrency means faster scraping — you can parallelize across many IPs simultaneously rather than waiting for each request to complete before firing the next. Concurrency limits are a key pricing differentiator among proxy providers. Entry plans might allow 10–50 concurrent connections; enterprise plans can run thousands. For large-scale scraping pipelines, practical throughput is usually concurrency × request speed — raising concurrency is often the fastest way to scale a working scraper.
Context Window
The context window is the maximum amount of text — measured in tokens — that an LLM can process in a single request, including both the prompt you send and the response it generates. Claude 3's context window is up to 200K tokens; GPT-4 Turbo supports 128K. The context window determines how much history, documentation, or data a model can "see" at once. Longer contexts enable richer RAG pipelines and multi-document synthesis but also increase latency and cost. Context window limitations drove the development of chunking strategies and vector-search retrieval systems as ways to work around early limits.
Contraction MRR
Contraction MRR is the monthly recurring revenue lost when existing customers downgrade their plan, reduce seats, or move to a lower tier — without cancelling entirely. It is a leading indicator of churn: accounts that downgrade often cancel in the following quarter. High contraction is an early warning sign that customers are trimming budgets, hitting usage limits they do not need, or finding value in only part of the product. Customer success teams use contraction signals to trigger proactive outreach before an account goes dark entirely.
Conversion Funnel
The stages a user moves through from awareness to purchase. Mapping it reveals where prospects drop off and where to optimize.
Conversion Pixel
A tiny tracking snippet that fires when a user completes an action, reporting the conversion back to an ad or analytics platform.
Conversion Rate
The percentage of visitors or users who complete a desired action — signup, purchase, or upgrade — a core funnel optimization metric.
Coupon Stacking
Applying more than one discount code or offer to a single purchase — usually restricted by merchants to one code per order.
Customer Health Score
A composite metric blending usage, support, and engagement signals to predict which customers are likely to renew, expand, or churn.
D
DOM (Document Object Model)
The DOM is the in-memory tree representation of an HTML document that the browser builds after parsing HTML and executing JavaScript. It is the live structure that JavaScript manipulates — and the structure that browser-based scrapers like Playwright and Puppeteer interact with. Static scrapers read raw HTML without executing JavaScript. Browser-based scrapers interact with the fully rendered DOM — necessary for JavaScript-heavy single-page apps where content only appears after client-side rendering. DOM-based scraping is slower and more resource-intensive but handles modern web apps that a raw HTTP scraper would see as nearly empty.
Data Parsing
Extracting structured fields from raw HTML or text — turning a scraped page into clean, usable rows of data.
Datacenter Proxy
A datacenter proxy routes traffic through an IP address hosted in a commercial hosting provider — AWS, Hetzner, OVH, DigitalOcean — rather than a residential ISP. Datacenter IPs are fast, cheap, and highly detectable. Use them for public-data scraping, CI/CD egress, geo-testing, and anything where the target site does not actively fingerprint IP reputation. Expect per-GB pricing in the $0.10-$0.50 range versus $2-$8 for residential.
Deep Linking (Affiliate)
Deep linking in affiliate marketing means generating a tracked link that sends users directly to a specific product page, landing page, or pricing page — rather than the advertiser's homepage. Deep links improve conversion by reducing friction between ad click and purchase intent. Most affiliate networks and in-house programs support deep linking via a URL parameter or a link-builder tool. For SaaS affiliate programs, linking directly to a pricing page or feature-specific landing page can meaningfully lift EPC compared to routing all traffic through a homepage that requires additional navigation before users see what they came for.
Drip Campaign
An automated series of pre-written emails sent on a schedule or trigger to nurture leads and move them toward conversion.
Dropshipping
A retail model where the store never holds inventory — suppliers ship products directly to customers after each order.
Dunning
The automated process of retrying failed subscription payments and emailing customers to recover revenue from expired or declined cards.
E
EPC (Earnings Per Click)
EPC is an affiliate metric measuring average revenue generated per 100 clicks sent to an offer. Formula: total commissions ÷ total clicks × 100. It is a publisher-side metric for comparing offer profitability across different commission rates and conversion rates. EPC helps affiliates decide how much to pay for traffic. A 20% commission on a $200 product with a 1% conversion rate yields $0.40 EPC — the maximum CPC bid that breaks even. Networks often publish advertiser EPC to help publishers prioritize promotional focus. EPC varies significantly by traffic source quality; the same offer can show 2× difference in EPC between email and paid search traffic.
Edge Computing
Running compute close to users at the network edge rather than a central data center, reducing latency for real-time apps.
Embeddings
Embeddings are numerical vector representations of text (or images, audio, or other data) that capture semantic meaning. Similar content produces vectors that are close together in high-dimensional space, enabling similarity search, clustering, and classification without keyword matching. Embeddings are the backbone of modern AI search, recommendation, and RAG systems. A query like "affordable residential proxies" embedded as a vector will surface semantically related results even if they share no exact words. Most embedding models output vectors of 768–3072 dimensions. Popular providers include OpenAI, Cohere, Google, and Cloudflare's built-in AI binding.
End-to-End Encryption
Encryption where only the communicating users can read the data; not even the service provider can access the plaintext.
Exit Node
The exit node is the final server or device in a proxy chain whose IP address the target website sees. In a residential proxy network, the exit node is the actual consumer device — a phone or desktop — routing your traffic through its connection to the internet. Exit node location determines the apparent geographic origin of your traffic. Understanding the full request path — your client → proxy infrastructure → exit node → target — matters for debugging latency issues: slow or overloaded exit nodes are often the bottleneck, not your bandwidth. Exit node health and speed vary significantly within large residential proxy pools.
Expansion MRR
Expansion MRR is the additional monthly recurring revenue generated from existing customers through upgrades, seat additions, add-on purchases, or moving to a higher pricing tier — without acquiring any new accounts. It is often the cheapest revenue to win because these customers already trust the product. Best-in-class SaaS teams aim for expansion MRR to outpace new-logo MRR within 2–3 years, at which point the business can grow even with flat new customer acquisition. Expansion MRR is tracked separately from new MRR to measure the health of the upsell and expansion motion.
F
Fine-Tuning
Fine-tuning is the process of continuing to train a pre-trained foundation model on a smaller, domain-specific dataset to specialize its behavior. Rather than training from scratch — which costs millions of dollars of compute — fine-tuning adapts an existing model for specific tasks at a fraction of the cost. Fine-tuning is used to make models follow specific response formats, adopt a particular tone, or develop expertise in a narrow domain such as medical documentation, legal drafting, or technical support. The risk: over-fine-tuning on narrow data can degrade general-purpose capabilities. RAG is often a better alternative when the goal is knowledge grounding rather than style or format specialization.
Foundation Model
A foundation model is a large AI model trained on broad, general data that can be adapted — via fine-tuning, prompting, or retrieval augmentation — for a wide range of downstream tasks. GPT-4, Claude, Llama, and Gemini are all foundation models. The "foundation" metaphor captures the idea that these models are starting points other applications build on, rather than training from scratch for every use case. Foundation model providers compete on benchmark scores, context length, safety, and price, while the application layer above them commoditizes rapidly as capabilities equalize.
Freemium
Freemium is a pricing model where a core set of features is permanently free, with additional functionality available on paid tiers. Unlike a free trial, freemium has no time limit — users can stay free indefinitely. The goal is to acquire a large user base that upsells over time. Freemium requires careful gate design: free must be valuable enough to attract real users, but constrained enough that growing teams need to upgrade. The median freemium-to-paid conversion rate is 2–5%; viral, team-based tools tend to convert higher because free users evangelize to colleagues who trigger team upgrades.
Fulfillment by Amazon (FBA)
Amazon’s service that stores, packs, and ships a seller’s products and handles returns and customer service, for a fee.
G
GDPR
The EU’s data-protection regulation governing how companies collect, store, and process personal data, with heavy fines for violations.
GRR (Gross Revenue Retention)
GRR measures the percentage of recurring revenue retained from existing customers — excluding any expansion. Where NRR can exceed 100% via upsell, GRR is capped at 100% because it only counts losses: churn and contraction. GRR is a purer signal of customer satisfaction than NRR. A company with high NRR but low GRR is papering over high churn with aggressive expansion. SaaS investors look at both together: GRR below 80% in enterprise or below 70% in SMB typically indicates a product or fit problem that upsell cannot fix.
Generative AI
Generative AI refers to AI systems that produce new content — text, images, audio, video, code — rather than just classifying or analyzing existing content. LLMs, image generators like Stable Diffusion and DALL·E, and audio synthesis models are all generative AI. The "generative" distinction matters for product design: these tools create artifacts that require human review and judgment, unlike a spam filter that returns a yes/no. Generative AI capabilities are expanding faster than governance norms, creating product opportunities and compliance risks for companies that deploy them in regulated industries.
Geo-Targeting (Proxy)
Geo-targeting in the proxy context means selecting IP addresses from a specific country, state/region, or city. Many web targets serve different content by geography — local prices, language versions, search results, streaming library availability, or ad creatives. Precise geo-targeting is critical for localized price monitoring, local SERP scraping, and ad verification. Country-level targeting is standard across all major providers; city-level targeting is available on premium plans and costs more due to smaller IP pools in specific locations. Some providers also offer ASN-level targeting to match traffic from specific ISPs.
Gross Margin
Revenue minus cost of goods sold, as a percentage. Healthy SaaS gross margins typically run 70–85% thanks to low marginal delivery cost.
Guardrails (AI)
Rules and filters that constrain AI behavior to keep outputs safe, on-topic, and compliant — blocking harmful or off-policy responses.
H
Hallucination (AI)
Hallucination is when an LLM generates confident, fluent text that is factually incorrect — inventing citations, people, statistics, or events that do not exist. It is a fundamental limitation of how LLMs work: they generate plausible continuations of text, not verified truth. Hallucination is the primary risk in high-stakes LLM deployments — legal, medical, financial. Mitigation strategies include RAG (grounding answers in retrieved documents), structured output validation, tool-calling to query live data, and human-in-the-loop review. Hallucination rates vary significantly by model, task type, and how well the prompt constrains the output space.
Headless Browser
A headless browser is a real browser (Chromium, Firefox, WebKit) run without a visible UI, controlled programmatically via a driver like Playwright, Puppeteer, or Selenium. Headless browsers are used for automated testing, scraping JavaScript-rendered sites, PDF generation, and SSR smoke tests. They are slow and memory-hungry compared to raw HTTP, but necessary whenever a site loads data via fetch/XHR after initial HTML.
Headless CMS
A content system that stores and delivers content via API, decoupled from the front end, so any device or framework can render it.
I
ICP (Ideal Customer Profile)
An ICP is a data-driven description of the company type most likely to buy, retain, and expand — defined by industry, headcount, tech stack, geography, and buying behavior. Not to be confused with a buyer persona (which describes a person), an ICP describes the account. A precise ICP sharpens every growth lever: which channels to invest in, what to say in SDR outreach, which deals to qualify in or out, and where to focus product roadmap investment. Most early-stage SaaS companies under-invest in ICP definition and end up serving segments with incompatible needs — high CAC, high churn, and low expansion as a result.
IP Rotation
IP rotation is the practice of cycling through a pool of IP addresses during web requests to avoid detection and rate limiting. Rotation can happen on every request, on a time interval, or when a block is detected by monitoring response codes. Effective IP rotation strategies match the target site's behavior: aggressive rotation on large anonymous scrapes, sticky sessions where the site expects consistent IPs (logged-in accounts, shopping carts), and geographic targeting when content differs by region. Poor rotation — using too few IPs or too-predictable patterns — is one of the top causes of scrape failures at scale.
IP Whitelisting
IP whitelisting (or allowlisting) is a security control that grants access only to requests from pre-approved IP addresses. Proxy providers offer static IPs or username-based authentication so customers can add proxy IPs to their target's allowlist without exposing credentials in every request header. For web scrapers accessing APIs legitimately, whitelisting your proxy provider's egress IPs eliminates per-request auth overhead and reduces latency. Some enterprise SaaS products require IP whitelisting as part of security compliance — meaning automated workflows using them need stable, predictable IP ranges rather than rotating pools.
ISP Proxy
An ISP proxy (also called a static residential proxy) is an IP address assigned by an internet service provider to a data center — combining the speed of datacenter infrastructure with the legitimacy of a residential IP. These IPs appear residential to target sites but are hosted on fast hardware. ISP proxies sit between datacenter and true residential proxies on the speed-vs-detection spectrum: faster than mobile or rotating residential proxies, but less likely to be blocked than standard datacenter IPs. They are a good fit for tasks requiring both speed and low block rates — price monitoring, account creation, ticketing, and SERP data collection.
Inference (AI)
Inference is the process of running a trained AI model on new input to generate a prediction or output. When you send a message to Claude or generate an image with DALL·E, the serving infrastructure is performing inference — running the model weights against your input in real time. Inference cost and latency are the dominant operational concerns for production AI products. Larger models are more capable but slower and more expensive to serve. Techniques like quantization, batching, KV caching, and speculative decoding optimize inference throughput. Inference providers include Anthropic, OpenAI, Google, and specialized hardware clouds like Together.ai and Groq.
J
JavaScript Rendering
JavaScript rendering refers to executing a page's JavaScript before extracting data from it. Many modern sites load content via JS after the initial HTML response — the raw HTML contains only a scaffold that scripts populate with actual content after the page loads. Handling JS rendering requires a headless browser (Playwright, Puppeteer, or a cloud rendering service like Browserless or ScraperAPI). Rendering is the most significant cost multiplier in scraping: a headless browser uses 10–100× more CPU and memory than a raw HTTP request and is 5–20× slower. Identifying which pages require JS rendering vs. which can be scraped statically is a critical pipeline design decision.
K
Knowledge Cutoff
The date after which an AI model has no training data, so it cannot know events or releases that happened later.
L
LLM (Large Language Model)
An LLM is a deep learning model trained on vast amounts of text data to understand and generate human language. GPT-4, Claude, Gemini, and Llama are all LLMs. They work by predicting the next token in a sequence, building fluent, contextually appropriate text one step at a time. LLMs are the foundation of most modern AI products — from chatbots and writing assistants to code generation and data extraction. Their capabilities and limitations are shaped by training data, model size, and alignment techniques like RLHF. Understanding what an LLM can and cannot do reliably is the first step to building on top of one.
LTV (Customer Lifetime Value)
LTV (also called CLV or Customer Lifetime Value) is the total revenue — or more usefully, gross profit — a customer generates from sign-up to churn. The simplest formula: ARPU ÷ monthly churn rate. More precise models weight by cohort and account for expansion revenue. LTV drives every growth decision: how much you can bid for ads, whether an enterprise sales team makes economic sense, and how aggressively you can discount to land logos. SaaS companies with strong expansion motions often find LTV inflates significantly in months 12–24 as upsell kicks in.
Last-Click Attribution
A model crediting the final touchpoint before conversion with the entire sale, simple but blind to earlier influences in the journey.
Latency (AI)
The delay between sending a prompt and receiving output. Lower latency matters for real-time chat, voice, and agent applications.
Lead Magnet
A free resource — ebook, template, tool — offered in exchange for contact details, used to capture and nurture leads.
Logo Churn
The percentage of customers (logos) lost in a period, regardless of their spend — distinct from revenue churn which weights by dollars.
Lookalike Audience
An ad-targeting segment built to resemble your best existing customers, helping platforms find new high-intent prospects at scale.
M
MRR (Monthly Recurring Revenue)
Monthly Recurring Revenue is the normalized monthly value of all active subscriptions. Annual plans are divided by 12. One-time fees, overages, and services are excluded. MRR is the canonical SaaS growth metric because it is predictable, comparable across pricing models, and directly drives runway and valuation math. Derived metrics include new MRR, expansion MRR, contraction MRR, churned MRR, and net new MRR.
Magic Number
A SaaS efficiency metric dividing new ARR by prior-period sales and marketing spend; above ~0.75 suggests efficient, scalable growth.
Merchant of Record
The legal entity responsible for selling to the customer, handling tax, compliance, and chargebacks — often a platform like Paddle or Lemon Squeezy.
Mixture of Experts (MoE)
A model architecture that routes each input to specialized sub-networks ("experts"), boosting capacity and efficiency without activating the whole model.
Mobile Proxy
A mobile proxy routes traffic through real mobile devices connected to carrier networks (3G/4G/5G), giving requests an IP address from a major telecoms provider like Verizon, T-Mobile, or Vodafone. Mobile IPs are the hardest to block because carriers assign them dynamically to millions of legitimate users. Mobile proxies command the highest prices in the proxy market — often 3–10× residential — because blocking a mobile carrier IP risks blocking real customers. They are essential for mobile-specific scraping, anti-fraud testing, ad verification on mobile platforms, and any target that actively distinguishes mobile from desktop traffic patterns.
Model Context Protocol (MCP)
An open standard that lets AI assistants securely connect to external tools, data sources, and APIs through a unified interface.
Multi-Tenancy
Multi-tenancy is a SaaS architecture where a single instance of the software serves multiple customers (tenants), with data isolation between them. Most SaaS products are multi-tenant — one application layer handles all customers, separating their data in the database rather than running separate servers for each. The opposite is single-tenancy: each customer gets a dedicated instance. Multi-tenancy is far more cost-efficient to operate, but requires rigorous data isolation to prevent cross-tenant data leakage. Security-sensitive enterprise customers sometimes demand single-tenant or "private cloud" deployments, typically at a significant price premium.
Multi-Touch Attribution
A model distributing conversion credit across every touchpoint in the customer journey, giving a fuller view of what drives sales.
Multimodal AI
AI that processes and combines multiple input types — text, images, audio, and video — in a single model, like GPT-4o or Gemini.
N
NRR (Net Revenue Retention)
NRR measures how much revenue you keep and grow from existing customers over a period — after accounting for expansion (upsells, seat adds), contraction (downgrades), and churn. A 110% NRR means your existing customer base paid 10% more this period than last, even after all losses. NRR above 100% is the holy grail of SaaS: growth without acquiring a single new customer. It is the primary metric investors use to distinguish a leaky bucket from a compounding flywheel. Best-in-class B2B SaaS typically runs 115–130% NRR. Below 90% is a retention crisis.
Net Dollar Retention (NDR)
Revenue retained and expanded from existing customers over a period, including upsells minus churn. Above 100% means growth without new logos.
Net Promoter Score (NPS)
A loyalty metric (-100 to 100) from asking how likely users are to recommend a product, scored on a 0–10 scale.
O
OAuth
An open standard that lets apps access a user’s data on another service via tokens, without sharing the user’s password.
P
PLG (Product-Led Growth)
PLG is a go-to-market strategy where the product itself is the primary driver of acquisition, activation, and expansion — rather than marketing campaigns or a sales team. Free trials, freemium tiers, and self-serve onboarding are the primary PLG mechanics. Users experience value before talking to anyone in sales. Successful PLG products generate leads from usage data: a team hitting 80% of their free quota is a warm sales signal without a single cold call. Slack, Notion, Figma, and Calendly are textbook examples. The trade-off: PLG requires the product to deliver value immediately, without hand-holding — which raises the bar for onboarding and activation.
Payment Gateway
The service that securely authorizes and processes online card payments between a store, the customer’s bank, and the merchant account.
Postback URL (S2S Tracking)
A postback URL (also called server-to-server or S2S tracking) is a server-side conversion notification: when a user converts, the advertiser's server pings the affiliate network's URL with conversion data. No cookies or pixels required — the data flows machine-to-machine. Postback tracking is more reliable than cookie-based tracking and immune to browser privacy restrictions. Most major affiliate networks support postback URLs for sophisticated publishers who need accurate server-side attribution — especially valuable for mobile apps, subscription billing where conversions happen weeks later, and cross-device purchase flows.
Product Qualified Lead (PQL)
A user whose in-product behavior (usage, feature adoption) signals readiness to buy — the PLG equivalent of a sales-qualified lead.
Prompt Engineering
Prompt engineering is the practice of crafting inputs to an LLM to reliably produce the desired output. It ranges from simple instruction phrasing to complex techniques like chain-of-thought prompting, few-shot examples, role-setting, and multi-step system prompt design. Good prompt engineering can dramatically change output quality without changing the model. In production AI systems, the system prompt is often the primary "code" that defines product behavior — making prompt engineering a core engineering discipline. Systematic prompt testing and version control are as important as any other part of the AI development workflow.
Prompt Injection
An attack where malicious instructions hidden in input trick an AI model into ignoring its rules or leaking data.
Proxy Authentication
Verifying access to a proxy via username/password or IP whitelisting, ensuring only authorized clients route traffic through it.
Proxy Pool
A proxy pool is the full inventory of IP addresses a proxy provider maintains for rotation and customer assignment. Pool size — often measured in millions of IPs for residential providers — determines how long a rotation strategy can run before repeating an IP against a given target. Not all pool IPs are equal: freshness (how recently tested), geographic distribution, and subnet diversity all affect scraping success rates. Premium proxy providers maintain "clean" pools by continuously testing IPs against common targets and retiring burned ones. Pool quality matters more than raw pool size for demanding scraping use cases.
Publisher & Advertiser
In affiliate marketing, the publisher is the affiliate — the website, newsletter, social account, or content creator driving traffic and earning commissions. The advertiser is the brand or company offering the product and paying commissions for conversions. Networks sit between the two, connecting publishers to advertiser offers and handling tracking and payments. Understanding this three-party structure clarifies who controls each lever: advertisers control offers, commission rates, creative assets, and cookie duration. Publishers control traffic quality, promotional placement, and audience targeting. Networks control tracking infrastructure, fraud protection, and payment terms. In-house affiliate programs cut out the network layer — increasing margin but requiring the advertiser to build and manage publisher relationships directly.
Q
Quantization (AI)
Reducing the numerical precision of a model’s weights to shrink size and speed up inference, with minimal accuracy loss.
Quick Ratio (SaaS)
New plus expansion MRR divided by churned plus contraction MRR. A ratio above 4 signals healthy, efficient revenue growth.
R
RAG (Retrieval-Augmented Generation)
Retrieval-Augmented Generation combines a vector store (or hybrid BM25 + vector) with a language model. Relevant documents are retrieved at query time and injected into the model prompt, letting the LLM ground its output in specific source material instead of relying purely on pretraining. RAG is the standard architecture for enterprise knowledge-base bots, support automation, and internal copilots in 2026. Key failure modes: chunking strategy, embedding quality, and retrieval recall.
RLHF (Reinforcement Learning from Human Feedback)
RLHF is the training technique that aligns LLMs to human preferences — making them helpful, harmless, and honest. Human raters score model outputs, those scores train a "reward model," and the LLM is then optimized with reinforcement learning to generate outputs that score highly. RLHF is responsible for the conversational, instruction-following behavior of modern chatbots. Without it, a raw language model generates text that is statistically plausible but not reliably useful or safe. The quality of RLHF data — and the values encoded in rating guidelines — shapes how a model responds to edge cases, sensitive topics, and ambiguous requests.
Rate Limiting
Rate limiting is a server-side control that caps the number of requests an IP address, API key, or user can make within a time window. Exceeding the limit typically returns HTTP 429 (Too Many Requests) and may trigger a temporary or permanent block. For scrapers, working within or around rate limits is the practical difference between a sustainable pipeline and a blocked one. Well-designed scraping systems add exponential back-off, request jitter, and per-domain throttling to stay under detection thresholds. Many APIs publish their rate limits explicitly in documentation; websites rarely do, requiring empirical testing to find safe request rates.
Residential Proxy
A residential proxy routes traffic through an IP address assigned to a real home ISP subscriber, making requests look like organic consumer traffic rather than server traffic. Residential proxies are used where a target site blocks or rate-limits datacenter IPs — price monitoring, ad verification, sneaker bots, travel fare scraping, and SEO rank tracking. They cost 10-50x more than datacenter proxies because the IP inventory is sourced from SDKs, VPNs, and affiliate networks paid per GB of shared bandwidth. Key quality signals: IP pool size, geo targeting granularity, sticky session support, and ASN diversity.
Retargeting
Showing ads to people who previously visited your site or app, re-engaging warm prospects who didn’t convert the first time.
Return on Ad Spend (ROAS)
Revenue generated for every dollar spent on advertising. A ROAS of 4 means $4 earned per $1 spent.
Rotating Proxy
A rotating proxy automatically assigns a new IP address from a pool for each request — or at set time intervals — making large-scale web requests appear to come from many different sources. This prevents target sites from blocking a single IP address after detecting too many requests. Rotating proxies are fundamental to web scraping at scale. Most residential proxy providers enable rotation by default. The size and quality of the rotation pool — number of IPs, geographic spread, freshness — determines how long you can scrape a target before encountering blocks or CAPTCHAs. Some use cases require sticky sessions instead of pure rotation.
Rule of 40
The Rule of 40 states that a SaaS company's revenue growth rate and profit margin (usually EBITDA or free cash flow margin) should add up to at least 40%. A company growing at 60% YoY can burn at −20% margin. One growing at 20% needs to run at 20%+ margin. It is a quick heuristic investors use to balance growth and efficiency — particularly useful for comparing companies at different stages. As growth rates slow naturally with scale, the margin component must rise to compensate. Companies consistently above 40 are considered efficiently run; consistently below 30 triggers questions about unit economics.
Runway
How many months a company can operate before cash runs out, calculated as current cash divided by monthly net burn rate.
S
SAML
An XML standard for exchanging authentication and authorization data between an identity provider and apps, powering enterprise SSO.
SKU (Stock Keeping Unit)
A unique code identifying a specific product variant for inventory tracking across catalogs, warehouses, and sales channels.
SLA (Service Level Agreement)
A contract guaranteeing service performance — typically uptime and support response times — with credits or penalties if missed.
SOC 2
An auditing standard certifying that a company securely manages customer data across security, availability, and confidentiality controls.
SOCKS5 Proxy
SOCKS5 is a general-purpose proxy protocol that operates at the network layer, routing any type of TCP or UDP traffic — HTTP, HTTPS, FTP, or custom protocols. Unlike HTTP proxies, which only handle web traffic, SOCKS5 proxies work with any application that supports the SOCKS protocol. SOCKS5 also supports authentication and can tunnel UDP traffic, making it suitable for use cases beyond web scraping: gaming, VoIP, torrenting, and custom client applications. Most proxy providers offer both HTTP and SOCKS5 endpoints from the same IP pool, giving you flexibility to choose based on your client library or use case.
SSO (Single Sign-On)
An authentication method letting users access multiple apps with one set of credentials, improving security and convenience.
Sandbox Environment
An isolated testing space that mirrors production, letting developers try changes and integrations safely without affecting live data.
Scraping API
A managed service that handles proxies, browsers, and anti-bot bypass, returning clean structured data from a target URL.
Seat-Based Pricing
Seat-based pricing charges per user or per license — typically a flat monthly fee per active user. It is the most common SaaS pricing model for collaboration and productivity tools because it is easy to understand, predict, and align with HR headcount data. The challenge: enterprise buyers actively minimize seats, creating a ceiling on expansion MRR. Many SaaS companies layer usage limits or feature gates on top of seats to unlock additional revenue vectors beyond simple headcount growth. Seat-based models work best when every user gets consistent, measurable value from the tool.
Semantic Search
Search that matches meaning rather than exact keywords, using vector embeddings to return conceptually relevant results.
Server-to-Server Tracking
Conversion tracking that sends data directly between servers (via postback), bypassing browser limits and ad blockers for accuracy.
Serverless
A cloud model where you run code without managing servers; the provider scales and bills only for actual execution time.
Sticky Session
A sticky session (sometimes called a session-based proxy) holds the same exit IP for a configurable duration — typically 1, 10, or 30 minutes — so a sequence of requests appears to come from the same user. Sticky sessions are required whenever a target site tracks login state, cart state, or browser fingerprint across requests. Rotating proxies break these workflows because each request looks like a different user.
Sub-Affiliate
A sub-affiliate is an affiliate recruited by another affiliate (the master affiliate or super affiliate) who earns a cut of the sub-affiliate's commissions without doing the end-customer acquisition themselves. It creates a two-tier affiliate structure. Sub-affiliate programs are used to incentivize high-volume affiliates to recruit and mentor smaller publishers. The master affiliate typically earns 5–15% of sub-affiliate commissions. Networks offering sub-affiliate tiers attract super-affiliates who see recruiting as a scalable revenue stream alongside their own direct promotion activity.
Subscription Billing
Automated recurring charges for ongoing access to a product, handling proration, dunning, upgrades, and renewals.
Synthetic Data
Artificially generated data that mimics real data’s patterns, used to train or test models without privacy or scarcity constraints.
T
Temperature (AI)
A setting controlling randomness in LLM output. Low temperature gives focused, deterministic answers; high temperature gives more creative, varied responses.
Time to Value (TTV)
How long it takes a new user to get meaningful value from a product. Shorter TTV improves activation, retention, and conversion.
Token (AI)
In LLMs, a token is the basic unit of text the model processes — roughly ¾ of a word in English. "SaaSTweaks" might split into 3 tokens; "the" is usually 1. Token count determines both context window size and API pricing, since most LLM APIs charge per input token and output token separately. Efficient token usage — compressing context, truncating conversation history, caching repeated system prompts — is a practical cost-optimization skill for teams building LLM applications. Long documents, verbose prompts, or deep conversational history all consume tokens quickly. Prompt caching (supported by Anthropic and OpenAI) dramatically reduces costs for repeated system prompts.
Tokens per Second (TPS)
A measure of how fast an LLM generates output, counting tokens produced each second — a key latency and throughput benchmark.
Total Contract Value (TCV)
The full value of a customer contract over its entire term, including recurring revenue plus any one-time and add-on fees.
Tracking Pixel
A tracking pixel is a 1×1 transparent image embedded on a web page or in an email that fires a request to a tracking server when loaded — recording an impression, page view, or conversion event. The pixel URL carries user identifiers, session data, and event metadata as query parameters. Pixels are the traditional client-side tracking mechanism for affiliate conversions and ad campaign measurement. They are increasingly unreliable due to browser ad blockers, iOS privacy changes (ITP), and third-party cookie deprecation. Server-side postbacks and Meta's Conversion API (CAPI) are gradually replacing pixels for conversion attribution in programs that need high accuracy.
Two-Factor Authentication (2FA)
A security layer requiring a second proof of identity — a code or device — beyond a password to log in.
U
UTM Parameters
Tags added to a URL to track campaign source, medium, and content in analytics, so you know which efforts drive traffic.
Uptime
The percentage of time a service is operational and available. "Five nines" (99.999%) means under six minutes of downtime per year.
Usage-Based Pricing (UBP)
Usage-based pricing charges customers based on what they consume — API calls, data volume, active seats, or compute hours — rather than a flat monthly fee. Also called consumption-based or metered pricing. It lowers the entry barrier because customers start small and pay more as they get more value. UBP aligns vendor and customer incentives cleanly: as customers succeed, revenue grows. Companies like Stripe, Twilio, and Snowflake use UBP as a core growth engine because customers naturally expand as their business scales. The trade-off is revenue volatility — usage can spike or drop significantly month to month.
User Agent
The user agent (UA) is a string sent in HTTP request headers that identifies the client software — browser type, version, and operating system. Websites use it to serve the right content format and as a basic bot-detection signal. Scrapers that send default HTTP library user agents (like "python-requests/2.28") are trivially detectable. Rotating realistic UA strings matching current browser versions and their exact header ordering is a baseline anti-bot countermeasure. Full browser fingerprinting goes beyond UA to include screen resolution, fonts, WebGL renderer, timing data, and hundreds of other signals — making a convincing browser impersonation significantly harder than swapping a header string.
V
Vector Database
A vector database stores and indexes high-dimensional embedding vectors, enabling fast similarity search across millions of records. Unlike a traditional database optimized for exact matches, a vector DB retrieves the nearest semantic neighbors to a query vector — finding the most relevant documents even when they share no keywords with the query. Vector databases are the retrieval layer in most RAG systems. When a user asks a question, their query is embedded and matched against pre-indexed document embeddings, retrieving the most relevant chunks to include in the LLM prompt. Popular options include Pinecone, Weaviate, Qdrant, pgvector (Postgres extension), and Cloudflare Vectorize.
Vector Embedding
A numerical representation of text, images, or data as points in high-dimensional space, enabling semantic search and similarity matching.
W
Web Crawling
Web crawling is the automated traversal of websites by following links to discover and index content. A crawler starts from a seed URL, parses all links on that page, follows them, and repeats — building a map of an entire site or web graph. Search engine bots like Googlebot are web crawlers. Crawling is distinct from scraping: a crawler discovers URLs, a scraper extracts data from them. For competitive intelligence or market research, crawling is typically the first phase — discovering all relevant URLs — before targeted scraping extracts specific data from each. Robots.txt files signal which paths crawlers should skip.
Web Scraping
Web scraping is the automated extraction of structured data from websites — product prices, review text, contact information, job listings, or any publicly visible content. A scraper issues HTTP requests, parses the returned HTML or JSON, and extracts target fields into a usable format. Modern web scraping is an arms race: websites deploy bot detection, CAPTCHAs, JavaScript challenges, and dynamic rendering to block automated access, while scrapers counter with proxies, browser automation, and behavioral mimicry. The legal landscape varies by jurisdiction and target site — always check a site's terms of service and applicable law before scraping at scale.
Webhook
An automated HTTP callback that sends real-time data to another app when an event occurs, instead of repeatedly polling for changes.
Webhook Retry
Automatic re-sending of a failed webhook delivery, ensuring downstream systems eventually receive the event despite transient errors.
White-Labeling
White-labeling is when a vendor allows customers or partners to rebrand the product as their own — replacing logos, colors, and domain with the buyer's brand identity while the underlying technology stays the vendor's. The end customer sees the partner's brand, not the original vendor. Common in SaaS platforms, white-labeling is a B2B2C distribution model: an agency or reseller packages your tool under their brand for their customers. It accelerates distribution without building partner marketing, but creates support complexity and can commoditize the core product if the vendor becomes invisible to end users.
Z
Zero Trust
A security model that trusts no user or device by default, verifying every access request regardless of network location.
Zero-Shot vs. Few-Shot Learning
Zero-shot means prompting an LLM to perform a task without providing any examples — relying entirely on the model's training. Few-shot means providing 2–5 input/output examples in the prompt before the actual task, giving the model a format to follow. Zero-shot is simpler and uses fewer tokens; few-shot often produces more consistent output on structured tasks like classification, extraction, or formatting. The choice between them is a practical engineering decision: test both with your specific task and model, measure output quality, and choose accordingly. Many production systems combine both — a zero-shot instruction with a few worked examples embedded in the system prompt.