Cold Email Deliverability Tools 2026: Testing, Monitoring, and Fixing Placement
By Puzzle Inbox Team · Jun 21, 2026 · 12 min read
The 2026 operator guide to cold email deliverability tools: inbox placement test platforms, blacklist checker monitoring, MXToolbox, Mailtester, Glockapps, Postmaster Tools.
The Deliverability Stack: What You Are Actually Measuring
Cold email deliverability is not one number. It is a stack of separate measurements, each requiring a different cold email deliverability tool, and each catching a different failure mode. Operators who think they have a "deliverability problem" almost always have four or five smaller problems that compound. The fastest way to fix placement is to instrument every layer and stop guessing.
At the top of the stack sits inbox placement: did the message land in Inbox, Promotions, Updates, Spam, or Quarantine? Below that sits domain reputation, then IP reputation, then authentication (SPF, DKIM, DMARC), then list quality, then content fingerprinting. Each layer has dedicated tools. An inbox placement test answers the first question. A blacklist checker answers the IP and domain question. DNS validators answer the authentication question. Postmaster Tools answers the reputation question. List verifiers answer the bounce question. Without all five, you are flying blind.
This guide walks through twelve tools across six categories, plus the meta-category of upstream infrastructure fixes that remove certain failure modes entirely. The goal is not to use every tool. The goal is to know which tool to reach for when a campaign starts underperforming, and to have a monitoring cadence that catches drift before it becomes a crisis. If you have not already audited your authentication setup, start at /free-tools/dns-checker and the SPF, DKIM, DMARC setup guide before reading further. Tools cannot fix records that do not exist.
Placement Test Tools: Where Did The Message Actually Land
An inbox placement test answers the single question that matters to a cold email operator: of one hundred messages sent right now, how many reached the primary tab, how many were filtered to Promotions or Updates, and how many hit Spam or Quarantine. Without this measurement, every other metric is academic. You can have perfect SPF, a clean IP, and a verified list, and still have eighty percent of mail sorted to Promotions because your subject line patterns triggered a category filter.
Glockapps
Glockapps is the heaviest hitter in the placement test category. You send a campaign to a list of seed addresses across Gmail, Outlook, Yahoo, AOL, iCloud, Zoho, and various smaller providers. Glockapps measures where each message landed, reports authentication results, flags blacklists hit, and graphs sender score over time. For Gmail specifically, Glockapps distinguishes between Inbox, Promotions, Updates, and Spam, which matters because Promotions placement is not the same outcome as Inbox placement for cold outreach. Use it before launching a new sequence, after any DNS change, and weekly for sequences over fifty messages per day per inbox.
Mail-Tester
Mailtester (mail-tester.com) is the fastest free placement signal available. You send one message to a generated address, you get a score out of ten with line-item feedback: SPF pass, DKIM pass, DMARC alignment, blacklist status, content score, HTML quality. It will not tell you what percentage of real recipients see the message in Inbox versus Spam, but it will catch authentication bugs, obvious spam content triggers, and basic reputation issues in under thirty seconds. Use Mail-Tester before every new template goes live. A score under eight points means stop and fix something.
Mailmeteor Seed Test and GMass Inbox Placement
Mailmeteor's seed test and GMass's inbox placement report are lower-volume but cheaper alternatives to Glockapps. They cover fewer providers but give you the same core data: where did the message land. Use them if you are sending under one hundred messages per day per inbox and Glockapps is overkill. For high-volume operators, the per-test cost matters less than the breadth of provider coverage, so Glockapps wins. For solo founders running ten warm inboxes, Mailmeteor seed tests give ninety percent of the signal at ten percent of the cost.
How To Run A Spam Folder Test Properly
A spam folder test only tells you something if it mirrors your real send. Send the exact subject line, body, signature, and link set you plan to use in production. Send from the inbox that will run the sequence. Send at the volume you will actually hit. A seed test of one message tells you about authentication. A seed test of fifty messages over an hour tells you about throttling and reputation. If you change anything between the test and production, the test result no longer applies.
Blacklist Monitoring: Catching Reputation Hits Before Volume Drops
Blacklists, also called blocklists or DNSBLs, are lists of IPs and domains that some receivers consult before accepting mail. Hitting Spamhaus SBL, Spamhaus DBL, SORBS, Barracuda, or SURBL will not always cause immediate placement collapse, but it correlates strongly with reputation problems, and major receivers like Gmail and Outlook do consult several of them. Operators need a passive blacklist monitor running continuously, not just an on-demand blacklist checker they remember to use when things go wrong.
MXToolbox
MXToolbox is the default blacklist checker the industry uses. The free tool at mxtoolbox.com/blacklists.aspx checks any IP or domain against roughly one hundred public blacklists in a few seconds. The paid Delivery Center product adds continuous monitoring, alerting, and historical tracking. For cold email operators on dedicated tenants, run MXToolbox manually after any deliverability scare, and consider the paid tier if you are managing more than five sending domains in parallel.
HetrixTools
HetrixTools is the lower-cost continuous monitoring option. You add your IPs and domains, HetrixTools checks them against one hundred plus RBLs every few minutes, and you get alerts via email, Slack, Discord, or webhook the moment something appears. The free tier covers small operators. The paid tier scales to agency volume. We recommend HetrixTools for any operator running more than two domains, because manual blacklist checks always lag the actual problem by days.
Puzzle's Free Blacklist Checker
For ad-hoc lookups without creating an account, use /free-tools/blacklist-checker. It queries the major DNSBLs in one pass and returns results in the browser. Bookmark it for the situation where a campaign suddenly drops and you need a fast read on whether reputation collapsed.
What To Do If You Get Listed
Getting on a blacklist is recoverable, not fatal. Identify which list, identify the trigger (volume spike, bounce spike, spam complaint, infected sender), fix the cause, then submit a delisting request through the list operator's portal. Spamhaus delistings often resolve within forty-eight hours if you provide cause and remediation. SORBS is slower. Barracuda is fast and forgiving. Do not request delisting before you have identified and fixed the underlying issue, or you will be relisted within hours and the second listing is harder to clear.
DNS Authentication Validators: SPF, DKIM, DMARC
Authentication failures are the single most common reason cold email lands in spam. SPF tells receivers which IPs are allowed to send for your domain. DKIM cryptographically signs your messages. DMARC tells receivers what to do when SPF or DKIM fails. If any of the three is broken, misaligned, or missing, major receivers will route you to spam regardless of content quality.
MXToolbox SPF, DKIM, DMARC Checker
MXToolbox runs free individual lookups for SPF (mxtoolbox.com/spf.aspx), DKIM (with the selector), and DMARC (mxtoolbox.com/dmarc.aspx). Each lookup returns the parsed record, validates syntax, and flags common errors: missing tags, conflicting records, exceeded DNS lookup limits, missing alignment policies. Run all three checkers on every sending domain. Re-run after any DNS change. The MXToolbox DKIM checker requires the selector, which you can find in the headers of any message you have sent: look for the d= and s= values in the DKIM-Signature header.
Puzzle's DNS Checker
/free-tools/dns-checker consolidates SPF, DKIM, DMARC, MX, and A record checks into a single view, designed for cold email operators who need a fast read on whether a domain is correctly configured for outbound. It catches the common cold email failure modes: SPF missing the Google or Microsoft include, DMARC at p=none with no aggregate reports configured, DKIM not published at the expected selector. Use it as the first step in any deliverability audit.
The DNS Lookup Limit Problem
SPF records are limited to ten DNS lookups. Each include: directive counts. If you are using Google Workspace plus a CRM plus a transactional service plus an email warmup tool, you can easily exceed ten lookups and your SPF will silently fail. MXToolbox flags this. The fix is SPF flattening, or removing services from the include chain. Cold email operators who use real Google Workspace or Microsoft 365 mailboxes from /buy-google-workspace-cold-email-inboxes typically have just one or two includes and never hit this limit. Operators stacking five warmup tools and three CRMs onto one domain hit it constantly.
DMARC Reporting With Postmark or Dmarcian
DMARC aggregate reports (rua=) are XML files receivers send to you summarizing authentication results across every sender claiming your domain. Reading raw XML is painful. Postmark's free DMARC monitoring and Dmarcian's free tier both parse these reports into dashboards showing exactly who is sending as your domain, what is passing, what is failing, and where attacks are coming from. Set rua=mailto: on your DMARC record, point it at a Postmark or Dmarcian inbox, and check it weekly.
Platform-Native Dashboards: The Data Only Google And Microsoft Have
Third-party tools estimate reputation. Platform-native dashboards report it directly. Google and Microsoft both publish free diagnostic tools for senders, and serious operators check them as routinely as they check their CRM.
Google Postmaster Tools
Postmaster Tools (postmaster.google.com) is the single most important deliverability data source if you send to Gmail addresses. After you verify ownership of a sending domain, Google reports domain reputation (high, medium, low, bad), IP reputation, spam rate (the percentage of your mail Gmail users mark as spam), authentication pass rates for SPF, DKIM, DMARC, encryption rate, and delivery error rates. Spam rate above 0.3 percent is a yellow flag. Above 0.5 percent and Gmail will start aggressively filtering you. Above 1 percent and you are effectively blocked from primary inbox.
Postmaster Tools requires meaningful volume to populate. If you send under five hundred messages per day to Gmail addresses across a domain, the dashboard will show "not enough data." This is why dedicated sending domains and consolidated sender identity matter: you want enough volume on one identifier to actually see your data. Verify Postmaster Tools on every cold email sending domain you operate.
Microsoft SNDS and JMRP
Microsoft's Smart Network Data Services (SNDS) and Junk Mail Reporting Program (JMRP) are the Outlook and Hotmail equivalents. SNDS reports the volume, complaint rate, and trap hits Microsoft sees from your IPs. JMRP forwards you a copy of every message a Microsoft user marks as junk. Both are free, both require enrollment at sendersupport.olc.protection.outlook.com, and both work at the IP level rather than the domain level. For operators sending from Microsoft 365 mailboxes via OAuth, your sending IPs are Microsoft's own, so SNDS is less directly actionable than Postmaster Tools, but JMRP still gives you per-recipient complaint signal.
Yahoo, Apple, And Other Platforms
Yahoo runs Sender Hub for postmaster signal. Apple does not publish a Postmaster equivalent and operates as a black box, which is why iCloud placement testing through Glockapps matters more than it does for Gmail. For the long tail of providers, you rely on placement testing and DMARC aggregate reports rather than native dashboards.
List Cleaning: Removing The Inputs That Tank Reputation
Bounces and spam traps are the fastest way to destroy a sender reputation that took months to build. A single send to a list with five percent invalid addresses can drop you from "high" to "bad" reputation in Postmaster Tools within forty-eight hours. List verification is non-negotiable for cold email.
ZeroBounce
ZeroBounce verifies email addresses against syntax rules, MX records, SMTP handshake, role-based detection, disposable domain lists, and known spam traps. Pricing is per-verification, typically a fraction of a cent at volume. ZeroBounce also has a scoring tier that flags "catch-all" addresses, which accept all mail at the MX level but may bounce internally. For cold email, treat catch-alls as risky and either skip them or send to them at a separate inbox to isolate any reputation damage.
NeverBounce
NeverBounce is the closest direct competitor to ZeroBounce. Similar accuracy, similar pricing, slightly different categorization scheme. Most operators pick one and stick with it. Either is fine. Both will catch the same eighty to ninety percent of bad addresses on a typical scraped or purchased list.
Bouncer
Bouncer is a European-focused alternative with GDPR-aware data handling, useful if you are operating from the EU or sending to EU lists and need processor agreements that name a European entity.
Puzzle's Email Verifier
For one-off checks without an account, use /free-tools/email-verifier. It handles single-address validation in the browser. For bulk verification, use ZeroBounce, NeverBounce, or Bouncer. Run every list before first send. Re-verify lists older than ninety days, because addresses decay at roughly two percent per month.
Spam Trigger Auditing
Content filters catch trigger phrases, suspicious link patterns, and HTML quality issues. Run every new template through /free-tools/spam-checker or SpamAssassin to catch the obvious offenders before they reach a receiver. The SpamAssassin score is not predictive on its own, but a score above five almost guarantees Gmail will route you to Promotions or Spam.
The Upstream Fix: Removing Failure Modes Before They Happen
Every tool above measures something that can go wrong. The faster path is to remove the variables that go wrong in the first place. Most cold email deliverability problems trace to three upstream choices: the sending infrastructure, the domain age and history, and the warmup process. Tools cannot fix bad upstream choices.
Real Mailboxes, Not Relays
The single largest hidden variable in cold email deliverability is the sending infrastructure. SMTP relay services, shared IP pools, and "cold email platforms" with their own sending IPs put you on infrastructure shared with hundreds or thousands of other senders. Your reputation is partially their reputation. When someone else on the shared pool gets blacklisted, your placement drops. When the pool IP hits a spam trap, your delivery slows.
Puzzle Inbox sells real Google Workspace and Microsoft 365 mailboxes on dedicated tenants. Sending happens on Google's and Microsoft's own infrastructure, the same MX servers that handle billion-dollar enterprise traffic. There is no shared IP pool to share blame with. The IP-side variable is removed from your deliverability equation, which means you are no longer running a blacklist monitor on infrastructure you do not control. See /how-it-works for the architecture.
OAuth, Not SMTP Or IMAP
Connection to your sending platform uses OAuth, not SMTP or IMAP. This matters for two reasons. First, OAuth-connected sends go through the provider's API and inherit the provider's full reputation handling, which SMTP relays bypass. Second, OAuth tokens do not trigger the "suspicious login" flags that password-based connections do, so accounts do not get suspended mid-sequence.
Pre-Warmed Versus Standard
Puzzle Inbox offers two product lines. Pre-Warmed inboxes ship on Puzzle's generic domains after a fourteen-day warmup process, so they are ready to send at meaningful volume from day one. Standard inboxes ship on your own domain in 24-72 hours and you handle warmup, which is the right path if branded sender identity matters more than ramp time. Both flavors deliver in 24-72 hours via WhatsApp or email. Both run on real tenants. Browse Google Workspace inboxes, Outlook 365 inboxes, or pricing.
What This Eliminates From Your Tool Stack
Using real mailboxes on dedicated tenants does not remove the need for Mail-Tester, Glockapps, MXToolbox, or Postmaster Tools. You still need to measure placement, monitor blacklists for your domain (Google's and Microsoft's IPs are not going to get blacklisted on your behalf, but your domain still can if you send poorly), and watch domain reputation. What it removes is the entire category of "shared IP pool reputation damage caused by someone else's mistakes," and the related category of "SMTP relay outages that take your entire sending offline." Those two failure modes are responsible for a large share of mysterious cold email deliverability crashes, and they are eliminated upstream rather than monitored downstream.
Monitoring Cadence: How Often To Check What
Tools are useless without a schedule. The right cadence depends on your sending volume, the maturity of your domains, and how exposed you are to a placement drop. Here is the operator default we recommend.
Daily Checks
- Reply rates and bounce rates inside your sending platform. A bounce rate climbing above two percent is a red flag.
- Spam complaint rate inside your sending platform. Anything above 0.1 percent for the day demands a pause.
- For agencies running multiple clients, automated alerting from HetrixTools or MXToolbox Delivery Center.
Weekly Checks
- Google Postmaster Tools for every sending domain.
- Microsoft SNDS and JMRP if you send to Outlook addresses.
- DMARC aggregate reports via Postmark or Dmarcian.
- A Glockapps placement test on your highest-volume sequence.
- Manual MXToolbox blacklist sweep for any domain with elevated complaint rates.
Per-Change Checks
- Mail-Tester on any new template before it goes into production.
- /free-tools/dns-checker after any DNS record change.
- Spam checker on any new template variation.
- ZeroBounce or NeverBounce on any new list before first send.
Monthly Audits
- Full domain reputation review across all sending domains.
- Sequence content audit: which messages drive the highest reply rates, which drive the highest complaint rates, kill the bottom decile.
- List hygiene pass: re-verify any list older than ninety days.
- Review the upstream infrastructure: are any inboxes degraded, are warmup numbers still hitting targets, do any domains need replacement.
When To Escalate
If Postmaster Tools shows reputation drop from "high" to "medium," pause sending on the affected domain and audit the prior seven days of campaigns. If it drops to "low" or "bad," stop sending entirely and treat it as a sixty-day recovery process. If you hit a major blacklist (Spamhaus SBL, DBL), stop, identify the cause, fix it, then submit a delisting request. Do not keep sending while you wait for delisting. For deeper recovery guidance see why cold emails land in spam and how to fix it and the email domain reputation playbook.
Tool Selection By Operator Type
Not every operator needs every tool. A rough mapping by stage:
Solo Founder Or Small Team Under Three Inboxes
Mail-Tester for content checks. /free-tools/blacklist-checker and /free-tools/dns-checker for ad-hoc audits. /free-tools/email-verifier for single-address checks. ZeroBounce for list cleaning. Postmaster Tools once you cross five hundred Gmail sends per day. Total tool spend: under fifty dollars per month.
Agency Or Five To Twenty Inboxes
All of the above, plus Glockapps for routine placement testing, HetrixTools for continuous blacklist monitoring, and Dmarcian for DMARC report parsing. SNDS enrollment for any client sending to Outlook. Total tool spend: two hundred to five hundred dollars per month, easily justified by avoiding one campaign-wide placement crash per quarter.
Volume Operators Over Twenty Inboxes
Everything above, plus MXToolbox Delivery Center for centralized monitoring, a managed DMARC platform with alerting, and contractual SLAs with your infrastructure provider. At this volume, every hour of degraded placement costs measurable revenue.
What Tools Cannot Do
Tools measure. They do not fix. Mail-Tester will not improve your subject lines. Glockapps will not write better copy. MXToolbox will not delist you. Postmaster Tools will not fix a spam complaint trend. The tool tells you what is wrong. The fix is your sending behavior, your list quality, your content, your domain age, and your infrastructure choices.
The operators who run the cleanest deliverability are the ones who use tools as instrumentation, not as a substitute for discipline. They send conservative volumes from real mailboxes on clean infrastructure, they verify every list, they write personalized opens, and they pause the moment a reputation signal turns yellow. The tool stack is the dashboard, not the engine.
If you want to remove the infrastructure variable from your equation, start at /pricing and pick the right inbox flavor for your campaign. Real Google Workspace and Microsoft 365 mailboxes, dedicated tenants, OAuth-connected, delivered in 24-72 hours. Then build the monitoring stack above around them. The tools will tell you when something is wrong. The infrastructure will keep most things from going wrong in the first place.
Related Reading
- Why Cold Emails Land In Spam And How To Fix It
- The Complete SPF, DKIM, And DMARC Setup Guide
- Email Domain Reputation: The Operator Playbook
- How Puzzle Inbox Delivers Real Workspace And 365 Mailboxes
- Puzzle Inbox Pricing And Inbox Flavors