Constella vs. Have I Been Pwned

When Credential Monitoring Isn't Enough

Have I Been Pwned built the category. It’s a respected, well-maintained service that has helped millions of individuals understand their breach exposure. But for security teams, fraud platforms, MDR providers, and identity monitoring products operating at enterprise scale, HIBP was never designed for what you’re trying to build. 

Constella’s Identity Intelligence API covers more breaches, more records, more identity attributes, and the entire infostealer threat surface that HIBP does not index at all. This page shows you exactly where the differences are, and why they matter. 

The Numbers at a Glance

3.6x 

More records than HIBP 

63B+ Constella records vs. 17.5B HIBP records 

49x

More breaches than HIBP 

23,500+ Constella breaches vs. 973 HIBP breaches 

81x 

More breaches ingested in 2025 alone 

Constella processed 81x more breach events than HIBP in the past year 

392% 

Constella breach growth since 2021 

HIBP grew 17% in the same period 

0 

Infostealer packages in HIBP 

HIBP does not index infostealer data. Constella processed 51.7M packages in 2025. 

300+

Identity fields Constella processes per record 

HIBP outputs email, breach name, and exposed field labels only 

What Have I Been Pwned Is

Have I Been Pwned (HIBP) is a free, publicly available breach notification service created by Troy Hunt in 2013. Its core function is simple: enter an email address and learn whether that address appears in any of the breaches HIBP has indexed. It is widely trusted, well-maintained, and genuinely useful for individual awareness. HIBP also offers a paid API that allows organizations to query breach data programmatically, a Pwned Passwords service for checking password exposure, and limited domain monitoring. For individual users, small teams, and basic breach notification use cases, HIBP delivers real value. What HIBP was not designed to do: power a fraud decisioning model, feed an MDR detection stack, drive an identity theft monitoring product at scale, detect session hijacking via infostealer cookie harvesting, or provide the N-dimensional identity intelligence that enterprise security and fraud programs require.

2-Dimensional Credential Monitoring vs. N-Dimensional Identity Intelligence

This is the fundamental architectural difference between the two approaches. 

HIBP: 2-Dimensional

HIBP’s data model is credential-centric. A breach record is essentially a vector of two dimensions: an email address and a password. That is what HIBP indexes, what its API returns, and what its notification system acts on. For password reset flows and basic breach awareness, two dimensions are sufficient. 

Inputs: email address or domain 

Outputs: breach name, date, record count, description, exposed field labels, logo, verified flag 

Identity attributes beyond email: not returned 

Infostealer session data: not indexed 

Hashed query support: k-anonymity for passwords only, email anonymization is Pro-tier 

Records per breach: typically email and password fields 

Constella: N-Dimensional

Constella processes identity as a multi-dimensional object. A single breach record at Constella can contain up to 300+ distinct identity fields: name, address, phone, national ID, financial data, device identifiers, geolocation, and more. Every field is normalized, verified, and queryable. 

Inputs: email, domain, phone, national ID, IP address, username, and 38+ additional PII attributes 

Outputs: full identity record with all available fields, breach attribution, Identity Exposure Score, breach type and classification, policy-driven response formatting, actionable recommendations 

Infostealer package data: 51.7M packages processed in 2025, including session cookies, VPN credentials, device metadata, and targeted URL capture 

Hashed data: full pipeline support for 100% hashed operations — no plaintext ever touches your infrastructure 

Multi-language support: 50+ languages, 125+ countries, EU data center available 

Identity Exposure Score: proprietary risk scoring per identity, enabling alert prioritization 

Data Scale: Where HIBP Ends and Constella Begins

HIBP indexes carefully curated, high-profile breaches. Its 973 breaches represent the most widely publicized incidents — Adobe, LinkedIn, Yahoo, and similar. That curation is intentional and makes HIBP’s data trustworthy for what it covers.  

The problem is what it doesn’t cover. Constella has indexed 23,500+ breaches — 49x more than HIBP. That gap is not noise. It is the long tail of smaller breaches, regional incidents, dark web dumps, combo lists, and the thousands of lower-profile exposures that don’t make headlines but do compromise real identities. HIBP covers approximately 2% of Constella’s total breach volume. 

A practical consequence: between 2021 and 2025, Constella ingested approximately 11.5x more records than HIBP — roughly 62 billion more records over the same five-year period. In 2025 alone, Constella processed approximately 19 billion more records than HIBP and 81x more breach events. 

Constella already covers approximately 82% of all breaches included in HIBP. The inverse is not true: HIBP covers less than 2% of Constella’s breach library. 

This matters operationally. When your fraud model, your identity monitoring product, or your MDR stack queries for a compromised identity and gets no match, the question becomes: is this person truly clean, or did the query not reach far enough? 

The Infostealer Gap: What HIBP Cannot See

This is not a scale difference. It is a capability gap. 

HIBP does not index infostealer data. Infostealers — Lumma Stealer, RedLine, Vidar, Raccoon, Stealc, and dozens of other active malware families — are the dominant attack vector in 2025 and 2026. They harvest active session cookies, VPN credentials, browser-saved passwords, SSH keys, cryptocurrency wallets, and device metadata from infected devices, and transmit that data to criminal markets within hours of infection. 

The reason infostealers matter to security programs is not just the credential harvest. It is the session cookie. A stolen session cookie from a Microsoft Entra ID authentication bypasses MFA entirely because the session was already authenticated. Attackers replay the cookie, not the credential. HIBP has no visibility into this attack chain at any stage. 

Constella processed 51.7 million infostealer packages in 2025 — a 72% year-over-year increase. Each package contains structured telemetry: the specific URLs where sessions were captured, the device hardware ID, the malware strain responsible, and the recency of the log. This is the intelligence that enables a security team to act before the stolen session is replayed. 

  • HIBP infostealer packages indexed in 2025: 0 
  • Constella infostealer packages processed in 2025: 51.7 million 
  • Constella alert window: hours from infection to marketplace detection 
  • HIBP alert window for infostealer threats: no capability 

API Capability Comparison

Capability Constella Have I Been Pwned 
Search API (breach by email) 
Yes
Yes
Monitoring API (real-time alerts)
Yes
Yes (domain-level, paid)
Catalog API (breach metadata)
Yes
Yes
ATO / Password exposure API
Yes
Yes (Pwned Passwords)
Paste / dark web API
Yes
Yes
Infostealer / session cookie API
Yes
No
Score API (Identity Exposure Score)
Yes
No
Surface / domain intelligence API
Yes
Limited (domain search)
Business / domain-level inputs
Yes
Yes (paid tier)
38+ PII attribute inputs
Yes
No (email and domain only)
Full hashed data pipeline (100%)
Yes
Partial (passwords only)
Policy-driven alert configuration
Yes
No
Identity Exposure Score per record
Yes
No
Breach type and classification
Yes
No
Actionable recommendations per alert
Yes
No
Multi-language alert delivery (50+)
Yes
No
EU data center available
Yes
No
Sub-200ms API latency
Yes
Rate-limited by plan tier
SLA-backed enterprise support
Yes
Community / documentation

Inputs and Outputs: What You Can Query, What You Get Back

Inputs: What You Can Search On

HIBP accepts email addresses and, in paid tiers, domains. That is the complete input set. 

Constella accepts email addresses, domains, phone numbers, national identity numbers, usernames, IP addresses, physical addresses, financial account identifiers, and 38 additional PII attributes. For fraud and identity use cases requiring entity resolution across partial data, this breadth of queryable input is not a feature — it is the architecture. 

Outputs: What You Get Back

HIBP returns: breach name, breach date, record count, description, data classes exposed (as labels, not values), logo, and verified flag. It does not return the actual exposed data values. 

Constella returns the full available record for the queried identity, normalized across all fields that appeared in the source breach or infostealer package. This includes hashes or clear text of exposed PII depending on your configured policy, the Identity Exposure Score, breach type and classification, breach attribution chain, device metadata for infostealer records, targeted URLs captured by the malware, and policy-driven recommendations formatted to your specification. 

The practical difference: a HIBP response tells you an identity was in a breach. A Constella response tells you what was exposed, how sensitive it was, how confident the attribution is, whether infostealer malware was involved, and what action your system or analyst should take. 

Data Verification: Basic vs. Comprehensive

HIBP applies a four-point verification check to breaches it accepts: public acknowledgment of the breach, a structure consistent with the claimed source, evidence of the attacker and their track record, and basic data integrity. This is honest, reasonable verification for a publicly accessible service. 

Constella’s verification pipeline applies 25+ automated checks to every ingested dataset, including source authentication, structural and linguistic integrity analysis, deduplication against the existing data lake, entity resolution across multiple identity attributes, legal liability evaluation, and data pedigree tracing. Every record in Constella’s lake is traceable to a confirmed breach or exposure event. 

This matters for alert quality. Higher verification standards mean lower false positive rates, which means your fraud model, your SOC alert queue, and your identity monitoring customers receive signals that are both accurate and actionable. 

One measurable consequence: in comparative testing using the same provisioned data, Constella-only breaches produced approximately a 50% unique hit rate, while breaches that appeared in both HIBP and Constella produced approximately a 30% hit rate. The Constella-exclusive breach coverage — the breaches HIBP has not indexed — delivers meaningfully higher match rates against real exposed identity populations. 

OEM and Partner Integration: Customization HIBP Cannot Offer

If you are building an identity theft monitoring product, a fraud decisioning service, an MDR offering, or any platform that requires identity intelligence as a component, HIBP’s API provides a fixed output: breach metadata and a binary match signal. 

Constella’s API is built for integration at the product level. This includes:  

  • Custom alert policy configuration: define your own alert descriptions, confidence thresholds, breach type filters, and delivery formats for each customer or product line 
  • Multi-language alert delivery: alerts generated in 30+ languages for international consumer monitoring products 
  • Identity Exposure Score: a proprietary risk score attached to each alert that enables your system to prioritize and filter without building scoring logic internally 
  • 100% hashed data pipeline: operate without any plaintext identity data touching your infrastructure, satisfying GDPR, CCPA, and equivalent data protection requirements out of the box 
  • Monitoring API: continuous background monitoring with webhook or batch alert delivery, not just query-response 
  • Catalog API: programmatic access to breach metadata for building breach disclosure workflows 
  • SLA-backed support with dedicated account management for enterprise and OEM partners 

The Constella API was purpose-built for OEM integration. HIBP’s API was built for direct access. The architectural difference becomes visible at scale. 

Who Each Solution Is Right For

HIBP is a strong fit for:

Individual users checking personal email breach exposure

Small security teams doing one-off breach lookups

Developers building simple breach awareness features into consumer applications

Organizations needing a low-cost, easy-to-implement domain monitoring notification

Proof-of-concept projects that need breach data without procurement overhead

Constella is built for:

Identity theft monitoring providers building consumer or enterprise products on top of breach and infostealer intelligence 

MDR and XDR platforms that need external identity risk signals to detect pre-ransomware credential exposure 

Fraud and risk teams at financial institutions, FinTechs, and e-commerce platforms that need N-dimensional identity data for fraud model inputs 

MSSPs and security service providers building premium identity monitoring service tiers 

Enterprise security teams using Hunter+ DRP for workforce credential monitoring, executive protection, and brand monitoring 

Law enforcement and OSINT investigators needing identity attribution across breach records and infostealer data 

Any organization operating in a regulated environment that requires 100% hashed data handling and EU data residency 

FAQ: Deep Dives into Methodology

Does Constella cover the same breaches as HIBP?

Constella already covers approximately 82% of the breaches in HIBP’s index. The remaining roughly 18% consists primarily of exclusive, selectively shared datasets — often partial email-only disclosures used by threat actors to establish credibility, or researcher-published notices. Constella covers these selectively based on signal quality and attribution confidence. HIBP covers less than 2% of Constella’s total breach library. 

Does HIBP have infostealer data?

No. HIBP does not index infostealer packages. This is a capability gap, not a scale gap. Infostealer data — session cookies, VPN credentials, device metadata, targeted application URLs — represents the primary attack vector for ransomware initial access and MFA bypass in 2025 and 2026. Constella processed 51.7 million infostealer packages in 2025. 

Can I use Constella if I already integrate HIBP?

Yes. Constella’s API is designed to be additive. Many organizations start with HIBP for basic breach notification and layer Constella’s API on top for infostealer detection, the additional 18x breach coverage that HIBP does not provide, N-dimensional identity attribute matching, and the monitoring and scoring capabilities HIBP’s API does not offer.

How does Constella handle privacy and data compliance?

Constella supports 100% hashed data operations across the full query and alert pipeline. No plaintext identity data needs to transit your infrastructure. This satisfies GDPR, CCPA, and equivalent data protection frameworks by design. Constella also maintains an EU data center for organizations with data residency requirements. HIBP supports hashed queries for passwords via k-anonymity but does not offer full hashed pipeline operation. 

What is the Identity Exposure Score?

The Identity Exposure Score is a proprietary risk metric Constella attaches to each identity alert that quantifies the severity and sensitivity of an exposure based on the breach source, data types exposed, infostealer involvement, freshness of the data, and other weighted factors. It enables downstream systems to prioritize alerts, trigger different response workflows, and avoid treating a partial email exposure from 2018 the same as a fresh infostealer log containing active session cookies. 

How quickly does Constella detect new breach and infostealer data?

Constella’s breach ingestion scales to 30,000 breach events per week and processes infostealer packages as they appear on monitored criminal markets. In documented cases, infostealer log detection has occurred within hours of infection — enabling credential rotation and session invalidation before the stolen data can be used for account takeover or ransomware staging. 

See the Difference in Your Environment

The best way to understand the gap between credential monitoring and identity intelligence is to run both against your own domain or customer population. Contact us to discuss a proof of value against your actual use case.