How this works (plain English)

Last updated: 2026-04-02

This tool helps you check whether your private AI chat content may have ended up in public training datasets. We built this for normal people, not just technical people.

What happens when you search

You type a query (like an email, phone number, name, or unusual phrase).
We send that query to public HuggingFace dataset search endpoints.
We check for matches in known high-risk datasets.
We show you redacted previews and match confidence.
We immediately discard your query and results after the response is returned.

What we are

A search wrapper and user safety tool.
A way to report exposed data to dataset hosts and regulators.
A plain-language recourse layer for people who should not need to know AI infrastructure.

What we are not

We are not a broker of leaked chat data.
We do not keep a private copy of these datasets.
We do not store your raw search query.
We do not store your raw result rows.
We do not run tracking analytics scripts on this page.

What we store

Public dataset registry metadata (dataset names/links), and basic service health metrics (uptime/latency/error). No raw user query content. No raw match content.

What your result means

Possible match means "this might be related, please review carefully."
High-confidence match means "the identifier pattern strongly matched."
Neither label is legal advice or a court finding.

What to do if you find a match

Use the "Report to dataset host" button.
File an FTC complaint if you want regulatory paper trail.
Review your state privacy rights.

Short version

We help you check public datasets. We do not keep your sensitive search data. We show only redacted snippets. We give you next steps.

Our next steps? We'll help show you what to do next. Why are we doing this? Because someone needs to, and at the hpl company we believe people like us broke it. It's our responsibility to help you fix it. Have questions? Need more help? Did we miss something? Let us know.

Questions or corrections: hello@hplcompany.com