How the index works.
What we index
Any public website we can reach. A site is never removed because enrichment failed — if we can confirm the domain exists, it stays in the index with whatever data we have, even if that's just a favicon and a hostname.
Where domains come from
Domains are discovered from open, public sources only: the Tranco top-1M list, Hacker News, GitHub trending and curated awesome-lists, public RSS feeds, public web directories, and the open web index (Common Crawl).
We never scrape private platforms, bypass paywalls, or use data that requires authentication.
How we enrich a listing
Each new domain is fetched with a plain HTTP request to read its public metadata (title, description, favicon, OpenGraph image). Sites that require JavaScript to render fall back to a headless renderer. Once we have text content, an AI pass infers a category, a region, and a brand color.
None of this changes whether a site is listed. Enrichment failure only changes how the card looks — never whether it appears.
How we rank
Each listing accumulates signals over time and rolls up into four scores: Hot (current activity), New (first-detection recency), Rising (acceleration vs. baseline), and Overall. Tranco rank is a heavy input on Overall (~50%), with outbound clicks (~25%) and freshness signals (~15%) adding to it. All weights decay with time.
The default feed mixes Hot, Rising, and New entries and de-duplicates by domain so a single site never dominates the page.
Estimated monthly visits
The visit number on each card is an estimate, not measured traffic. It's derived from the site's Tranco rank using a power-law approximation calibrated to publicly known traffic data. Treat it as an order-of-magnitude signal, not a precise figure.
Quality and safety filters
A blocklist removes adult, gambling, and link-shortener domains. A quality and spam score filters out parked pages and obvious low-content junk. A health checker only marks a site offline after ten consecutive failed probes — short outages are never enough to hide a listing.
What we do not do
No user accounts, profiles, votes, comments, follows, or social features. No fake engagement counts. No paid placement. No AI-generated reviews or editorial summaries — descriptions come from each site's own public metadata.
Removal
If you operate a site listed here and would like it removed or corrected, see the removal page.
