Frequently asked questions.

Discovery & indexing

What is NeighborhoodTreasure?

NeighborhoodTreasure is an automated, public index of the live web. We discover websites from open signal sources, enrich them with public metadata, rank them by sustained traffic and freshness, and present the result as a continuously updated feed. There are no accounts, no votes, and no paid placement.

How do you discover new websites?

Domains come from open public sources only: the Tranco top-1M list, Hacker News, GitHub trending and curated awesome-lists, public RSS feeds, public web directories, and the open Common Crawl web index. We never scrape private platforms, bypass paywalls, or use any data that requires authentication.

How often is the index updated?

Continuously. Ingestion crons run on short intervals throughout the day to pick up new domains and refresh signals; ranking, screenshots, and metadata refresh on their own schedules. The Hot, New, and Rising feeds typically reflect activity from the last few hours.

Why is a particular website not listed?

Either we have not discovered it yet through our public sources, the domain failed a basic liveness check, or it has been removed by request. A site is never delisted because enrichment failed — if we can confirm the domain exists, it stays in the index, even if all we have is a favicon and a hostname.

Do you crawl or scrape websites?

We make plain HTTP requests to read each site's public metadata (title, description, favicon, OpenGraph image), and fall back to a headless renderer for sites that require JavaScript. We respect robots.txt directives that apply to general crawlers and we do not bypass any access controls.

How do you decide a site's category and region?

After fetching public text content, an AI pass infers a category, a region, and a brand color from what's on the page. Heuristics handle obvious cases (TLDs, language, well-known platforms); the AI handles the rest. Misclassifications can be reported via the contact page.

Ranking & scores

How is the Hot score calculated?

Hot reflects current activity over a short rolling window. It combines outbound clicks logged on this site, fresh inbound mentions on Hacker News and GitHub, and engagement velocity. The score decays quickly — a site falls out of Hot within a few days unless new signals keep arriving.

How is the New score calculated?

New is weighted entirely by first-detection recency. A domain we discovered today ranks above one we discovered yesterday. After about 14 days the New score effectively reaches zero and the site moves into the long-term feeds.

How is the Rising score calculated?

Rising measures acceleration relative to a site's own baseline. A small site with a sudden 10× jump in signal can outrank a much larger site with steady traffic. It's designed to surface things on the way up, not things that are already large.

How is the Overall score calculated?

Roughly 50% Tranco global popularity, 25% outbound clicks logged on this site, 15% freshness signals, and 10% quality and category coverage. All inputs decay with time so the index doesn't ossify around what was popular a year ago.

Where does the estimated monthly-visits number come from?

It's an estimate, not a measurement. We derive it from the site's Tranco rank using a power-law curve calibrated against publicly known traffic data. Treat it as an order-of-magnitude signal: a site ranked around 100 sees billions of visits per month, a site ranked around 100,000 sees hundreds of thousands.

Can I pay to rank higher or get featured?

No. There is no paid placement, no sponsored slot, no submission form, and no editorial override. Every rank is produced by the formulas above applied to public signal data.

Do you remove sites for low quality or low traffic?

No. Coverage is the goal. A site never disappears because its score is low — it just sits further down the feed. The only reasons a site is removed are owner removal requests, confirmed malware, or legal obligation.

For site owners

I own a site listed here — how do I update its description, category, or screenshot?

Most listings refresh automatically when we re-fetch your site, so updating the metadata on your homepage (title tag, meta description, OpenGraph image) is the fastest fix. For a manual refresh or a category correction, use the contact page and include the domain.

How do I request removal of my site?

Use the removal page. Verified domain owners can have their listing taken down — we do not require a legal threat or justification. The domain will be excluded from future ingestion as well.

Why does my site's screenshot look outdated or broken?

Screenshots are captured periodically, not on every visit, and some sites block the screenshot renderer with a bot challenge. If the image is stale or missing, request a refresh via the contact page or update your OpenGraph image — we'll prefer that over a fresh screenshot when one is available.

Does being listed imply endorsement?

No. Listings are automated and based on public signals. Inclusion is not a recommendation, an endorsement, or any kind of relationship between NeighborhoodTreasure and the listed site.

Privacy & data

Do I need an account to use the site?

No. There are no accounts, no logins, and no profiles. Everything you can do here works without signing in.

Do you track me? What analytics do you collect?

We log aggregate, non-identifying signals: outbound link clicks, impressions, share events, and the visitor's country (derived from the request, not from any persistent identifier). We do not use third-party analytics trackers, advertising cookies, or fingerprinting.

Do you sell or share data with third parties?

No. We do not sell, rent, or share visitor data. The aggregate signals we collect are used only to rank and improve the index.

What gets logged when I click an outbound link or share a card?

An outbound click logs the target website, the surface it was clicked from (Hot, New, etc.), and the visitor's country. A share event logs the same plus the channel (X, Facebook, copy link, etc.). No personal identifiers are stored and the events are rate-limited and de-duplicated.

Sources & methodology

What public sources do you use?

Tranco (a research-grade aggregate of public domain rankings), Hacker News, GitHub trending and curated awesome-lists, public RSS feeds from technology and design publications, the Common Crawl open web index, and a small set of public web directories. The full list is on the Data sources page.

Do you use AI? Which models, and for what?

Yes. We use OpenAI's gpt-4o-mini for two narrow tasks: classifying a site into a category and inferring a region from its public text content. The AI never decides whether a site is listed or how it ranks — it only labels. There is a hard daily budget cap to keep costs predictable.

Is the data open? Can I get an export or API?

There is no public API or bulk export today. The site itself, including category, region, and cross-cut pages, is fully crawlable, and an XML sitemap is published at /sitemap.xml. If you have a research use case, get in touch via the contact page.

Sharing, business & legal

Can I share or embed a listing on social media or my own site?

Yes. Every card has a share button (X, Facebook, LinkedIn, Reddit, WhatsApp, copy link) that points to the listing's detail page on this site. You're also welcome to deep-link directly to /site/{domain} from anywhere.

Who runs NeighborhoodTreasure?

NeighborhoodTreasure is operated by StarNest LLC. There is no editorial team and no community moderation layer — the index is fully automated.

How do I contact you or report an issue?

Use the contact page for general questions, classification corrections, or technical issues. Use the removal page for delisting requests. We respond to legitimate requests promptly.

Are you affiliated with any of the sites you list?

No. We have no commercial, editorial, or partnership relationship with the sites in the index. Listings are purely the output of an automated pipeline over public signals.

Still need something? Contact us, request removal of a site, or read the full methodology.