Methodology

A technical look at how raw seller data becomes the structured rankings on Picky.expert. If you're new here, start with How We Rank for the plain-language version.

1. Ingestion

We continuously pull product listings and offers from retailer sources. Each raw row is stored verbatim with a timestamp and the source URL, so any downstream claim can be traced back.

2. Normalization & deduplication

The same product is often listed under dozens of titles ("Apple iPhone 15 128GB Black Unlocked", "iPhone 15 — 128 GB"). We normalize names with an AI model constrained to the format Brand + Model, then auto-merge entries with matching normalized names as variants. A canonical record is created; the other listings become variants attached to it.

3. Attribute extraction

Specs are extracted from listing text and manufacturer pages by structured prompts that return JSON conforming to a per-category schema. Extracted values are then validated against allowed types and ranges; out-of-bounds values are rejected, not silently coerced.

4. Taxonomy maintenance

Product categories and tags are kept consolidated. Synonyms (e.g., "mobile phone", "smartphone") collapse to a single canonical value with the rest stored as aliases that 301-redirect. Action-style tags ("comparison", "troubleshooting") are blocked at extraction time so the taxonomy stays focused on product entities.

5. Refresh cadence

Offer prices: refreshed on a rolling cycle, typically daily.
Product attributes: refreshed when source listings change.
Ratings: re-pulled on the same cycle as offers.
Taxonomy: maintenance pass runs continuously in the background.

6. Human review checkpoints

AI is fast but not infallible. Human reviewers spot-check normalized names, validate new categories before they become landing pages, and review every community-submitted edit before it merges into the canonical record.

7. Known limitations

Coverage depth varies by category — categories with fewer listed products produce fewer comparison pages.
Prices reflect the last refresh, not the live retailer price at click time. Always confirm on the seller's site before purchasing.
Ratings inherit any bias present in the retailer source they came from.

Find an error in the methodology or its output? Tell us on the contact page.