Methodology
A technical look at how raw seller data becomes the structured rankings on Picky.expert. If you're new here, start with How We Rank for the plain-language version.
1. Ingestion
We continuously pull product listings and offers from retailer sources. Each raw row is stored verbatim with a timestamp and the source URL, so any downstream claim can be traced back.
2. Normalization & deduplication
The same product is often listed under dozens of titles ("Apple iPhone 15 128GB Black Unlocked", "iPhone 15 — 128 GB"). We normalize names with an AI model constrained to the format Brand + Model, then auto-merge entries with matching normalized names as variants. A canonical record is created; the other listings become variants attached to it.
3. Attribute extraction
Specs are extracted from listing text and manufacturer pages by structured prompts that return JSON conforming to a per-category schema. Extracted values are then validated against allowed types and ranges; out-of-bounds values are rejected, not silently coerced.
4. Taxonomy maintenance
Product categories and tags are kept consolidated. Synonyms (e.g., "mobile phone", "smartphone") collapse to a single canonical value with the rest stored as aliases that 301-redirect. Action-style tags ("comparison", "troubleshooting") are blocked at extraction time so the taxonomy stays focused on product entities.
5. Refresh cadence
- Offer prices: refreshed on a rolling cycle, typically daily.
- Product attributes: refreshed when source listings change.
- Ratings: re-pulled on the same cycle as offers.
- Taxonomy: maintenance pass runs continuously in the background.
6. Human review checkpoints
AI is fast but not infallible. Human reviewers spot-check normalized names, validate new categories before they become landing pages, and review every community-submitted edit before it merges into the canonical record.
7. Known limitations
- Coverage depth varies by category — categories with fewer listed products produce fewer comparison pages.
- Prices reflect the last refresh, not the live retailer price at click time. Always confirm on the seller's site before purchasing.
- Ratings inherit any bias present in the retailer source they came from.
Find an error in the methodology or its output? Tell us on the contact page.