fix(data): filter non-Latin city names + score range clamp (Phase F)
- stg_population_geonames: reject CJK/Cyrillic/Arabic city names via regex (fixes "Seelow" showing Japanese characters on map) - dim_locations: filter empty location names after trim - location_profiles: defensive LEAST/GREATEST clamp on both scores (0-100) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
@@ -48,6 +48,7 @@ locations AS (
|
||||
ref_year
|
||||
FROM staging.stg_population_geonames
|
||||
WHERE lat IS NOT NULL AND lon IS NOT NULL
|
||||
AND LENGTH(TRIM(city_name)) > 0
|
||||
),
|
||||
-- ── EU NUTS-2 income via spatial join ──────────────────────────────────────
|
||||
-- Each EU location's (lon, lat) is matched against NUTS-2 boundary polygons.
|
||||
|
||||
Reference in New Issue
Block a user