1.Data Acquisition-Blog-Factory Audit China & Supplier Verification

1.Data Acquisition

Blog | February 19, 2026

To verify factory maps using satellite data, follow this structured approach combining geospatial analysis, machine learning, and manual validation:

Satellite Imagery: Obtain high-resolution imagery (≤1m/pixel) from sources like:
- Commercial: Maxar, Planet Labs, Airbus
- Open: Sentinel-2 (10m), Landsat (30m), Google Earth Engine
Factory Map Data: Use vector formats (GeoJSON, Shapefile) with factory coordinates/polygons.
Ancillary Data: DEMs (elevation), road networks, land-use classifications.

Preprocessing

Image Enhancement:
- Apply pansharpening (if using multispectral + panchromatic bands).
- Use atmospheric correction (e.g., DOS1) and cloud masking.
Coordinate Alignment:
- Reproject all data to a common CRS (e.g., UTM).
- Georeference satellite imagery using GCPs or RPCs.
Map Validation:
- Clean map data (remove duplicates, fix topology errors).
- Ensure timestamps align (use imagery close to map creation date).

Automated Verification with AI/ML

Building Detection:
- Train a model (U-Net, Mask R-CNN) on labeled satellite data to detect factory buildings.
- Use transfer learning with pre-trained models (e.g., DeepGlobe, xView).
Feature Extraction:
- Structural Features: Identify large rectangular buildings, parking lots, storage tanks.
- Spectral Signatures: Analyze NDVI (vegetation absence), NDBI (built-up areas).
- Contextual Clues: Proximity to roads/railways, zoning data.

Comparison Workflow:

# Pseudocode for matching factories
for factory in map_data:
    satellite_region = crop_satellite_image(factory.bbox, buffer=50m)
    detected_buildings = building_detection_model(satellite_region)
    if not detected_buildings:
        mark_as_false_positive(factory)
    else:
        check_overlap(factory.polygon, detected_buildings)

Quantitative Metrics

Spatial Accuracy:
- Calculate Hausdorff Distance between mapped and detected factory polygons.
- Use Intersection over Union (IoU) for building-level matching.
Statistical Measures:
- Precision/Recall: Precision = TP / (TP + FP), Recall = TP / (TP + FN)
- F1-Score: Harmonic mean of precision/recall.
Error Categorization:
- False Positives: Mapped factories with no satellite evidence.
- False Negatives: Undetected factories in imagery.
- Location Shifts: Distance discrepancies (e.g., >30m).

Manual Validation

Visual Inspection:
- Use GIS tools (QGIS, ArcGIS) to overlay maps on imagery.
- Flag ambiguous cases for expert review.
Ground Truthing:
- Validate discrepancies with field surveys or drone imagery.
- Cross-reference with OSM or OpenAddresses.

Iterative Refinement

Model Improvement:
- Retrain ML models with false positives/negatives as new training data.
- Augment data with rotated/flipped factory images.
Threshold Adjustment:
Tune IoU/distance thresholds based on region-specific factors (e.g., urban vs. rural).

Tools & Platforms

Processing: Google Earth Engine, ArcGIS Pro, QGIS, Python (GDAL, Rasterio, GeoPandas).
ML Frameworks: TensorFlow, PyTorch, Detectron2.
Cloud Platforms: AWS SageMaker, Google Vertex AI.

Example Workflow

Input: Factory map (GeoJSON) + Sentinel-2 imagery.
Preprocessing: Reproject to EPSG:32633, apply cloud masking.
AI Detection: Run U-Net to segment factory buildings.
Comparison: Compute IoU for each mapped factory vs. detected polygons.
Output: Report with:
- Precision: 85%
- Recall: 78%
- F1-Score: 81%
- Top 5 false positives (e.g., mapped locations in forests).

Challenges & Mitigations

Occlusion: Use SAR data (e.g., Sentinel-1) for cloudy regions.
Scale Variations: Employ multi-scale detection (sliding windows).
Temporal Changes: Prioritize recent imagery; flag outdated maps.
Data Gaps: Blend open-source (Sentinel) + commercial (Maxar) data.

By combining automated AI-driven analysis with human validation, you can efficiently verify factory maps with high accuracy, ensuring reliability for applications like urban planning, supply chain audits, or environmental monitoring.

Previous: Why Location Verification Matters for Compliance:A Critical Business Imperative

Next: 1.Protection of Trade Secrets Intellectual Property: