How to Run the End-to-End Pipeline

This guide explains how to orchestrate the full GeoLift workflow using the installed geolift pipeline command.

Prerequisites

  • A prepared dataset (CSV format)
  • A configuration YAML file
  • GeoLift installed

Running the Pipeline

Packaged-install example:

geolift pipeline --config /path/to/geolift_analysis_config.yaml

Source-checkout example with the shipped demo configs:

geolift pipeline --config data-config/geolift_analysis_config.yaml

Pipeline Config Behaviour

  • --config must point to one canonical stage YAML file
  • in a source checkout, GeoLift resolves the sibling canonical YAMLs from the same directory
  • GeoLift does not use a separate pipeline_config.yaml
  • built artifacts do not include the repo’s data-config/; packaged users should supply their own YAML path instead

Stage Selection

You can run partial pipelines by skipping specific stages:

  • --skip-power: Skips the power analysis stage
  • --skip-donor: Skips the donor evaluation stage
  • --only-inference: Runs only the final inference stage

Report Generation

Control report generation with:

  • --report (enabled by default)
  • --no-report

Output Directory

Specify where results should be saved:

geolift pipeline \
  --config data-config/geolift_analysis_config.yaml \
  --output-dir results/campaign_a

This will generate the following structure:

  • results/campaign_a/multicell_power_analysis/
  • results/campaign_a/multicell_donor_eval/
  • results/campaign_a/multicell_geolift_analysis/
  • results/campaign_a/geolift_pipeline_report.md
  • results/campaign_a/geolift_pipeline_report.html

For a source-tree walkthrough using the shipped demo configs, start with Quick Start.