dataorc-utils — Config¶
Lightweight configuration for Data Lake pipelines.
| Topic | Link |
|---|---|
| Immutable config & path pattern | CorePipelineConfig |
| Environment integration | PipelineParameterManager |
| Validation helpers | Defaults and validation |
Quick start¶
from dataorc_utils.config import PipelineParameterManager
mgr = PipelineParameterManager()
infra = mgr.prepare_infrastructure([
"datalake_name",
"datalake_container_name",
])
cfg = mgr.build_core_config(
infra,
domain="sales",
product="orders",
table_name="order_lines",
)
print(cfg.get_lake_path("bronze"))
print(cfg.get_work_path("bronze"))
# Access infrastructure variables
datalake = cfg.env_vars["datalake_name"]
# Or: infra.variables["datalake_name"]
Note: if `env` is not set in the environment, `prepare_infrastructure()` will default it to "dev".