Python Support Library

Conductor includes a Python library that provides utilities for tasks that are implemented as Python scripts. For example, you might use a Python script to process experiment results, transform data, or create figures. As long as you have installed Conductor, you can access the Python library by importing conductor.lib.

import conductor.lib as cond

note

The functions in this library are meant to be called in scripts that are launched by Conductor. If used in scripts that are not launched by Conductor, these library functions will raise RuntimeError exceptions.

Path Utilities

`get_deps_paths()`

def get_deps_paths() -> List[pathlib.Path]

Returns a list of this task's dependencies' output paths. If the task has no dependencies, the returned list will be empty.

`get_output_path()`

def get_output_path() -> pathlib.Path

Returns the path where this task should write its output files.

Usage Example

The following code snippets provide an example of how you can use Conductor's Python library to retrieve the location of dependent tasks' outputs as well as the task's output path.

COND
run_experiment(
  name="benchmark_1",
  run="./run_benchmark_1.sh",
)

run_experiment(
  name="benchmark_2",
  run="./run_benchmark_2.sh",
)

run_command(
  name="combine",
  run="python combine_results.py",
  deps=[
    ":benchmark_1",
    ":benchmark_2",
  ],
)

combine_results.py
import conductor.lib as cond
import pandas as pd


def main():
    """
    Combines the CSV files written by this task's dependencies.
    """
    results = []
    deps = cond.get_deps_paths()

    for dep_path in deps:
        results.append(pd.read_csv(dep_path / "measurements.csv"))
    combined = pd.concat(results, ignore_index=True)

    out_dir = cond.get_output_path()
    combined.to_csv(out_dir / "combined.csv", index=False)


if __name__ == "__main__":
    main()

`in_output_dir()`

def in_output_dir(file_path: path.Path | str) -> pathlib.Path

If the current script is being run by Conductor, this function amends file_path to make it fall under where Conductor's task outputs should be stored. Otherwise, this function returns file_path unchanged (but as a pathlib.Path).

This is meant to be useful for scripts that may be run independently of Conductor. Note that file_path should be a relative path.

`where()`

def where(
    identifier: str,
    relative_to_project_root: bool = False,
    non_existent_ok: bool = False,
) -> Optional[pathlib.Path]

Returns the output location path of the given task identifier. This function will only work when executed from inside a Conductor project (i.e., in a path that is under the project root). This function is useful when retrieving experimental results in scripts or notebooks.

If this function returns None, it indicates no output location is available (e.g., the task has not run before).

If relative_to_project_root is set to True, this will return a relative path to the project root. Otherwise, it returns an absolute path.

If non_existent_ok is set to True, this will return the task's output path even if the path does not yet exist.

Path Utilities​

get_deps_paths()​

get_output_path()​

Usage Example​

in_output_dir()​

where()​