Hi! I’m considering adopting Kedro for a project I’m working on, but seem to have arrived at a limitation for my use case.
I would like to run the same pipeline on different sets of input data. Say, I want to run a pipeline for country A and country B. The country is defined in the SQL query as follows:
SELECT * FROM input_data WHERE country = {country}
Is there a way I could define the country as a parameter in the CLI? e.g kedro run --params country=a
. In essence, use params for data catalogs (not just for nodes).
Also, is there a way in which I could load the processed data by specifying the country? Sort of like a tag, e.g. country_data = io.load("processed_data:country_a")
Thank you!