You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The manual should be fully operational, with all outdated information removed to prevent confusion.
Future of the kedro-airflow-k8s plugin: There needs to be a decision on whether to continue recommending the kedro-airflow-k8s plugin, especially since it's noted to be compatible with kedro versions less than 0.18.
My suggestion is to move away from recommending the astro-airflow-iris starter, considering it's outdated and not specifically required for running Kedro projects on Airflow with Astronomer. It may be clearer for users to start with the standard spaceflights-pandas starter available through the kedro new command with options [1-5], ensuring a more streamlined and up-to-date starting point.
Current Strategy:
The general strategy to deploy a Kedro pipeline on Apache Airflow is to run every Kedro node as an Airflow task while the whole pipeline is converted into a DAG for orchestration purpose. This approach mirrors the principles of running Kedro in a distributed environment.
should be discussed and confirmed, as it has several drawbacks regarding the approach of running a new Kedro session for each node:
it can be time consuming
it does not accommodate Memory datasets, necessitating their specification and storage within the dataCatalog. This limitation should be explicitly addressed in the docs to ensure clarity for users.
If strategy will be confirmed it might be advantageous to enhance the kedro-airflow plugin with to not only generate the DAG but also create an Airflow configuration folder, including a tailored version of config.yml that incorporates all MemoryDatasets.
Given the known issues with default logging via the Rich library, the manual should include a section advising on switching from Rich to Console logging, complete with detailed instructions to facilitate this change and ensure DAG operability.
Guidance on automatically transferring files from the Airflow container back to the user's local folder maybe benefit users.
Incorporating a section that outlines steps for deploying kedro Airflow project to cloud services such as AWS, Azure, and Google Cloud, AstroCloud.
The text was updated successfully, but these errors were encountered:
The outcome of this ticket is to fix 1,2,3,5 and 6. The idea is to improve the existing documentation and make it run properly. There other points mentioned here will be tackled in separate tickets.
Comments 1, 3, 5, and 6 have been addressed in kedro-org/kedro#3792. Follow-up tickets have been created for comments 2 and 7 at #652 and #651 respectively. Regarding the changes in deployment strategy proposed by comment 4, I currently do not have any ideas for modifications. However, we should discuss the deployment strategy during a Technical Design session to share knowledge, explore possibilities, and confirm the current strategy.
Description
The Airflow Deployment Manual appears to require updates:
kedro-airflow-k8s
plugin: There needs to be a decision on whether to continue recommending the kedro-airflow-k8s plugin, especially since it's noted to be compatible with kedro versions less than 0.18.astro-airflow-iris
starter, considering it's outdated and not specifically required for running Kedro projects on Airflow with Astronomer. It may be clearer for users to start with the standardspaceflights-pandas
starter available through thekedro new
command with options [1-5], ensuring a more streamlined and up-to-date starting point.should be discussed and confirmed, as it has several drawbacks regarding the approach of running a new Kedro session for each node:
If strategy will be confirmed it might be advantageous to enhance the
kedro-airflow
plugin with to not only generate the DAG but also create an Airflow configuration folder, including a tailored version ofconfig.yml
that incorporates all MemoryDatasets.The text was updated successfully, but these errors were encountered: