Discussion about this post

User's avatar
Omar's avatar

Bit late to the post. We use airflow these days to trigger a databricks dbt job to run the marts.

Each dag in airflow triggers one mart, so we have 400+ marts.

Biggest downside is that we are using airflow datasets as a scheduler and it is extremely buggy. We are stuck in 2.10.x and won’t be upgrading anytime soon due to migrating to dbks

My question is: have you ever had issues with dataset scheduling in airflow?

Dennys's avatar

what if you have more than 400 DBT models? is the DBTDag still the best option?

Is better to have 400 tasks instead of just 3?

5 more comments...

No posts

Ready for more?