Use the @py_job and @parallelized_job decorators for complex transformations in Python
@sql_job
is great for SQL-based transformations, some logic is too complex or better expressed in Python. The @py_job
and @parallelized_job
decorators are designed for these scenarios.
False
. Whether or not the function will work on the entire data or return it as a generator.
dft plan
, Datafruit checks if all referenced tables and columns exist in the current schema.table_name
or query
parametersdft plan
before applying changes@parallelized_job
handles the parallel execution across your entire dataset using Ray.
@parallelized_job
is to insert data row by row