Summary
Data engineers may become obsolete due to the new Genie code in Databricks that generates Pyspark code.
Genie code in Databricks transforms the role of data engineers
Databricks has launched the Genie code, which automatically generates Pyspark code from source-to-target mapping (STTM) using feedback queries. This development allows users to create notebooks faster with an accuracy of around 50%, which traditionally takes data engineers significant time and effort.
Why this matters for BI professionals
This innovation may lead to a shift in the data engineering sector as the need for traditional data engineers diminishes. Competitors like Google BigQuery and Snowflake will need to respond to these changes, especially as automation becomes increasingly commonplace in data analysis. The trend toward AI-driven solutions is accelerating and could drastically change how teams operate, shifting the focus toward strategic decision-making rather than execution tasks.
Concrete takeaway
BI professionals should monitor the rise of automation in the data engineering space and be prepared to adapt their skills to new technologies like Genie code to remain relevant.
Deepen your knowledge
ETL Explained — Extract, Transform, Load in plain language
What is ETL? Learn how Extract, Transform, and Load works, the difference with ELT, and which tools to use. Clearly expl...
Knowledge BaseData Lakehouse Explained — The best of both worlds
What is a data lakehouse and why does it combine the best of data warehouses and data lakes? Architecture, comparison, a...
Knowledge BasePredictive Analytics — What can it do for your business?
Discover what predictive analytics is, how it works, and how to apply it in your business. From the 4 levels of analytic...