News

With data lineage for Unity Catalog, data teams can see all the downstream consumers impacted by data changes – applications, dashboards, machine learning models or data sets, etc. – and ...
Netflix's data-science team has open-sourced its Metaflow Python library, a key part of the 'human-centered' machine-learning infrastructure it uses for building and deploying data-science workflows.
Databricks Inc. today is adding data lineage features to its Unity Catalog governance platform, a move that it says significantly expands data governance capabilities on the hybrid data warehouse ...
However, there has been a lack of open source Python library for developers designed around AI workflows. dltHun (short for data load tool), a Berlin-based startup, thinks it may have the solution.
Key features of Unity Catalog include automated run-time lineage to capture all lineage generated in Databricks, providing more accuracy and efficiency versus manually tagging data.