Datafold Blog
Our thoughts and insight in the world of data.
Our thoughts and insight in the world of data.
Open source data-diff automates data quality checks for data replication and migration.
Read More
Read More
Open source data-diff automates data quality checks for data replication and migration.
Read More
Data diffing is the process of comparing two datasets. See various ways to compare data at different levels of complexity.
Read More
Read More
Data diffing is the process of comparing two datasets. See various ways to compare data at different levels of complexity.
Read More
Learn best practices for how to write and manage dbt tests in your organization.
Read More
Read More
Learn best practices for how to write and manage dbt tests in your organization.
Read More
Datafold has launched new pricing to make data quality more accessible for analytics engineers and data engineers.
Read More
Read More
Datafold has launched new pricing to make data quality more accessible for analytics engineers and data engineers.
Read More
It's official Datafold is now SOC2 Type II compliant. We follow a security by design approach to our software development process and are focused on keeping our customers' data safe.
Read More
Read More
It's official Datafold is now SOC2 Type II compliant. We follow a security by design approach to our software development process and are focused on keeping our customers' data safe.
Read More
Datafold has partnered with dbt Labs and has launched an integration with dbt to deliver column-level lineage, data diff, and shareable impact reports for analytics engineers.
Read More
Read More
Datafold has partnered with dbt Labs and has launched an integration with dbt to deliver column-level lineage, data diff, and shareable impact reports for analytics engineers.
Read More
2021 was a big year for Datafold. We reflect on top feature updates, blogs, and major company announcements from the past year.
Read More
Read More
2021 was a big year for Datafold. We reflect on top feature updates, blogs, and major company announcements from the past year.
Read More
Get an overview of the Data Quality Meetup #6. With speakers from Yelp, Patreon, Convoy, and Lightdash, the event included lightning rounds on data quality best practices and approaches from leading data-driven companies.
Read More
Read More
Get an overview of the Data Quality Meetup #6. With speakers from Yelp, Patreon, Convoy, and Lightdash, the event included lightning rounds on data quality best practices and approaches from leading data-driven companies.
Read More
Datafold Founder and CEO, Gleb Mezhanskiy, shares what prompted Datafold's creation, how it has grown, and plans for the future.
Read More
Read More
Datafold Founder and CEO, Gleb Mezhanskiy, shares what prompted Datafold's creation, how it has grown, and plans for the future.
Read More
What should you be looking for when doing data QA with Data Diff? There are three core checks that can help prevent surprises in production dashboards, and this blog walks you through what you're looking for in each step.
Read More
Read More
What should you be looking for when doing data QA with Data Diff? There are three core checks that can help prevent surprises in production dashboards, and this blog walks you through what you're looking for in each step.
Read More
There are plenty of rules around PII, but you can stay on top of where your sensitive data is flowing in your pipelines with column-level lineage.
Read More
Read More
There are plenty of rules around PII, but you can stay on top of where your sensitive data is flowing in your pipelines with column-level lineage.
Read More
Bad data cost Samsung and Uber ridiculous sums of money with issues that could have been averted if they had been invested in data quality management. Read about their mistakes, and see how you could avoid doing the same.
Read More
Read More
Bad data cost Samsung and Uber ridiculous sums of money with issues that could have been averted if they had been invested in data quality management. Read about their mistakes, and see how you could avoid doing the same.
Read More
If you want column-level lineage but you prefer tools like Amundsen or Data Hub, Datafold's GraphQL API lets you bring your metadata with you.
Read More
Read More
If you want column-level lineage but you prefer tools like Amundsen or Data Hub, Datafold's GraphQL API lets you bring your metadata with you.
Read More
Without proactive data quality management, mistakes will happen. What you do can help improve your data quality in the future. Data quality post-mortems are a valuable tool for building improved processes and systems, plus rebuilding stakeholder trust.
Read More
Read More
Without proactive data quality management, mistakes will happen. What you do can help improve your data quality in the future. Data quality post-mortems are a valuable tool for building improved processes and systems, plus rebuilding stakeholder trust.
Read More
It can be hard to even answer the question "is our data in good shape?" but these teams have gone on a journey towards improved data quality management. Here's how.
Read More
Read More
It can be hard to even answer the question "is our data in good shape?" but these teams have gone on a journey towards improved data quality management. Here's how.
Read More
Doordash, Truebill, Appfolio, Evidently.ai, and Narrator share valuable insights at the fifth Data Quality Meetup hosted by Datafold.
Read More
Read More
Doordash, Truebill, Appfolio, Evidently.ai, and Narrator share valuable insights at the fifth Data Quality Meetup hosted by Datafold.
Read More
SOC 2 compliance is a major step on our security journey. Here are some lessons we learned, as well as what Datafold's compliance means for your business.
Read More
Read More
SOC 2 compliance is a major step on our security journey. Here are some lessons we learned, as well as what Datafold's compliance means for your business.
Read More
In July 2021, Datafold co-founder and CEO Gleb Mezhanskiy went on the Data Engineering Podcast to share his thoughts about a proactive approach to data quality management.
Read More
Read More
In July 2021, Datafold co-founder and CEO Gleb Mezhanskiy went on the Data Engineering Podcast to share his thoughts about a proactive approach to data quality management.
Read More
If you're looking to build the ideal modern data stack for analytics using only open-source options, this is the blog for you. Find all the best open-source alternatives to your favorite paid tools.
Read More
Read More
If you're looking to build the ideal modern data stack for analytics using only open-source options, this is the blog for you. Find all the best open-source alternatives to your favorite paid tools.
Read More
Data quality is increasingly a top KPI for data teams, even as multiple sources of data are making it harder to maintain data quality and reliability. These tools can facilitate quality data at every step.
Read More
Read More
Data quality is increasingly a top KPI for data teams, even as multiple sources of data are making it harder to maintain data quality and reliability. These tools can facilitate quality data at every step.
Read More
Lightdash is an open-source alternative to Looker that natively integrates with dbt. It may not be as mature as other open-source products like Metabase, Querybook, or Superset, but it is different in a few essential ways.
Read More
Read More
Lightdash is an open-source alternative to Looker that natively integrates with dbt. It may not be as mature as other open-source products like Metabase, Querybook, or Superset, but it is different in a few essential ways.
Read More
Learn what steps your team needs to take to improve data quality and get the most out of your data.
Read More
Read More
Learn what steps your team needs to take to improve data quality and get the most out of your data.
Read More
Data quality is always evolving, so where is it in 2021? We asked and you answered - here are the results.
Read More
Read More
Data quality is always evolving, so where is it in 2021? We asked and you answered - here are the results.
Read More
Learn what steps your team needs to take to improve data quality and get the most out of your data.
Read More
Read More
Learn what steps your team needs to take to improve data quality and get the most out of your data.
Read More
Lyft vs. Shopify in testing ETL at scale, using fake data to align your stakeholders, and how to avoid nuclear meltdowns in your data platform.
Read More
Read More
Lyft vs. Shopify in testing ETL at scale, using fake data to align your stakeholders, and how to avoid nuclear meltdowns in your data platform.
Read More
Good Data: How Spotify, Shopify & Lyft approach data quality
Read More
Read More
Good Data: How Spotify, Shopify & Lyft approach data quality
Read More
Why implement regression testing for ETL code changes, how to align data producers and consumers, and what Data teams at Carta, Thumbtack, Shopify & Clari do to solve data quality.
Read More
Read More
Why implement regression testing for ETL code changes, how to align data producers and consumers, and what Data teams at Carta, Thumbtack, Shopify & Clari do to solve data quality.
Read More
Take your ETL workflow to the next level with Datafold and dbt integration that automates data testing and provides column-level data lineage
Read More
Read More
Take your ETL workflow to the next level with Datafold and dbt integration that automates data testing and provides column-level data lineage
Read More
The more people that are looking at the data, and the more apps that are using the data, the faster data quality issues will be identified and resolved.
Read More
Read More
The more people that are looking at the data, and the more apps that are using the data, the faster data quality issues will be identified and resolved.
Read More
On the second Data Quality Meetup, we discussed three types of data testing and when to apply them, new-generation ETL frameworks and ROI of open-source data catalogs.
Read More
Read More
On the second Data Quality Meetup, we discussed three types of data testing and when to apply them, new-generation ETL frameworks and ROI of open-source data catalogs.
Read More
Over the past 10 years, we've seen a great advancement in technologies and tools for analytics and machine learning: with today’s modern analytics stack, we have fast and scalable data warehouses, dirt-cheap data storage, capable ETL orchestrators, and powerful BI tools.
Read More
Read More
Over the past 10 years, we've seen a great advancement in technologies and tools for analytics and machine learning: with today’s modern analytics stack, we have fast and scalable data warehouses, dirt-cheap data storage, capable ETL orchestrators, and powerful BI tools.
Read More
Unlocking the next level with most popular ETL orchestrator
Read More
Read More
Unlocking the next level with most popular ETL orchestrator
Read More
Put a comma in the right place
Read More
Read More
Put a comma in the right place
Read More
Objective criteria and subjective advice when choosing a data warehouse for analytics.
Read More
Read More
Objective criteria and subjective advice when choosing a data warehouse for analytics.
Read More
To get Datafold to integrate seamlessly with your data stack we need to have a quick onboarding call to get everything configured properly