rw-book-cover

Metadata

Highlights

These days, as the volume of data collected by companies grows exponentially, we’re all realizing that more data is not always better (View Highlight)

In fact, more data, especially if you can’t rely on its quality, can hinder a company by slowing down decision-making or causing poor decisions. (View Highlight)

Seemingly basic metrics like “Active Listings” relied on a web of upstream dependencies. Conducting meaningful data work required significant institutional knowledge to overcome hidden caveats in our data (View Highlight)

While this made sense for our most critical data, pursuing such rigorous standards at scale presented challenges (View Highlight)

Data producers are the ones that need to be certified

Data producers were requesting a lighter-weight process that could provide some of the quality guardrails of Midas, but with less rigor and time investment (View Highlight)

Accuracy, Reliability (Timeliness), Stewardship, and Usability (View Highlight)

We could also weigh each dimension according to our perception of its importance in determining quality (View Highlight)

(View Highlight)

(View Highlight)

we had to surface data quality information directly in the venue where data users already discovered and explored data. (View Highlight)

Luckily, we had two existing tools that would make this much easier: Dataportal (Airbnb’s data catalog and exploration UI), and the Unified Metadata Service (UMS (View Highlight)