Big data technologies enable businesses to quickly collect massive volumes and varieties of data, but because these technologies don’t associate metadata with files, businesses have difficulties determining what data they have.
Although this wasn’t a problem historically, as data governance was built into the design of data stores, it now makes data governance difficult at best. CapTech has developed a strategy that addresses this problem in a big data environment. One such method uses application programming interfaces (APIs) that associate metadata with files being brought into a big data platform. In addition to providing metadata, they deliver a level of lineage or traceability and support data security and quality, key components of data governance.
Click here to download the full white paper, "A Strategy for Establishing Data Governance in the Big Data World."