As we continue to add exponential amount of unstructured data from content creation and elsewhere (audio, image, video, 3D VR and AR etc.) there will soon to manage and utilize these unstructured data in a scalable way.

Unstructured data is information that does not fit into a predefined data model or schema, or in essence, does not fit into the traditional database (I mean the usage of the word ‘traditional’ is on a relative basis, given how fast industries are innovating today versus decades ago given the proliferation of information, global market access, low cost of experimenting and validating your ideas with low-code/no-code and the likes and more).

Typically text-heavy, such as form responses and social media conversations, unstructured data also encompasses images, video, and audio. Industry-specific file types such as VCF (genomics), KDF (semiconductors), or HDF5 (aeronautics) are included in this category

The new iOS 15 that came out earlier this Monday (9/20/2021) actually have the ability to read UNSTRUCTURED data INSIDE an image and video – in essence, you can copy and paste a text INSIDE an image or video. This (major) software update got me thinking on how data warehousing is going to be done in the future.

Snowflake, one of the leading data warehousing enterprise, just launched unstructured data Support in Sept 2021, 2 days before Apple announced their iOS 15, which also offers unstructured data read.

https://docs.snowflake.com/en/user-guide/unstructured-intro.html

There are several practical use cases for unstructured data analyses and subsequently, data warehousing of these data including comprehending data inside photo and videos and more (for example, information on a PDF/Docusign contract, photo of a drug subscription) or even run analyses on the actual content of a YouTube or Vimeo video and linking in advertisement data related to an series of items that consumers see inside the video to facilitate an even more tight-knit attribution approach to advertisement strategy.

I think as the market for content creators (not just in the traditional YouTube or TikTok content creator sense, but more in general, including niche communities like the Finance or Biotech Substack or Twitter Newsletter or Medium, continues to grow, there will inevitable be more unstructured data that will be recorded, managed, reconciled and utilized.

Leave a comment