Ever since completing two posts, sharing resources from the #CityChain17 blockchain conference, I’ve been ruminating on some fundamental questions for blockchain.
These are questions relevant to data and analytics leaders, especially if blockchain technologies are to fulfil their potential of moving beyond pilot stage.
Each deals with a different aspect of the key challenges (facing data and analytics leaders today) and asks, could blockchain help achieve this? My three questions cover: ways to amalgamate disparate data sources; performing meaningful analytics on new data structures; and facing into one of the key challenges presented by GDPR.
As I am no blockchain expert and robust thinking on this topic is still at a relatively early stage (just as most ‘use cases‘ are still pilots), so I have looked to others.
In this post, I share three other articles reflecting on the three questions for blockchain (that I pose below). Each demonstrates some real expertise and sensible thinking, at least about the work still needed. I hope you find them useful.
Q1: Could blockchain help address the problem of amalgamating the disparate data sources needed for analytics?
Here it is worth reflecting on both the similarities of blockchain and database technologies (as Gideon Greenspan shared here) and also what database theory has learned over recent decades of practice.
Despite not having the answer (yet), in his article, Dr Barry Devlin explains the potential of blockchain in the context of how the management of historical data has developed. Understanding the pros and cons of different approaches helps highlight both the potential and some of the remaining challenges for using blockchain technology here.
Dr Devlin’s experience since the early days of data warehouses means he is well worth hearing (to balance the enthusiasm of your blockchain developers and sales people): Historical Data: From Data Warehouse to Immutable Blockchain.
So, we might conclude that there is real potential here, but still unproven. I hope some businesses take upon that challenge, as the metadata benefits of blockchain timestamps, transparent provenance and immutability could be significant.
But, if blockchain solutions could replace some data warehouses, federated solutions or provide indexing – will analysts be able to use it?
Q2: How could analysts or data scientists interrogate data held in blockchains, to gain insights?
As I raised during a Q&A session at #CityChain17, there does appear to be much opportunity for collaboration between Data Science teams and blockchain developers. However, despite the understanding of the similarities between blockchains & databases, most use cases are focussed on operational systems, rather than solutions that analysts could interrogate to provide analysis or insights.
This must be a relevant topic for IT leaders at the moment, as I note Gartner has recently published new research on this question.
So, I was pleased to see this post, from Benedikt Koehler, on how to query the blockchain using R. It is interesting to see that he also references a paper using Graph Analysis, a technique I’m hearing mentioned more and more by analysts – so we will return to that in a future post.
Anyway, given the increasing use of R by analysts (and especially data scientists), I hope you find the coding examples and outputs shared by Benedikt to be helpful. Do let me know if you have an alternative proven approach to query blockchain data: Querying the Bitcoin blockchain with R.
Finally, we should return to the viability of blockchains holding your customer data in a post-GDPR world.
Q3: If blockchain is immutable (insert only), then how will it support GDPR requirements for ‘right to erasure’?
As we shared in an earlier post on the challenges presented by GDPR, one of them is the ‘right to be forgotten’ or ‘right to erasure‘. It was notable that despite all the useful information shared at #CityChain17, this challenge was never raised or answered. Indeed the intrinsic immutability of blockchains was lauded as one of the key benefits. But, if it can’t be changed, how can a business comply with customer’s request for erasure?
In this useful short article, Steven Farmer (legal counsel at Pillsbury Winthrop Shaw Pittman), outlines the problem and suggests a couple of possible mitigations. It will be interesting to see if such workarounds do reduce the risk of court action. They seem sensible within the constraints of current blockchain technology, but I’d prefer to see them proven as accepted by case-law first.
It's well worth reading Steven’s clear and simple explanations: Blockchain technologies and the EU 'right to be forgotten' – an insurmountable tension?
Any more questions for the blockchain?
I hope the above resources were worth sharing. Given the number of visitors to our recent blockchain posts, it seems this is a topic of real interest to our readers.
If you are a data or analytics leader, considering blockchain, what other questions for blockchain do you have?
Feel free to raise your questions in our comments section below and I will curate the best answers or even interview an expert for a future post.