Control over patient data: a decentralized perspective

Ruben Verborgh, Ghent Universityimec

AIMS Journal Club, 5 October 2020

Control over patient data: a decentralized perspective

Ruben Verborgh

Ghent University – imec

The history of computing
started out centralized.

Even when the Internet became popular,
data and CPU remained decentralized.

The software-as-a-service model brought
storage and processing to the cloud.

A history of (de-)centralization
is driven by more than just technology.

Decentralization was an assumption
when the Web was designed.

The Web’s novelty was universality.

The Web strives to be universal
through independence of many factors.

Our data has become centralized
in a handful of Web platforms.

In 2017, Tim Berners-Lee listed
three challenges for the Web.

All three indicate a loss of control/agency.

Within the walled gardens of social media,
you have to move either data or people.

© David Simonds

Tim Berners-Lee is spearheading Solid
as an ecosystem to take back control.

Solid aims to restore choice
by separating data from apps.

For every piece of data an actor produces, they can choose where to store it.

They can grant apps and people access
to very specific parts of their data.

Separating app and storage competition
stimulates sustainable innovation.

Solid is not a platform to replace others,
but a way of building for the Web.

A Solid server acts as a data pod
that stores and guards your data.

A data pod can contain any data
you create or need online.

Solid clients are browser or native apps
that read from or write to your data pod.

Any app you can envision,
you can build with Solid.

Client–server communication is governed
by the Solid specifications.

Several open-source implementations
of servers, apps, and libraries exist.

Interoperability challenges in Solid
are solved through Linked Data in RDF.

With JSON-LD, every piece of data
can link to any other piece of data.

{
  "@context":  "https://www.w3.org/ns/activitystreams",
  "id":        "#ruben-likes-ugent",
  "type":      "Like",
  "actor":     "https://ruben.verborgh.org/profile/#me",
  "object":    "https://www.ugent.be/#this",
  "published": "2019-04-25T08:00:00Z"
}

Data shapes and their semantics
enable layered compatibility.

{
  "@context":  "https://www.w3.org/ns/activitystreams",
  "id":        "#ruben-likes-ugent",
  "type":      "Like",
  "actor":     "https://ruben.verborgh.org/profile/#me",
  "object":    "https://www.ugent.be/#this",
  "published": "2019-04-25T08:00:00Z"
}

Different source data
can be concatenated.

{
  "@context":  "https://www.w3.org/ns/activitystreams",
  "@graph": [{
    "type":      "Like",
    "actor":     "https://ruben.verborgh.org/profile/#me",
    "object":    "https://www.ugent.be/#this",
    "published": "2019-04-25T08:00:00Z"
  },{
    "type":      "Like",
    "actor":     "https://example.org/people/silvia#me",
    "object":    "https://www.ugent.be/#this",
    "published": "2019-04-25T08:05:00Z"
  }]
}
©2014 tHeDiGiTaLdRoPoUt

The Paradox of Freedom:
you can only be free if you follow rules.

We need to identify those rules
we all need to agree on.

Lessons learned from aggregating hundreds of datasets
are highly useful to inform the discussion.

Decentralization needs replication
for realistic performance.

In addition to technological changes,
we need a shift of mindset.

Current networks are centered
around the aggregator.

We need to create network flows
to and from the aggregator.

The individual network nodes
need to become the source of truth.

Aggregators need to become part
of a larger network.

Aggregators serve as a crucial
but transparent layer in the network.

Aggregators’ main responsibility becomes
fostering a network between nodes.

The best way to predict
the future is to invent it.

Alan Kay

The best way to invent
the future is to predict it.

John Perry Barlow

Control over patient data: a decentralized perspective

@RubenVerborgh

ruben.verborgh.org