Bigger than Big Data:
Decentralized personal data on the Web

Ruben Verborgh

Big Data in Media workshop at the European Big Data Value Forum, 14 November 2018

Bigger than Big Data

Decentralized personal data on the Web

Ruben Verborgh

Ghent University – imec

Big Data is a finite competition,
where one winner ultimately
harvests the most data.

Most of us have already lost.
But is harvesting really our game?

The Solid ecosystem enables people to use the apps they need, while
storing their data wherever they want.

People own their data, and share it
with the apps and people they choose.

Bigger than Big Data: Decentralized personal data on the Web

Bigger than Big Data: Decentralized personal data on the Web

People choose where they store
every single piece of data they produce.

They can grant apps and people access
to very specific parts of their data.

Separating app and storage competition
drives permissionless innovation.

The traditional way of building apps
does not work well with decentralization.

Building apps over decentralized data
requires different app techniques.

Solid is an ecosystem of data and apps
that work seamlessly together.

Bigger than Big Data: Decentralized personal data on the Web

Linked Data in the RDF model
solves crucial challenges for Solid.

Through URLs and RDF, every piece of data
can link to any other piece of data.

PREFIX as: <https://www.w3.org/ns/activitystreams#>
PREFIX xsd: <http://www.w3.org/2001/XMLSchema#>
<#ruben-likes-ebdvf2018> a as:Like;
  as:actor  <https://ruben.verborgh.org/profile/#me>;
  as:object <https://www.european-big-data-value-forum.eu/#this>;
  as:published "2018-11-14T14:00:00Z"^^xsd:dateTime.

Shapes (and hopefully soon semantics)
enable layered compatibility.

PREFIX as: <https://www.w3.org/ns/activitystreams#>
PREFIX xsd: <http://www.w3.org/2001/XMLSchema#>
<#ruben-likes-ebdvf2018> a as:Like;
  as:actor  <https://ruben.verborgh.org/profile/#me>;
  as:object <https://www.european-big-data-value-forum.eu/#this>;
  as:published "2018-11-14T14:00:00Z"^^xsd:dateTime.

Different source data can be concatenated
(but let’s track provenance).

PREFIX as: <https://www.w3.org/ns/activitystreams#>
PREFIX xsd: <http://www.w3.org/2001/XMLSchema#>
<#ruben-likes-ebdvf2018> a as:Like;
  as:actor  <https://ruben.verborgh.org/profile/#me>;
  as:object <https://www.european-big-data-value-forum.eu/#this>;
  as:published "2018-11-14T14:00:00Z"^^xsd:dateTime.
<#bart-likes-ebdvf2018> a as:Like;
  as:actor  <https://example.org/bart/#me>;
  as:object <https://www.european-big-data-value-forum.eu/#this>;
  as:published "2018-11-14T14:05:00Z"^^xsd:dateTime.

Bigger than Big Data: Decentralized personal data on the Web

Will we lose access to customers’ data
we need for analytics?

Can we trust data that comes
from consumers?

What should we do if
we are not collecting data anymore?

There will not be less data.
There will be more.

Bigger than Big Data: Decentralized personal data on the Web

Bigger than Big Data

Decentralized personal data on the Web

Ruben Verborgh

Ghent University – imec