What we’re now doing is we can publish the schema, that is to say the shape of the data. We maybe publish synthetic datasets, that is to say it doesn’t really contain any personal info because they’re all fake. Given those random data, the data scientist can do a lot meaningful statistic algorithms, but still for the data operate your statistic agency to run and then to publish. Then, it could be comparable across jurisdictions.

Keyboard shortcuts

j previous speech k next speech