View Speaker: Audrey Tang

2017-07-28 Interview with David Green

It remains controversial, because according to our current privacy law, for every collecting agency, its governing ministry is the data protection agency.

Link in context Link

2017-07-28 Interview with David Green

Right, exactly. Then the outcome, when you calculate the average, the mean, whatever linear regression you want to run with it, it’s just not very useful. It is privacy-protecting though. That’s the only case so far that we’ve done as to using this standard.

Link in context Link

2017-07-28 Interview with David Green

If I am alone in a basic statistical area, the size of a county or a township, and I’m the only one earning above a certain amount of money, then my data is going to be removed from the data set.

Link in context Link

2017-07-28 Interview with David Green

To protect privacy sufficiently, they used k-anonymity, which is a crude way to anonymize this data. It says that one must not be distinguished from a group of, say, 25 people or so.

Link in context Link

2017-07-28 Interview with David Green

In practice, it is actually very difficult. So far, the only case in Taiwan that has completed this CNS 29100 process is one that outlines the personal income in all the different areas, so that you can know how income changes, year after year, in the average residence. It’s obviously ...

Link in context Link

2017-07-28 Interview with David Green

That’s exactly right. After processing, if it is not personal data anymore, then it’s free to just hand it over to the open data platform or some other agency.

Link in context Link

2017-07-28 Interview with David Green

Of course, statistics can be sold and commercially used just like any other open data. The thing is that the court has ruled that the unit collecting the data, the personal data, must be the same one that processes this data. It can’t turn it over to some other company ...

Link in context Link

2017-07-28 Interview with David Green

If it is just statistics, then the privacy law doesn’t govern it. Right?

Link in context Link

2017-07-28 Interview with David Green

Beyond that, what exactly is this "statistical use" when compared to raw personal data? There was a CNS standard, the one you just mentioned, CNS 29100, that says after a certain de-identification process, then we can use the results because it has minimal privacy impact for statistical use, instead of ...

Link in context Link

2017-07-28 Interview with David Green

There’s no such clause in the Taiwan counterpart. Instead, we have crime prevention in the same position. This says something about the values that the legislators care about.

Link in context Link

2017-07-28 Interview with David Green

The law protects a few uses. Some are pretty common, such as I mentioned for research, in many EU countries, there’s a special clause for historians to protect the archival or the interpretation of history.

Link in context Link

2017-07-28 Interview with David Green

Alternatively, it must be for the public good, but it may also used in a statistical way, not the raw data.

Link in context Link

2017-07-28 Interview with David Green

That’s exactly right. The law is pretty clear. You can only use it for academic research purposes. In a lot of EU countries, the enactment of this is similar to that in Taiwan, saying that it must be used for the public good and for research.

Link in context Link

2017-07-28 Interview with David Green

The other contested point is the so-called uninformed use outside its original collective purpose.

Link in context Link

2017-07-28 Interview with David Green

There’s a few issues in the new privacy law for us. The old contested issue, when we had that debate, was on health and other sensitive information. There was a section in the privacy law that required a much more strict measurement of the data protection endeavors. Criteria for this ...

Link in context Link

2017-07-28 Interview with David Green

OK. We do have a data protection law here. It was largely outlined after the previous EU data privacy law. It is inherently pretty compatible with the new directive improved by the Article 29 Working Party.

Link in context Link

2017-07-28 Interview with David Green

I’ll be recording this. Is this OK with you?

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

...is compatible with everything.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

We do CC0 here.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

Yup.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

He likes and he re‑Tweets. Now I know this.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

Everybody can Tweet at @rufuspollock.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

You want a curve like a flipped power-law graph.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

Got it.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

The hard part is to tie this training budget with the cloud usage numbers. Currently in the Digital Nation Plan, these are completely different funds. This is for the enrichment of the community, and this is for reducing the licensing costs. We’re doing both, but not as the same project.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

Then we want this many people empowered over four years. That’s part of the Digital Nation Plan. It’s in it already.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

You see the lines of code that use Docker, for example. Then we say, "OK, it’s important to have the local Docker community that is able to maintain this kind of thing." Then we grade people with three, maybe four levels, like being able to install and use it, maybe ...

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

The other easy part is to get the local talents and everybody trained to specific technologies...we count them as common things in the stacks, like Tensorflow, OpenStack, maybe Docker or something. Then we say, "OK, so for these critical parts of the infrastructure, since all the procurements..."

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

In any case, what I’m saying is that we are, as part of the Digital Nation Plan, developing this automated assessment tool, so we can get some useful numbers out of it. A breakdown of all the licenses, count of lines, and so on.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

Saying "open software" doesn’t cover the whole of it, the non-software parts.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

I’m aware of that. I’m just saying there’s non‑code part, too.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

Yeah. Technically it’s "free culture license" if we are talking in a CC way.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

Well, even if it’s CC‑ND, we want to know its license.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

There’s a part in the Digital Nation Plan that develops automated tools to look at all the source code and binaries that a bidding vendor submits, and then try to figure out, first, how much of it is open licensed, how much of it is Creative Commons license, which may ...

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

We can measure that for cloud procurements.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

Even if we don’t manage to convince to get the entire lower stack binaries to convert to open software, you can still inspect pretty much everything, during a running system, to figure out how it’s working.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

As you probably are aware, if they use Oracle and the Oracle Access Manager’s stored procedures, a lot of the procedures is plain text. It’s not binaries.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

We do it not for data localization purposes, but really for know‑how localization, so that people can locally inspect what’s going on.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

That actually rules out pretty much everybody who relies on this sticky lock-in. Even if they say it’s proprietary software, it still has to run on the data center here.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

However, for the cloud part, this is my main target of this procurement change. First, it’s software‑as‑a‑service, but we’re insisting now running on local infrastructure.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

We haven’t solved that.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

Yes. That means the new vendor must still know Oracle to get the winning bid. Even they get a whole open source, open data system, they must know Oracle.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

It just means you can swap out the top-layer application vendors.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

It’s true. We don’t have a good story here.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

Then it ends up paying a lot on Oracle license, which is your classic example.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

You can say, "You must use PostgreSQL," but you must know you want PostgreSQL going in. For many government agencies, this is simply out of their consideration, so they just say, "OK, the web application source code must be open," or something.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

We don’t have a good story for the latter; this is what all of us are painfully aware of.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

This is where the cloud procurement and the spec — or agile — procurement differs.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

There’s 14 days left. We can’t really do anything before that. I’m sorry that it still says OpenAPI, but it will say common API standards at some point.

Link in context Link

2017-02-14 Open discussion with Rufus Pollock

Right, and then we put it up for 60 days of public consultation. I think it’s drawing to a close now. Let’s look at the actual comments. It’s very tricky.

Link in context Link

SayIt

Audrey Tang

Speeches