Tuesday, February 28, 2017

Is deep learning about what you train on?

In simple terms, deep learning is about using a series of artificial neural networks in multiple layers to form representations of data. The math behind it is a bit more complex, but it is more or less about adjusting the weights and biases linking the various neurons in the network so that for a given input, the calculated result gives a certain output.

Usually, we use a set of random initial values for the weights and biases, and through training (basically, feeding in many many sets of inputs, comparing the outputs of the network with the expected outputs, and adjusting the weights and biases if the two are not the same), arrive at a network that has a set of weights and biases that can more or less given us an expected output when we feed it an input.

The training data used to train the network is usually a very large set of inputs and their corresponding outputs (aka correct answer). And most of the time, there is no order to this set of data; it is like a huge random collection with no fixed order.

But if you think about it, if these artificial neural networks are trying to simulate our brains, with each layer learning an abstraction of the data, then maybe we should be training these networks in a similar way to how we train our brains.

What I am suggesting is this: start off with a set of simple data (simple pictures that you find in children's books, simple sentences or words, etc.) and allow the network to make big adjustments to the weights and biases as it trains on this set of data. This is like how children learn basic concepts with easily influenced minds.

Next is to slower increase the complexity of the training data, while reducing the amount of changes (learning rate) that can be made to the weights and biases. Eventually, we should be training the network with real-world data. The whole idea is to train the network as we would train our own brains, in increasing level of complexity.

The tedious part here is to segregate existing training data into differing levels of complexity. It may be easier to just create new datasets, but datasets are usually huge and therefore time-consuming to generate anew. But with Amazon Mechanical Turk, it may not be that difficult after all. Maybe some researcher somewhere can pick up on this idea. Hopefully, I myself will be able to work on this idea to test it out (but first, I need to get a better computer, hopefully one with TITAN X).

Wednesday, February 15, 2017

Promoting my blogs

Ever since I started my blogs in 2006, I haven't really been actively promoting them. Instead, my belief was that if my readers find the contents useful, they will share them with others.

But I also started blogging so that I can share my thoughts and experiences with a wider audience. Which also means that I am not meeting my aim if useful contents are not reaching out to people.

So I have decided that I will be a bit more proactive in promoting what I have written (or will write) in my blogs, so that more can benefit from these thoughts and experiences. Of course, I will try to be as selective as I can, so as to avoid spamming people. We all know that it is a thin line that separates advertising from spam.

Saturday, February 11, 2017

On the reopening of the case of Pedra Branca's sovereignty

Reblogged from my Facebook post:

I just went through the application documents on ICJ site, the three documents submitted are not likely to be significant... in fact, I think it is quite a joke.

Document 1: The reference to Horsburgh Light is part of an entire string of references to denote the proposed international passage corridor through the Singapore Strait. From one end to the other. And Horsburgh Light is mentioned because it is used to denote the South China Sea end of the strait. It is hard to see how this links to thinking about sovereignty, but I guess not everyone can understand the Queen's English which was used in the telegram. (sarcasm fully intended)

Document 2: Quite a joke. The entire document does not give specific latitude/longitude position of the ship in question. "Near Horsburgh Light" can mean different things to different people. Near can be 1m, 100m, 1km, 1nm, 5nm, and in any of the 360 degrees of the compass. And all who ply that part of the sea knows the distance between Horsburgh Light to the Malaysian coast is not that far in the first place. Exactly where was Labuan Haji?

Document 3: Seems to be a document on curfews and no fishing zones to protect mainland Singapore from infiltration from Indonesian side. This is during the period when we were seeing Communist activities, and just prior to Konfrontasi. I don't think security forces on Singapore were worried about Horsburgh Light being infiltrated as much as the entire coastline that we have facing south. By the way, the map does not show Pedra Branca in the first place. Maybe it was in another part of the document... we won't know unless we have the whole document in full, instead of just a single page that shows the main island of Singapore.

Thursday, February 02, 2017

Backing up decisions with facts

Since Trump has taken office, an amazing number of presidential orders have been issued within a record period of time.

It is great to have a person who can make decisions and act on them, but one must also remember that any decision made by a person in office needs to be backed up with facts and supporting data, usually obtained through thorough research and background checks.

Orders need to be both legal and lawful. Legal means they are issued in accordance with the existing legal framework. Lawful means they are in accordance with existing laws, rules, and regulations, as well as the gist behind those laws, rules, and regulations. This is because not everything about how a law came into being makes it into the text, but every law in place has something that it was meant to protect, some of it explicitly stated, some implicitly implied, and some simply left to the good judgment of people executing the laws.

So an order to "kill Person A" can be legal, if it was drafted and approved by the correct people within the legal framework. But it is not lawful, if the law says it is illegal to kill someone.

At the amazing rate orders are being churned out, my only (and biggest) worry is whether enough research and background checks have been done to gather the facts and supporting data that led to the decision. Because while the orders may be legal, insufficient supporting information to back up those decisions may well render them unlawful.