Just to make it more explicit what we are doing (this isn't necessary), we're going to write out EXACTLY what the function we are minimizing is, so that you can see it is quadratic and so has a unique minimum.

First, dist2data is the distance from the function h to our 5 points (ie, the sum of the squares of the vertical distance h(x)-y at these 5 points).