Simple Polynomial Gradient Descent algorithm not working

Question

I am trying to implement a simple 2nd order polynomial gradient descent algorithm in Java. It is not converging and becomes unstable. How do I fix it?

public class PolyGradientDescent {
public static double getValue(double input) {
    return 3 * input * input - 4 * input + 3.5;
}
public static void fit() {
    double x0 = Math.random();
    double x1 = Math.random();
    double x2 = Math.random();
    double size = 15;
    double learningrate = 0.0001;
for(int i = 0; i &lt; 400; i++) {
    double partial_x2 = 0;
    double partial_x1 = 0;
    double partial_x0 = 0;
    for(double x = 0; x &lt; size+0.001; x++) {
        double xx = x * x;
        double y_predict = xx * x2 + x * x1 + x0;
        double delta = getValue(x) - y_predict;
        partial_x2 += xx * delta;
        partial_x1 += x * delta;
        partial_x0 += delta;
    }
    x0 = x0 + (2 / size) * partial_x0 * learningrate;
    x1 = x1 + (2 / size) * partial_x1 * learningrate;
    x2 = x2 + (2 / size) * partial_x2 * learningrate;
    System.out.println(x0 + &quot;\t&quot; + x1 + &quot;\t&quot; + x2 + &quot;\t&quot; + &quot;\t&quot; + partial_x2 + &quot;\t&quot; + partial_x1 + &quot;\t&quot; + partial_x0);
}
for(double x = 0; x &lt; size+0.001; x++) {
    System.out.println(&quot;Y: &quot; + getValue(x) + &quot;, Y_Predict: &quot; + (x2 * x * x + x1 * x + x0));
}

}
public static void main(String[] args) {
    fit();
}
}

score 1 · Accepted Answer · answered Jan 21 '22 at 01:13

I tested your code in python and it works just fine, when I decrease the learning rate (divided by 100) by a bit more (and the epochs multiplied by 100).

I also changed the way the derivative was calculated to make it more mathematically correct :)

import random
def getValue(x):
    return 3 * x * x - 4 * x + 3.5
def fit():
    x0 = random.randrange(-100, 101) / 100
    x1 = random.randrange(-100, 101) / 100
    x2 = random.randrange(-100, 101) / 100
    size = 15
    learningrate = 0.000001
for i in range(40000):
    partial_x2 = 0
    partial_x1 = 0
    partial_x0 = 0
    for x in range(16):
        xx = x * x
        y_predict = xx * x2 + x * x1 + x0
        delta = getValue(x) - y_predict

        # for the partial derivatives, I pulled the sign and the 2 into this step, and also devided the term later by -2, because this would be the true derivative
        partial_x2 -= 2 * xx * delta
        partial_x1 -= 2 * x * delta
        partial_x0 -= 2 * delta

    x0 = x0 - (1 / size) * partial_x0 * learningrate
    x1 = x1 - (1 / size) * partial_x1 * learningrate
    x2 = x2 - (1 / size) * partial_x2 * learningrate

for x in range(16):
    print(&quot;Y: &quot; + str(getValue(x)) + &quot;, Y_Predict: &quot; + str(x2 * x * x + x1 * x + x0))


fit()

Simple Polynomial Gradient Descent algorithm not working

1 Answers1