Optimizer Practice

Colab Setup

1
%config InlineBackend.figure_format='retina'

Sets the output resolution for matplotlib and similar to retina.

Noise

1
n_data = 10000
2
x_numpy = -3+6*np.random.rand(n_data,1)
3
# y_numpy = np.exp(-(x_numpy**2))*np.cos(10*x_numpy)
4
y_numpy = np.exp(-(x_numpy**2))*np.cos(10*x_numpy) + 3e-2*np.random.randn(n_data,1)
5
plt.figure(figsize=(8,5))
6
plt.plot(x_numpy,y_numpy,'r.',ms=2)
7
plt.show()
8
x_torch = torch.Tensor(x_numpy).to(device)
9
y_torch = torch.Tensor(y_numpy).to(device)
10
print ("Done.")

![](/assets/images/Optimizer 실습/f9fa06fa-733a-458b-8c5c-4cf9aba778bb-image.png) The graph above shows the originally intended function. Let’s add noise to it. Noise is implemented by multiplying np.random.randn() by a small real value of 3e-2, as shown in the code above. ![](/assets/images/Optimizer 실습/95e2a7f4-1755-447c-929d-e3bdb570a047-image.png)

Optimizer Comparison

![](/assets/images/Optimizer 실습/ec72c958-f2cd-45c4-a3d6-f6065c0fb9b3-image.png) ![](/assets/images/Optimizer 실습/851dd821-a73e-4f8d-aede-9a0960ae7c9c-image.png) ![](/assets/images/Optimizer 실습/fce0de37-8504-4b73-936a-7d5a347684f3-image.png)

Graphs showing how well each optimizer’s model approximates the function at epochs 500, 3500, and 9999. GT is the target function to approximate.

Adam was already approximating from the start. Very fast. SGD and Momentum look similar.