Introduction to Deep Learning with PyTorch > Other Gradient Descent Optimization Algorithms | Python Programming (70053 Autumn Term 2021/2022) | Department of Computing

Introduction to Deep Learning with PyTorch

face Luca Grillotti

Remember how we defined our optimiser?

learning_rate = 0.2
optimiser = torch.optim.SGD(params=list_parameters, lr=learning_rate)

PyTorch actually provides plenty of optimisers. Many of those optimisers are more efficient than SGD.

Among the most popular optimisers we have: Adam and RMSProp. The way those optimisers work is out of the scope of this course.

optimiser = torch.optim.Adam(params=list_parameters)

import torch
optimiser = torch.optim.RMSprop(params=list_parameters)

Try to replace SGD with those optimisers in your code. Do they produce better results?