File:Plot of Loss Functions.png

Summary

Description
English: All machine learning algorithms use a loss, cost, utility or reward function to

encode the learning objective and oversee the learning process. This function that supervises learning is a frequently unrecognized hyperparameter that determines how incorrect outputs are penalized and can be tuned to improve performance. This paper shows that training speed and final accuracy of neural networks can significantly depend on the loss function used to train neural networks. In particular derivative values can be significantly different with different loss functions leading to significantly different performance after gradient descent based Backpropagation (BP) training. This paper explores the effect on performance of new loss functions that are more ”liberal” or ”strict” compared to the popular Cross-entropy loss in penalizing incorrect outputs. Eight new loss functions are proposed and a comparison of performance with different loss functions is presented. The new loss functions presented in this paper are shown to outperform Cross-entropy loss on computer vision and NLP benchmarks.

https://arxiv.org/abs/2303.09935
Date
Source Own work
Author Mathew Mithra Noel

Licensing

I, the copyright holder of this work, hereby publish it under the following license:
w:en:Creative Commons
attribution share alike
This file is licensed under the Creative Commons Attribution-Share Alike 4.0 International license.
You are free:
  • to share – to copy, distribute and transmit the work
  • to remix – to adapt the work
Under the following conditions:
  • attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
  • share alike – If you remix, transform, or build upon the material, you must distribute your contributions under the same or compatible license as the original.
Category:CC-BY-SA-4.0#Plot%20of%20Loss%20Functions.pngCategory:Self-published work
Category:Artificial neural networks Category:Loss functions Category:Cost functions Category:Objective functions Category:Machine learning
Category:Artificial neural networks Category:CC-BY-SA-4.0 Category:Cost functions Category:Loss functions Category:Machine learning Category:Objective functions Category:Self-published work