Loss Function Theory 101
David Wölfle
Well, you probably know that mean squared error is a good default choice for training machine learning models on regression tasks while cross entropy is commonly recommended for classification. (No worries if not, you will be able to understand the talk nevertheless.) But, do you also know why this is the case? Why these loss functions can be used for machine learning at all, and when you should consider an alternative? No? Great! This talk is for you.
The talk begins with a recap on the definition of loss functions in the context of gradient descent, followed by a short introduction to maximum likelihood estimation using a practical example. As the talk proceeds, it is shown how to derive mean squared error and cross correlation from Gaussian- and multinoulli distributions, which allows us to identify potential limitations of the two loss functions. The remaining time is used to relate these findings into the broader context of machine learning, with especial regard to neuronal networks.
The talk should be useful for those getting started with machine learning, as well as for practitioners that desire a theory recap. After the talk you should have a general intuition what actually happens behind the scenes while you train your machine learning model with mean squared error or cross correlation.
David Wölfle
Affiliation: FZI Research Center for Information Technology
David Wölfle studied mechanical engineering in Karlsruhe. During his master studies he focused on wind energy technology and researched data sources for wind resource estimation. After graduating as Master of Science from Flensburg University of applied Applied Sciences in 2015, David worked as a R&D scientist at EWC Weather Consult GmbH (now UBIMET GmbH), where he designed and implemented software components for the estimation and predication of renewable energy power production. In 2016 David has been promoted to a team manager at EWC Weather Consult where he was responsible for the software engineering within the product development and as well as the design and execution of the project management. Besides these duties, he also developed innovative methods for estimating power production of airborne wind energy converters, using high-resolution meteorological data and machine learning methods.
Since early 2018 David works as a research scientist at FZI Research Center for Information Technology in the field of smart energy. His research focuses thereby on self-learning energy management systems using reinforcement learning techniques.
Besides these duties, David likes to give talks about renewable energy and machine learning topics, e.g. at PyConDE 2018: https://www.youtube.com/watch?v=2KfyvrEn8p8