eremo

Multivariate Normal Distributions – III – Variance-Covariance Matrix and a distance measure for vectors of non-degenerate distributions

by eremo
24 Jul 202424 Jul 2024
Multivariate Normal Distributions

In previous posts of this series I have motivated the functional form of the probability density of a so called “non-degenerate Multivariate Normal Distribution“. In this post we will have a closer look at the matrix Σ that controls the probability density function [pdf] of such a distribution. We will show that it actually is the covariance matrix of the… Read More »Multivariate Normal Distributions – III – Variance-Covariance Matrix and a distance measure for vectors of non-degenerate distributions

Multivariate Normal Distributions – II – Linear transformation of a random vector with independent standardized normal components

by eremo
21 Jul 202424 Jul 2024
Multivariate Normal Distributions

In Machine Learning we typically deal with huge, but finite vector distributions defined in the ℝn. At least in certain regions of the ℝn these distributions may approximate an underlying continuous distribution. In the first post of this series we worked with a special type of continuous vector distribution based on independent 1-dimensional standardized normal distributions for the vector components.… Read More »Multivariate Normal Distributions – II – Linear transformation of a random vector with independent standardized normal components

Multivariate Normal Distributions – I – Basics and a random vector of independent Gaussians

by eremo
19 Jul 202424 Jul 2024
Multivariate Normal Distributions

This post series is about mathematical aspects of so called “Multivariate Normal Distributions“. In the literature two abbreviations are common: MNDs or MVNs. I will use both synonymously. To get an easy access I want to introduce MNDs as the result of a linear transformations applied to random vectors whose components can be described by independent 1-dimensional normal distributions. Afterward… Read More »Multivariate Normal Distributions – I – Basics and a random vector of independent Gaussians

Short ResNet training on CIFAR10 over 21 epochs

AdamW for a ResNet56v2 – V – weight decay and cosine shaped schedule of the learning rate

by eremo
14 Jul 202415 Jul 2024
ResNets

In this post series we try to find methods to reduce the number of epochs for the training of ResNets on image datasets. Our test case is CIFAR10. In this post we will test a modified cosine shaped schedule for a systematic and fast reduction of the learning rate LR. This supplements the approaches described in previous posts of this… Read More »AdamW for a ResNet56v2 – V – weight decay and cosine shaped schedule of the learning rate