Machine Learning @ anracon – ML topics and related math

PyTorch / datasets / dataloader / data transfer to GPU – III – prepared tensor datasets and preloading to GPU

by eremo
26 Mar 202527 Mar 2025
PyTorch datasets

In this post series we have a look at PyTorch dataloaders and Torchvision image datasets (downloaded via PyTorch modules). PyTorch DataLoaders retrieve batches of dataset elements and transfer them to neural networks [NN] on a computation device – e.g. a CUDA driven graphics card. A central dataset functions applies defined transformation operations to its elements. We analyze the impact of… Read More »PyTorch / datasets / dataloader / data transfer to GPU – III – prepared tensor datasets and preloading to GPU

Runtime vs.number of dataloader workers and batch size

PyTorch / datasets / dataloader / data transfer to GPU – II – dataloader too slow on CPU?

by eremo
17 Mar 202527 Mar 2025
PyTorch datasets

Editorial hint: This post has been revised and changed in parts on March, 18th/19th/22nd, 2025, after some new tests and insights. The changes did not concern the result data of the performed experiments, but their interpretation. In the last post of this mini-series we saw that some Torchvision datasets have a directly accessible property “data“. It contains image data in… Read More »PyTorch / datasets / dataloader / data transfer to GPU – II – dataloader too slow on CPU?

Examples images from the FashionMnist dataset

PyTorch / datasets / dataloader / data transfer to GPU – I – properties of some torchvision datasets

by eremo
13 Mar 202527 Mar 2025
PyTorch datasets

For an old fan of Tensorflow2 it is somewhat satisfactory to notice that some problems also exist in analogous form in a PyTorch environment. Anyone who has worked with visual data knows that one needs to modify, augment and transform the image data and then load them from some storage under CPU control to the GPU’s VRAM before or during… Read More »PyTorch / datasets / dataloader / data transfer to GPU – I – properties of some torchvision datasets

Two CUDA/cudnn versions with Pytorch and Tensorflow in one virtual Python environment

by eremo
4 Mar 202528 Mar 2025
General ML-topics, Nvidia / CUDA

One of the problems I recently ran into was the coexistence of Tensorflow2 [TF2] and PyTorch in the very same virtual Python environment. I just wanted to make experiments to compare the performance of some Keras-based models with the TF2-backend on one side and, on the other side, with the PyTorch-backend. My trouble resulted from a mismatch of two CUDA/cudnn… Read More »Two CUDA/cudnn versions with Pytorch and Tensorflow in one virtual Python environment