Rethinking Depthwise Separable Convolutions in PyTorch

July 19, 2022 — Written by Marc Päpper — ⏰ 6 min read

This is a follow-up to my previous post of Depthwise Separable Convolutions in PyTorch. This article is based on the nice CVPR paper titled “Rethinking Depthwise Separable Convolutions: How Intra-Kernel Correlations Lead to Improved MobileNets” by Haase and Amthor. Previously I took a look at depthwise separable convolutions which are a drop-in replacement for standard convolutions, but focused on computational and parameter-based efficiency. Basically, you can gain similar results with a lot less parameters and FLOPs, so they are used in MobileNet style architectures.

Read more →

DINO - Emerging properties in self-supervised vision transformers

March 13, 2022 — Written by Marc Päpper — ⏰ 8 min read

#machinelearning #deeplearning #paper #transformer #computervision

Today’s paper: Emerging properties in self-supervised vision transformers by Mathilde Caron et al. Let’s get the dinosaur out of the room: the name DINO refers to self-distillation with no labels. The self-distillation part refers to self-supervised learning in a student-teacher setup as is often seen for distillation. However, the catch is that in contrast to normal distillation setups where a previously trained teacher network is training a student network, here they work without labels and without pre-training the teacher.

Read more →

Rethinking Batch in BatchNorm

February 28, 2022 — Written by Marc Päpper — ⏰ 6 min read

#machinelearning #deeplearning #paper #batchnorm

Today’s paper: Rethinking ‘Batch’ in BatchNorm by Wu & Johnson BatchNorm is a critical building block in modern convolutional neural networks. Its unique property of operating on “batches” instead of individual samples introduces significantly different behaviors from most other operations in deep learning. As a result, it leads to many hidden caveats that can negatively impact model’s performance in subtle ways. This is a citation from the paper’s abstract and the emphasis is mine which caught my attention.

Read more →

Hyperparameter tuning on numerai data with PyTorch Lightning and weights & biases

December 05, 2021 — Written by Marc Päpper — ⏰ 10 min read

#python #pytorch #machinelearning #deeplearning #hyperparameters #lightning #wandb

To compare the previously described approach of hyperparameter tuning using fastai and wandb, today we’ll see how to tackle the same approach, but using PyTorch Lightning instead of fastai. The goal is to have an automated hyperparameter tuning pipeline running on the Numerai data set. What is Numerai? Numerai is a hedge fund which trades stocks in a market neutral fashion. That means that they try to make money without having a lot of risk for their customers.

Read more →

Hyperparameter tuning on numerai data with fastai and weights & biases

November 27, 2021 — Written by Marc Päpper — ⏰ 8 min read

#python #pytorch #machinelearning #deeplearning #hyperparameters #fastai #wandb

Today we will try to tackle the Numerai tournament using the fastai deep learning library. However, as the results likely depend on many different hyperparameters, let’s take advantage of the weights and biases library and their sweeps API. Sweeps are hyperparameter runs which test out different combinations of your model’s hyperparameters. What is Numerai? Numerai is a hedge fund which trades stocks in a market neutral fashion. That means that they try to make money without having a lot of risk for their customers.

Read more →

Rethinking Depthwise Separable Convolutions in PyTorch

DINO - Emerging properties in self-supervised vision transformers

Rethinking Batch in BatchNorm

Hyperparameter tuning on numerai data with PyTorch Lightning and weights & biases

Hyperparameter tuning on numerai data with fastai and weights & biases

I help you listen through the noise in machine learning: