DINO as a von Mises-Fisher mixture model

Govindarajan, Hariprasath; Sidén, Per; Roll, Jacob; Lindsten, Fredrik

Computer Science > Machine Learning

arXiv:2405.10939 (cs)

[Submitted on 17 May 2024]

Title:DINO as a von Mises-Fisher mixture model

Authors:Hariprasath Govindarajan, Per Sidén, Jacob Roll, Fredrik Lindsten

View PDF HTML (experimental)

Abstract:Self-distillation methods using Siamese networks are popular for self-supervised pre-training. DINO is one such method based on a cross-entropy loss between $K$-dimensional probability vectors, obtained by applying a softmax function to the dot product between representations and learnt prototypes. Given the fact that the learned representations are $L^2$-normalized, we show that DINO and its derivatives, such as iBOT, can be interpreted as a mixture model of von Mises-Fisher components. With this interpretation, DINO assumes equal precision for all components when the prototypes are also $L^2$-normalized. Using this insight we propose DINO-vMF, that adds appropriate normalization constants when computing the cluster assignment probabilities. Unlike DINO, DINO-vMF is stable also for the larger ViT-Base model with unnormalized prototypes. We show that the added flexibility of the mixture model is beneficial in terms of better image representations. The DINO-vMF pre-trained model consistently performs better than DINO on a range of downstream tasks. We obtain similar improvements for iBOT-vMF vs iBOT and thereby show the relevance of our proposed modification also for other methods derived from DINO.

Comments:	Accepted to ICLR 2023
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2405.10939 [cs.LG]
	(or arXiv:2405.10939v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.10939

Submission history

From: Hariprasath Govindarajan [view email]
[v1] Fri, 17 May 2024 17:49:45 UTC (232 KB)

Computer Science > Machine Learning

Title:DINO as a von Mises-Fisher mixture model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:DINO as a von Mises-Fisher mixture model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators