Learning to approximate a Bregman divergence

Siahkamari, Ali; Xia, Xide; Saligrama, Venkatesh; Castanon, David; Kulis, Brian

Learning to approximate a Bregman divergence

Files

NeurIPS-2020-learning-to-approximate-a-bregman-divergence-Paper.pdf(1.32 MB)

Published version

Date

2020

Authors

Siahkamari, Ali

Xia, Xide

Saligrama, Venkatesh

Castanon, David

Kulis, Brian

Version

Published version

URI

https://hdl.handle.net/2144/42993

Citation

Ali Siahkamari, Xide Xia, Venkatesh Saligrama, David Casta nón, Brian Kulis. 2020. "Learning to Approximate a Bregman Divergence." Advances in Neural Information Processing Systems. https://papers.nips.cc/paper/2020/hash/24bcb4d0caa4120575bb45c8a156b651-Abstract.html

Abstract

Bregman divergences generalize measures such as the squared Euclidean distance and the KL divergence, and arise throughout many areas of machine learning. In this paper, we focus on the problem of approximating an arbitrary Bregman divergence from supervision, and we provide a well-principled approach to analyzing such approximations. We develop a formulation and algorithm for learning arbitrary Bregman divergences based on approximating their underlying convex generating function via a piecewise linear function. We provide theoretical approximation bounds using our parameterization and show that the generalization error Op(m^-1/2) for metric learning using our framework matches the known generalization error in the strictly less general Mahalanobis metric learning setting. We further demonstrate empirically that our method performs well in comparison to existing metric learning methods, particularly for clustering and ranking problems.

Collections

BU Open Access Articles
ENG: Electrical and Computer Engineering: Scholarly Papers

Full item page