Rare geometries: revealing rare categories via dimension-driven statistics

2019-01-29

Henry Kvinge, Elin Farnell

arXiv_CV

arXiv_CV Detection

Abstract
Abstract (translated by Google)
URL
PDF

Abstract

In many situations, the classes of data points of primary interest also happen to be those that are least numerous. A well-known example is detection of fraudulent transactions among the collection of all transactions, the majority of which are legitimate. These types of problems fall under the label of `rare category detection’. One challenging aspect of these problems is that a rare class may not be easily separable from the majority class (at least in terms of available features). Statistics related to the geometry of the rare class (such as its intrinsic dimension) can be significantly different from those for the majority class, reflecting the different dynamics driving variation in the different classes. In this paper we present a new supervised learning algorithm that uses a dimension-driven statistic, called the $\kappa$-profile, to classify unlabeled points as likely to belong to a rare class.

Abstract (translated by Google)

URL

http://arxiv.org/abs/1901.10585

PDF

http://arxiv.org/pdf/1901.10585

Rare geometries: revealing rare categories via dimension-driven statistics

Abstract

Abstract (translated by Google)

URL

PDF

Similar Posts

Comments