[mlpack-git] [mlpack] DET has only one leaf prior to pruning every time (#515)

Ryan Curtin notifications at github.com
Wed Feb 10 15:27:16 EST 2016


Diagnosis: the log negative error of a DET is defined as

R(t) = log(|t|^2 / (N^2 V_t)).

At the first level of this tree, the volume of the node is the entire volume spanned by the data.  i.e. V = the width of every dimension multiplied together.  But some dimensions have width 0 in this dataset, so, V = 0 and R(t) = inf.

I don't yet know how I want to handle this problem for the mlpack code; I need to review the paper and maybe send Pari an email or something depending on what I can come up with.

A quick solution is to add tiny bits of noise to your data points, or to drop any dimensions that have zero range (i.e. where all of the rows have 0 in that dimension).

I'll keep digging and let you know what I think of.

---
Reply to this email directly or view it on GitHub:
https://github.com/mlpack/mlpack/issues/515#issuecomment-182569695
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.cc.gatech.edu/pipermail/mlpack-git/attachments/20160210/676b2e65/attachment.html>


More information about the mlpack-git mailing list