<p>The speedup looks reasonable to me; I expect that most users will use 10-fold cross-validation, so the speedup isn't amazing but it certainly is a net improvement. Two last minor questions before I merge this:</p>
<ul>
<li>Did you try any other parallelization strategies? i.e. the for loops inside of SplitNode(), or something like that.</li>
<li>What's the overhead for the OpenMP-ized code using only one core (do you happen to know)? No need for extensive benchmarks; I just want to make sure that OpenMP isn't incurring huge overhead even in the singlethreaded case.</li>
</ul>
<p style="font-size:small;-webkit-text-size-adjust:none;color:#666;">—<br>Reply to this email directly or <a href="https://github.com/mlpack/mlpack/pull/438#issuecomment-103537525">view it on GitHub</a>.<img alt="" height="1" src="https://github.com/notifications/beacon/AJ4bFDaSrk2cQGE1Za6EfDxVZ04jqqNuks5oK0c7gaJpZM4Ea3Qy.gif" width="1" /></p>
<div itemscope itemtype="http://schema.org/EmailMessage">
<div itemprop="action" itemscope itemtype="http://schema.org/ViewAction">
<link itemprop="url" href="https://github.com/mlpack/mlpack/pull/438#issuecomment-103537525"></link>
<meta itemprop="name" content="View Pull Request"></meta>
</div>
<meta itemprop="description" content="View this Pull Request on GitHub"></meta>
</div>