<blockquote>
<p>this will happen very early on: I don't think there are many datasets that have 1M+ lines of valid numbers then suddenly a "hello".</p>
</blockquote>

<p>I guess you are right, ok, I will implement this strategy.Before I finish a faster, memory efficient version, could we treat this pull request as temporary solution?</p>

<p style="font-size:small;-webkit-text-size-adjust:none;color:#666;">&mdash;<br />You are receiving this because you are subscribed to this thread.<br />Reply to this email directly, <a href="https://github.com/mlpack/mlpack/pull/660#issuecomment-223318321">view it on GitHub</a>, or <a href="https://github.com/notifications/unsubscribe/AJ4bFKzM99Ktpq597sR3qSLhcEi3wzCkks5qHu-cgaJpZM4Iq0a1">mute the thread</a>.<img alt="" height="1" src="https://github.com/notifications/beacon/AJ4bFEcRwAg44FZcvMq3q6zYf5xW6ntVks5qHu-cgaJpZM4Iq0a1.gif" width="1" /></p>
<div itemscope itemtype="http://schema.org/EmailMessage">
<div itemprop="action" itemscope itemtype="http://schema.org/ViewAction">
  <link itemprop="url" href="https://github.com/mlpack/mlpack/pull/660#issuecomment-223318321"></link>
  <meta itemprop="name" content="View Pull Request"></meta>
</div>
<meta itemprop="description" content="View this Pull Request on GitHub"></meta>
</div>