<p>Actually, the errors are only coming closer but still differing by a magnitude of 10 now from differing in magnitude of 1000, and I checked again, the paper clearly tells sigmoid function is used in both the layers, so I don't know what to make of this.</p>

<p style="font-size:small;-webkit-text-size-adjust:none;color:#666;">&mdash;<br />You are receiving this because you are subscribed to this thread.<br />Reply to this email directly or <a href="https://github.com/mlpack/mlpack/issues/414#issuecomment-212316249">view it on GitHub</a><img alt="" height="1" src="https://github.com/notifications/beacon/AJ4bFDQnjFkKU5lTWvZI2A2Duat_2QKEks5p5d6XgaJpZM4DnsTV.gif" width="1" /></p>
<div itemscope itemtype="http://schema.org/EmailMessage">
<div itemprop="action" itemscope itemtype="http://schema.org/ViewAction">
  <link itemprop="url" href="https://github.com/mlpack/mlpack/issues/414#issuecomment-212316249"></link>
  <meta itemprop="name" content="View Issue"></meta>
</div>
<meta itemprop="description" content="View this Issue on GitHub"></meta>
</div>