Add reference implementation for bn_[mul|sqr]_mont, new candidates for