This question was answered in a private message but I paste answer here as well;
In portrait mode we are using dual sensor hardware depth detection, to help with determin the depth data in the scene.
In beauty mode, we are using a single lens depth detection as we also apply the beautification filters.
In dual sensor hardware depth detection, we cannot apply a software zoom in the way it is being implemented.
Hope this answers your question.