Not only that, but the standard deviation of the residuals is getting smaller, too.
As you fit more variables, you are explaining more of the data. The model fits the data better, which means that the residuals are getting closer to the regression surface.
If you have one regressor, there might be observations that are far from the model. These "outliers" show up in the residual histogram as being far from the zero. Thus the histogram does not look bell-shaped. As you add more regressors, there are fewer outliers and the surface passes close to all the points. The histogram of residuals will be very bell-shaped and narrow (small standard deviation).