If you are really correct that the stack size can fail to be divisible by 4, then this is a disaster. It would mean that if you later push an int on the stack, it will not be aligned. The Intel architecture will allow this, but it will slow you down by a huge huge amount. So, are you confused, or is this really being done?