The same main reason Intel and AMD lowered processor core-voltage as years went by - Less heat dissipation for the same workload.
Most people have opposite problem to yours, they don't understand why over-volting worked.
---
Die size shrink required proportional drop in voltage to avoid reaching voltage break down threshold for the process technology.
No, depending on the designs, for some more volts simply don't work for higher clock speed.
under-volting created less heat allowing for higher clocks, why did over-volting also worked for higher overclocking?
---
Don't rely upon belief, try education and intelligence instead.
Air has nominal dielectric rating of 10KV per inch, the voltage break down value is much lower for smaller air-gap distance. The same applied to any material including all typical semi-conductors.
---
No, not at all. Increasing voltage which is increasing logic threshold level - AKA "increasing the signal to noise ratio" but it also increased the time between valid logic levels... eg, negating signal to noise advantage from the extra time taken between rail to rail voltage.
---
"Electron Migration" is about failure and reliability, not about achieving higher clock speeds. Lower and higher "Electron Migration" rates will not make clock speed lower or higher, only how fast it would have failed.
---
Go ahead, answer my very first question directly to you and the rest take care of itself.
"under-volting created less heat allowing for higher clocks, why did over-volting also worked for higher overclocking?
"
There're more than one reason, you can give me just one of them, it would be suffice.