Aarch64 backend optimization

Hi All,
I am trying to optimize the Aarch64 backend to produce more efficient code

What possible optimizations are still pending that would make Aarch64 compiler much better than gcc?

cheers,

Manjunath DN