Performance analysis for TSVC

I found that I missed some loops which are vectorized by Clang but not by Flang.

There are 3 issues; one is related to integer overflow, and the others are related to strided accesses. One of commenters told me that some of these issues could be resolved by the polyhedral model, but I’m not sure whether it can be realized because I’m not familiar with it.

In addition, vectorization with masks didn’t seem to work when I rewrote explicit-shape arrays to deferred-shape arrays.