* Specialize the op lowering logic for elementwise operations * Fix clang-format error. * Update tests for LSTM since LSTM uses element-wise ops Co-authored-by: Tian Jin <tjingrant@gmail.com>