Monday, April 14, 2008

MySQL Performance - eliminating ORDER BY function

MySQL Performance - eliminating ORDER BY function

Posted By peter On October 17, 2007 @ 5:24 am In optimizer | 5 Comments

One of the first rules you would learn about MySQL Performance Optimization is to avoid using functions when comparing constants or order by. Ie use indexed_col=N is good. function(indexed_col)=N is bad because MySQL Typically will be unable to use index on the column even if function is very simple such as arithmetic operation. Same can apply to order by, if you would like that to use the index for sorting. There are however some interesting exception.

Compare those two queries for example. If you look only at ORDER BY clause you would see first query which sorts by function is able to avoid order by while second which uses direct column value needs to do the filesort:

SQL:
  1. mysql> EXPLAIN SELECT * FROM tst WHERE i=5 AND date(d)=date(now()) ORDER BY date(d) \G
  2. *************************** 1. row ***************************
  3. id: 1
  4. select_type: SIMPLE
  5. TABLE: tst
  6. type: ref
  7. possible_keys: i
  8. KEY: i
  9. key_len: 5
  10. ref: const
  11. rows: 10
  12. Extra: USING WHERE
  13. 1 row IN SET (0.00 sec)
  14. mysql> EXPLAIN SELECT * FROM tst WHERE i=5 AND date(d)=date(now()) ORDER BY d \G
  15. *************************** 1. row ***************************
  16. id: 1
  17. select_type: SIMPLE
  18. TABLE: tst
  19. type: ref
  20. possible_keys: i
  21. KEY: i
  22. key_len: 5
  23. ref: const
  24. rows: 10
  25. Extra: USING WHERE; USING filesort
  26. 1 row IN SET (0.00 sec)

If you take a closer look to WHERE clause you will find the reason - date(d) equals to date(now()) which is constant which means we're sorting by constant and so sort phase can be skipped all together.

Note in this case MySQL Optimizer is rather smart and is able to do this even if we have function in ORDER BY and exactly the same function is equals to constant by WHERE clause. If course it works for direct constants as well.

However if functions are different MySQL is not able to do this optimization even in cases when this would be possible:

SQL:
  1. mysql> EXPLAIN SELECT * FROM tst WHERE i=5 AND date(d)=date(now()) ORDER BY unix_timestamp(date(d)) \G
  2. *************************** 1. row ***************************
  3. id: 1
  4. select_type: SIMPLE
  5. TABLE: tst
  6. type: ref
  7. possible_keys: i
  8. KEY: i
  9. key_len: 5
  10. ref: const
  11. rows: 10
  12. Extra: USING WHERE; USING filesort
  13. 1 row IN SET (0.00 sec)

0 comments: