mysql關鍵字 exists 與 in

在mysql中我們經常會使用到子查詢，而子查詢中裡面就會使用到in與exists。下面我們就來分析一下這兩個關鍵字。

1.1 exists

在sql語句中出現exists關鍵字的時候，它先會對外表進行迴圈查詢並且查詢都會檢視exists條件語句是否符合條件。當exists裡的條件語句能夠返回記錄行時，條件就為真，就會返回當前資料。反之如果exists裡的條件語句不能返回記錄行，則當前迴圈到的這條資料就會被丟棄。exists的條件就是乙個過濾條件，當能返回結果集則為true，不能返回結果集則為false;

如下：

select * from
user
where
exists (select
1);

對user表的記錄逐條取出，由於子條件中的select 1永遠能返回記錄行，那麼user表的所有記錄都將被加入結果集，所以與 select * from user;是一樣的。

又如下

select * from
user
where
exists (select * from
user
where userid = 0);

可以知道對user表進行loop時，檢查條件語句(select * from user where userid = 0),由於userid永遠不為0，所以條件語句永遠返回空集，條件永遠為false，那麼user表的所有記錄都將被丟棄。

not exists與exists相反，也就是當exists條件有結果集返回時，loop到的記錄將被丟棄，否則將loop到的記錄加入結果集。

總的來說，如果a表有n條記錄，那麼exists查詢就是將這n條記錄逐條取出，然後判斷n遍exists條件

1.2、in

in查詢相當於多個or條件的疊加，這個比較好理解，比如下面的查詢

select * from
user
where userid in (1, 2, 3);

等效於

select * from
user
where userid = 1
or userid = 2
or userid = 3;

not in與in相反，例如：

select * from
user
where userid not
in (1, 2, 3);

等效於

select * from
user
where userid != 1
and userid != 2
and userid != 3;

總的來說，in查詢就是先將子查詢條件的記錄全都查出來，假設結果集為b，共有m條記錄，然後在將子查詢條件的結果集分解成m個，再進行m次查詢

值得一提的是，in查詢的子條件返回結果必須只有乙個字段，例如：

select * from
user
where userid in (select id from b);

而不能是

select * from
user
where userid in (select id, age from b);

而exists就沒有這個限制

我們先來考慮如下兩個sql語句：

1：select * from a where
exists (select * from b where b.id = a.id);
2：select * from a where a.id in (select id from b);

查詢1.可以轉化以下偽**，便於理解

for ($i = 0; $i
< count(a); $i++) 
return
$result;

大概就是這麼個意思，其實可以看到,查詢1主要是用到了b表的索引，a表如何對查詢的效率影響應該不大。

對於查詢2，假設b表的所有id為1,2,3,查詢2可以轉換為

or a.id = 2

or a.id = 3;

這個好理解了，這裡主要是用到了a的索引，b表如何對查詢影響不大。

下面再看not exists 和 not in

1. select * from a where
notexists (select * from b where b.id = a.id);
2. select * from a where a.id not
in (select id from b);

看查詢1，還是和上面一樣，用了b的索引,而對於查詢2，可以轉化成如下語句。

and a.id != 2

and a.id != 3;

可以知道not in是個範圍查詢，這種!=的範圍查詢無法使用任何索引,等於說a表的每條記錄，都要在b表裡遍歷一次，檢視b表裡是否存在這條記錄

故not exists比not in效率高

mysql中的in語句是把外表和內錶作hash 連線，而exists語句是對外表作loop迴圈，每次loop迴圈再對內表進行查詢。一直大家都認為exists比in語句的效率要高，這種說法其實是不準確的。這個是要區分環境的:

例如：表a（小表），表b（大表）

case 1:

select * from a where cc in (select cc from b) # 效率低，用到了a表上cc列的索引； select * from a where exists(select cc from b where cc=a.cc) # 效率高，用到了b表上cc列的索引。

相反的

case 2：

select * from b where cc in (select cc from a) # 效率高，用到了b表上cc列的索引； select * from b where exists(select cc from a where cc=b.cc)

#效率低，用到了a表上cc列的索引。

not in 和not exists如果查詢語句使用了not in 那麼內外表都進行全表掃瞄，沒有用到索引；而not extsts 的子查詢依然能用到表上的索引。所以無論那個表大，用not exists都比not in要快。

in執行順序：先執行in中的子查詢，作為我們最外層迴圈，主查詢作為內層迴圈

exists: 主查詢作為最外層迴圈，子查詢作為最內層迴圈

而根據我們的時間複雜度，最外層迴圈小於內層迴圈的時候，使用的時間相對較少

結論：永遠小表驅動大表是最優的選擇方式

mysql關鍵字 exists 與 in

SQL中IN與EXISTS關鍵字

exists關鍵字在 mysql 中的用法

exists關鍵字的通俗理解以及與in的比較

mysql關鍵字 exists 與 in

SQL中IN與EXISTS關鍵字

exists關鍵字在 mysql 中的用法

exists關鍵字的通俗理解以及與in的比較

相關推薦