天天看點

T-SQL WITH AST-SQL with關鍵字

T-SQL with關鍵字

Select字句在邏輯上是SQL語句最後進行處理的最後一步,是以,以下查詢會發生錯誤:

SELECT
 YEAR(OrderDate) AS OrderYear,
 COUNT(DISTINCT CustomerID) AS NumCusts
FROM dbo.Orders
GROUP BY OrderYear;      

因為group by是在Select之前進行的,那個時候orderYear這個列并沒有形成。

如果要查詢成功,可以像下面進行修改:

SELECT OrderYear, COUNT(DISTINCT CustomerID) AS NumCusts
FROM (SELECT YEAR(OrderDate) AS OrderYear, CustomerID
   FROM dbo.Orders) AS D
GROUP BY OrderYear;      

還有一種很特殊的寫法:

SELECT OrderYear, COUNT(DISTINCT CustomerID) AS NumCusts
FROM (SELECT YEAR(OrderDate), CustomerID
   FROM dbo.Orders) AS D(OrderYear, CustomerID)
GROUP BY OrderYear;      

在作者眼裡,他是非常喜歡這種寫法的,因為更清晰,更明确,更便于維護。

在查詢中使用參數定向産生一批結果,這個技巧沒有什麼好說的。

嵌套查詢,在處理邏輯上是從裡向外進行執行的。

多重引用,有可能你的SQL語句包含了多次從一個表進行查詢後進行連接配接組合。比如你要比較每年的顧客數同先前年的顧客數的變化,是以你的查詢就必須JOIN了2個相同的表的執行個體,這也是不可避免的。

Common Table Expressions (CTE)

CTE是在SQL2005新加入的一種表的表示類型。

它的定義如下:

WITH cte_name

AS

(

cte_query

)

outer_query_refferring to_cte_name;

注意:因為在标準的T-SQL語言中已經包含了WITH關鍵字,是以為了區分,CTE在語句的結尾加上了“;”作為停止符。

CTE執行個體一(結果集别名)

WITH C AS
(
 SELECT YEAR(OrderDate) AS OrderYear, CustomerID
 FROM dbo.Orders
)
SELECT OrderYear, COUNT(DISTINCT CustomerID) AS NumCusts
FROM C
GROUP BY OrderYear;      

當然,作者本人有更推薦的寫法:

WITH C(OrderYear, CustomerID) AS
(
 SELECT YEAR(OrderDate), CustomerID
 FROM dbo.Orders
)
SELECT OrderYear, COUNT(DISTINCT CustomerID) AS NumCusts
FROM C
GROUP BY OrderYear;      

CTE執行個體二(多重CTEs)

WITH C1 AS
(
 SELECT YEAR(OrderDate) AS OrderYear, CustomerID
 FROM dbo.Orders
),
C2 AS
(
 SELECT OrderYear, COUNT(DISTINCT CustomerID) AS NumCusts
 FROM C1
 GROUP BY OrderYear
)
SELECT OrderYear, NumCusts
FROM C2
WHERE NumCusts > 70;      

CTE執行個體三(多重引用)

WITH YearlyCount AS
(
 SELECT YEAR(OrderDate) AS OrderYear,
  COUNT(DISTINCT CustomerID) AS NumCusts
 FROM dbo.Orders
 GROUP BY YEAR(OrderDate)
)
SELECT Cur.OrderYear,
 Cur.NumCusts AS CurNumCusts, Prv.NumCusts AS PrvNumCusts,
 Cur.NumCusts - Prv.NumCusts AS Growth
FROM YearlyCount AS Cur
 LEFT OUTER JOIN YearlyCount AS Prv
  ON Cur.OrderYear = Prv.OrderYear + 1;      

CTE執行個體四(修改資料)

1.把從customer表查詢出來的結果,動态的組裝進新表CustomersDups裡:

IF OBJECT_ID('dbo.CustomersDups') IS NOT NULL
 DROP TABLE dbo.CustomersDups;
GO

WITH CrossCustomers AS
(
 SELECT 1 AS c, C1.*
 FROM dbo.Customers AS C1, dbo.Customers AS C2
)
SELECT ROW_NUMBER() OVER(ORDER BY c) AS KeyCol,
 CustomerID, CompanyName, ContactName, ContactTitle, Address,
 City, Region, PostalCode, Country, Phone, Fax
INTO dbo.CustomersDups
FROM CrossCustomers;
      

2.使用CTE移除資料,隻保留CustomerDups表裡同一CustomerID裡KeyCol為最大的記錄。

WITH JustDups AS
(
 SELECT * FROM dbo.CustomersDups AS C1
 WHERE KeyCol <
  (SELECT MAX(KeyCol) FROM dbo.CustomersDups AS C2
   WHERE C2.CustomerID = C1.CustomerID)
)
DELETE FROM JustDups;      

CTE執行個體五(對象容器)

即提供了封裝的能力,有利于元件化的程式設計。作者額外的提醒,CTE無法直接内嵌,但是可以通過把CTE封裝進一個對象容器裡并從一個外部的CTE裡對這容器的資料進行查詢而實作内嵌。

作者也說明了,使用CTEs在VIEW和UDFs裡是沒有什麼價值的。

有個例子,如下:

CREATE VIEW dbo.VYearCnt
AS
WITH YearCnt AS
(
 SELECT YEAR(OrderDate) AS OrderYear,
  COUNT(DISTINCT CustomerID) AS NumCusts
 FROM dbo.Orders
 GROUP BY YEAR(OrderDate)
)
SELECT * FROM YearCnt;      

CTE執行個體六(CTEs的遞歸)

作者給了一個例子,來講述這個在SQL2005的新内容,CTEs的遞歸。

根據employeeId,傳回此員工的資訊,并包含所有下級員工的資訊。(等級關系基于empolyeeId和reportsTo的屬性)所傳回的結果包含下列字段,employeeId,reportsTo,FirstName,LastName。

作者在這裡,給予了一個最佳的索引方式:

CREATE UNIQUE INDEX idx_mgr_emp_ifname_ilname
 ON dbo.Employees(ReportsTo, EmployeeID)
 INCLUDE(FirstName, LastName);      

作者的解釋: 這個索引将通過一個單獨的查詢(局部掃描)來取得每個經理的直接下級。Include(FristName,LastName)加在這裡,即是覆寫列。

小知識:什麼Include索引?

Include索引是SQL2005的新功能。Include索引的列并不影響索引行的實體存儲順序,他們作為一個挂件‘挂在'索引行上。挂這些‘挂件'的目的在于,隻需要掃描一把索引就獲得了這些附加資料。

回到作者的例子上,下面是遞歸的代碼:

WITH EmpsCTE AS
(
 SELECT EmployeeID, ReportsTo, FirstName, LastName
 FROM dbo.Employees
 WHERE EmployeeID = 5
 UNION ALL

 SELECT EMP.EmployeeID, EMP.ReportsTo, EMP.FirstName, EMP.LastName
 FROM EmpsCTE AS MGR
  JOIN dbo.Employees AS EMP
   ON EMP.ReportsTo = MGR.EmployeeID
)
SELECT * FROM EmpsCTE;      

了解:一個遞歸的CTE包含了至少2個查詢,第一個查詢在CTE的身體裡類似于一格錨點。這個錨點僅僅傳回一個有效的表,并作為遞歸的一個錨。從上的例子看出來,錨點僅僅傳回了一個employeeID = 5 的一行。然後的第2個查詢是作為遞歸成員。當查詢到下屬成員的結果為空時,此遞歸結束。

如果你擔心遞歸會造成永久循環,你可以使用下面的表達:

WITH cte_name AS (cte_body) outer_query OPTION (MAXRECURSION n);

預設的n為100,當n=0時,無限制。