标签:
Pivoting是一项可以把行旋转为列的技术。在执行Pivoting的过程中可能会使用到聚合。Pivoting技术应用非常广泛。下面讨论的都是静态的Pivoting查询,即用户需要提前知道旋转的属性和列的值。对于动态Pivoting,需要动态地构造字符串。
CREATE TABLE t( id INT, attribute VARCHAR(10), value VARCHAR(20), PRIMARY KEY(id,attribute) );
INSERT INTO t SELECT 1,‘attr1‘,‘BMW‘; INSERT INTO t SELECT 1,‘attr2‘,‘100‘; INSERT INTO t SELECT 1,‘attr3‘,‘2010-01-01‘; INSERT INTO t SELECT 2,‘attr2‘,‘200‘; INSERT INTO t SELECT 2,‘attr3‘,‘2010-03-04‘; INSERT INTO t SELECT 2,‘attr4‘,‘M‘; INSERT INTO t SELECT 2,‘attr5‘,‘55.60‘; INSERT INTO t SELECT 3,‘attr1‘,‘SUV‘; INSERT INTO t SELECT 3,‘attr2‘,‘10‘; INSERT INTO t SELECT 3,‘attr3‘,‘2011-11-11‘;
SELECT id, MAX(CASE WHEN attribute=‘attr1‘ THEN value END) AS attr1, MAX(CASE WHEN attribute=‘attr2‘ THEN value END) AS attr2, MAX(CASE WHEN attribute=‘attr3‘ THEN value END) AS attr3, MAX(CASE WHEN attribute=‘attr4‘ THEN value END) AS attr4, MAX(CASE WHEN attribute=‘attr5‘ THEN value END) AS attr5 FROM t GROUP BY id;
Pivoting先根据id进行分组,确定行列互转后记录的行数。之后通过已知的5个属性来确定行列互转后有5列数据,并通过CASE得到每列的值。由于使用了分组技术,因此一定要使用分组函数来取得列的值,故这里使用MAX函数,当然也可以使用MIN函数。最后得到的结果如下图
CREATE TABLE t1 ( orderid VARCHAR(10) NOT NULL, productid INT NOT NULL, PRIMARY KEY(orderid,productid) );
INSERT INTO t1 SELECT ‘A‘,1; INSERT INTO t1 SELECT ‘A‘,2; INSERT INTO t1 SELECT ‘A‘,3; INSERT INTO t1 SELECT ‘A‘,4; INSERT INTO t1 SELECT ‘B‘,2; INSERT INTO t1 SELECT ‘B‘,3; INSERT INTO t1 SELECT ‘B‘,4; INSERT INTO t1 SELECT ‘C‘,3; INSERT INTO t1 SELECT ‘C‘,4; INSERT INTO t1 SELECT ‘D‘,
SELECT orderid FROM ( SELECT orderid, MAX(CASE WHEN productid=2 THEN 1 END) AS p2, MAX(CASE WHEN productid=3 THEN 1 END) AS P3, MAX(CASE WHEN productid=4 THEN 1 END) AS p4 FROM t1 GROUP BY orderid ) AS P WHERE p2=1 AND p3=1 AND p4=1;
上述语句返回“A”和“B”。如果单独运行子查询,将会得到每个订单对应的产品ID,得到的结果如下
SELECT orderid FROM ( SELECT orderid, COUNT(CASE WHEN productid=2 THEN 1 END) AS p2, COUNT(CASE WHEN productid=3 THEN 1 END) AS P3, COUNT(CASE WHEN productid=4 THEN 1 END) AS p4 FROM t1 GROUP BY orderid ) AS P WHERE p2=1 AND p3=1 AND p4=1;
CREATE TABLE t2 ( orderid INT NOT NULL, orderdate DATE NOT NULL, empid INT NOT NULL, custid VARCHAR(10) NOT NULL, qty INT NOT NULL, PRIMARY KEY (orderid,orderdate) );
INSERT INTO t2 SELECT 1,‘2010-01-02‘,‘3‘,‘A‘,10; INSERT INTO t2 SELECT 2,‘2010-04-02‘,‘2‘,‘B‘,20; INSERT INTO t2 SELECT 3,‘2010-05-02‘,‘1‘,‘A‘,30; INSERT INTO t2 SELECT 4,‘2010-07-02‘,‘3‘,‘D‘,40; INSERT INTO t2 SELECT 5,‘2011-01-02‘,‘4‘,‘A‘,20; INSERT INTO t2 SELECT 6,‘2011-01-02‘,‘3‘,‘B‘,30; INSERT INTO t2 SELECT 7,‘2011-01-02‘,‘1‘,‘C‘,40; INSERT INTO t2 SELECT 8,‘2009-01-02‘,‘2‘,‘A‘,10; INSERT INTO t2 SELECT 9,‘2009-01-02‘,‘3‘,‘B‘,20;
SELECT custid,YEAR(orderdate) AS year,SUM(qty) AS sum_qty FROM t2 GROUP BY custid,YEAR(orderdate)
SELECT custid, IFNULL(SUM(CASE WHEN orderyear=2009 THEN qty END),0) AS ‘2009‘, IFNULL(SUM(CASE WHEN orderyear=2010 THEN qty END),0) AS ‘2010‘, IFNULL(SUM(CASE WHEN orderyear=2011 THEN qty END),0) AS ‘2011‘ FROM (SELECT custid,YEAR(orderdate) AS orderyear,qty FROM t2) AS p GROUP BY custid;
CREATE TABLE Matrix ( orderyear INT PRIMARY KEY, y2009 INT NULL, y2010 INT NULL, y2011 INT NULL );
INSERT INTO Matrix SELECT 2009,1,0,0; INSERT INTO Matrix SELECT 2010,0,1,0; INSERT INTO Matrix SELECT 2011,0,0,1;
SELECT custid, SUM(qty*y2009) AS ‘2009‘, SUM(qty*y2010) AS ‘2010‘, SUM(qty*y2011) AS ‘2011‘ FROM (SELECT custid,YEAR(orderdate) AS orderyear,qty FROM t2) AS O INNER JOIN Matrix AS P ON O.orderyear=P.orderyear GROUP BY custid;
CREATE TABLE p ( custid VARCHAR(10) NOT NULL, y2009 INT NULL, y2010 INT NULL, y2011 INT NULL, PRIMARY KEY (custid) );
INSERT INTO p SELECT custid, IFNULL(SUM(CASE WHEN orderyear=2009 THEN qty END), 0) AS ‘2009‘, IFNULL(SUM(CASE WHEN orderyear=2010 THEN qty END), 0) AS ‘2010‘, IFNULL(SUM(CASE WHEN orderyear=2011 THEN qty END), 0) AS ‘2011‘ FROM (SELECT custid, YEAR(orderdate) AS orderyear, qty FROM t2 ) AS P GROUP BY custid;
这里把t2表返回后的内容导入到表p中,如果想得到t2表直接聚合得到的结果,这个问题就变成了Unpivoting问题。解决这个问题需要将列旋转为行。这里使用的技巧是对每行数据产生3个副本,每个副本产生一个需要旋转的列,这个过程可以通过如下的CROSS JOIN来完成。
SELECT * FROM p, (SELECT 2009 AS orderyear UNION ALL SELECT 2010 UNION ALL SELECT 2011) AS o
CASE orderyear WHEN 2009 THEN y2009 WHEN 2010 THEN y2010 WHEN 2011 THEN y2011 END AS qty
SELECT custid,orderyear, CASE orderyear WHEN 2009 THEN y2009 WHEN 2010 THEN y2010 WHEN 2011 THEN y2011 END AS qty FROM p, (SELECT 2009 AS orderyear UNION ALL SELECT 2010 UNION ALL SELECT 2011) AS o
SELECT custid,orderyear,qty FROM ( SELECT custid,orderyear, CASE orderyear WHEN 2009 THEN y2009 WHEN 2010 THEN y2010 WHEN 2011 THEN y2011 END AS qty FROM p, (SELECT 2009 AS orderyear UNION ALL SELECT 2010 UNION ALL SELECT 2011) AS o ) AS M WHERE qty <> 0
标签:
原文地址:http://www.cnblogs.com/chenqionghe/p/4679797.html