顺序事件计数和顺序SQL计数
我有一个查询,我用一个找到的答案构建了这个查询,它非常有用。我加了一些东西来满足我的需要。我添加了一个行数,以计算在任何时间段内30天内有多少人再次入院。我已经按照第一个答案和一个问题的建议,将cte结果插入临时表中。这并不能解决思想、序列长度和序列计数问题 以下是查询:顺序事件计数和顺序SQL计数,sql,sql-server-2008-r2,parent-child,common-table-expression,row-number,Sql,Sql Server 2008 R2,Parent Child,Common Table Expression,Row Number,我有一个查询,我用一个找到的答案构建了这个查询,它非常有用。我加了一些东西来满足我的需要。我添加了一个行数,以计算在任何时间段内30天内有多少人再次入院。我已经按照第一个答案和一个问题的建议,将cte结果插入临时表中。这并不能解决思想、序列长度和序列计数问题 以下是查询: -- CREATE TABLE TO STORE CTE RESULTS DECLARE @PPR TABLE( VISIT1 VARCHAR(20) -- THIS IS A UNIQUE VALUE
-- CREATE TABLE TO STORE CTE RESULTS
DECLARE @PPR TABLE(
VISIT1 VARCHAR(20) -- THIS IS A UNIQUE VALUE
, READMIT VARCHAR(20) -- THIS IS A UNIQUE VALUE
, MRN VARCHAR(10) -- THIS IS UNIQUE TO A PERSON
, INIT_DISC DATETIME
, RA_ADM DATETIME
, R1 INT
, R2 INT
, INTERIM1 VARCHAR(20)
, RA_COUNT INT
, FLAG VARCHAR(2)
);
-- THE CTE THAT WILL GET USED TO POPULATE THE ABOVE TABLE
WITH cte AS (
SELECT PTNO_NUM
, Med_Rec_No
, Dsch_Date
, Adm_Date
, ROW_NUMBER() OVER (
PARTITION BY MED_REC_NO
ORDER BY PtNo_Num
) AS r
FROM smsdss.BMH_PLM_PtAcct_V
WHERE Plm_Pt_Acct_Type = 'I'
AND PtNo_Num < '20000000'
)
-- INSERT CTE RESULTS INTO PPR TABLE
INSERT INTO @PPR
SELECT
c1.PtNo_Num AS [INDEX]
, c2.PtNo_Num AS [READMIT]
, c1.Med_Rec_No AS [MRN]
, c1.Dsch_Date AS [INITIAL DISCHARGE]
, c2.Adm_Date AS [READMIT DATE]
, C1.r
, C2.r
, DATEDIFF(DAY, c1.Dsch_Date, c2.Adm_Date) AS INTERIM1
, ROW_NUMBER() OVER (
PARTITION BY C1.MED_REC_NO
ORDER BY C1.PTNO_NUM ASC
) AS [RA COUNT]
, CASE
WHEN DATEDIFF(DAY, c1.Dsch_Date, c2.Adm_Date) <= 30
THEN 1
ELSE 0
END [FLAG]
FROM cte C1
INNER JOIN cte C2
ON C1.Med_Rec_No = C2.Med_Rec_No
WHERE C1.Adm_Date <> C2.Adm_Date
AND C1.r + 1 = C2.r
ORDER BY C1.Med_Rec_No, C1.Dsch_Date
-- MANIPULATE PPR TABLE
SELECT PPR.VISIT1
, PPR.READMIT
, PPR.MRN
, PPR.INIT_DISC
, PPR.RA_ADM
--, PPR.R1
--, PPR.R2
, PPR.INTERIM1
--, PPR.RA_COUNT
, PPR.FLAG
-- THE BELOW DOES NOT WORK AT ALL
, CASE
WHILE (SELECT PPR.INTERIM1 FROM @PPR PPR) <= 30
BEGIN
ROW_NUMBER() OVER (PARTITION BY PPR.MRN, PPR.VISIT1
ORDER BY PPR.VISIT1
)
IF (SELECT PPR.INTERIM1 FROM @PPR PPR) > 30
BREAK
END
END
FROM @PPR PPR
WHERE PPR.MRN = 'A NUMBER'
所以第三行显然不是30天内再入院,而是患者回到医院的时间点,所以RA_计数回到1,标志变为0,因为这不是30天内再入院
我应该创建一个表而不是使用cte吗
我想添加的是链长度和链数。以下是一些定义:
链长:在后续就诊的30天内,连续有多少次患者再次入院
比如说
INDEX | READMIT | MRN | INITIAL DISCHARGE | READMIT DATE | CHAIN LEN | Count
123 | 133 | 1236 | 2009-05-13 | 2009-06-12 | 1 | 1
133 | 145 | 1236 | 2009-06-16 | 2009-07-04 | 2 | 1
145 | 157 | 1236 | 2009-07-06 | 2009-07-15 | 3 | 1
165 | 189 | 1236 | 2011-01-01 | 2011-01-12 | 1 | 2
189 | 195 | 1236 | 2011-02-06 | 2011-03-01 | 2 | 2
链数就是有多少条链:在上表中有2条。我试图使用case语句来确定链的长度
下面是一个SQL脚本,其中包含一些示例数据,因为它将在执行CTE之前出现
谢谢,关于CTE有一个缺点需要记住 每次引用它们时,它都会重新执行查询
在查询中,两次引用CTE c1和c2。这意味着它将执行两次查询。最好将CTE结果存储在一个表变量或临时表中,然后执行联接。解决方案是从一个视图开始,该视图汇总每个VisitID的数据。我用一个视图来代替CTE,因为它看起来像是你在一次以上的时间里要告诉我们的东西
create view vReadmits
as
select t.VisitID,
t.UID,
min(r.AdmitDT) ReadmittedDT,
min(r.VisitID) NextVisitID,
sum(case when r.AdmitDT < dateadd(d, 30, isnull(t.DischargeDT, t.AdmitDT))
then 1 else 0 end) ReadmitNext30
from t
left join t as r
on t.UID = r.UID
and t.VisitID < r.VisitID
group by t.VisitID,
t.UID
这将获取每个VisitID并为该UID查找下一个VisitID。同时,它总结了未来不到30天的访问。它使用ISNULL来解释丢失的放电
然后,可以为CTE中的链添加逻辑。然后,您可以连接到视图和CTE以在视图中包含列
with Chains as
(
select v.UID,
sum(case when r.ReadmittedDT < dateadd(d, 30, v.ReadmittedDT)
then 0 else 1 end) as ChainCount
from vReadmits v
left join vReadmits r
on r.NextVisitID = v.VisitID
group by v.UID
)
select t.UID,
t.VisitId,
t.AdmitDT,
t.DischargeDT,
v.NextVisitID,
v.ReadmitNext30,
v.ReadmittedDT,
c.ChainCount
from t
join vReadmits v
on t.VisitID = v.VisitID
inner join Chains c
on v.UID = c.UID
order by t.UID, t.VisitID
这是你的电话号码
我的假设是,如果VisitID大于另一个,那么它的受欢迎程度也会更高。尤其对于相同的UID,情况应该是这样,但如果不是这样,您会将视图更改为使用admitts而不是VisitID。更新1:如果两个事件之间的最大差异为30天,则会链接两个事件。[计数]值是按每人生成的
您可以调整以下使用a的示例:
如你所见
对于最后一条语句,执行计划包含两个索引查找运算符,因为@EventsWithNum上定义的此约束主键EventNum,PersonID强制SQL Server创建一个具有复合键EventNum,PersonID的聚集索引
另外,我们可以看到INSERT@EventsWithNum的估计成本。。。大于具有CountingSequentiaEvents的的估计成本。。。选择从计数顺序事件中……这样做是一种放慢速度的方法。在cte上进行连接对我来说要快得多。我将更新我的代码,因为将结果存储在临时表中仍然无法回答获取链长或链数的问题。这是否运行?选择案件虽然…案件不它不是,我从来没有用过它,正在尝试不同的事情out@MCP_infiltrator你有没有可能用你的一些数据创建一个sql FIDLE?我现在会尝试这样做。更新后的sql FIDLE我要到周一才能测试,但到目前为止看起来还不错。这个生成的结果与我在原始问题中使用CTE得到的结果非常相似减去案例陈述。您的解决方案存在的问题是,它会计算某个人在单个帐户(而不是任何帐户)的30天内被重新接纳的次数。我需要一个与@bodan的答案类似的输出。你能更新你当前的解决方案吗?我会稍微尝试一下。请看有问题的评论,链长似乎计算不正确。@MCP_Insiderator:那么,告诉我我测试的正确[chain LEN]是什么?你显示的输出是使用一次遭遇30天重新提交,在我想要的输出中,它不需要是一次访问,但可以是一个菊花链访问。我将尝试使我想要的输出显示得更长一点。我将尝试一下
with Chains as
(
select v.UID,
sum(case when r.ReadmittedDT < dateadd(d, 30, v.ReadmittedDT)
then 0 else 1 end) as ChainCount
from vReadmits v
left join vReadmits r
on r.NextVisitID = v.VisitID
group by v.UID
)
select t.UID,
t.VisitId,
t.AdmitDT,
t.DischargeDT,
v.NextVisitID,
v.ReadmitNext30,
v.ReadmittedDT,
c.ChainCount
from t
join vReadmits v
on t.VisitID = v.VisitID
inner join Chains c
on v.UID = c.UID
order by t.UID, t.VisitID
CREATE TABLE dbo.Events (
EventID INT IDENTITY(1,1) PRIMARY KEY,
EventDate DATE NOT NULL,
PersonID INT NOT NULL
);
GO
INSERT dbo.Events (EventDate, PersonID)
VALUES
('2014-01-01', 1), ('2014-01-05', 1), ('2014-02-02', 1), ('2014-03-30', 1), ('2014-04-04', 1),
('2014-01-11', 2), ('2014-02-02', 2),
('2014-01-03', 3), ('2014-03-03', 3);
GO
DECLARE @EventsWithNum TABLE (
EventID INT NOT NULL,
EventDate DATE NOT NULL,
PersonID INT NOT NULL,
EventNum INT NOT NULL,
PRIMARY KEY (EventNum, PersonID)
);
INSERT @EventsWithNum
SELECT crt.EventID, crt.EventDate, crt.PersonID,
ROW_NUMBER() OVER(PARTITION BY crt.PersonID ORDER BY crt.EventDate, crt.EventID) AS EventNum
FROM dbo.Events crt;
WITH CountingSequentiaEvents
AS (
SELECT crt.EventID, crt.EventDate, crt.PersonID, crt.EventNum,
1 AS GroupNum,
1 AS GroupEventNum
FROM @EventsWithNum crt
WHERE crt.EventNum = 1
UNION ALL
SELECT crt.EventID, crt.EventDate, crt.PersonID, crt.EventNum,
CASE
WHEN DATEDIFF(DAY, prev.EventDate, crt.EventDate) <= 30 THEN prev.GroupNum
ELSE prev.GroupNum + 1
END AS GroupNum,
CASE
WHEN DATEDIFF(DAY, prev.EventDate, crt.EventDate) <= 30 THEN prev.GroupEventNum + 1
ELSE 1
END AS GroupEventNum
FROM @EventsWithNum crt JOIN CountingSequentiaEvents prev ON crt.PersonID = prev.PersonID
AND crt.EventNum = prev.EventNum + 1
)
SELECT x.EventID, x.EventDate, x.PersonID,
x.GroupEventNum AS [CHAIN LEN],
x.GroupNum AS [Count]
FROM CountingSequentiaEvents x
ORDER BY x.PersonID, x.EventDate
-- 1000 means 1000 + 1 = maximum 1001 events / person
OPTION (MAXRECURSION 1000); -- Please read http://msdn.microsoft.com/en-us/library/ms175972.aspx (section Guidelines for Defining and Using Recursive Common Table Expressions)
EventID EventDate PersonID CHAIN LEN Count
------- ---------- -------- --------- -----
1 2014-01-01 1 1 1
2 2014-01-05 1 2 1
3 2014-02-02 1 3 1
------- ---------- -------- --------- -----
4 2014-03-30 1 1 2
5 2014-04-04 1 2 2
------- ---------- -------- --------- -----
6 2014-01-11 2 1 1
7 2014-02-02 2 2 1
------- ---------- -------- --------- -----
8 2014-01-03 3 1 1
------- ---------- -------- --------- -----
9 2014-03-03 3 1 2
------- ---------- -------- --------- -----