Regex &燃气轮机&燃气轮机&燃气轮机&燃气轮机;自配置标签DIP<<<&书信电报;文件转换

Regex &燃气轮机&燃气轮机&燃气轮机&燃气轮机;自配置标签DIP<<<&书信电报;文件转换,regex,tsql,vbscript,get,Regex,Tsql,Vbscript,Get,我有100个数据文件需要帮助格式化,所以它们看起来像这样: "DocTypeName","DocDate","Loan Number","Property Name","Borrower Name","Address","City","State","Document Handle","DiskgroupNum","VolumeNum","NumOfPages","FileSize","DocRevNum","Rendition","PhysicalPageNum","ItemPageNum",

我有100个数据文件需要帮助格式化,所以它们看起来像这样:

"DocTypeName","DocDate","Loan Number","Property Name","Borrower Name","Address","City","State","Document Handle","DiskgroupNum","VolumeNum","NumOfPages","FileSize","DocRevNum","Rendition","PhysicalPageNum","ItemPageNum","FileTypeNum","ImageType","Compress","Xdpi","Ydpi","FileName"
"System Maintenance Forms","3/18/2013","893102103","sample","sample","sample","DECATUR","GA","599954","102","110","0","303909","0","1","0","0","16","0","2","0","0","\V110\1366\798567.pdf"
"Post Closing Contact Sheet","3/18/2013","893102103","sample","sample","sample","DECATUR","GA","2013 02 26 BWR CONTACT INFO UPDATE FOORM","599983","102","110","0","276653","0","1","0","0","16","0","2","0","0","\V110\1366\798596.pdf"
>>>>Self Configuring Tagged DIP<<<<
        BEGIN:
        >>DocTypeName: System Maintenance Forms
        >>DocDate: 3/18/2013
        Loan Number: 893102103
        Property Name: sample
        Borrower Name: sample
        Address: sample
        City: DECATUR
        State: GA
        Document Handle: 599954
        >>DiskgroupNum: 102
        >>VolumeNum: 110
        >>NumOfPages: 0
        >>FileSize: 303909
        >>DocRevNum: 0
        >>Rendition: 1
        >>PhysicalPageNum: 0
        >>ItemPageNum: 0
        >>FileTypeNum: 16
        >>ImageType: 0
        >>Compress: 2
        >>Xdpi: 0
        >>Ydpi: 0
        >>FileName: \V110\1366\798567.pdf
        BEGIN:
        >>DocTypeName: Post Closing Contact Sheet
        >>DocDate: 3/18/2013
        Loan Number: 893102103
        Property Name: sample
        Borrower Name: sample
        Address: sample 
        City: DECATUR
        State: GA
        Ad Hoc: 2013 02 26 BWR CONTACT INFO UPDATE FOORM
        Document Handle: 599983
        >>DiskgroupNum: 102
        >>VolumeNum: 110
        >>NumOfPages: 0
        >>FileSize: 276653
        >>DocRevNum: 0
        >>Rendition: 1
        >>PhysicalPageNum: 0
        >>ItemPageNum: 0
        >>FileTypeNum: 16
        >>ImageType: 0
        >>Compress: 2
        >>Xdpi: 0
        >>Ydpi: 0
        >>FileName: \V110\1366\798596.pdf
        BEGIN:
        >>DocTypeName: Insurance ACORD
        >>DocDate: 1/11/2008
        Policy Number: 91QB94439
        Effective Date: 01/24/2008
        Expiration Date: 01/24/2009
        Loan Number: 893102103
        Property Name: sample
        Borrower Name: sample
        Address: 3642 sample
        City: DECATUR
        State: GA
        Document Handle: 98326
        >>DiskgroupNum: 102
        >>VolumeNum: 24
        >>NumOfPages: 1
        >>FileSize: 74839
        >>DocRevNum: 0
        >>Rendition: 0
        >>PhysicalPageNum: 0
        >>ItemPageNum: 0
        >>FileTypeNum: 2
        >>ImageType: 0
        >>Compress: 2
        >>Xdpi: 0
        >>Ydpi: 0
        >>FileName: \V24\202\133225.TIF
        >>DiskgroupNum: 102
        >>VolumeNum: 24
        >>NumOfPages: 1
        >>FileSize: 74839
        >>DocRevNum: 0
        >>Rendition: 0
        >>PhysicalPageNum: 1
        >>ItemPageNum: 1
        >>FileTypeNum: 2
        >>ImageType: 0
        >>Compress: 2
        >>Xdpi: 0
        >>Ydpi: 0
        >>FileName: \V24\202\133225.TIF
        BEGIN:
        >>DocTypeName: Insurance ACORD
        >>DocDate: 1/2/2008
        Policy Number: 91QB94439
        Effective Date: 01/24/2008
        Expiration Date: 01/24/2009
        Loan Number: 893102103
        Property Name: sample
        Borrower Name: sample
        Address: sample
        City: DECATUR
        State: GA
        Document Handle: 94202
        >>DiskgroupNum: 102
        >>VolumeNum: 23
        >>NumOfPages: 1
        >>FileSize: 78846
        >>DocRevNum: 0
        >>Rendition: 0
        >>PhysicalPageNum: 0
        >>ItemPageNum: 0
        >>FileTypeNum: 2
        >>ImageType: 0
        >>Compress: 2
        >>Xdpi: 0
        >>Ydpi: 0
        >>FileName: \V23\194\128851.TIF
        >>DiskgroupNum: 102
        >>VolumeNum: 23
        >>NumOfPages: 1
        >>FileSize: 78846
        >>DocRevNum: 0
        >>Rendition: 0
        >>PhysicalPageNum: 1
        >>ItemPageNum: 1
        >>FileTypeNum: 2
        >>ImageType: 0
        >>Compress: 2
        >>Xdpi: 0
        >>Ydpi: 0
        >>FileName: \V23\194\128851.TIF
当前的文件如下所示:

"DocTypeName","DocDate","Loan Number","Property Name","Borrower Name","Address","City","State","Document Handle","DiskgroupNum","VolumeNum","NumOfPages","FileSize","DocRevNum","Rendition","PhysicalPageNum","ItemPageNum","FileTypeNum","ImageType","Compress","Xdpi","Ydpi","FileName"
"System Maintenance Forms","3/18/2013","893102103","sample","sample","sample","DECATUR","GA","599954","102","110","0","303909","0","1","0","0","16","0","2","0","0","\V110\1366\798567.pdf"
"Post Closing Contact Sheet","3/18/2013","893102103","sample","sample","sample","DECATUR","GA","2013 02 26 BWR CONTACT INFO UPDATE FOORM","599983","102","110","0","276653","0","1","0","0","16","0","2","0","0","\V110\1366\798596.pdf"
>>>>Self Configuring Tagged DIP<<<<
        BEGIN:
        >>DocTypeName: System Maintenance Forms
        >>DocDate: 3/18/2013
        Loan Number: 893102103
        Property Name: sample
        Borrower Name: sample
        Address: sample
        City: DECATUR
        State: GA
        Document Handle: 599954
        >>DiskgroupNum: 102
        >>VolumeNum: 110
        >>NumOfPages: 0
        >>FileSize: 303909
        >>DocRevNum: 0
        >>Rendition: 1
        >>PhysicalPageNum: 0
        >>ItemPageNum: 0
        >>FileTypeNum: 16
        >>ImageType: 0
        >>Compress: 2
        >>Xdpi: 0
        >>Ydpi: 0
        >>FileName: \V110\1366\798567.pdf
        BEGIN:
        >>DocTypeName: Post Closing Contact Sheet
        >>DocDate: 3/18/2013
        Loan Number: 893102103
        Property Name: sample
        Borrower Name: sample
        Address: sample 
        City: DECATUR
        State: GA
        Ad Hoc: 2013 02 26 BWR CONTACT INFO UPDATE FOORM
        Document Handle: 599983
        >>DiskgroupNum: 102
        >>VolumeNum: 110
        >>NumOfPages: 0
        >>FileSize: 276653
        >>DocRevNum: 0
        >>Rendition: 1
        >>PhysicalPageNum: 0
        >>ItemPageNum: 0
        >>FileTypeNum: 16
        >>ImageType: 0
        >>Compress: 2
        >>Xdpi: 0
        >>Ydpi: 0
        >>FileName: \V110\1366\798596.pdf
        BEGIN:
        >>DocTypeName: Insurance ACORD
        >>DocDate: 1/11/2008
        Policy Number: 91QB94439
        Effective Date: 01/24/2008
        Expiration Date: 01/24/2009
        Loan Number: 893102103
        Property Name: sample
        Borrower Name: sample
        Address: 3642 sample
        City: DECATUR
        State: GA
        Document Handle: 98326
        >>DiskgroupNum: 102
        >>VolumeNum: 24
        >>NumOfPages: 1
        >>FileSize: 74839
        >>DocRevNum: 0
        >>Rendition: 0
        >>PhysicalPageNum: 0
        >>ItemPageNum: 0
        >>FileTypeNum: 2
        >>ImageType: 0
        >>Compress: 2
        >>Xdpi: 0
        >>Ydpi: 0
        >>FileName: \V24\202\133225.TIF
        >>DiskgroupNum: 102
        >>VolumeNum: 24
        >>NumOfPages: 1
        >>FileSize: 74839
        >>DocRevNum: 0
        >>Rendition: 0
        >>PhysicalPageNum: 1
        >>ItemPageNum: 1
        >>FileTypeNum: 2
        >>ImageType: 0
        >>Compress: 2
        >>Xdpi: 0
        >>Ydpi: 0
        >>FileName: \V24\202\133225.TIF
        BEGIN:
        >>DocTypeName: Insurance ACORD
        >>DocDate: 1/2/2008
        Policy Number: 91QB94439
        Effective Date: 01/24/2008
        Expiration Date: 01/24/2009
        Loan Number: 893102103
        Property Name: sample
        Borrower Name: sample
        Address: sample
        City: DECATUR
        State: GA
        Document Handle: 94202
        >>DiskgroupNum: 102
        >>VolumeNum: 23
        >>NumOfPages: 1
        >>FileSize: 78846
        >>DocRevNum: 0
        >>Rendition: 0
        >>PhysicalPageNum: 0
        >>ItemPageNum: 0
        >>FileTypeNum: 2
        >>ImageType: 0
        >>Compress: 2
        >>Xdpi: 0
        >>Ydpi: 0
        >>FileName: \V23\194\128851.TIF
        >>DiskgroupNum: 102
        >>VolumeNum: 23
        >>NumOfPages: 1
        >>FileSize: 78846
        >>DocRevNum: 0
        >>Rendition: 0
        >>PhysicalPageNum: 1
        >>ItemPageNum: 1
        >>FileTypeNum: 2
        >>ImageType: 0
        >>Compress: 2
        >>Xdpi: 0
        >>Ydpi: 0
        >>FileName: \V23\194\128851.TIF
>自配置标签DIP>DocDate:3/18/2013
贷款号码:893102103
属性名称:示例
借款人名称:样本
地址:样本
城市:迪凯特
州:佐治亚州
文件处理:599954
>>DiskgroupNum:102
>>卷数:110
>>页数:0
>>文件大小:303909
>>DocRevNum:0
>>移交:1
>>PhysicalPageNum:0
>>ItemPageNum:0
>>FileTypeNum:16
>>图像类型:0
>>压缩:2
>>Xdpi:0
>>Ydpi:0
>>文件名:\V110\1366\798567.pdf
开始:
>>DocTypeName:关闭后联系表
>>文件日期:2013年3月18日
贷款号码:893102103
属性名称:示例
借款人名称:样本
地址:样本
城市:迪凯特
州:佐治亚州
特设:2013 02 26 BWR联系信息更新FOORM
文件句柄:599983
>>DiskgroupNum:102
>>卷数:110
>>页数:0
>>文件大小:276653
>>DocRevNum:0
>>移交:1
>>PhysicalPageNum:0
>>ItemPageNum:0
>>FileTypeNum:16
>>图像类型:0
>>压缩:2
>>Xdpi:0
>>Ydpi:0
>>文件名:\V110\1366\798596.pdf
开始:
>>医生姓名:保险公司
>>文件日期:1/11/2008
保单号码:91QB94439
生效日期:01/24/2008
有效期:01/24/2009
贷款号码:893102103
属性名称:示例
借款人名称:样本
地址:3642
城市:迪凯特
州:佐治亚州
文件句柄:98326
>>DiskgroupNum:102
>>卷数:24
>>页数:1
>>文件大小:74839
>>DocRevNum:0
>>格式副本:0
>>PhysicalPageNum:0
>>ItemPageNum:0
>>FileTypeNum:2
>>图像类型:0
>>压缩:2
>>Xdpi:0
>>Ydpi:0
>>文件名:\V24\202\133225.TIF
>>DiskgroupNum:102
>>卷数:24
>>页数:1
>>文件大小:74839
>>DocRevNum:0
>>格式副本:0
>>PhysicalPageNum:1
>>ItemPageNum:1
>>FileTypeNum:2
>>图像类型:0
>>压缩:2
>>Xdpi:0
>>Ydpi:0
>>文件名:\V24\202\133225.TIF
开始:
>>医生姓名:保险公司
>>文件日期:1/2/2008
保单号码:91QB94439
生效日期:01/24/2008
有效期:01/24/2009
贷款号码:893102103
属性名称:示例
借款人名称:样本
地址:样本
城市:迪凯特
州:佐治亚州
文件句柄:94202
>>DiskgroupNum:102
>>卷数:23
>>页数:1
>>文件大小:78846
>>DocRevNum:0
>>格式副本:0
>>PhysicalPageNum:0
>>ItemPageNum:0
>>FileTypeNum:2
>>图像类型:0
>>压缩:2
>>Xdpi:0
>>Ydpi:0
>>文件名:\V23\194\128851.TIF
>>DiskgroupNum:102
>>卷数:23
>>页数:1
>>文件大小:78846
>>DocRevNum:0
>>格式副本:0
>>PhysicalPageNum:1
>>ItemPageNum:1
>>FileTypeNum:2
>>图像类型:0
>>压缩:2
>>Xdpi:0
>>Ydpi:0
>>文件名:\V23\194\128851.TIF
有一个类似的问题,但我看不到答案:

任何帮助都将不胜感激。VB、SQL或其他


谢谢…

您的示例文件包含5条记录,其中至少有3种不同的结构。如果要将100个文件转换为(某种类型的).csv,则需要更好/更详细的计划

要开始解析子任务,请执行以下操作:

Option Explicit

Class cParser
  Private r
  Private d
  Private Sub Class_Initialize()
    Set r       = New RegExp
    r.Global    = True
    r.MultiLine = True
    r.Pattern   = "^\s+(?:>>)?([^:]+):\s*(.+)\r$"
    Set d       = CreateObject("Scripting.Dictionary")
  End Sub
  Public Function parse(s)
    Dim ms : Set ms = r.Execute(s)
    If 0 = ms.Count Then
       WScript.Echo "parse error"
       WScript.Echo s
       WScript.Quit 1
    End If
    d.RemoveAll
    Dim i
    For i = 0 To ms.Count - 1
       d(ms(i).SubMatches(0)) = ms(i).SubMatches(1)
    Next
    Set parse = d
  End Function
End Class

Dim aAll : aAll  = Split(CreateObject("Scripting.FileSystemObject").OpenTextFile(".\28099507.txt").ReadAll(), "BEGIN:")
Dim p    : Set p = New cParser
Dim i, d
For i = 1 To UBound(aAll)
  Set d = p.parse(aAll(i))
  WScript.Echo Join(d.Keys(), ",")
  WScript.Echo Join(d.Items(), ",")
  WScript.Echo
Next
输出:

cscript 28099507.vbs
DocTypeName,DocDate,Loan Number,Property Name,Borrower Name,Address,City,State,Document Handle,DiskgroupNum,Vo
lumeNum,NumOfPages,FileSize,DocRevNum,Rendition,PhysicalPageNum,ItemPageNum,FileTypeNum,ImageType,Compress,Xdp
i,Ydpi,FileName
System Maintenance Forms,3/18/2013,893102103,sample,sample,sample,DECATUR,GA,599954,102,110,0,303909,0,1,0,0,1
6,0,2,0,0,\V110\1366\798567.pdf

DocTypeName,DocDate,Loan Number,Property Name,Borrower Name,Address,City,State,Ad Hoc,Document Handle,Diskgrou
pNum,VolumeNum,NumOfPages,FileSize,DocRevNum,Rendition,PhysicalPageNum,ItemPageNum,FileTypeNum,ImageType,Compr
ess,Xdpi,Ydpi,FileName
Post Closing Contact Sheet,3/18/2013,893102103,sample,sample,sample,DECATUR,GA,2013 02 26 BWR CONTACT INFO UPD
ATE FOORM,599983,102,110,0,276653,0,1,0,0,16,0,2,0,0,\V110\1366\798596.pdf

DocTypeName,DocDate,Policy Number,Effective Date,Expiration Date,Loan Number,Property Name,Borrower Name,Addre
ss,City,State,Document Handle,DiskgroupNum,VolumeNum,NumOfPages,FileSize,DocRevNum,Rendition,PhysicalPageNum,I
temPageNum,FileTypeNum,ImageType,Compress,Xdpi,Ydpi,FileName
Insurance ACORD,1/11/2008,91QB94439,01/24/2008,01/24/2009,893102103,sample,sample,3642 sample,DECATUR,GA,98326
,102,24,1,74839,0,0,1,1,2,0,2,0,0,\V24\202\133225.TIF

DocTypeName,DocDate,Policy Number,Effective Date,Expiration Date,Loan Number,Property Name,Borrower Name,Addre
ss,City,State,Document Handle,DiskgroupNum,VolumeNum,NumOfPages,FileSize,DocRevNum,Rendition,PhysicalPageNum,I
temPageNum,FileTypeNum,ImageType,Compress,Xdpi,Ydpi,FileName
Insurance ACORD,1/2/2008,91QB94439,01/24/2008,01/24/2009,893102103,sample,sample,sample,DECATUR,GA,94202,102,2
3,1,78846,0,0,1,1,2,0,2,0,0,\V23\194\128851.TIF

我在这篇帖子里没有看到任何问题…谢谢你的回复Ekkehard.Horner。作为一个新手,请原谅我的下一个问题。这段代码是excel、vb还是c#?@user1926883-与您的标记:VBScript相同。