Regex &燃气轮机&燃气轮机&燃气轮机&燃气轮机;自配置标签DIP<<<&书信电报;文件转换
我有100个数据文件需要帮助格式化,所以它们看起来像这样:Regex &燃气轮机&燃气轮机&燃气轮机&燃气轮机;自配置标签DIP<<<&书信电报;文件转换,regex,tsql,vbscript,get,Regex,Tsql,Vbscript,Get,我有100个数据文件需要帮助格式化,所以它们看起来像这样: "DocTypeName","DocDate","Loan Number","Property Name","Borrower Name","Address","City","State","Document Handle","DiskgroupNum","VolumeNum","NumOfPages","FileSize","DocRevNum","Rendition","PhysicalPageNum","ItemPageNum",
"DocTypeName","DocDate","Loan Number","Property Name","Borrower Name","Address","City","State","Document Handle","DiskgroupNum","VolumeNum","NumOfPages","FileSize","DocRevNum","Rendition","PhysicalPageNum","ItemPageNum","FileTypeNum","ImageType","Compress","Xdpi","Ydpi","FileName"
"System Maintenance Forms","3/18/2013","893102103","sample","sample","sample","DECATUR","GA","599954","102","110","0","303909","0","1","0","0","16","0","2","0","0","\V110\1366\798567.pdf"
"Post Closing Contact Sheet","3/18/2013","893102103","sample","sample","sample","DECATUR","GA","2013 02 26 BWR CONTACT INFO UPDATE FOORM","599983","102","110","0","276653","0","1","0","0","16","0","2","0","0","\V110\1366\798596.pdf"
>>>>Self Configuring Tagged DIP<<<<
BEGIN:
>>DocTypeName: System Maintenance Forms
>>DocDate: 3/18/2013
Loan Number: 893102103
Property Name: sample
Borrower Name: sample
Address: sample
City: DECATUR
State: GA
Document Handle: 599954
>>DiskgroupNum: 102
>>VolumeNum: 110
>>NumOfPages: 0
>>FileSize: 303909
>>DocRevNum: 0
>>Rendition: 1
>>PhysicalPageNum: 0
>>ItemPageNum: 0
>>FileTypeNum: 16
>>ImageType: 0
>>Compress: 2
>>Xdpi: 0
>>Ydpi: 0
>>FileName: \V110\1366\798567.pdf
BEGIN:
>>DocTypeName: Post Closing Contact Sheet
>>DocDate: 3/18/2013
Loan Number: 893102103
Property Name: sample
Borrower Name: sample
Address: sample
City: DECATUR
State: GA
Ad Hoc: 2013 02 26 BWR CONTACT INFO UPDATE FOORM
Document Handle: 599983
>>DiskgroupNum: 102
>>VolumeNum: 110
>>NumOfPages: 0
>>FileSize: 276653
>>DocRevNum: 0
>>Rendition: 1
>>PhysicalPageNum: 0
>>ItemPageNum: 0
>>FileTypeNum: 16
>>ImageType: 0
>>Compress: 2
>>Xdpi: 0
>>Ydpi: 0
>>FileName: \V110\1366\798596.pdf
BEGIN:
>>DocTypeName: Insurance ACORD
>>DocDate: 1/11/2008
Policy Number: 91QB94439
Effective Date: 01/24/2008
Expiration Date: 01/24/2009
Loan Number: 893102103
Property Name: sample
Borrower Name: sample
Address: 3642 sample
City: DECATUR
State: GA
Document Handle: 98326
>>DiskgroupNum: 102
>>VolumeNum: 24
>>NumOfPages: 1
>>FileSize: 74839
>>DocRevNum: 0
>>Rendition: 0
>>PhysicalPageNum: 0
>>ItemPageNum: 0
>>FileTypeNum: 2
>>ImageType: 0
>>Compress: 2
>>Xdpi: 0
>>Ydpi: 0
>>FileName: \V24\202\133225.TIF
>>DiskgroupNum: 102
>>VolumeNum: 24
>>NumOfPages: 1
>>FileSize: 74839
>>DocRevNum: 0
>>Rendition: 0
>>PhysicalPageNum: 1
>>ItemPageNum: 1
>>FileTypeNum: 2
>>ImageType: 0
>>Compress: 2
>>Xdpi: 0
>>Ydpi: 0
>>FileName: \V24\202\133225.TIF
BEGIN:
>>DocTypeName: Insurance ACORD
>>DocDate: 1/2/2008
Policy Number: 91QB94439
Effective Date: 01/24/2008
Expiration Date: 01/24/2009
Loan Number: 893102103
Property Name: sample
Borrower Name: sample
Address: sample
City: DECATUR
State: GA
Document Handle: 94202
>>DiskgroupNum: 102
>>VolumeNum: 23
>>NumOfPages: 1
>>FileSize: 78846
>>DocRevNum: 0
>>Rendition: 0
>>PhysicalPageNum: 0
>>ItemPageNum: 0
>>FileTypeNum: 2
>>ImageType: 0
>>Compress: 2
>>Xdpi: 0
>>Ydpi: 0
>>FileName: \V23\194\128851.TIF
>>DiskgroupNum: 102
>>VolumeNum: 23
>>NumOfPages: 1
>>FileSize: 78846
>>DocRevNum: 0
>>Rendition: 0
>>PhysicalPageNum: 1
>>ItemPageNum: 1
>>FileTypeNum: 2
>>ImageType: 0
>>Compress: 2
>>Xdpi: 0
>>Ydpi: 0
>>FileName: \V23\194\128851.TIF
当前的文件如下所示:
"DocTypeName","DocDate","Loan Number","Property Name","Borrower Name","Address","City","State","Document Handle","DiskgroupNum","VolumeNum","NumOfPages","FileSize","DocRevNum","Rendition","PhysicalPageNum","ItemPageNum","FileTypeNum","ImageType","Compress","Xdpi","Ydpi","FileName"
"System Maintenance Forms","3/18/2013","893102103","sample","sample","sample","DECATUR","GA","599954","102","110","0","303909","0","1","0","0","16","0","2","0","0","\V110\1366\798567.pdf"
"Post Closing Contact Sheet","3/18/2013","893102103","sample","sample","sample","DECATUR","GA","2013 02 26 BWR CONTACT INFO UPDATE FOORM","599983","102","110","0","276653","0","1","0","0","16","0","2","0","0","\V110\1366\798596.pdf"
>>>>Self Configuring Tagged DIP<<<<
BEGIN:
>>DocTypeName: System Maintenance Forms
>>DocDate: 3/18/2013
Loan Number: 893102103
Property Name: sample
Borrower Name: sample
Address: sample
City: DECATUR
State: GA
Document Handle: 599954
>>DiskgroupNum: 102
>>VolumeNum: 110
>>NumOfPages: 0
>>FileSize: 303909
>>DocRevNum: 0
>>Rendition: 1
>>PhysicalPageNum: 0
>>ItemPageNum: 0
>>FileTypeNum: 16
>>ImageType: 0
>>Compress: 2
>>Xdpi: 0
>>Ydpi: 0
>>FileName: \V110\1366\798567.pdf
BEGIN:
>>DocTypeName: Post Closing Contact Sheet
>>DocDate: 3/18/2013
Loan Number: 893102103
Property Name: sample
Borrower Name: sample
Address: sample
City: DECATUR
State: GA
Ad Hoc: 2013 02 26 BWR CONTACT INFO UPDATE FOORM
Document Handle: 599983
>>DiskgroupNum: 102
>>VolumeNum: 110
>>NumOfPages: 0
>>FileSize: 276653
>>DocRevNum: 0
>>Rendition: 1
>>PhysicalPageNum: 0
>>ItemPageNum: 0
>>FileTypeNum: 16
>>ImageType: 0
>>Compress: 2
>>Xdpi: 0
>>Ydpi: 0
>>FileName: \V110\1366\798596.pdf
BEGIN:
>>DocTypeName: Insurance ACORD
>>DocDate: 1/11/2008
Policy Number: 91QB94439
Effective Date: 01/24/2008
Expiration Date: 01/24/2009
Loan Number: 893102103
Property Name: sample
Borrower Name: sample
Address: 3642 sample
City: DECATUR
State: GA
Document Handle: 98326
>>DiskgroupNum: 102
>>VolumeNum: 24
>>NumOfPages: 1
>>FileSize: 74839
>>DocRevNum: 0
>>Rendition: 0
>>PhysicalPageNum: 0
>>ItemPageNum: 0
>>FileTypeNum: 2
>>ImageType: 0
>>Compress: 2
>>Xdpi: 0
>>Ydpi: 0
>>FileName: \V24\202\133225.TIF
>>DiskgroupNum: 102
>>VolumeNum: 24
>>NumOfPages: 1
>>FileSize: 74839
>>DocRevNum: 0
>>Rendition: 0
>>PhysicalPageNum: 1
>>ItemPageNum: 1
>>FileTypeNum: 2
>>ImageType: 0
>>Compress: 2
>>Xdpi: 0
>>Ydpi: 0
>>FileName: \V24\202\133225.TIF
BEGIN:
>>DocTypeName: Insurance ACORD
>>DocDate: 1/2/2008
Policy Number: 91QB94439
Effective Date: 01/24/2008
Expiration Date: 01/24/2009
Loan Number: 893102103
Property Name: sample
Borrower Name: sample
Address: sample
City: DECATUR
State: GA
Document Handle: 94202
>>DiskgroupNum: 102
>>VolumeNum: 23
>>NumOfPages: 1
>>FileSize: 78846
>>DocRevNum: 0
>>Rendition: 0
>>PhysicalPageNum: 0
>>ItemPageNum: 0
>>FileTypeNum: 2
>>ImageType: 0
>>Compress: 2
>>Xdpi: 0
>>Ydpi: 0
>>FileName: \V23\194\128851.TIF
>>DiskgroupNum: 102
>>VolumeNum: 23
>>NumOfPages: 1
>>FileSize: 78846
>>DocRevNum: 0
>>Rendition: 0
>>PhysicalPageNum: 1
>>ItemPageNum: 1
>>FileTypeNum: 2
>>ImageType: 0
>>Compress: 2
>>Xdpi: 0
>>Ydpi: 0
>>FileName: \V23\194\128851.TIF
>自配置标签DIP>DocDate:3/18/2013
贷款号码:893102103
属性名称:示例
借款人名称:样本
地址:样本
城市:迪凯特
州:佐治亚州
文件处理:599954
>>DiskgroupNum:102
>>卷数:110
>>页数:0
>>文件大小:303909
>>DocRevNum:0
>>移交:1
>>PhysicalPageNum:0
>>ItemPageNum:0
>>FileTypeNum:16
>>图像类型:0
>>压缩:2
>>Xdpi:0
>>Ydpi:0
>>文件名:\V110\1366\798567.pdf
开始:
>>DocTypeName:关闭后联系表
>>文件日期:2013年3月18日
贷款号码:893102103
属性名称:示例
借款人名称:样本
地址:样本
城市:迪凯特
州:佐治亚州
特设:2013 02 26 BWR联系信息更新FOORM
文件句柄:599983
>>DiskgroupNum:102
>>卷数:110
>>页数:0
>>文件大小:276653
>>DocRevNum:0
>>移交:1
>>PhysicalPageNum:0
>>ItemPageNum:0
>>FileTypeNum:16
>>图像类型:0
>>压缩:2
>>Xdpi:0
>>Ydpi:0
>>文件名:\V110\1366\798596.pdf
开始:
>>医生姓名:保险公司
>>文件日期:1/11/2008
保单号码:91QB94439
生效日期:01/24/2008
有效期:01/24/2009
贷款号码:893102103
属性名称:示例
借款人名称:样本
地址:3642
城市:迪凯特
州:佐治亚州
文件句柄:98326
>>DiskgroupNum:102
>>卷数:24
>>页数:1
>>文件大小:74839
>>DocRevNum:0
>>格式副本:0
>>PhysicalPageNum:0
>>ItemPageNum:0
>>FileTypeNum:2
>>图像类型:0
>>压缩:2
>>Xdpi:0
>>Ydpi:0
>>文件名:\V24\202\133225.TIF
>>DiskgroupNum:102
>>卷数:24
>>页数:1
>>文件大小:74839
>>DocRevNum:0
>>格式副本:0
>>PhysicalPageNum:1
>>ItemPageNum:1
>>FileTypeNum:2
>>图像类型:0
>>压缩:2
>>Xdpi:0
>>Ydpi:0
>>文件名:\V24\202\133225.TIF
开始:
>>医生姓名:保险公司
>>文件日期:1/2/2008
保单号码:91QB94439
生效日期:01/24/2008
有效期:01/24/2009
贷款号码:893102103
属性名称:示例
借款人名称:样本
地址:样本
城市:迪凯特
州:佐治亚州
文件句柄:94202
>>DiskgroupNum:102
>>卷数:23
>>页数:1
>>文件大小:78846
>>DocRevNum:0
>>格式副本:0
>>PhysicalPageNum:0
>>ItemPageNum:0
>>FileTypeNum:2
>>图像类型:0
>>压缩:2
>>Xdpi:0
>>Ydpi:0
>>文件名:\V23\194\128851.TIF
>>DiskgroupNum:102
>>卷数:23
>>页数:1
>>文件大小:78846
>>DocRevNum:0
>>格式副本:0
>>PhysicalPageNum:1
>>ItemPageNum:1
>>FileTypeNum:2
>>图像类型:0
>>压缩:2
>>Xdpi:0
>>Ydpi:0
>>文件名:\V23\194\128851.TIF
有一个类似的问题,但我看不到答案:
任何帮助都将不胜感激。VB、SQL或其他
谢谢…您的示例文件包含5条记录,其中至少有3种不同的结构。如果要将100个文件转换为(某种类型的).csv,则需要更好/更详细的计划 要开始解析子任务,请执行以下操作:
Option Explicit
Class cParser
Private r
Private d
Private Sub Class_Initialize()
Set r = New RegExp
r.Global = True
r.MultiLine = True
r.Pattern = "^\s+(?:>>)?([^:]+):\s*(.+)\r$"
Set d = CreateObject("Scripting.Dictionary")
End Sub
Public Function parse(s)
Dim ms : Set ms = r.Execute(s)
If 0 = ms.Count Then
WScript.Echo "parse error"
WScript.Echo s
WScript.Quit 1
End If
d.RemoveAll
Dim i
For i = 0 To ms.Count - 1
d(ms(i).SubMatches(0)) = ms(i).SubMatches(1)
Next
Set parse = d
End Function
End Class
Dim aAll : aAll = Split(CreateObject("Scripting.FileSystemObject").OpenTextFile(".\28099507.txt").ReadAll(), "BEGIN:")
Dim p : Set p = New cParser
Dim i, d
For i = 1 To UBound(aAll)
Set d = p.parse(aAll(i))
WScript.Echo Join(d.Keys(), ",")
WScript.Echo Join(d.Items(), ",")
WScript.Echo
Next
输出:
cscript 28099507.vbs
DocTypeName,DocDate,Loan Number,Property Name,Borrower Name,Address,City,State,Document Handle,DiskgroupNum,Vo
lumeNum,NumOfPages,FileSize,DocRevNum,Rendition,PhysicalPageNum,ItemPageNum,FileTypeNum,ImageType,Compress,Xdp
i,Ydpi,FileName
System Maintenance Forms,3/18/2013,893102103,sample,sample,sample,DECATUR,GA,599954,102,110,0,303909,0,1,0,0,1
6,0,2,0,0,\V110\1366\798567.pdf
DocTypeName,DocDate,Loan Number,Property Name,Borrower Name,Address,City,State,Ad Hoc,Document Handle,Diskgrou
pNum,VolumeNum,NumOfPages,FileSize,DocRevNum,Rendition,PhysicalPageNum,ItemPageNum,FileTypeNum,ImageType,Compr
ess,Xdpi,Ydpi,FileName
Post Closing Contact Sheet,3/18/2013,893102103,sample,sample,sample,DECATUR,GA,2013 02 26 BWR CONTACT INFO UPD
ATE FOORM,599983,102,110,0,276653,0,1,0,0,16,0,2,0,0,\V110\1366\798596.pdf
DocTypeName,DocDate,Policy Number,Effective Date,Expiration Date,Loan Number,Property Name,Borrower Name,Addre
ss,City,State,Document Handle,DiskgroupNum,VolumeNum,NumOfPages,FileSize,DocRevNum,Rendition,PhysicalPageNum,I
temPageNum,FileTypeNum,ImageType,Compress,Xdpi,Ydpi,FileName
Insurance ACORD,1/11/2008,91QB94439,01/24/2008,01/24/2009,893102103,sample,sample,3642 sample,DECATUR,GA,98326
,102,24,1,74839,0,0,1,1,2,0,2,0,0,\V24\202\133225.TIF
DocTypeName,DocDate,Policy Number,Effective Date,Expiration Date,Loan Number,Property Name,Borrower Name,Addre
ss,City,State,Document Handle,DiskgroupNum,VolumeNum,NumOfPages,FileSize,DocRevNum,Rendition,PhysicalPageNum,I
temPageNum,FileTypeNum,ImageType,Compress,Xdpi,Ydpi,FileName
Insurance ACORD,1/2/2008,91QB94439,01/24/2008,01/24/2009,893102103,sample,sample,sample,DECATUR,GA,94202,102,2
3,1,78846,0,0,1,1,2,0,2,0,0,\V23\194\128851.TIF
我在这篇帖子里没有看到任何问题…谢谢你的回复Ekkehard.Horner。作为一个新手,请原谅我的下一个问题。这段代码是excel、vb还是c#?@user1926883-与您的标记:VBScript相同。