Google cloud platform Google Cloud Vision-方法:files.annotate-响应对象更改
根据BoundingPoly中提供的文档,块数组中的BoundingPoly对象应具有如下格式Google cloud platform Google Cloud Vision-方法:files.annotate-响应对象更改,google-cloud-platform,google-cloud-vision,Google Cloud Platform,Google Cloud Vision,根据BoundingPoly中提供的文档,块数组中的BoundingPoly对象应具有如下格式 { “顶点”:[ { 对象(顶点) } ], “规格化顶点”:[ { 对象(规格化顶点) } ] } 但是当我们尝试https://vision.googleapis.com/v1/files:annotate?key=xxxxxx要使用请求对PDF文件执行OCR,请执行以下操作: { "requests": [{ "inputConfig": { "
{
“顶点”:[
{
对象(顶点)
}
],
“规格化顶点”:[
{
对象(规格化顶点)
}
]
}
但是当我们尝试https://vision.googleapis.com/v1/files:annotate?key=xxxxxx
要使用请求对PDF文件执行OCR,请执行以下操作:
{
"requests": [{
"inputConfig": {
"content": "encoded content",
"mimeType": "application/pdf"
},
"features": [{
"type": "DOCUMENT_TEXT_DETECTION",
"maxResults": 50
}]
}]
}
来自服务器的响应为
{
"responses": [
{
"responses": [
{
"fullTextAnnotation": {
"pages": [
{
"property": {
"detectedLanguages": [
{
"languageCode": "en",
"confidence": 0.65
},
{
"languageCode": "fil",
"confidence": 0.01
}
]
},
"width": 841,
"height": 595,
"blocks": [
{
"property": {
"detectedLanguages": [
{
"languageCode": "en",
"confidence": 1
}
]
},
"boundingBox": {
"normalizedVertices": [
{
"x": 0.4351962,
"y": 0.057142857
},
{
"x": 0.6052319,
"y": 0.057142857
},
{
"x": 0.6052319,
"y": 0.08571429
},
{
"x": 0.4351962,
"y": 0.08571429
}
]
},
"paragraphs": [
{
"property": {
"detectedLanguages": [
{
"languageCode": "en",
"confidence": 1
}
]
},
"boundingBox": {
"normalizedVertices": [
{
"x": 0.4351962,
"y": 0.057142857
},
{
"x": 0.6052319,
"y": 0.057142857
},
{
"x": 0.6052319,
"y": 0.08571429
},
{
"x": 0.4351962,
"y": 0.08571429
}
]
},
"words": [
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
]
},
"boundingBox": {
"normalizedVertices": [
{
"x": 0.4351962,
"y": 0.057142857
},
{
"x": 0.49346018,
"y": 0.057142857
},
{
"x": 0.49346018,
"y": 0.08571429
},
{
"x": 0.4351962,
"y": 0.08571429
}
]
},
"symbols": [
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
]
},
"text": "F",
"confidence": 0.99
},
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
]
},
"text": "a",
"confidence": 1
},
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
]
},
"text": "c",
"confidence": 0.99
},
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
]
},
"text": "t",
"confidence": 0.99
},
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
]
},
"text": "o",
"confidence": 1
},
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
]
},
"text": "r",
"confidence": 1
},
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
],
"detectedBreak": {
"type": "SPACE"
}
},
"text": "y",
"confidence": 1
}
],
"confidence": 0.99
},
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
]
},
"text": "i",
"confidence": 0.99
},
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
]
},
"text": "n",
"confidence": 1
},
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
],
"detectedBreak": {
"type": "SPACE"
}
},
"text": "g",
"confidence": 1
}
],
"confidence": 0.99
},
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
]
},
"boundingBox": {
"normalizedVertices": [
{
"x": 0.57431626,
"y": 0.057142857
},
{
"x": 0.6052319,
"y": 0.057142857
},
{
"x": 0.6052319,
"y": 0.08571429
},
{
"x": 0.57431626,
"y": 0.08571429
}
]
},
"symbols": [
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
]
},
"text": "L",
"confidence": 0.99
},
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
]
},
"text": "i",
"confidence": 0.99
},
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
]
},
"text": "s",
"confidence": 0.99
},
{
"property": {
"detectedLanguages": [
{
"languageCode": "en"
}
],
"detectedBreak": {
"type": "LINE_BREAK"
}
},
"text": "t",
"confidence": 1
}
],
"confidence": 0.99
}
],
"confidence": 0.99
}
],
"blockType": "TEXT",
"confidence": 0.99
}
如果BoundingPoly对象(boundingBox)中缺少顶点对象,是否需要考虑什么
在拖放中尝试时,对图像执行的OCR的json响应是
"fullTextAnnotation": {
"pages": [
{
"blocks": [
{
"blockType": "TEXT",
"boundingBox": {
"vertices": [
{
"x": 31,
"y": 63
},
{
"x": 147,
"y": 63
},
{
"x": 147,
"y": 81
},
{
"x": 31,
"y": 81
}
]
},
"confidence": 0.99,
"paragraphs": [
{
"boundingBox": {
"vertices": [
{
"x": 31,
"y": 63
},
{
"x": 147,
"y": 63
},
{
"x": 147,
"y": 81
},
{
"x": 31,
"y": 81
}
]
},
这是预期的行为还是任何问题?我们应该遵循哪个字段规范化顶点或顶点 不同之处在于,在从代码发出的请求中,您发送的是PDF。在拖放演示中,您正在发送图像(演示不接受文件) 我复制了这一点,其行为似乎是不变的:PDF文件使用规范化顶点进行注释,而图像使用顶点进行注释。我的猜测是,这是旨在提高大型PDF文件注释请求性能的行为(由于页面数量较大)
我向谷歌文档发送了一个请求,以便他们可以在文档中添加此信息。不同之处在于,在根据代码发出的请求中,您发送的是PDF。在拖放演示中,您正在发送图像(演示不接受文件) 我复制了这一点,其行为似乎是不变的:PDF文件使用规范化顶点进行注释,而图像使用顶点进行注释。我的猜测是,这是旨在提高大型PDF文件注释请求性能的行为(由于页面数量较大) 我向谷歌文档发送了一个请求,以便他们可以在文档中添加这些信息