Google bigquery 在BigQuery？；中，使用什么算法来实现近似的TOP计数；_Google Bigquery_Approximation - Fatal编程技术网

Google bigquery 在BigQuery？；中，使用什么算法来实现近似的TOP计数；

google-bigquery

Google bigquery 在BigQuery？；中，使用什么算法来实现近似的TOP计数；,google-bigquery,approximation,Google Bigquery,Approximation,BigQuery表示，近似聚合函数在内存使用和时间方面是可伸缩的，但会产生近似结果，而不是精确结果我在钻孔机或蜂箱中找不到任何类似的功能。使用集群计算，我们可以很容易地得到准确的结果，为什么以及何时应该使用这个近似函数？我还希望有人能告诉我近似函数使用的算法是什么？近似函数有用的一个例子是分析Firebase事件日志（关于StackOverflow上的BigQuery/Firebase有很多问题）。例如，如果您只想知道访问量最大的前10个页面，您可以使用APPROX\u top\u COU

BigQuery表示，近似聚合函数在内存使用和时间方面是可伸缩的，但会产生近似结果，而不是精确结果

我在钻孔机或蜂箱中找不到任何类似的功能。使用集群计算，我们可以很容易地得到准确的结果，为什么以及何时应该使用这个近似函数？

我还希望有人能告诉我近似函数使用的算法是什么？

近似函数有用的一个例子是分析Firebase事件日志（关于StackOverflow上的BigQuery/Firebase有很多问题）。例如，如果您只想知道访问量最大的前10个页面，您可以使用

APPROX\u top\u COUNT

来执行分析，这通常比

COUNT（*）和groupby
和ORDER BY的。。。限制…

从实现的角度来看，您可以想象，如果您只对访问量最大的前10个页面感兴趣，那么可能没有必要在内存中为不经常访问的页面的长尾保留状态，因为它稍后将被丢弃
您可以在以下论文中阅读近似算法：



Thx对于您的答案，我已经使用bigquery测试了bigquery的publicdata（1108779463行）上的近似TOP计数。结果是近似TOP计数和计数（*），带有GROUP BY和ORDER BY。。。限制两个2秒的时间。你能告诉我一些关于这两个查询性能的例子吗？




[azure]相关文章推荐



                                                        
Microsoft Azure服务总线主题计费
azure 
数据传输在my Azure项目设置中的成本
azureazure-sql-database 
Azure父/子差异磁盘
azure 
Azure 用于Windows Server 2008 R2保险库的服务总线1.0
azure 
如何在Azure中使用相同的URL运行网站？
azure 
获取Azure worker角色运行（）上的应用程序路径
azure 
Azure 利息；“现有单点登录”；在检修面板上
azureazure-active-directory 
通过SQL检索服务器上所有数据库的SQL Azure服务级别
azureazure-sql-database 
Azure 如何在edX平台中修改激活电子邮件内容？
azure 
通过任何方式，我们都可以通过传递用户名和http请求从Azure ADAL获得访问令牌
azureactive-directorymicrosoft-graph-api 
Azure 创建虚拟机并将虚拟机与现有虚拟网络关联
azure 
Azure函数主机密钥限制？
azureazure-functions 
在Azure中创建开发/登台环境
azurecontinuous-integrationazure-devops 
我可以在Azure函数中缓存单个值而不产生任何负面影响吗？
azureazure-functions 
订阅Azure云合作伙伴门户
azureazure-functions 
使用Azure devops REST API获取github存储库
azurerestgithubazure-devopspostman 
Azure Devops权限服务与特定发布管道的连接
azureazure-devops 
将Azure BLOB容器访问权分配给用户
azure 
2021年，导出Azure Automation AzureRunAsConnection用于本地调试的证书的最快/最简单的方法是什么？
azurepowershell 
重定向\u URI \u与Azure AD B2C Web应用程序不匹配（基于Python/Flask）
azureazure-active-directory 
                                       





随机文章推荐



                                                        
Post 将数据发布到另一台服务器上的表单并弹出一个窗口
post 
哪个免费的嵌入式web服务器可以处理*非常大*的POST请求？
postfile-upload 
Javascript-内置类似于POST URL上的参数
postparameters 
从浏览器控件获取POST变量
postbrowserwindows-phone-8 
HTTP请求POST。通过JaspeReports服务器上载JRXML文件
post 
Post 使用jquery访问json数据（由php返回）
postjson 
playFramework路由不处理我的post请求，但它在路由文件中声明
postplayframework-2.0 
Post 从移动客户端到Meteor服务器的简单请求
postmobilemeteor 
Post 如何邮寄表格？
post 
Post 如何确定在Wordpress中设置摘录长度的位置？
postwordpress 
Post 从objective-c向web服务发布词典
postdictionary 
Post 当我在nginx中设置worker_进程=8时，为什么会有这么多CLOSE_WAIT stage？
postnginxredis 
Post WebApi客户端。邮差
post 
Post 为什么我的HTTP.call（）不工作？
postmeteor 
Post Google Analitics测量协议-如何分割电子商务负载？
postgoogle-analyticse-commerce 
在具有SuiteScript 2.0的NetSuite中，无法发送包含内容类型为multipart/form data的HTTP POST请求的文件
posthttpsnetsuite 
Laravel$请求->；all（）为空，但$\u POST将以正确的形式返回实际发布的数据
post


                                        

                                        
                                        


                                                
                                                        [google bigquery]相关推荐
                                                        
Google bigquery 在BigQuery中重命名数据集
									Google Bigquery
							 
Google bigquery BQ加载作业全部处于挂起状态
									Google Bigquery
							 
Google bigquery BQ shell加载数据存储时出错，write_处置为write append
									Google Bigquery
							 
Google bigquery Google BigQuery上的数百万个表
									Google Bigquery
							 
Google bigquery Google BigQuery：性能（详细信息）
									Google Bigquery
							 
Google bigquery （解析）插入到bigquery的行数输入不相同
									Google Bigquery
							 
Google bigquery 从谷歌云存储下载文件随机损坏
									Google Bigquery
							 									Google Cloud Storage
							 
Google bigquery BigQuery API-totalBytesProcessed返回0
									Google Bigquery
							 
Google bigquery BigQuery-将多行连接成一行
									Google Bigquery
							 
Google bigquery BigQuery—6年订单迁移、表/查询设计
									Google Bigquery
							 
Google bigquery 云数据流：如何使用谷歌为PubSub提供的模板进行BigQuery
									Google Bigquery
							 									Google Cloud Dataflow
							 
Google bigquery 用户在项目gdelt bq中没有bigquery.jobs.create权限
									Google Bigquery
							 
Google bigquery 最大重复值
									Google Bigquery
							 
Google bigquery “按表达式聚类必须是可分组的，但类型为STRUCT”错误
									Google Bigquery
							 
Google bigquery Google BigQuery相当于MySQL多表内部联接
									Google Bigquery
							 
Google bigquery 如何使用Google BigQuery导出到电子表格？
									Google Bigquery
							 
Google bigquery Google Bigquery连接速度非常慢
									Google Bigquery
							 
Google bigquery 如果通过保留期删除表，Bigquery是否收费？
									Google Bigquery
							 
Google bigquery 基于字段位置而非名称的BigQuery联合架构
									Google Bigquery
							 
Google bigquery BigQuery中json对象的最新字符串化数组
									Google Bigquery
							 
Google bigquery 组合两个BigQuery查询的最佳方式（处理方式）是什么？
									Google Bigquery
							 
Google bigquery 加载新数据时如何向bigquery表添加字段
									Google Bigquery
							 
Google bigquery 需要上一年特定日期范围的结果-bigquery
									Google Bigquery
							 
Google bigquery 银行家'；BigQuery的舍入
									Google Bigquery
							 
Google bigquery 如何比较Google Bigquery中的上一行和当前行？
									Google Bigquery
							 
Google bigquery BigQuery数据仓库/实时数据/配额
									Google Bigquery
							 
Google bigquery GCP的电力BI服务（BigQuery）
									Google Bigquery
							 									Powerbi
							 
Google bigquery 如何使用ApacheBeamAPI读取具有最新分区的BigQuery？
									Google Bigquery
							 									Google Cloud Dataflow
							 
Google bigquery ST_GEOGFROMTEXT函数中的问题
									Google Bigquery
							 
Google bigquery 递归地遍历文件夹，并将每个文件夹中的csv文件加载到BigQuery中
									Google Bigquery
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Menu
Sql Server 2008
Exchange Server
D
Jira
Terminal
Magento2
Iis 7
Single Sign On
Jquery Mobile
Flutter
Windows Mobile
Liferay
Map
Sharepoint 2010
Tridion
Webstorm
Asp.net
Keyboard
Ios5
Pycharm
X86
Google Apps Script
Stripe Payments
Entity Framework
Blockchain
Http
Sed
Actions On Google
Uitableview
Parsing
Ruby On Rails 4
Security
Wix
Ms Office
Xamarin.android
Parse Platform
R
Testng
Html5 Canvas
Biztalk
Asterisk
Cloud
Ide
Printing
Virtual Machine
Xpath
Mariadb
Ibm Mobilefirst
Karate
Nativescript
Recursion
Log4net
Vector
Javascript
Sparql
Web Crawler
Android Fragments
Vbscript
Magento
Cypress
Server
Object
Compression
Android Studio
Image Processing
Virtualbox
Cloud Foundry
Xamarin.ios
Deployment
Julia
Rxjs
Microservices
File Upload
Actionscript
Sbt
Jvm
Google Calendar Api
Mongoose
Jpa
Jupyter Notebook
Isabelle
Teradata
Nginx
Localization
Db2
Notepad++
Identityserver4
Iframe
Udp
Libgdx
Asp.net Mvc 4
Azure Service Fabric
Gruntjs
Shopify
Dependencies
Xslt
EmptyTag
Outlook
Wolfram Mathematica
Mapping
Git
Oracle Apex
Coq
Blazor
Processing
Drupal
Asp.net Mvc
Google Maps Api 3
Svg
Centos
Chart.js
Vue.js
Web Services
Wso2
Teamcity
Visual Studio 2008
Memory Management
Smtp
Domain Driven Design
Cobol
Activemq
Ssh
Unix
Push Notification
Content Management System
Documentation
Windbg
Doctrine Orm
Doxygen
Unicode
Pip
Openssl
Permissions
Apache Pig
Nuget
Prolog
Vb.net
Migration
Nhibernate
Sprite Kit
Architecture
Geolocation
Keycloak
Listview
Email
Testing
Acumatica
Jquery Plugins
Java 8
Reflection
Jekyll
Open Source
Selenium Webdriver
Netty
Kdb
Video Streaming
Elm
Vaadin
Google Colaboratory
Jsf 2
Database Design
Raspberry Pi
Intellij Idea
Amazon S3
Fluent Nhibernate
Neo4j
Visual Studio
Clojure
Tags
Three.js
Transactions
Lambda
Zend Framework
Racket
Discord.py
Material Ui
Workflow
Configuration
Frameworks
Ms Access
Caching
Command Line
Windows Phone 8.1
Mqtt
Batch File
Telegram
Logic
Ssl
Sdk
String
Authentication
Antlr
Encoding
Gitlab
Winforms
Akka
Silverlight
Parallel Processing
View
Hadoop


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网