Python 如何加快遍历大型词典的速度_Python_Loops_Dictionary - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/9/loops/2.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 如何加快遍历大型词典的速度_Python_Loops_Dictionary - Fatal编程技术网

Python 如何加快遍历大型词典的速度

python loops dictionary

Python 如何加快遍历大型词典的速度,python,loops,dictionary,Python,Loops,Dictionary,我有一个字典，分别有键值对句子ID和簇ID 这是一种格式：{sequence\u ID:cluster\u ID} 例如： my_id_dict: {0: 71, 1: 63, 2: 66, 3: 92, 4: 49, 5: 85 . .} 总共，我有超过200000个句子ID和100个集群ID 我试图循环使用my_id_dict，为每个集群生成一个句子id的列表我想要的示例输出： Cluster 0 [63,

我有一个字典，分别有键值对

句子ID

和

簇ID

这是一种格式：

{sequence\u ID:cluster\u ID}

例如：

my_id_dict:
    {0: 71, 
    1: 63, 
    2: 66, 
    3: 92, 
    4: 49, 
    5: 85
      .
      .}

总共，我有超过200000个句子ID和100个集群ID

我试图循环使用

my_id_dict

，为每个集群生成一个句子id的列表
我想要的示例输出：

Cluster 0 [63, 71, 116, 168, 187, 231, 242, 290, 330, 343] Cluster 1 [53, 107, 281, 292, 294, 313, 353, 392, 405, 479]
这是我使用的代码：
逻辑是，对于每个集群，创建一个句子列表，然后对于所有200000 over dict值中的cluster_id，如果dict值==当前集群索引，则将句子id写入句子列表
继续100次

cluster_dict = defaultdict(list) num_clusters = 100 for cluster in xrange(0,num_clusters): print "\nCluster %d" % cluster sentences = [] for i in xrange(0,len(my_id_dict.values())): if( my_id_dict.values()[i] == cluster ): sentences.append(my_id_dict.keys()[i]) cluster_dict[cluster] = sentences print sentences[:10]

这是可行的，但速度非常慢。有没有更快的方法可以做到这一点？
你要检查每个集群的每个句子。只需将每个句子检查一次，然后将其分配到一个簇：

cluster_dict = defaultdict(list) for sentence, cluster in my_id_dict.items(): cluster_dict[cluster].append(sentence)

[loops]相关文章推荐

随机文章推荐

Nunit 使用模拟验证的单元测试 nunit

是否存在类似于NUnits的MSTest集合约束 nunit

无法使用Nant运行nunit测试 nunit

Nunit RhinoMock.BackToRecord不'；似乎不适用于例外情况 nunit

带有Assert.That和NunitLite的NullReferenceException nunit

[python]相关推荐

Python 格式化提示计算器输出
Python Function

Python 使用souce文件安装Cartopy
Python Installation

Python 将此数组转换为字典数组
Python

Python 为同一对象的另一个实例更改“self”？
Python Design Patterns

当我'；我用Python创建它
Python Python 3.x

Python 如何使用lambda表达式实现递归函数
Python Python 3.x Recursion

Python 计算余弦导数时numpy梯度的相移
Python Numpy

Python 如何从WikiData或SPARQL上的字符串实体获取本体？
Python Sparql

如何在jupyter笔记本中插入python？
Python Jupyter Notebook

Python Tensorflow-创建日志、标签和计算损失
Python Tensorflow Machine Learning

Python 无法在终端中执行ModuleNotFoundError。但是在pyide中工作
Python Pycharm

为什么ndiff（Python 3）以三种不同的方式报告差异
Python

Python 线程中未预期结果的合成
Python Python 3.x Python 2.7

selenium通过python单击下一个超链接
Python Selenium

Python 如何使用Django rest框架在尝试3次登录后阻止用户
Python Django Django Rest Framework

Python线程异常&；插座断开/WinError 10053
Python Python 3.x Sockets

Python 属性错误：'；元组'；对象没有属性'；状态代码'；
Python Django Python 2.7

如何验证包含各种POST请求的YAML文件中的输入？我必须使用Python编程
Python Validation Yaml Swagger

Python 创建图形时出现未知索引器
Python Matplotlib Scikit Learn

Python 从列表中追加项目不起作用？
Python Arrays List

Python 使用Pyglet可视化大量数据
Python

td中的Python/Selenium文本，不含span文本
Python Html Selenium Xpath

Python 将数据框列打印为系列将设置意外参数
Python Pandas Matplotlib

如何使用python 3启动子程序并向其写入命令
Python

Python 将列表转换为numpy数组
Python Numpy Scikit Learn

Python Fabric2发出运行Django管理命令
Python Django Python 3.x

Python中的整数与浮点乘法
Python Performance

Python Django CreateView测试失败
Python Django Forms Unit Testing Django Models

Arduino串行端口与Python的通信
Python Matplotlib Arduino

Python 未触发Airflow worker进程-计划程序抛出错误消息（Docker compose-芹菜执行模式）
Python Docker Compose Airflow

Tags

Here Api For Loop Socket.io Jar Exception Netsuite Networking Symfony1 C++ Forms Eclipse Plugin Linker Dynamics Crm 2011 Bootstrap 4 Gstreamer Aframe Asp.net Mvc 2 Iis Io Vb.net Monitoring Binding Excel Ios6 Stata Drools Dynamics Crm Discord.js Sockets Service Operating System Scroll Encoding Artifactory Grep Printing Nuget Scikit Learn R Glsl Chart.js Security Openssl Db2 Opengl Es Talend Polymer Redux Rxjs Pagination Uml Dependency Injection Xquery Microservices .htaccess Plone Ada Fonts Logic Java Me Google Plus Google Colaboratory Magento2 Nest Woocommerce Spring Boot Network Programming Elm Stored Procedures Opencl Ajax Sails.js If Statement Parse Platform Hbase Elixir Vaadin Servlets Tomcat Pandas Uwp Latex Raspberry Pi Asp.net Web Api Binary Gridview Linkedin Automation Scheme Tree Windows Phone 7 Clang Biztalk Neo4j Xamarin.android Asp.net Mvc 3 Oracle11g Oauth 2.0 Post Jenkins Camera Mips Mongodb Gcc Core Data Ms Office Postman Compiler Construction View Knockout.js Vue.js Unicode Deep Learning Isabelle Firefox Addon Cucumber Fortran Single Sign On Air Z3 Javascript Grafana Gremlin Embedded Dependencies Vector Vagrant Apache Zookeeper Cobol Nsis Floating Point Asp.net Core Google App Engine Inheritance Jasmine C# 4.0 Workflow Entity Framework Installation Python 2.7 Typo3 Map Log4j Umbraco Websphere Linq Documentation Graphics Wix Hybris Macos Acumatica Modelica Jquery Android Emulator Websocket Openstack Mod Rewrite Sapui5 Sharepoint 2007 Opengl Kotlin Composer Php Sonarqube Configuration Cookies Wxpython Asp.net Mvc 4 Office365 Oracle Vb6 Session Sequelize.js Amazon Redshift Debian Ecmascript 6 Snmp Twitter Bootstrap Ubuntu Hyperledger Fabric Sqlite Spring Batch Windows Gatsby Cloud Sql Server 2008 R2 Spring Security Migration Javafx 2 Text Snowflake Cloud Data Platform Command Line Firebase Wolfram Mathematica Types Magento Scrapy Sms Google Cloud Platform Hadoop Visual Studio 2015

Copyright © 2024. All Rights Reserved by - Fatal编程技术网