Concurrency Clojure'；是否为URL获取操作生成pmap函数？_Concurrency_Clojure_Pmap

Concurrency Clojure'；是否为URL获取操作生成pmap函数？

concurrency clojure

Concurrency Clojure'；是否为URL获取操作生成pmap函数？,concurrency,clojure,pmap,Concurrency,Clojure,Pmap,关于pmap函数的文档让我想知道，对于通过web获取XML提要集合这样的东西，它的效率有多高。我不知道pmap会产生多少并发获取操作，最大值是多少。如果您检查源代码，您会看到： > (use 'clojure.repl) > (source pmap) (defn pmap "Like map, except f is applied in parallel. Semi-lazy in that the parallel computation stays ahead of

关于

pmap

函数的文档让我想知道，对于通过web获取XML提要集合这样的东西，它的效率有多高。我不知道pmap会产生多少并发获取操作，最大值是多少。

如果您检查源代码，您会看到：

> (use 'clojure.repl)
> (source pmap)
(defn pmap
  "Like map, except f is applied in parallel. Semi-lazy in that the
  parallel computation stays ahead of the consumption, but doesn't
  realize the entire result unless required. Only useful for
  computationally intensive functions where the time of f dominates
  the coordination overhead."
  {:added "1.0"}
  ([f coll]
   (let [n (+ 2 (.. Runtime getRuntime availableProcessors))
         rets (map #(future (f %)) coll)
         step (fn step [[x & xs :as vs] fs]
                (lazy-seq
                 (if-let [s (seq fs)]
                   (cons (deref x) (step xs (rest s)))
                   (map deref vs))))]
     (step rets (drop n rets))))
  ([f coll & colls]
   (let [step (fn step [cs]
                (lazy-seq
                 (let [ss (map seq cs)]
                   (when (every? identity ss)
                     (cons (map first ss) (step (map rest ss)))))))]
     (pmap #(apply f %) (step (cons coll colls))))))

（+2（…Runtime getRuntime availableProcessors））

是一条重要线索。pmap将获取第一批

（+2个处理器）

工件，并通过

未来

异步运行它们。因此，如果你有2个内核，它将一次启动4个工作，试图保持领先一点，但最大值应为2+n

future

最终使用代理I/O线程池，该线程池支持无限数量的线程。它将随着工作的进行而增长，如果线程未使用，它将收缩

基于Alex解释pmap工作原理的出色回答，以下是我对您的情况的建议：

(doall
  (map
    #(future (my-web-fetch-function %))
    list-of-xml-feeds-to-fetch))

理由：

您需要尽可能多的飞行中的工作，因为大多数都会阻塞网络IO
Future将为每个请求启动一个异步工作，在线程池中处理。你可以让Clojure聪明地处理这个问题
地图上的doall将强制评估整个序列（即启动所有请求）
您的主线程可以立即开始取消对未来的引用，因此可以在单个结果返回时继续取得进展

（defn samplef[n]
（打印项次“开始”n）
（线程/睡眠10000）
n）
（def结果（pmap样本（范围0-100）））

pmap

fn

deref