Python 对阵列应用多个掩码

Python 对阵列应用多个掩码,python,arrays,filter,numpy,Python,Arrays,Filter,Numpy,我在解释我实际上希望做什么,以防有更高层次的建议完全排除这个问题 我将科学数据存储在三个数组中:wave,flux,error。这些表示波长、通量和误差值。 阵列大约有4000个元素长(阵列的索引编号对应于探测器的像素编号) 我做了各种各样的测试,但是对于这个例子,让我们假设我做了两个测试,需要有效地屏蔽相关的数组 masks = [] masks.append(wave > 5500.35) masks.append(flux / wave > 8.5) 子问题: 我可以很容易地

我在解释我实际上希望做什么,以防有更高层次的建议完全排除这个问题

我将科学数据存储在三个数组中:
wave
flux
error
。这些表示波长、通量和误差值。 阵列大约有4000个元素长(阵列的索引编号对应于探测器的像素编号)

我做了各种各样的测试,但是对于这个例子,让我们假设我做了两个测试,需要有效地屏蔽相关的数组

masks = []
masks.append(wave > 5500.35)
masks.append(flux / wave > 8.5)
子问题: 我可以很容易地做两个面具的情况,如:

fullmask = [x[0] and x[1] for x in zip(masks[0], masks[1])]
但是,对于任意数量的面具,该怎么做呢

真正的问题: 是否有方法将所有掩码应用于每个阵列(
wave
flux
error
),并保留原始索引号?“保留原始索引数”的意思是,原则上,我可以取屏蔽波阵列的平均像素数(原始索引数)?也就是说:如果
wave[98:99]
是唯一未被遮罩的部分,则平均像素数为98.5

元问题:这是做这些事情的最佳方式吗


编辑

下面是一些示例数据

wave = array([5000, 5001, 5002, 5003, 5004, 5005, 5006, 5007, 5008, 5009, 5010,
   5011, 5012, 5013, 5014, 5015, 5016, 5017, 5018, 5019, 5020, 5021,
   5022, 5023, 5024, 5025, 5026, 5027, 5028, 5029, 5030, 5031, 5032,
   5033, 5034, 5035, 5036, 5037, 5038, 5039, 5040, 5041, 5042, 5043,
   5044, 5045, 5046, 5047, 5048, 5049, 5050, 5051, 5052, 5053, 5054,
   5055, 5056, 5057, 5058, 5059, 5060, 5061, 5062, 5063, 5064, 5065,
   5066, 5067, 5068, 5069, 5070, 5071, 5072, 5073, 5074, 5075, 5076,
   5077, 5078, 5079, 5080, 5081, 5082, 5083, 5084, 5085, 5086, 5087,
   5088, 5089, 5090, 5091, 5092, 5093, 5094, 5095, 5096, 5097, 5098,
   5099])

flux = array([ 112.65878609,  109.2008992 ,  113.30629929,  117.17002715,
   103.19663878,  110.42131523,  106.00841123,  100.27882741,
   103.89160905,  102.29402469,  105.58894696,  103.21314852,
    96.97242814,  106.70130478,  108.83891225,  110.60598803,
    95.10361887,  109.39734257,  103.08289878,  104.97258911,
    96.46606257,  106.75993458,   99.25386914,  105.91429417,
   105.83752232,  100.53312657,   99.74871394,  107.12735837,
   108.81187473,   96.51418895,   99.71311101,   94.08702553,
    98.81198643,   93.84567201,  103.21444519,   94.7027134 ,
    99.61842203,  103.71336458,  100.8697998 ,   92.1564786 ,
    96.56711985,   94.7728761 ,   82.65194671,   83.52280884,
    86.57960844,   73.6700194 ,   66.11794666,   61.01624627,
    63.19944529,   55.50283247,   62.09172307,   59.55436092,
    75.66399466,   70.69397378,   64.27899192,   73.80248662,
    89.17119606,   78.97024327,   82.3334254 ,  100.82581489,
   102.77937201,   99.37717696,   96.2215563 ,  104.52291339,
    93.7581944 ,   93.32154346,  103.57018896,  108.08682518,
   105.2711359 ,  100.00242988,  100.86934866,  103.20764384,
   104.19274473,  101.3314802 ,  102.75057114,   94.02347591,
    95.48758551,  106.0099397 ,   99.50733501,   97.88110415,
   107.54266965,  107.76126331,   98.14882302,  101.55654606,
   101.02418212,  106.82324958,   95.52086925,  102.65957133,
   104.93806492,  103.22762427,  108.02087993,  106.71911141,
    97.24396195,  103.3450277 ,  113.99870588,  106.4145751 ,
   110.08294674,  109.40908288,  118.61518086,  114.37341062])

error = array([ 11.72799338,  22.33423611,  16.89347382,  12.80063102,
   23.99242356,  25.15863754,  20.44765811,  14.84358628,
   19.16343785,  19.5703491 ,  18.44427035,  19.08648083,
   19.09116433,  12.22098884,  14.81280352,  11.35010222,
   18.59850136,  15.78855734,  21.85877638,  20.12179042,
   22.04894395,  21.986731  ,  13.26738352,  16.10987762,
   24.28528627,  30.11866128,  25.30220842,  25.02100014,
   29.38560916,  16.8192307 ,  29.15097205,  23.56805267,
   15.17285709,  18.27495747,  18.63750452,  18.61618504,
   11.45940025,  21.95805701,  24.22923951,  11.76824052,
   19.75465065,  14.72979889,  15.45936176,  14.73227474,
   28.91683627,  22.90534472,  16.82376093,  21.47830226,
   20.05012214,  16.74393817,  17.79456361,  20.80008233,
   19.32059989,  23.23471888,  13.77434964,  17.56121752,
   15.96716163,  18.5294016 ,  28.31005939,  13.66340359,
   10.38160267,  16.09621015,  18.25125683,  20.95954331,
   21.31996941,  24.51998489,  16.58831953,  15.25427142,
   23.93065281,  30.4552266 ,  16.94527367,  16.92730802,
   17.79659417,  18.85080572,  18.0839428 ,  23.93949481,
   26.60243553,  13.68320208,  16.74669921,  20.30238694,
   12.74773905,  19.20810456,  20.7189417 ,  20.73402554,
   17.12106905,  25.06475175,  13.0947528 ,  28.16437938,
   22.4803386 ,  13.71143627,   6.60617725,  20.41186825,
   23.54924934,  22.25930658,  20.09337438,  24.94705884,
   18.58056249,   5.58653271,  18.71242702,  17.83578444])


# How I created masks, or just jump to next comment if it's too painful to look at...
masks = []
masks.append(flux/error > 4.0) # high error
absorptionMask1 = (wave < 5060)
absorptionMask2 = (wave > 5040)
bob = [all(x) for x in zip(absorptionMask1, absorptionMask2)]
absorptionMask = ~np.array(bob)
masks.append(absorptionMask) 

# The resulting mask
masks = [array([ True,  True,  True,  True,  True,  True,  True,  True,  True,
       True,  True,  True,  True,  True,  True,  True,  True,  True,
       True,  True,  True,  True,  True,  True,  True, False, False,
       True, False,  True, False, False,  True,  True,  True,  True,
       True,  True,  True,  True,  True,  True,  True,  True, False,
      False, False, False, False, False, False, False, False, False,
       True,  True,  True,  True, False,  True,  True,  True,  True,
       True,  True, False,  True,  True,  True, False,  True,  True,
       True,  True,  True, False, False,  True,  True,  True,  True,
       True,  True,  True,  True,  True,  True, False,  True,  True,
       True,  True,  True,  True,  True,  True,  True,  True,  True,  True], dtype=bool),
array([ True,  True,  True,  True,  True,  True,  True,  True,  True,
       True,  True,  True,  True,  True,  True,  True,  True,  True,
       True,  True,  True,  True,  True,  True,  True,  True,  True,
       True,  True,  True,  True,  True,  True,  True,  True,  True,
       True,  True,  True,  True,  True, False, False, False, False,
      False, False, False, False, False, False, False, False, False,
      False, False, False, False, False, False,  True,  True,  True,
       True,  True,  True,  True,  True,  True,  True,  True,  True,
       True,  True,  True,  True,  True,  True,  True,  True,  True,
       True,  True,  True,  True,  True,  True,  True,  True,  True,
       True,  True,  True,  True,  True,  True,  True,  True,  True,  True], dtype=bool)]


# More in a bit, should get you a feel for what I'm looking at. 
wave=array([5000,5001,5002,5003,5004,5005,5006,5007,5008,5009,5010,
5011, 5012, 5013, 5014, 5015, 5016, 5017, 5018, 5019, 5020, 5021,
5022, 5023, 5024, 5025, 5026, 5027, 5028, 5029, 5030, 5031, 5032,
5033, 5034, 5035, 5036, 5037, 5038, 5039, 5040, 5041, 5042, 5043,
5044, 5045, 5046, 5047, 5048, 5049, 5050, 5051, 5052, 5053, 5054,
5055, 5056, 5057, 5058, 5059, 5060, 5061, 5062, 5063, 5064, 5065,
5066, 5067, 5068, 5069, 5070, 5071, 5072, 5073, 5074, 5075, 5076,
5077, 5078, 5079, 5080, 5081, 5082, 5083, 5084, 5085, 5086, 5087,
5088, 5089, 5090, 5091, 5092, 5093, 5094, 5095, 5096, 5097, 5098,
5099])
通量=阵列([112.65878609109.2008992113.3062992117.17002715,
103.19663878,  110.42131523,  106.00841123,  100.27882741,
103.89160905,  102.29402469,  105.58894696,  103.21314852,
96.97242814,  106.70130478,  108.83891225,  110.60598803,
95.10361887,  109.39734257,  103.08289878,  104.97258911,
96.46606257,  106.75993458,   99.25386914,  105.91429417,
105.83752232,  100.53312657,   99.74871394,  107.12735837,
108.81187473,   96.51418895,   99.71311101,   94.08702553,
98.81198643,   93.84567201,  103.21444519,   94.7027134 ,
99.61842203,  103.71336458,  100.8697998 ,   92.1564786 ,
96.56711985,   94.7728761 ,   82.65194671,   83.52280884,
86.57960844,   73.6700194 ,   66.11794666,   61.01624627,
63.19944529,   55.50283247,   62.09172307,   59.55436092,
75.66399466,   70.69397378,   64.27899192,   73.80248662,
89.17119606,   78.97024327,   82.3334254 ,  100.82581489,
102.77937201,   99.37717696,   96.2215563 ,  104.52291339,
93.7581944 ,   93.32154346,  103.57018896,  108.08682518,
105.2711359 ,  100.00242988,  100.86934866,  103.20764384,
104.19274473,  101.3314802 ,  102.75057114,   94.02347591,
95.48758551,  106.0099397 ,   99.50733501,   97.88110415,
107.54266965,  107.76126331,   98.14882302,  101.55654606,
101.02418212,  106.82324958,   95.52086925,  102.65957133,
104.93806492,  103.22762427,  108.02087993,  106.71911141,
97.24396195,  103.3450277 ,  113.99870588,  106.4145751 ,
110.08294674,  109.40908288,  118.61518086,  114.37341062])
错误=数组([11.72799338,22.33423611,16.89347382,12.80063102,
23.99242356,  25.15863754,  20.44765811,  14.84358628,
19.16343785,  19.5703491 ,  18.44427035,  19.08648083,
19.09116433,  12.22098884,  14.81280352,  11.35010222,
18.59850136,  15.78855734,  21.85877638,  20.12179042,
22.04894395,  21.986731  ,  13.26738352,  16.10987762,
24.28528627,  30.11866128,  25.30220842,  25.02100014,
29.38560916,  16.8192307 ,  29.15097205,  23.56805267,
15.17285709,  18.27495747,  18.63750452,  18.61618504,
11.45940025,  21.95805701,  24.22923951,  11.76824052,
19.75465065,  14.72979889,  15.45936176,  14.73227474,
28.91683627,  22.90534472,  16.82376093,  21.47830226,
20.05012214,  16.74393817,  17.79456361,  20.80008233,
19.32059989,  23.23471888,  13.77434964,  17.56121752,
15.96716163,  18.5294016 ,  28.31005939,  13.66340359,
10.38160267,  16.09621015,  18.25125683,  20.95954331,
21.31996941,  24.51998489,  16.58831953,  15.25427142,
23.93065281,  30.4552266 ,  16.94527367,  16.92730802,
17.79659417,  18.85080572,  18.0839428 ,  23.93949481,
26.60243553,  13.68320208,  16.74669921,  20.30238694,
12.74773905,  19.20810456,  20.7189417 ,  20.73402554,
17.12106905,  25.06475175,  13.0947528 ,  28.16437938,
22.4803386 ,  13.71143627,   6.60617725,  20.41186825,
23.54924934,  22.25930658,  20.09337438,  24.94705884,
18.58056249,   5.58653271,  18.71242702,  17.83578444])
#我是如何制作面具的,或者如果看起来太痛苦就跳到下一个评论。。。
面具=[]
掩码。追加(通量/误差>4.0)#高误差
吸收率MASK1=(波<5060)
吸收率MASK2=(波>5040)
bob=[拉链中x的全部(x)(吸收体MASK1,吸收体MASK2)]
吸收任务=~np.array(bob)
masks.append(吸收任务)
#生成的掩码
掩码=[数组([真,真,真,真,真,真,真,真,真,
真的,真的,真的,真的,真的,真的,真的,真的,真的,
真,真,真,真,真,真,真,真,假,假,
真,假,真,假,假,真,真,真,真,
真,真,真,真,真,真,真,真,真,假,
假,假,假,假,假,假,假,假,假,假,
真,真,真,真,假,真,真,真,真,真,
真,真,假,真,真,真,假,真,真,
真,真,真,假,假,真,真,
fullmask = [all(mask) for mask in zip(*masks)]
d=np.arange(10)
masks = [d>5, d % 2 == 0, d<8]
total_mask = reduce(np.logical_and, masks)
total_mask = masks[0] & masks[1] & masks[2]
import numpy as np

# create some data
pixel = np.arange(4000)
wave = pixel / 4000. + 5500
flux = pixel / 4000. + 9.5 * 5500
data = np.rec.fromarrays((pixel, wave, flux), names='pixel, wave, flux')

mask = data.wave > 5500.25
mask &= data.flux / data.wave > 8.5

print data[mask].pixel.mean()
your_array = [1, 5, 6000]    
filter(lambda elem: elem > 5000, your_array)
your_array = [1, 5, 6000]
map(lambda elem: elem > 5000, your_array)
# Given wave, error, and flux the way you defined

# If wave is [21.2, 34.1, 43.423], then this returns [(0, 21.2), (1, 34.1), (2, 43.423)]
# Each element is now a tuple of (index, elem)
enum_wave = enumerate(wave)

# Returns a list of the indexes that pass the condition
# For example, if only 98, and 99 aren't filtered out, this will return [98, 99]
masked_wave = [index for index, elem in enum_wave if elem > 5060]

# To find the average
sum(masked_wave) / float(len(masked_wave))
combined_mask = np.full(len(masks), True)
for mask in masks:
    combined_mask &= mask