[

1. 多线程与GIL

1.1. GIL

CPython解释器本身就不是线程安全的,因此有全局解释器锁(GIL),一次只允许使用一个线程执行 Python 字节码.因此一个 Python 进程通常不能同时使用多个 CPU 核心。

编写 Python 代码时无法控制 GIL;不过,执行耗时的任务时,可以使用一个内置的函数或一个使用 C 语言编写的扩展释放GIL.其实有个使用 C 语言编写的 Python库能管理GIL,自行启动操作系统线程,利用全部可用的 CPU 核心.这样做会极大地增加库代码的复杂度,因此大多数库的作者都不这么做.

然而,标准库中所有执行阻塞型I/O操作的函数,在等待操作系统返回结果时都会释放GIL.这意味着在 Python 语言这个层次上可以使用多线程处理io阻塞问题,而 I/O 密集型 Python 程序能从中受益--一个 Python 线程等待网络响应时,阻塞型 I/O 函数会释放 GIL,再运行一个线程.

1.1.1. 为什么需要GIL

GIL是必须的,这是Python设计的问题--Python解释器是非线程安全的.这意味着当从线程内尝试安全的访问Python对象的时候将有一个全局的强制锁.在任何时候,仅仅一个单一的线程能够获取Python对象或者C API.每100个字节的Python指令解释器将重新获取锁,这(潜在的)阻塞了I/O操作.因此CPU密集型的代码使用线程库时,不会获得性能的提高.

1.2. 使用concurrent.futures进行高层抽象的多线程操作

concurrent.futures提供两种编程模型:

并行任务模型单独任务独立使用自己的过程和数据,多任务独立并行计算
MapReduce模型为各个线程分发数据执行相同的过程

1.2.1. 并行任务模型

这个模型使用submit提交任务到上下文管理器,之后使用返回对象的result()方法阻塞io等待任务完成

from concurrent.futures import ThreadPoolExecutor,as_completed from random import randrange from time import time

def arcfour(key, in_bytes, loops=20):     """rc4算法"""     kbox = bytearray(256)  # create key box     for i, car in enumerate(key):  # copy key and vector         kbox[i] = car     j = len(key)     for i in range(j, 256):  # repeat until full         kbox[i] = kbox[i-j]      # [1] initialize sbox     sbox = bytearray(range(256))      # repeat sbox mixing loop, as recommened in CipherSaber-2     # http://ciphersaber.gurus.com/faq.html#cs2     j = 0     for k in range(loops):         for i in range(256):             j = (j + sbox[i] + kbox[i]) % 256             sbox[i], sbox[j] = sbox[j], sbox[i]      # main loop     i = 0     j = 0     out_bytes = bytearray()      for car in in_bytes:         i = (i + 1) % 256         # [2] shuffle sbox         j = (j + sbox[i]) % 256         sbox[i], sbox[j] = sbox[j], sbox[i]         # [3] compute t         t = (sbox[i] + sbox[j]) % 256         k = sbox[t]         car = car ^ k         out_bytes.append(car)      return out_bytes

clear = bytearray(b'1234567890' * 100000) t0 = time() cipher = arcfour(b'key', clear) print('elapsed time: %.2fs' % (time() - t0)) result = arcfour(b'key', cipher) assert result == clear, '%r != %r' % (result, clear) print('elapsed time: %.2fs' % (time() - t0)) print('OK')

elapsed time: 0.47s elapsed time: 0.95s OK

def crypto_process(size, key):     in_text = bytearray(randrange(256) for i in range(size))     cypher_text = arcfour(key, in_text)     out_text = arcfour(key, cypher_text)     assert in_text == out_text, 'Failed arcfour_test'     return size  def main(workers=None):     JOBS = 12     SIZE = 2**18      KEY = b"'Twas brillig, and the slithy toves\nDid gyre"     STATUS = '{} workers, elapsed time: {:.2f}s'     if workers:         workers = int(workers)     t0 = time()      with ThreadPoolExecutor(workers) as executor:         actual_workers = executor._max_workers         to_do = []         for i in range(JOBS, 0, -1):             size = SIZE + int(SIZE / JOBS * (i - JOBS/2))             job = executor.submit(crypto_process, size, KEY)             to_do.append(job)          for future in as_completed(to_do):             res = future.result()             print('{:.1f} KB'.format(res/2**10))      print(STATUS.format(actual_workers, time() - t0))

main(1)

384.0 KB 362.7 KB 341.3 KB 320.0 KB 298.7 KB 277.3 KB 256.0 KB 234.7 KB 213.3 KB 192.0 KB 170.7 KB 149.3 KB 1 workers, elapsed time: 5.74s

main(2)

362.7 KB 384.0 KB 320.0 KB 341.3 KB 298.7 KB 277.3 KB 234.7 KB 256.0 KB 192.0 KB 213.3 KB 170.7 KB 149.3 KB 2 workers, elapsed time: 5.90s

main(4)

def arcfour(key, in_bytes, loops=20):     """rc4算法"""     kbox = bytearray(256)  # create key box     for i, car in enumerate(key):  # copy key and vector         kbox[i] = car     j = len(key)     for i in range(j, 256):  # repeat until full         kbox[i] = kbox[i-j]      # [1] initialize sbox     sbox = bytearray(range(256))      # repeat sbox mixing loop, as recommened in CipherSaber-2     # http://ciphersaber.gurus.com/faq.html#cs2     j = 0     for k in range(loops):         for i in range(256):             j = (j + sbox[i] + kbox[i]) % 256             sbox[i], sbox[j] = sbox[j], sbox[i]      # main loop     i = 0     j = 0     out_bytes = bytearray()      for car in in_bytes:         i = (i + 1) % 256         # [2] shuffle sbox         j = (j + sbox[i]) % 256         sbox[i], sbox[j] = sbox[j], sbox[i]         # [3] compute t         t = (sbox[i] + sbox[j]) % 256         k = sbox[t]         car = car ^ k         out_bytes.append(car)      return out_bytes 0

1.2.2. MapReduce模型

这种模式可能更加被大家熟悉,同一个流程,将容器中的数据一条一脚放入子进程运算,最终也结果也会被放入容器中.最后可以将收集来的数据在主进程中进行处理

def arcfour(key, in_bytes, loops=20):     """rc4算法"""     kbox = bytearray(256)  # create key box     for i, car in enumerate(key):  # copy key and vector         kbox[i] = car     j = len(key)     for i in range(j, 256):  # repeat until full         kbox[i] = kbox[i-j]      # [1] initialize sbox     sbox = bytearray(range(256))      # repeat sbox mixing loop, as recommened in CipherSaber-2     # http://ciphersaber.gurus.com/faq.html#cs2     j = 0     for k in range(loops):         for i in range(256):             j = (j + sbox[i] + kbox[i]) % 256             sbox[i], sbox[j] = sbox[j], sbox[i]      # main loop     i = 0     j = 0     out_bytes = bytearray()      for car in in_bytes:         i = (i + 1) % 256         # [2] shuffle sbox         j = (j + sbox[i]) % 256         sbox[i], sbox[j] = sbox[j], sbox[i]         # [3] compute t         t = (sbox[i] + sbox[j]) % 256         k = sbox[t]         car = car ^ k         out_bytes.append(car)      return out_bytes 1

def arcfour(key, in_bytes, loops=20):     """rc4算法"""     kbox = bytearray(256)  # create key box     for i, car in enumerate(key):  # copy key and vector         kbox[i] = car     j = len(key)     for i in range(j, 256):  # repeat until full         kbox[i] = kbox[i-j]      # [1] initialize sbox     sbox = bytearray(range(256))      # repeat sbox mixing loop, as recommened in CipherSaber-2     # http://ciphersaber.gurus.com/faq.html#cs2     j = 0     for k in range(loops):         for i in range(256):             j = (j + sbox[i] + kbox[i]) % 256             sbox[i], sbox[j] = sbox[j], sbox[i]      # main loop     i = 0     j = 0     out_bytes = bytearray()      for car in in_bytes:         i = (i + 1) % 256         # [2] shuffle sbox         j = (j + sbox[i]) % 256         sbox[i], sbox[j] = sbox[j], sbox[i]         # [3] compute t         t = (sbox[i] + sbox[j]) % 256         k = sbox[t]         car = car ^ k         out_bytes.append(car)      return out_bytes 2

def arcfour(key, in_bytes, loops=20):     """rc4算法"""     kbox = bytearray(256)  # create key box     for i, car in enumerate(key):  # copy key and vector         kbox[i] = car     j = len(key)     for i in range(j, 256):  # repeat until full         kbox[i] = kbox[i-j]      # [1] initialize sbox     sbox = bytearray(range(256))      # repeat sbox mixing loop, as recommened in CipherSaber-2     # http://ciphersaber.gurus.com/faq.html#cs2     j = 0     for k in range(loops):         for i in range(256):             j = (j + sbox[i] + kbox[i]) % 256             sbox[i], sbox[j] = sbox[j], sbox[i]      # main loop     i = 0     j = 0     out_bytes = bytearray()      for car in in_bytes:         i = (i + 1) % 256         # [2] shuffle sbox         j = (j + sbox[i]) % 256         sbox[i], sbox[j] = sbox[j], sbox[i]         # [3] compute t         t = (sbox[i] + sbox[j]) % 256         k = sbox[t]         car = car ^ k         out_bytes.append(car)      return out_bytes 3

def arcfour(key, in_bytes, loops=20):     """rc4算法"""     kbox = bytearray(256)  # create key box     for i, car in enumerate(key):  # copy key and vector         kbox[i] = car     j = len(key)     for i in range(j, 256):  # repeat until full         kbox[i] = kbox[i-j]      # [1] initialize sbox     sbox = bytearray(range(256))      # repeat sbox mixing loop, as recommened in CipherSaber-2     # http://ciphersaber.gurus.com/faq.html#cs2     j = 0     for k in range(loops):         for i in range(256):             j = (j + sbox[i] + kbox[i]) % 256             sbox[i], sbox[j] = sbox[j], sbox[i]      # main loop     i = 0     j = 0     out_bytes = bytearray()      for car in in_bytes:         i = (i + 1) % 256         # [2] shuffle sbox         j = (j + sbox[i]) % 256         sbox[i], sbox[j] = sbox[j], sbox[i]         # [3] compute t         t = (sbox[i] + sbox[j]) % 256         k = sbox[t]         car = car ^ k         out_bytes.append(car)      return out_bytes 4

def arcfour(key, in_bytes, loops=20):     """rc4算法"""     kbox = bytearray(256)  # create key box     for i, car in enumerate(key):  # copy key and vector         kbox[i] = car     j = len(key)     for i in range(j, 256):  # repeat until full         kbox[i] = kbox[i-j]      # [1] initialize sbox     sbox = bytearray(range(256))      # repeat sbox mixing loop, as recommened in CipherSaber-2     # http://ciphersaber.gurus.com/faq.html#cs2     j = 0     for k in range(loops):         for i in range(256):             j = (j + sbox[i] + kbox[i]) % 256             sbox[i], sbox[j] = sbox[j], sbox[i]      # main loop     i = 0     j = 0     out_bytes = bytearray()      for car in in_bytes:         i = (i + 1) % 256         # [2] shuffle sbox         j = (j + sbox[i]) % 256         sbox[i], sbox[j] = sbox[j], sbox[i]         # [3] compute t         t = (sbox[i] + sbox[j]) % 256         k = sbox[t]         car = car ^ k         out_bytes.append(car)      return out_bytes 5

def arcfour(key, in_bytes, loops=20):     """rc4算法"""     kbox = bytearray(256)  # create key box     for i, car in enumerate(key):  # copy key and vector         kbox[i] = car     j = len(key)     for i in range(j, 256):  # repeat until full         kbox[i] = kbox[i-j]      # [1] initialize sbox     sbox = bytearray(range(256))      # repeat sbox mixing loop, as recommened in CipherSaber-2     # http://ciphersaber.gurus.com/faq.html#cs2     j = 0     for k in range(loops):         for i in range(256):             j = (j + sbox[i] + kbox[i]) % 256             sbox[i], sbox[j] = sbox[j], sbox[i]      # main loop     i = 0     j = 0     out_bytes = bytearray()      for car in in_bytes:         i = (i + 1) % 256         # [2] shuffle sbox         j = (j + sbox[i]) % 256         sbox[i], sbox[j] = sbox[j], sbox[i]         # [3] compute t         t = (sbox[i] + sbox[j]) % 256         k = sbox[t]         car = car ^ k         out_bytes.append(car)      return out_bytes 3

1.3. 使用线程池进行相对底层的多进程操作

线程池的方式很适合批量创建子线程.线程池模块藏在多进程模块multiprocessing.pool下,ThreadPool

对ThreadPool对象调用join()方法会等待所有子进程执行完毕,调用join()之前必须先调用close()，调用close()之后就不能继续添加新的Process了.

请注意输出的结果,task 0,1,2,3是立刻执行的,而task 4要等待前面某个task完成后才执行,这是因为Pool的默认大小在我的电脑上是4，因此，最多同时执行4个进程.这是Pool有意设计的限制,并不是操作系统的限制.如果改成p = Pool(5)就可以同时跑5个进程.

由于Pool的默认大小是CPU的核数,如果你不幸拥有8核CPU,你要提交至少9个子进程才能看到上面的等待效果.

除了使用apply_async方法外,还有apply,map和map_async可以用于线程池的计算,编程模型也是如concurrent.futures一样分为两类

并行任务模型
- apply 单一任务布置
- apply_async 非阻塞单一任务布置
MapReduce模型
- map 同系统的map方法
- map_async 非阻塞的map

apply_async

def arcfour(key, in_bytes, loops=20):     """rc4算法"""     kbox = bytearray(256)  # create key box     for i, car in enumerate(key):  # copy key and vector         kbox[i] = car     j = len(key)     for i in range(j, 256):  # repeat until full         kbox[i] = kbox[i-j]      # [1] initialize sbox     sbox = bytearray(range(256))      # repeat sbox mixing loop, as recommened in CipherSaber-2     # http://ciphersaber.gurus.com/faq.html#cs2     j = 0     for k in range(loops):         for i in range(256):             j = (j + sbox[i] + kbox[i]) % 256             sbox[i], sbox[j] = sbox[j], sbox[i]      # main loop     i = 0     j = 0     out_bytes = bytearray()      for car in in_bytes:         i = (i + 1) % 256         # [2] shuffle sbox         j = (j + sbox[i]) % 256         sbox[i], sbox[j] = sbox[j], sbox[i]         # [3] compute t         t = (sbox[i] + sbox[j]) % 256         k = sbox[t]         car = car ^ k         out_bytes.append(car)      return out_bytes 7

def arcfour(key, in_bytes, loops=20):     """rc4算法"""     kbox = bytearray(256)  # create key box     for i, car in enumerate(key):  # copy key and vector         kbox[i] = car     j = len(key)     for i in range(j, 256):  # repeat until full         kbox[i] = kbox[i-j]      # [1] initialize sbox     sbox = bytearray(range(256))      # repeat sbox mixing loop, as recommened in CipherSaber-2     # http://ciphersaber.gurus.com/faq.html#cs2     j = 0     for k in range(loops):         for i in range(256):             j = (j + sbox[i] + kbox[i]) % 256             sbox[i], sbox[j] = sbox[j], sbox[i]      # main loop     i = 0     j = 0     out_bytes = bytearray()      for car in in_bytes:         i = (i + 1) % 256         # [2] shuffle sbox         j = (j + sbox[i]) % 256         sbox[i], sbox[j] = sbox[j], sbox[i]         # [3] compute t         t = (sbox[i] + sbox[j]) % 256         k = sbox[t]         car = car ^ k         out_bytes.append(car)      return out_bytes 8

def arcfour(key, in_bytes, loops=20):     """rc4算法"""     kbox = bytearray(256)  # create key box     for i, car in enumerate(key):  # copy key and vector         kbox[i] = car     j = len(key)     for i in range(j, 256):  # repeat until full         kbox[i] = kbox[i-j]      # [1] initialize sbox     sbox = bytearray(range(256))      # repeat sbox mixing loop, as recommened in CipherSaber-2     # http://ciphersaber.gurus.com/faq.html#cs2     j = 0     for k in range(loops):         for i in range(256):             j = (j + sbox[i] + kbox[i]) % 256             sbox[i], sbox[j] = sbox[j], sbox[i]      # main loop     i = 0     j = 0     out_bytes = bytearray()      for car in in_bytes:         i = (i + 1) % 256         # [2] shuffle sbox         j = (j + sbox[i]) % 256         sbox[i], sbox[j] = sbox[j], sbox[i]         # [3] compute t         t = (sbox[i] + sbox[j]) % 256         k = sbox[t]         car = car ^ k         out_bytes.append(car)      return out_bytes 9

map_async

clear = bytearray(b'1234567890' * 100000) t0 = time() cipher = arcfour(b'key', clear) print('elapsed time: %.2fs' % (time() - t0)) result = arcfour(b'key', cipher) assert result == clear, '%r != %r' % (result, clear) print('elapsed time: %.2fs' % (time() - t0)) print('OK') 0

clear = bytearray(b'1234567890' * 100000) t0 = time() cipher = arcfour(b'key', clear) print('elapsed time: %.2fs' % (time() - t0)) result = arcfour(b'key', cipher) assert result == clear, '%r != %r' % (result, clear) print('elapsed time: %.2fs' % (time() - t0)) print('OK') 1

获取进程池中的运算结果

clear = bytearray(b'1234567890' * 100000) t0 = time() cipher = arcfour(b'key', clear) print('elapsed time: %.2fs' % (time() - t0)) result = arcfour(b'key', cipher) assert result == clear, '%r != %r' % (result, clear) print('elapsed time: %.2fs' % (time() - t0)) print('OK') 2

clear = bytearray(b'1234567890' * 100000) t0 = time() cipher = arcfour(b'key', clear) print('elapsed time: %.2fs' % (time() - t0)) result = arcfour(b'key', cipher) assert result == clear, '%r != %r' % (result, clear) print('elapsed time: %.2fs' % (time() - t0)) print('OK') 3

1.4. 更底层的多线程编程

threading模块提供了一个高层的API来提供线程的并发性.这些线程并发运行并共享内存.多线程看着多么美好的,但因为数据安全的问题被加了锁.所以永远是单核运行,不细说了看个简单的用法吧

下面来看threading模块的具体用法:

clear = bytearray(b'1234567890' * 100000) t0 = time() cipher = arcfour(b'key', clear) print('elapsed time: %.2fs' % (time() - t0)) result = arcfour(b'key', cipher) assert result == clear, '%r != %r' % (result, clear) print('elapsed time: %.2fs' % (time() - t0)) print('OK') 4

clear = bytearray(b'1234567890' * 100000) t0 = time() cipher = arcfour(b'key', clear) print('elapsed time: %.2fs' % (time() - t0)) result = arcfour(b'key', cipher) assert result == clear, '%r != %r' % (result, clear) print('elapsed time: %.2fs' % (time() - t0)) print('OK') 5

对比下不用多线程:

clear = bytearray(b'1234567890' * 100000) t0 = time() cipher = arcfour(b'key', clear) print('elapsed time: %.2fs' % (time() - t0)) result = arcfour(b'key', cipher) assert result == clear, '%r != %r' % (result, clear) print('elapsed time: %.2fs' % (time() - t0)) print('OK') 6

clear = bytearray(b'1234567890' * 100000) t0 = time() cipher = arcfour(b'key', clear) print('elapsed time: %.2fs' % (time() - t0)) result = arcfour(b'key', cipher) assert result == clear, '%r != %r' % (result, clear) print('elapsed time: %.2fs' % (time() - t0)) print('OK') 7

1.4.1. 一个相对复杂的例子

clear = bytearray(b'1234567890' * 100000) t0 = time() cipher = arcfour(b'key', clear) print('elapsed time: %.2fs' % (time() - t0)) result = arcfour(b'key', cipher) assert result == clear, '%r != %r' % (result, clear) print('elapsed time: %.2fs' % (time() - t0)) print('OK') 8

clear = bytearray(b'1234567890' * 100000) t0 = time() cipher = arcfour(b'key', clear) print('elapsed time: %.2fs' % (time() - t0)) result = arcfour(b'key', cipher) assert result == clear, '%r != %r' % (result, clear) print('elapsed time: %.2fs' % (time() - t0)) print('OK') 9

1.4.2. 使用Thread作为父类自定义子线程

Thread的子类需要重写run方法

elapsed time: 0.47s elapsed time: 0.95s OK 0

elapsed time: 0.47s elapsed time: 0.95s OK 1

创建子线程时,只需要传入一个执行函数和函数的参数,创建一个Thread实例,用start()方法启动,这样创建进程比fork()简单.

join()方法可以等待子线程结束后再继续往下运行,通常用于线程间的同步.

可以看到我们的父线程进行完了子线程才进行.其实当执行start方法的时候我们就已经把线程创建好并给他任务了.虽然线程启动了,但我们并不能知道它啥时候运算完成.这时候用join方法来确认是否执行完了(通过阻塞主线程),也就是起个等待结果的作用.

1.5. 使用队列管理线程

线程安全是多线程编程中最不容易的事儿,线程间同步,互斥数据共享一直是要考虑的问题,而最常见的就是用队列实现管理线程了.

1.5.1. 生产者消费者模型

队列最常见的用处就是在生产者消费者模式中作为数据缓冲区.以下就是一个生产者消费者模式的例子

elapsed time: 0.47s elapsed time: 0.95s OK 2

elapsed time: 0.47s elapsed time: 0.95s OK 3

elapsed time: 0.47s elapsed time: 0.95s OK 4

elapsed time: 0.47s elapsed time: 0.95s OK 5

elapsed time: 0.47s elapsed time: 0.95s OK 6

1.6. queue模块说明

队列类型

queue.Queue(maxsize)先进先出队列,maxsize是队列长度,其值为非正数时是无限循环队列
queue.LifoQueue(maxsize) 后进先出队列,也就是栈
queue.PriorityQueue(maxsize) 优先级队列

支持方法

qsize() 返回近似队列大小,,用近似二字因为当该值大于0时不能保证并发执行的时候get(),put()方法不被阻塞
empty() 判断是否为空,空返回True否则返回False
full() 当设定了队列大小的时候,如果队列满了则返回True,否则False
put(item[,block[,timeout]]) 向队列添加元素
- 当block设置为False时队列满则抛出异常
- 当block为True,timeout为None时则会等待直到有空位
- 当block为True,timeout不为None时则根据设定的时间判断是否等待,超时了就抛出错误
put_nowait(item) 相当于put(item,False)
get([,block[,timeout]) 从队列中取出元素,
- 当block设置为False时队列空则抛出异常
- 当block为True,timeout为None时则会等待直到有+元素
- 当block为True,timeout不为None时则根据设定的时间判断是否等待,超时了就抛出错误
get_nowait() 等价于get(False)
task_done() 发送信号表明入列任务已经完成,常在消费者线程里使用
join() 阻塞直到队列中所有元素处理完

Queue是线程安全的,而且支持in操作,因此用它的时候不用考虑锁的问题

1.7. 使用Unix信号

标准库signal提供了操作Unix信号的方法.需要注意signal模块主要是针对Unix平台(linux,osx).Windows上的Python不能发挥signal模块的功能.

常见的信号可以查看本章的结语部分.

Python信号处理程序总是在主线程中执行.这意味着信号不能用作线程间通信的手段.同时也只允许主线程设置新的信号处理程序.

1.7.1. 常用信号处理函数

设置发送SIGALRM信号的定时器

signal.alarm(time)可以设置一个发送SIGALRM信号的定时器,在time秒后就会发送这个信号量到进程,在不做处理的情况下进程会退出

使用signal.pasue阻塞函数

signal.pasue会让主线程暂停以等待信号,接收到信号后使进程停止

signal.signal(sig,handler)用于注册收到信号后的处理函数

注意handler函数有两个参数--信号number和帧对象

下面这个例子我们演示了监听信号的过程,无论是等待10s还是使用ctrl+C都可以中断阻塞使程序结束.

elapsed time: 0.47s elapsed time: 0.95s OK 7

elapsed time: 0.47s elapsed time: 0.95s OK 8

1.7.2. 多线程中使用信号

signal.sigwait(sigset)用于在子线程中等待sigset中定义的多个信号之一,一旦受到信号就取消阻塞向下走
signal.pthread_kill(thread_id, signal.SIGCONT)用于在主线程中发送消息到子线程.thread_id可以通过运行中的子线程的ident属性获得.

下面的例子演示了主线程向子线程发送信号的过程

elapsed time: 0.47s elapsed time: 0.95s OK 9

def crypto_process(size, key):     in_text = bytearray(randrange(256) for i in range(size))     cypher_text = arcfour(key, in_text)     out_text = arcfour(key, cypher_text)     assert in_text == out_text, 'Failed arcfour_test'     return size  def main(workers=None):     JOBS = 12     SIZE = 2**18      KEY = b"'Twas brillig, and the slithy toves\nDid gyre"     STATUS = '{} workers, elapsed time: {:.2f}s'     if workers:         workers = int(workers)     t0 = time()      with ThreadPoolExecutor(workers) as executor:         actual_workers = executor._max_workers         to_do = []         for i in range(JOBS, 0, -1):             size = SIZE + int(SIZE / JOBS * (i - JOBS/2))             job = executor.submit(crypto_process, size, KEY)             to_do.append(job)          for future in as_completed(to_do):             res = future.result()             print('{:.1f} KB'.format(res/2**10))      print(STATUS.format(actual_workers, time() - t0)) 0

1.8. 线程变协程

在Python3.4之前python没有原生的协程那个时候有一个神级的协程库gevent它可以通过monkey patch将标准库替换从而实现线程变协程,替换的库在这个文档中有汇总.gevent至今依然被广泛使用,也是最推荐的协程使用方式之一.

def crypto_process(size, key):     in_text = bytearray(randrange(256) for i in range(size))     cypher_text = arcfour(key, in_text)     out_text = arcfour(key, cypher_text)     assert in_text == out_text, 'Failed arcfour_test'     return size  def main(workers=None):     JOBS = 12     SIZE = 2**18      KEY = b"'Twas brillig, and the slithy toves\nDid gyre"     STATUS = '{} workers, elapsed time: {:.2f}s'     if workers:         workers = int(workers)     t0 = time()      with ThreadPoolExecutor(workers) as executor:         actual_workers = executor._max_workers         to_do = []         for i in range(JOBS, 0, -1):             size = SIZE + int(SIZE / JOBS * (i - JOBS/2))             job = executor.submit(crypto_process, size, KEY)             to_do.append(job)          for future in as_completed(to_do):             res = future.result()             print('{:.1f} KB'.format(res/2**10))      print(STATUS.format(actual_workers, time() - t0)) 1

def crypto_process(size, key):     in_text = bytearray(randrange(256) for i in range(size))     cypher_text = arcfour(key, in_text)     out_text = arcfour(key, cypher_text)     assert in_text == out_text, 'Failed arcfour_test'     return size  def main(workers=None):     JOBS = 12     SIZE = 2**18      KEY = b"'Twas brillig, and the slithy toves\nDid gyre"     STATUS = '{} workers, elapsed time: {:.2f}s'     if workers:         workers = int(workers)     t0 = time()      with ThreadPoolExecutor(workers) as executor:         actual_workers = executor._max_workers         to_do = []         for i in range(JOBS, 0, -1):             size = SIZE + int(SIZE / JOBS * (i - JOBS/2))             job = executor.submit(crypto_process, size, KEY)             to_do.append(job)          for future in as_completed(to_do):             res = future.result()             print('{:.1f} KB'.format(res/2**10))      print(STATUS.format(actual_workers, time() - t0)) 2

def crypto_process(size, key):     in_text = bytearray(randrange(256) for i in range(size))     cypher_text = arcfour(key, in_text)     out_text = arcfour(key, cypher_text)     assert in_text == out_text, 'Failed arcfour_test'     return size  def main(workers=None):     JOBS = 12     SIZE = 2**18      KEY = b"'Twas brillig, and the slithy toves\nDid gyre"     STATUS = '{} workers, elapsed time: {:.2f}s'     if workers:         workers = int(workers)     t0 = time()      with ThreadPoolExecutor(workers) as executor:         actual_workers = executor._max_workers         to_do = []         for i in range(JOBS, 0, -1):             size = SIZE + int(SIZE / JOBS * (i - JOBS/2))             job = executor.submit(crypto_process, size, KEY)             to_do.append(job)          for future in as_completed(to_do):             res = future.result()             print('{:.1f} KB'.format(res/2**10))      print(STATUS.format(actual_workers, time() - t0)) 3

def crypto_process(size, key):     in_text = bytearray(randrange(256) for i in range(size))     cypher_text = arcfour(key, in_text)     out_text = arcfour(key, cypher_text)     assert in_text == out_text, 'Failed arcfour_test'     return size  def main(workers=None):     JOBS = 12     SIZE = 2**18      KEY = b"'Twas brillig, and the slithy toves\nDid gyre"     STATUS = '{} workers, elapsed time: {:.2f}s'     if workers:         workers = int(workers)     t0 = time()      with ThreadPoolExecutor(workers) as executor:         actual_workers = executor._max_workers         to_do = []         for i in range(JOBS, 0, -1):             size = SIZE + int(SIZE / JOBS * (i - JOBS/2))             job = executor.submit(crypto_process, size, KEY)             to_do.append(job)          for future in as_completed(to_do):             res = future.result()             print('{:.1f} KB'.format(res/2**10))      print(STATUS.format(actual_workers, time() - t0)) 4

]

周飒博客-ZhouSa.com