一行程式碼完成並行任務

colleen__chen發表於2014-02-12

眾所周知，Python的並行處理能力很不理想。我認為如果不考慮執行緒和GIL的標準引數（它們大多是合法的），其原因不是因為技術不到位，而是我們的使用方法不恰當。大多數關於Python執行緒和多程式的教材雖然都很出色，但是內容繁瑣冗長。它們的確在開篇鋪陳了許多有用資訊，但往往都不會涉及真正能提高日常工作的部分。

經典例子

DDG上以“Python threading tutorial （Python執行緒教程）”為關鍵字的熱門搜尋結果表明：幾乎每篇文章中給出的例子都是相同的類+佇列。

事實上，它們就是以下這段使用producer/Consumer來處理執行緒/多程式的程式碼示例：

#Example.py
'''
Standard Producer/Consumer Threading Pattern
'''

import time 
import threading 
import Queue 

class Consumer(threading.Thread): 
	def __init__(self, queue): 
		threading.Thread.__init__(self)
		self._queue = queue 

	def run(self):
		while True: 
			# queue.get() blocks the current thread until 
			# an item is retrieved. 
			msg = self._queue.get() 
			# Checks if the current message is 
			# the "Poison Pill"
			if isinstance(msg, str) and msg == 'quit':
				# if so, exists the loop
				break
			# "Processes" (or in our case, prints) the queue item	
			print "I'm a thread, and I received %s!!" % msg
		# Always be friendly! 
		print 'Bye byes!'

def Producer():
	# Queue is used to share items between
	# the threads.
	queue = Queue.Queue()

	# Create an instance of the worker
	worker = Consumer(queue)
	# start calls the internal run() method to 
	# kick off the thread
	worker.start() 

	# variable to keep track of when we started
	start_time = time.time() 
	# While under 5 seconds.. 
	while time.time() - start_time < 5: 
		# "Produce" a piece of work and stick it in 
		# the queue for the Consumer to process
		queue.put('something at %s' % time.time())
		# Sleep a bit just to avoid an absurd number of messages
		time.sleep(1)

	# This the "poison pill" method of killing a thread. 
	queue.put('quit')
	# wait for the thread to close down
	worker.join()

if __name__ == '__main__':
	Producer()

#Example.py

'''

Standard Producer/Consumer Threading Pattern

'''

import time

import threading

import Queue

class Consumer(threading.Thread):

def __init__(self, queue):

threading.Thread.__init__(self)

self._queue = queue

def run(self):

while True:

# queue.get() blocks the current thread until

# an item is retrieved.

msg = self._queue.get()

# Checks if the current message is

# the "Poison Pill"

if isinstance(msg, str) and msg == 'quit':

# if so, exists the loop

break

# "Processes" (or in our case, prints) the queue item

print "I'm a thread, and I received %s!!" % msg

# Always be friendly!

print 'Bye byes!'

def Producer():

# Queue is used to share items between

# the threads.

queue = Queue.Queue()

# Create an instance of the worker

worker = Consumer(queue)

# start calls the internal run() method to

# kick off the thread

worker.start()

# variable to keep track of when we started

start_time = time.time()

# While under 5 seconds..

while time.time() - start_time < 5:

# "Produce" a piece of work and stick it in

# the queue for the Consumer to process

queue.put('something at %s' % time.time())

# Sleep a bit just to avoid an absurd number of messages

time.sleep(1)

# This the "poison pill" method of killing a thread.

queue.put('quit')

# wait for the thread to close down

worker.join()

if __name__ == '__main__':

Producer()

唔…….感覺有點像Java。

我現在並不想說明使用Producer / Consume來解決執行緒/多程式的方法是錯誤的——因為它肯定正確，而且在很多情況下它是最佳方法。但我不認為這是平時寫程式碼的最佳選擇。

它的問題所在（個人觀點）

首先，你需要建立一個樣板式的鋪墊類。然後，你再建立一個佇列，通過其傳遞物件和監管佇列的兩端來完成任務。（如果你想實現資料的交換或儲存，通常還涉及另一個佇列的參與）。

Worker越多，問題越多。

接下來，你應該會建立一個worker類的pool來提高Python的速度。下面是IBM tutorial給出的較好的方法。這也是程式設計師們在利用多執行緒檢索web頁面時的常用方法。

#Example2.py
'''
A more realistic thread pool example 
'''

import time 
import threading 
import Queue 
import urllib2 

class Consumer(threading.Thread): 
	def __init__(self, queue): 
		threading.Thread.__init__(self)
		self._queue = queue 

	def run(self):
		while True: 
			content = self._queue.get() 
			if isinstance(content, str) and content == 'quit':
				break
			response = urllib2.urlopen(content)
		print 'Bye byes!'

def Producer():
	urls = [
		'http://www.python.org', 'http://www.yahoo.com'
		'http://www.scala.org', 'http://www.google.com'
		# etc.. 
	]
	queue = Queue.Queue()
	worker_threads = build_worker_pool(queue, 4)
	start_time = time.time()

	# Add the urls to process
	for url in urls: 
		queue.put(url)	
	# Add the poison pillv
	for worker in worker_threads:
		queue.put('quit')
	for worker in worker_threads:
		worker.join()

	print 'Done! Time taken: {}'.format(time.time() - start_time)

def build_worker_pool(queue, size):
	workers = []
	for _ in range(size):
		worker = Consumer(queue)
		worker.start() 
		workers.append(worker)
	return workers

if __name__ == '__main__':
	Producer()

#Example2.py

'''

A more realistic thread pool example

'''

import time

import threading

import Queue

import urllib2

class Consumer(threading.Thread):

def __init__(self, queue):

threading.Thread.__init__(self)

self._queue = queue

def run(self):

while True:

content = self._queue.get()

if isinstance(content, str) and content == 'quit':

break

response = urllib2.urlopen(content)

print 'Bye byes!'

def Producer():

urls = [

'http://www.python.org', 'http://www.yahoo.com'

'http://www.scala.org', 'http://www.google.com'

# etc..

]

queue = Queue.Queue()

worker_threads = build_worker_pool(queue, 4)

start_time = time.time()

# Add the urls to process

for url in urls:

queue.put(url)

# Add the poison pillv

for worker in worker_threads:

queue.put('quit')

for worker in worker_threads:

worker.join()

print 'Done! Time taken: {}'.format(time.time() - start_time)

def build_worker_pool(queue, size):

workers = []

for _ in range(size):

worker = Consumer(queue)

worker.start()

workers.append(worker)

return workers

if __name__ == '__main__':

Producer()

它的確能執行，但是這些程式碼多麼複雜阿！它包括了初始化方法、執行緒跟蹤列表以及和我一樣容易在死鎖問題上出錯的人的噩夢——大量的join語句。而這些還僅僅只是繁瑣的開始！

我們目前為止都完成了什麼？基本上什麼都沒有。上面的程式碼幾乎一直都只是在進行傳遞。這是很基礎的方法，很容易出錯（該死，我剛才忘了在佇列物件上還需要呼叫task_done()方法（但是我懶得修改了）），價效比很低。還好，我們還有更好的方法。

介紹：Map

Map是一個很棒的小功能，同時它也是Python並行程式碼快速執行的關鍵。給不熟悉的人講解一下吧，map是從函式語言Lisp來的。map函式能夠按序對映出另一個函式。例如

urls = ['http://www.yahoo.com', 'http://www.reddit.com']
results = map(urllib2.urlopen, urls)

1 2	urls = ['http://www.yahoo.com', 'http://www.reddit.com'] results = map(urllib2.urlopen, urls)

這裡呼叫urlopen方法來把呼叫結果全部按序返回並儲存到一個列表裡。就像：

results = []
for url in urls: 
    results.append(urllib2.urlopen(url))

results = []

for url in urls:

results.append(urllib2.urlopen(url))

Map按序處理這些迭代。呼叫這個函式，它就會返回給我們一個按序儲存著結果的簡易列表。

為什麼它這麼厲害呢？因為只要有了合適的庫，map能使並行執行得十分流暢！

MAP

有兩個能夠支援通過map函式來完成並行的庫：一個是multiprocessing，另一個是鮮為人知但功能強大的子檔案：multiprocessing.dummy。

題外話：這個是什麼？你從來沒聽說過dummy多程式庫？我也是最近才知道的。它在多程式的說明文件裡面僅僅只被提到了一句。而且那一句就是大概讓你知道有這麼個東西的存在。我敢說，這樣幾近拋售的做法造成的後果是不堪設想的！

Dummy就是多程式模組的克隆檔案。唯一不同的是，多程式模組使用的是程式，而dummy則使用執行緒（當然，它有所有Python常見的限制）。也就是說，資料由一個傳遞給另一個。這能夠使得資料輕鬆的在這兩個之間進行前進和回躍，特別是對於探索性程式來說十分有用，因為你不用確定框架呼叫到底是IO 還是CPU模式。

準備開始

要做到通過map函式來完成並行，你應該先匯入裝有它們的模組：

from multiprocessing import Pool
from multiprocessing.dummy import Pool as ThreadPool

1 2	from multiprocessing import Pool from multiprocessing.dummy import Pool as ThreadPool

再初始化:

pool = ThreadPool()

1	pool = ThreadPool()

這簡單的一句就能代替我們的build_worker_pool 函式在example2.py中的所有工作。換句話說，它建立了許多有效的worker，啟動它們來為接下來的工作做準備，以及把它們儲存在不同的位置，方便使用。

Pool物件需要一些引數，但最重要的是：程式。它決定pool中的worker數量。如果你不填的話，它就會預設為你電腦的核心數值。

如果你在CPU模式下使用多程式pool，通常核心數越大速度就越快（還有很多其它因素）。但是，當進行執行緒或者處理網路繫結之類的工作時，情況會比較複雜所以應該使用pool的準確大小。

pool = ThreadPool(4) # Sets the pool size to 4

1	pool = ThreadPool(4) # Sets the pool size to 4

如果你執行過多執行緒，多執行緒間的切換將會浪費許多時間，所以你最好耐心除錯出最適合的任務數。

我們現在已經建立了pool物件，馬上就能有簡單的並行程式了，所以讓我們重新寫example2.py中的url opener吧！

import urllib2 
from multiprocessing.dummy import Pool as ThreadPool 

urls = [
	'http://www.python.org', 
	'http://www.python.org/about/',
	'http://www.onlamp.com/pub/a/python/2003/04/17/metaclasses.html',
	'http://www.python.org/doc/',
	'http://www.python.org/download/',
	'http://www.python.org/getit/',
	'http://www.python.org/community/',
	'https://wiki.python.org/moin/',
	'http://planet.python.org/',
	'https://wiki.python.org/moin/LocalUserGroups',
	'http://www.python.org/psf/',
	'http://docs.python.org/devguide/',
	'http://www.python.org/community/awards/'
	# etc.. 
	]

# Make the Pool of workers
pool = ThreadPool(4) 
# Open the urls in their own threads
# and return the results
results = pool.map(urllib2.urlopen, urls)
#close the pool and wait for the work to finish 
pool.close() 
pool.join()

import urllib2

from multiprocessing.dummy import Pool as ThreadPool

urls = [

'http://www.python.org',

'http://www.python.org/about/',

'http://www.onlamp.com/pub/a/python/2003/04/17/metaclasses.html',

'http://www.python.org/doc/',

'http://www.python.org/download/',

'http://www.python.org/getit/',

'http://www.python.org/community/',

'https://wiki.python.org/moin/',

'http://planet.python.org/',

'https://wiki.python.org/moin/LocalUserGroups',

'http://www.python.org/psf/',

'http://docs.python.org/devguide/',

'http://www.python.org/community/awards/'

# etc..

]

# Make the Pool of workers

pool = ThreadPool(4)

# Open the urls in their own threads

# and return the results

results = pool.map(urllib2.urlopen, urls)

#close the pool and wait for the work to finish

pool.close()

pool.join()

看吧！這次的程式碼僅用了4行就完成了所有的工作。其中3句還是簡單的固定寫法。呼叫map就能完成我們前面例子中40行的內容！為了更形象地表明兩種方法的差異，我還分別給它們執行的時間計時。

# results = [] 
# for url in urls:
# 	result = urllib2.urlopen(url)
# 	results.append(result)

# # ------- VERSUS ------- # 

# # ------- 4 Pool ------- # 
# pool = ThreadPool(4) 
# results = pool.map(urllib2.urlopen, urls)

# # ------- 8 Pool ------- # 

# pool = ThreadPool(8) 
# results = pool.map(urllib2.urlopen, urls)

# # ------- 13 Pool ------- # 

# pool = ThreadPool(13) 
# results = pool.map(urllib2.urlopen, urls)

# results = []

# for url in urls:

# result = urllib2.urlopen(url)

# results.append(result)

# # ------- VERSUS ------- #

# # ------- 4 Pool ------- #

# pool = ThreadPool(4)

# results = pool.map(urllib2.urlopen, urls)

# # ------- 8 Pool ------- #

# pool = ThreadPool(8)

# results = pool.map(urllib2.urlopen, urls)

# # ------- 13 Pool ------- #

# pool = ThreadPool(13)

# results = pool.map(urllib2.urlopen, urls)

結果：

# 						Single thread:  14.4 Seconds 
# 						       4 Pool:   3.1 Seconds
# 						       8 Pool:   1.4 Seconds
# 						      13 Pool:   1.3 Seconds

# Single thread: 14.4 Seconds

# 4 Pool: 3.1 Seconds

# 8 Pool: 1.4 Seconds

# 13 Pool: 1.3 Seconds

相當出色！並且也表明了為什麼要細心除錯pool的大小。在這裡，只要大於9，就能使其執行速度加快。

例項2：

生成成千上萬的縮圖

我們在CPU模式下來完成吧！我工作中就經常需要處理大量的影象資料夾。其任務之一就是建立縮圖。這在並行任務中已經有很成熟的方法了。

基礎的單執行緒建立

import os 
import PIL 

from multiprocessing import Pool 
from PIL import Image

SIZE = (75,75)
SAVE_DIRECTORY = 'thumbs'

def get_image_paths(folder):
	return (os.path.join(folder, f) 
			for f in os.listdir(folder) 
			if 'jpeg' in f)

def create_thumbnail(filename): 
	im = Image.open(filename)
	im.thumbnail(SIZE, Image.ANTIALIAS)
	base, fname = os.path.split(filename) 
	save_path = os.path.join(base, SAVE_DIRECTORY, fname)
	im.save(save_path)

if __name__ == '__main__':
	folder = os.path.abspath(
		'11_18_2013_R000_IQM_Big_Sur_Mon__e10d1958e7b766c3e840')
	os.mkdir(os.path.join(folder, SAVE_DIRECTORY))

	images = get_image_paths(folder)

	for image in images:
             create_thumbnail(Image)

import os

import PIL

from multiprocessing import Pool

from PIL import Image

SIZE = (75,75)

SAVE_DIRECTORY = 'thumbs'

def get_image_paths(folder):

return (os.path.join(folder, f)

for f in os.listdir(folder)

if 'jpeg' in f)

def create_thumbnail(filename):

im = Image.open(filename)

im.thumbnail(SIZE, Image.ANTIALIAS)

base, fname = os.path.split(filename)

save_path = os.path.join(base, SAVE_DIRECTORY, fname)

im.save(save_path)

if __name__ == '__main__':

folder = os.path.abspath(

'11_18_2013_R000_IQM_Big_Sur_Mon__e10d1958e7b766c3e840')

os.mkdir(os.path.join(folder, SAVE_DIRECTORY))

images = get_image_paths(folder)

for image in images:

create_thumbnail(Image)

對於一個例子來說，這是有點難，但本質上，這就是向程式傳遞一個資料夾，然後將其中的所有圖片抓取出來，並最終在它們各自的目錄下建立和儲存縮圖。

我的電腦處理大約6000張圖片用了27.9秒。

如果我們用並行呼叫map來代替for迴圈的話：

import os 
import PIL 

from multiprocessing import Pool 
from PIL import Image

SIZE = (75,75)
SAVE_DIRECTORY = 'thumbs'

def get_image_paths(folder):
	return (os.path.join(folder, f) 
			for f in os.listdir(folder) 
			if 'jpeg' in f)

def create_thumbnail(filename): 
	im = Image.open(filename)
	im.thumbnail(SIZE, Image.ANTIALIAS)
	base, fname = os.path.split(filename) 
	save_path = os.path.join(base, SAVE_DIRECTORY, fname)
	im.save(save_path)

if __name__ == '__main__':
	folder = os.path.abspath(
		'11_18_2013_R000_IQM_Big_Sur_Mon__e10d1958e7b766c3e840')
	os.mkdir(os.path.join(folder, SAVE_DIRECTORY))

	images = get_image_paths(folder)

	pool = Pool()
        pool.map(create_thumbnail,images)
        pool.close()
        pool.join()

import os

import PIL

from multiprocessing import Pool

from PIL import Image

SIZE = (75,75)

SAVE_DIRECTORY = 'thumbs'

def get_image_paths(folder):

return (os.path.join(folder, f)

for f in os.listdir(folder)

if 'jpeg' in f)

def create_thumbnail(filename):

im = Image.open(filename)

im.thumbnail(SIZE, Image.ANTIALIAS)

base, fname = os.path.split(filename)

save_path = os.path.join(base, SAVE_DIRECTORY, fname)

im.save(save_path)

if __name__ == '__main__':

folder = os.path.abspath(

'11_18_2013_R000_IQM_Big_Sur_Mon__e10d1958e7b766c3e840')

os.mkdir(os.path.join(folder, SAVE_DIRECTORY))

images = get_image_paths(folder)

pool = Pool()

pool.map(create_thumbnail,images)

pool.close()

pool.join()

5.6秒！

對於只改變了幾行程式碼而言，這是大大地提升了執行速度。這個方法還能更快，只要你將cpu 和 io的任務分別用它們的程式和執行緒來執行——但也常造成死鎖。總之，綜合考慮到 map這個實用的功能，以及人為執行緒管理的缺失，我覺得這是一個美觀，可靠還容易debug的方法。

好了，文章結束了。一行完成並行任務。

一行 Python 程式碼實現並行
2019-03-16
Python並行
怎樣用一行 Python 程式碼實現並行
2020-07-10
Python並行
真▪一行程式碼完成從前端程式碼build到部署線上
2018-08-09
行程前端UI
一行程式碼完成定時任務排程,基於Quartz的UI視覺化操作元件 GZY.Quartz.MUI
2021-12-29
行程quartzUI視覺化元件
[原始碼解析] 並行分散式任務佇列 Celery 之多程式模型
2021-04-15
原始碼並行分散式佇列模型
TensorFlow Hub--用一行程式碼完成遷移學習
2018-07-04
行程遷移學習
.NET併發程式設計-任務函式並行
2021-03-14
程式設計函式並行
js使用waterfall橫向載入瀑布流,一行程式碼完成
2018-05-03
JS行程
溫故之.NET 任務並行
2019-02-12
並行
一行程式碼完成 JAVA 的 EXCEL 讀寫——EasyExcel 的方法封裝
2018-09-20
行程JavaExcel封裝
[原始碼分析]並行分散式任務佇列 Celery 之子程式處理訊息
2021-04-25
原始碼並行分散式佇列
golang runtime實現多核並行任務
2018-11-01
Golang並行
Linux執行shell指令碼完成任務計劃是以秒為單位
2019-12-11
Linux指令碼
一行程式碼建立cell
2018-08-20
行程
一行 CSS 程式碼的魅力
2021-01-06
CSS
詳解JavaScript的任務、微任務、佇列以及程式碼執行順序
2019-07-25
JavaScript佇列
[原始碼分析] 並行分散式任務佇列 Celery 之 Timer & Heartbeat
2021-05-06
原始碼並行分散式佇列
任務排程的並行演算法
2018-04-03
並行演算法
Python3一行程式碼搭建極簡web服務
2018-07-02
Python行程Web
php:多程式執行任務
2019-02-16
PHP
一行程式碼如何隱藏Linux程式？
2022-11-23
行程Linux
[原始碼解析] 並行分散式任務佇列 Celery 之負載均衡
2021-05-13
原始碼並行分散式佇列負載
目前待完成的任務們
2024-04-13
第一行程式碼——書的理解
2019-01-14
行程
一行程式碼實現微前端
2021-08-01
行程前端
[原始碼解析] 並行分散式任務佇列 Celery 之 Task是什麼
2021-04-08
原始碼並行分散式佇列
併發程式設計-7.任務並行庫（TPL）和資料流
2024-03-30
程式設計並行
Django 如何使用 Celery 完成非同步任務或定時任務
2023-04-25
Django非同步
任務系統 / 擊殺指定數量怪物後完成任務
2024-09-13
Java多執行緒並行處理任務的實現
2019-04-20
Java執行緒並行
C#多執行緒開發-任務並行庫04
2021-09-09
C#執行緒並行
Golang 入門 : 等待 goroutine 完成任務
2019-05-24
Golang
非同步任務：並行與序列的典型問題
2019-05-06
非同步並行
laravel建立定時任務並在windows下執行
2021-07-22
LaravelWindows
[20230508]能否一行完成(使用tee小技巧).txt
2023-05-08
Flowable啟動流程例項和查詢任務以及完成任務
2024-03-11
一行程式碼引發的恐懼
2019-04-08
行程
一行程式碼實現ViewPager卡片效果
2019-03-04
行程Viewpager
Shell 一行程式碼搞定大問題
2018-12-12
行程

一行程式碼完成並行任務

相關文章