http://software.intel.com/en-us/articles/Implementing-thread-safe-unordered-sets-without-reducers/

Implementing thread-safe unordered sets without reducers

Published On :	October 29, 2009 12:00 AM PDT
Rate Please login to rate! Current Score: 0 out of 0 users Please login to rate! Current Score: 0 out of 0 users Please login to rate! Current Score: 0 out of 0 users Please login to rate! Current Score: 0 out of 0 users Please login to rate! Current Score: 0 out of 0 users

by Alex Zatsman

Races and Reducers: the Risks and Rewards

One of the biggest challenges in building a parallel program is dealing with data races. Cilk++ offers several tools and techniques to find and eliminate races from your program. Reducers provide a a powerful mechanism for eliminating races, but as Spiderman said, "with great power comes great responsibility."

The Problem

I recently looked at a customer problem that at first looked like an obvious application for reducers. Briefly, the task was to sort a collection of objects into separate bins based on certain properties of the objects. The simplest parallel implementation, converting the for loop to a cilk_for loop, introduces data races on the bins if two or more objects can be put into the same bin.

At first look, this seemed like a good opportunity to use reducers. The contents of each bin is a list of elements, so why not just have each bin hold a list reducer? Reducers are extremely efficient when used sparingly, but unfortunately can become expensive if you create thousands or even millions of reducers, as would be needed in this case.

The other safe approach to handling races is to use locks to protect access to the shared memory locations. In this case, that would either mean using a single lock to protect the entire collection (safe, but bad because the high contention on this lock would eliminate all advantages of parallelism) or to create and acquire a lock for each bin (also safe, but bad because it requires thousands or millions of expensive locks.)

Luckily, there is another way.

The Approach

Mutex locks provided by the operating system are expensive to create and acquire, and are generally more heavy weight than is required for this problem. However, there is a much lower-cost alternative. The underlying hardware offers atomic instructions, which are effectively locks around single hardware instructions. These are the same instructions that are used to implement mutexes, but we can use them directly to save a significant amount of overhead for our problem at hand. It is important to note that there is no free lunch - atomic instructions require a memory barrier that can cost 40 or more instruction cycles. This is far more expensive than the few cycles used by an unlocked read/write, but still much less than the thousands required to create and acquire mutexes. You can think of these as small locks that lock just the read/write hardware operation to ensure that no other processor can race on the memory between the read and the write.

My approach is to use the atomic instruction to safely update the bins in a way that will not create data races. As with any use of locks, this approach prevents data races, but does not guarantee a deterministic result. In this case, each bin will hold the correct list of elements, but the order of the elements in the list is indeterminate.

The Solution

An atomic swap operation for pointers is available under different names in both Gnu and Windows C/C++ compilers:

GNU:	`__sync_lock_test_and_set`
MSVC:	`InterlockedExchangePointer`

An implementation of a thread-safe sets can be implemented as a template with the class C of the set element as the template parameter:

#ifdef __GNUC__
#include 
#define AtomicSwap(A,X) __sync_lock_test_and_set   (A, X)


































0
意見
(+add yours?)






張貼留言


















較新的文章


較舊的文章





訂閱：
張貼留言 (Atom)






匯率


匯率




Blog Archive





19/11/17 - 26/11/17 (1)
      

01/10/17 - 08/10/17 (2)
      

06/12/15 - 13/12/15 (1)
      

05/05/13 - 12/05/13 (16)
      

21/08/11 - 28/08/11 (2)
      

05/09/10 - 12/09/10 (1)
      

06/06/10 - 13/06/10 (5)
      

30/05/10 - 06/06/10 (1)
      

23/05/10 - 30/05/10 (1)
      

16/05/10 - 23/05/10 (5)
      

09/05/10 - 16/05/10 (3)
      

02/05/10 - 09/05/10 (2)
      

25/04/10 - 02/05/10 (5)
      

18/04/10 - 25/04/10 (22)
      

11/04/10 - 18/04/10 (11)
      

04/04/10 - 11/04/10 (1)
      

28/03/10 - 04/04/10 (7)
      

21/03/10 - 28/03/10 (15)
      

14/03/10 - 21/03/10 (20)
      

07/03/10 - 14/03/10 (11)
      

28/02/10 - 07/03/10 (1)
      

06/07/08 - 13/07/08 (24)
      

29/06/08 - 06/07/08 (1)
      

11/05/08 - 18/05/08 (1)
      

04/05/08 - 11/05/08 (1)
      

13/01/08 - 20/01/08 (1)
      

23/12/07 - 30/12/07 (1)
      

18/11/07 - 25/11/07 (2)
      

11/11/07 - 18/11/07 (1)
      

04/11/07 - 11/11/07 (4)
      

28/10/07 - 04/11/07 (16)
      

21/10/07 - 28/10/07 (10)
      

14/10/07 - 21/10/07 (3)
      

07/10/07 - 14/10/07 (8)
      

29/10/06 - 05/11/06 (1)
      






Labels



日月潭
(1)


木馬
(2)


北台灣
(1)


台中
(1)


生命
(1)


企業信息安全
(1)


安全
(3)


安全社區
(1)


安全信息
(1)


系統安全
(1)


系統漏洞
(1)


咖啡
(1)


東海
(1)


物理隔離
(1)


社會工程學
(1)


信息安全
(1)


信息安全威脅趨勢
(1)


旅遊
(1)


病毒
(1)


針對攻擊
(1)


軟件加密
(1)


惡意軟件
(1)


惡意網路
(1)


虛擬化
(1)


雲端安全
(2)


雲端運算
(4)


黑客
(1)


極光
(1)


電腦犯罪
(1)


網絡犯罪
(1)


網路
(1)


網路安全
(1)


銀行木馬
(1)


趣聞
(1)


殭屍網絡
(1)


藝術街
(1)


驅動加密
(1)


纜車
(1)


Adobe漏洞
(1)


GetLastError
(1)


MSDN
(1)


Web2.0
(1)





Followers











贊助商








 



(c) Just in thinking!

Icons & Wordpress Theme by N.Design
Blogger Template by Blogger FAQs.

Just in thinking!

20100511