C++实现自旋锁

zoukankan html css js c++ java

C++实现自旋锁
背景
 代码
 优化
 内存序扩展连接
toc

背景

互斥锁(mutex)：
- 加锁失败的线程会被阻塞，阻塞的线程不耗费CPU资源
- 导致模式切换，使用互斥锁加锁会进入内核态，阻塞时还会引发调度，运行时重新进入用户态
自旋锁(spin lock)：
- 使用了忙等待，加锁失败的线程会一直重复尝试加锁，耗费CPU资源
- 使用机器指令实现，不涉及模式切换，也不会引发调度
使用场景：
- 如果锁粒度足够小、持有锁时间足够短，建议使用自旋锁，反之，使用互斥锁
- 如果临界区内含有IO操作，建议使用互斥锁(临界区内不建议存在IO，一定要存在，使用互斥锁)
- 如果线程多、锁竞争激烈时，建议使用互斥锁
代码

此自旋锁过lock-free的std::atomic_flag实现
#ifndef _SPINLOCK_H_ #define _SPINLOCK_H_ #include <atomic> class SpinLock final{ public: void lock(); void unlock(); SpinLock() = default; ~SpinLock() = default; SpinLock(const SpinLock& rhs) = delete; SpinLock(SpinLock&& rhs) = delete; SpinLock& operator=(const SpinLock& rhs) = delete; SpinLock& operator=(SpinLock&& rhs) = delete; private: std::atomic_flag m_lock = ATOMIC_FLAG_INIT; }; #endif // !_SPINLOCK_H_
#include "SpinLock.h" void SpinLock::lock(){ while(m_lock.test_and_set(std::memory_order_acquire)); } void SpinLock::unlock(){ m_lock.clear(std::memory_order_release); }
- 为了获得更高的执行效率，编译器会对指令进行重排(不改变基本语义)，CPU也会乱序执行，在多线程编程中会带来线程间同步问题，test_and_set方法内加入内存顺序参数来处理这个问题：
  std::memory_order_acquire的解释为“当前线程中读或写不能被重排到此加载前。其他释放同一原子变量的线程的所有写入，能为当前线程所见”，相当于lock
  即 acquire后的读写操作必然发生在acquire之后，并获取其他线程的最新更改
  
  std::memory_order_release的解释为“当前线程中的读或写不能被重排到此存储后。当前线程的所有写入，可见于获得该同一原子变量的其他线程”，相当于unlock
  即 release前的读写操作必然发生在release之前，并提交对原子变量的更改
总之，上述内存顺序的组合限制了线程读写指令的重排的界限与执行顺序，读写指令的重排不能越界，读写操作执行也不能越界进行(此处的越界是单向的，仅仅是acquire与release范围内的读写不能往外)
- SpinLock满足基本可锁定要求(实现了方法lock(), unlock())，可通过std::lock_guard<>、std::unique_lock<>实现RAII风格锁定，达到自动释放锁及异常安全的目的
优化
- 增加了x86 pause指令来优化等待循环的性能(来自boost)
  
  Improves the performance of spin-wait loops. When executing a "spin-wait loop," a Pentium 4 or Intel Xeon processor suffers a severe performance penalty when exiting the loop because it detects a possible memory order violation. The PAUSE instruction provides a hint to the processor that the code sequence is a spin-wait loop. The processor uses this hint to avoid the memory order violation in most situations, which greatly improves processor performance. For this reason, it is recommended that a PAUSE instruction be placed in all spin-wait loops.
  An additional function of the PAUSE instruction is to reduce the power consumed by a Pentium 4 processor while executing a spin loop. The Pentium 4 processor can execute a spinwait loop extremely quickly, causing the processor to consume a lot of power while it waits for the resource it is spinning on to become available. Inserting a pause instruction in a spinwait loop greatly reduces the processor's power consumption.
  This instruction was introduced in the Pentium 4 processors, but is backward compatible with all IA-32 processors. In earlier IA-32 processors, the PAUSE instruction operates like a NOP instruction. The Pentium 4 and Intel Xeon processors implement the PAUSE instruction as a pre-defined delay. The delay is finite and can be zero for some processors. This instruction does not change the architectural state of the processor (that is, it performs essentially a delaying noop operation).
  来源： http://c9x.me/x86/html/file_module_x86_id_232.html
- 增加try_lock()使SpinLock满足可锁定要求
#ifndef _SPINLOCK_H_ #define _SPINLOCK_H_ #include <atomic> #include <emmintrin.h> #if defined(_MSC_VER) && _MSC_VER >= 1310 && ( defined(_M_IX86) || defined(_M_X64) ) && !defined(__c2__) #define BOOST_SMT_PAUSE _mm_pause(); #elif defined(__GNUC__) && ( defined(__i386__) || defined(__x86_64__) ) #define BOOST_SMT_PAUSE __asm__ __volatile__( "rep; nop" : : : "memory" ); #endif class SpinLock final{ public: void lock(); bool try_lock(); void unlock(); SpinLock() = default; ~SpinLock() = default; SpinLock(const SpinLock& rhs) = delete; SpinLock(SpinLock&& rhs) = delete; SpinLock& operator=(const SpinLock& rhs) = delete; SpinLock& operator=(SpinLock&& rhs) = delete; private: std::atomic_flag m_lock = ATOMIC_FLAG_INIT; }; #endif // !_SPINLOCK_H_
#include <emmintrin.h> #include "SpinLock.h" void SpinLock::lock(){ while(m_lock.test_and_set(std::memory_order_acquire)){ BOOST_SMT_PAUSE } } bool SpinLock::try_lock(){ return true != m_lock.test_and_set(std::memory_order_acquire); } void SpinLock::unlock(){ m_lock.clear(std::memory_order_release); }
内存序扩展连接

聊聊原子变量、锁、内存屏障那点事
 并发研究之CPU缓存一致性协议(MESI)

来自为知笔记(Wiz)
原创不易，转载请注明出处，谢谢
查看全文

相关阅读:
CPU 后缀
 获取当前IP的接口
 win10 禁用自动更新
 C# 调用腾讯云接口获取视频基本信息
 SQL Server服务器角色和数据库角色描述
 C# 使用cmd
C# 对DataTable的简单操作
 参考文档链接地址-个人比较推荐的
 类似input框内最右边添加图标，有清空功能
 CentOS-6.3安装Mysql-5.5.29[转]

原文地址：https://www.cnblogs.com/Keeping-Fit/p/14961258.html

C++实现自旋锁

背景

代码

优化

内存序扩展连接