zoukankan      html  css  js  c++  java
  • 对于Linux平台下C语言开发中__sync_函数的认识(转)

    reference:http://gcc.gnu.org/onlinedocs/gcc-4.1.0/gcc/Atomic-Builtins.html#Atomic-Builtins

    A built-in function is a coding extension to C and C++ that allows a programmer to use the syntax of C function calls and C variables to access the instruction set of the processor of the compiling machine.

    The following builtins are intended to be compatible with those described in the Intel Itanium Processor-specific Application Binary Interface, section 7.4. As such, they depart from the normal GCC practice of using the “__builtin_” prefix, and further that they are overloaded such that they work on multiple types.
    为了适用不同的处理器,buildin将GCC中"__builtin_"的前缀改为“__sync_"前缀的函数来扩展支持。

    The definition given in the Intel documentation allows only for the use of the types int, long, long long as well as their unsigned counterparts. GCC will allow any integral scalar or pointer type that is 1, 2, 4 or 8 bytes in length.


    Not all operations are supported by all target processors. If a particular operation cannot be implemented on the target processor, a warning will be generated and a call an external function will be generated. The external function will carry the same name as the builtin, with an additional suffix `_n' where n is the size of the data type.
    正是由于GCC对处理器所能处理的数据类型进行扩展,buildin需要将GCC进行重载。比较常见的就是数据类型如unit32_t。

    In most cases, these builtins are considered a full barrier. That is, no memory operand will be moved across the operation, either forward or backward. Further, instructions will be issued as necessary to prevent the processor from speculating loads across the operation and from queuing stores after the operation.
    buildin被视为全barrier,即没有内存在操作之间的移动。

    All of the routines are are described in the Intel documentation to take “an optional list of variables protected by the memory barrier”. It's not clear what is meant by that; it could mean that only the following variables are protected, or it could mean that these variables should in addition be protected. At present GCC ignores this list and protects all variables which are globally accessible. If in the future we make some use of this list, an empty list will continue to mean all globally accessible variables.
    这种原因是由于GCC和具体的处理器之间的约定不确定导致,对全局变量访问保护不一致。

    type __sync_fetch_and_add (type *ptr, type value, ...)
    type __sync_fetch_and_sub (type *ptr, type value, ...)
    type __sync_fetch_and_or (type *ptr, type value, ...)
    type __sync_fetch_and_and (type *ptr, type value, ...)
    type __sync_fetch_and_xor (type *ptr, type value, ...)
    type __sync_fetch_and_nand (type *ptr, type value, ...)
        These builtins perform the operation suggested by the name, and returns the value that had previously been in memory. That is,

                  { tmp = *ptr; *ptr op= value; return tmp; } 
        //not understand, maybe mistake of op.    
                  { tmp = *ptr; *ptr = ~tmp & value; return tmp; }   // nand
             
    __sys_fetch_and_XXX控制对内存的读写后的操作,保证原子性。
    下面的__sys_XXX_and_fetch同样控制对内存写操作后的读,保证原子性。

    type __sync_add_and_fetch (type *ptr, type value, ...)
    type __sync_sub_and_fetch (type *ptr, type value, ...)
    type __sync_or_and_fetch (type *ptr, type value, ...)
    type __sync_and_and_fetch (type *ptr, type value, ...)
    type __sync_xor_and_fetch (type *ptr, type value, ...)
    type __sync_nand_and_fetch (type *ptr, type value, ...)
        These builtins perform the operation suggested by the name, and return the new value. That is,

                  { *ptr op= value; return *ptr; }
                  { *ptr = ~*ptr & value; return *ptr; }   // nand    

    bool __sync_bool_compare_and_swap (type *ptr, type oldval type newval, ...)
    type __sync_val_compare_and_swap (type *ptr, type oldval type newval, ...)
        These builtins perform an atomic compare and swap. That is, if the current value of *ptr is oldval, then write newval into *ptr.

        The “bool” version returns true if the comparison is successful and newval was written. The “val” version returns the contents of *ptr before the operation.
    对于这种函数,从名字就可以看出其意义。

    __sync_synchronize (...)
        This builtin issues a full memory barrier.

    type __sync_lock_test_and_set (type *ptr, type value, ...)
        This builtin, as described by Intel, is not a traditional test-and-set operation, but rather an atomic exchange operation. It writes value into *ptr, and returns the previous contents of *ptr.


    对于锁的支持比较少,由些只存储当前为1的有效的常量值。lock和release对合局变量的进行加锁和解锁。

        Many targets have only minimal support for such locks, and do not support a full exchange operation. In this case, a target may support reduced functionality here by which the only valid value to store is the immediate constant 1. The exact value actually stored in *ptr is implementation defined.

        This builtin is not a full barrier, but rather an acquire barrier. This means that references after the builtin cannot move to (or be speculated to) before the builtin, but previous memory stores may not be globally visible yet, and previous memory loads may not yet be satisfied.
    void __sync_lock_release (type *ptr, ...)
        This builtin releases the lock acquired by __sync_lock_test_and_set. Normally this means writing the constant 0 to *ptr.

        This builtin is not a full barrier, but rather a release barrier. This means that all previous memory stores are globally visible, and all previous memory loads have been satisfied, but following memory reads are not prevented from being speculated to before the barrier. 


    除了sync and aotmic build-in functions, 还有其它的,如:
    Cache-related build-in functions, Block-related built-in functions等等。
    像Block-related built-in functions其中的一个,如_bzero,与bzero相比如下:
    /* We define this function always since `bzero' is sometimes needed when
       the namespace rules does not allow this. */
    extern void __bzero (void *__s, size_t __n) __THROW __nonnull ((1));

    /* Set N bytes of S to 0. */
    extern void bzero (void *__s, size_t __n) __THROW __nonnull ((1));

  • 相关阅读:
    LC 1383. Maximum Performance of a Team
    P3805 【模板】manacher算法
    P1226 【模板】快速幂||取余运算
    1119 Pre- and Post-order Traversals
    1117 Eddington Number (二分)
    1111 Online Map (Dij/spfa)
    1108 Finding Average
    IPC
    Json to JObject转换的使用方法
    JsonConvert序列化
  • 原文地址:https://www.cnblogs.com/zxc2man/p/7685231.html
Copyright © 2011-2022 走看看