std:: hardware_destructive_interference_size, std:: hardware_constructive_interference_size

From cppreference.net

< cpp ‎ | thread

Concurrency support library

Threads

thread

(C++11)

jthread

(C++20)

hardware_destructive_interference_size hardware_constructive_interference_size

(C++17) (C++17)


                    this_thread

namespace

get_id (C++11)
yield (C++11)

sleep_for (C++11)
sleep_until (C++11)

Cooperative cancellation

stop_token (C++20)
inplace_stop_token (C++26)
never_stop_token (C++26)
stop_source (C++20)
inplace_stop_source (C++26)
stop_callback (C++20)

inplace_stop_callback (C++26)
stop_callback_for_t (C++26)
stoppable_token (C++26)
unstoppable_token (C++26)
stoppable-source (C++26)
stoppable-callback-for (C++26)

Mutual exclusion

mutex (C++11)
recursive_mutex (C++11)
shared_mutex (C++17)

timed_mutex (C++11)
recursive_timed_mutex (C++11)
shared_timed_mutex (C++14)

Generic lock management

lock (C++11)
lock_guard (C++11)
scoped_lock (C++17)
unique_lock (C++11)
shared_lock (C++14)
once_flag (C++11)
call_once (C++11)

try_lock (C++11)
defer_lock try_to_lock adopt_lock defer_lock_t try_to_lock_t adopt_lock_t (C++11) (C++11) (C++11) (C++11) (C++11) (C++11)

Condition variables

condition_variable

(C++11)

condition_variable_any

(C++11)

notify_all_at_thread_exit

(C++11)

cv_status

(C++11)

Semaphores

counting_semaphore binary_semaphore

(C++20) (C++20)

Latches and Barriers

latch

(C++20)

barrier

(C++20)

Futures

promise (C++11)
future (C++11)
shared_future (C++11)
packaged_task (C++11)
async (C++11)

launch (C++11)
future_status (C++11)
future_error (C++11)
future_category (C++11)
future_errc (C++11)

Safe reclamation

rcu_obj_base (C++26)
rcu_domain (C++26)
rcu_default_domain (C++26)

rcu_synchronize (C++26)
rcu_barrier (C++26)
rcu_retire (C++26)

Hazard pointers

hazard_pointer_obj_base

(C++26)

hazard_pointer

(C++26)

make_hazard_pointer

(C++26)

Atomic types
atomic (C++11)
atomic_ref (C++20)
atomic_flag (C++11)
Initialization of atomic types
atomic_init (C++11) (deprecated in C++20)
ATOMIC_VAR_INIT (C++11) (deprecated in C++20)
ATOMIC_FLAG_INIT (C++11)
Memory ordering
memory_order (C++11)
kill_dependency (C++11) (deprecated in C++26)
atomic_thread_fence (C++11)
atomic_signal_fence (C++11)
Free functions for atomic operations
atomic_store atomic_store_explicit (C++11) (C++11)
atomic_load atomic_load_explicit (C++11) (C++11)
atomic_exchange atomic_exchange_explicit (C++11) (C++11)
atomic_compare_exchange_weak atomic_compare_exchange_weak_explicit atomic_compare_exchange_strong atomic_compare_exchange_strong_explicit (C++11) (C++11) (C++11) (C++11)
atomic_fetch_add atomic_fetch_add_explicit (C++11) (C++11)
atomic_fetch_sub atomic_fetch_sub_explicit (C++11) (C++11)
atomic_fetch_and atomic_fetch_and_explicit (C++11) (C++11)
atomic_fetch_or atomic_fetch_or_explicit (C++11) (C++11)
atomic_fetch_xor atomic_fetch_xor_explicit (C++11) (C++11)
atomic_fetch_max atomic_fetch_max_explicit (C++26) (C++26)
atomic_fetch_min atomic_fetch_min_explicit (C++26) (C++26)
atomic_is_lock_free (C++11)
atomic_wait atomic_wait_explicit (C++20) (C++20)
atomic_notify_one (C++20)
atomic_notify_all (C++20)
Free functions for atomic flags
atomic_flag_test_and_set atomic_flag_test_and_set_explicit (C++11) (C++11)
atomic_flag_clear atomic_flag_clear_explicit (C++11) (C++11)
atomic_flag_test atomic_flag_test_explicit (C++20) (C++20)
atomic_flag_wait atomic_flag_wait_explicit (C++20) (C++20)
atomic_flag_notify_one (C++20)
atomic_flag_notify_all (C++20)

Défini dans l'en-tête `<new>`
inline constexpr std:: size_t hardware_destructive_interference_size = /implementation-defined/ ;	(1)	(depuis C++17)
inline constexpr std:: size_t hardware_constructive_interference_size = /implementation-defined/ ;	(2)	(depuis C++17)

1) Décalage minimal entre deux objets pour éviter le faux partage. Garanti d'être au moins alignof ( std:: max_align_t )

struct keep_apart
{
    alignas(std::hardware_destructive_interference_size) std::atomic<int> cat;
    alignas(std::hardware_destructive_interference_size) std::atomic<int> dog;
};

2) Taille maximale de la mémoire contiguë pour favoriser le vrai partage. Garantie d'être au moins alignof ( std:: max_align_t )

struct together
{
    std::atomic<int> dog;
    int puppy;
};
struct kennel
{
    // Other data members...
    alignas(sizeof(together)) together pack;
    // Other data members...
};
static_assert(sizeof(together) <= std::hardware_constructive_interference_size);

Notes

Ces constantes fournissent un moyen portable d'accéder à la taille de ligne du cache de données L1.

Macro de test de fonctionnalité	Valeur	Std	Fonctionnalité
`__cpp_lib_hardware_interference_size`	`201703L`	(C++17)	constexpr std :: hardware_constructive_interference_size et constexpr std :: hardware_destructive_interference_size

Exemple

Le programme utilise deux threads qui écrivent de manière atomique dans les membres de données des objets globaux donnés. Le premier objet tient dans une ligne de cache, ce qui entraîne une "interférence matérielle". Le deuxième objet maintient ses membres de données sur des lignes de cache séparées, évitant ainsi une éventuelle "synchronisation du cache" après les écritures des threads.

Exécuter ce code

#include <atomic>
#include <chrono>
#include <cstddef>
#include <iomanip>
#include <iostream>
#include <mutex>
#include <new>
#include <thread>
#ifdef __cpp_lib_hardware_interference_size
    using std::hardware_constructive_interference_size;
    using std::hardware_destructive_interference_size;
#else
    // 64 octets sur x86-64 │ L1_CACHE_BYTES │ L1_CACHE_SHIFT │ __cacheline_aligned │ ...
    constexpr std::size_t hardware_constructive_interference_size = 64;
    constexpr std::size_t hardware_destructive_interference_size = 64;
#endif
std::mutex cout_mutex;
constexpr int max_write_iterations{10'000'000}; // l'ajustement du temps de référence
struct alignas(hardware_constructive_interference_size)
OneCacheLiner // occupe une ligne de cache
{
    std::atomic_uint64_t x{};
    std::atomic_uint64_t y{};
}
oneCacheLiner;
struct TwoCacheLiner // occupe deux lignes de cache
{
    alignas(hardware_destructive_interference_size) std::atomic_uint64_t x{};
    alignas(hardware_destructive_interference_size) std::atomic_uint64_t y{};
}
twoCacheLiner;
inline auto now() noexcept { return std::chrono::high_resolution_clock::now(); }
template<bool xy>
void oneCacheLinerThread()
{
    const auto start{now()};
    for (uint64_t count{}; count != max_write_iterations; ++count)
        if constexpr (xy)
            oneCacheLiner.x.fetch_add(1, std::memory_order_relaxed);
        else
            oneCacheLiner.y.fetch_add(1, std::memory_order_relaxed);
    const std::chrono::duration<double, std::milli> elapsed{now() - start};
    std::lock_guard lk{cout_mutex};
    std::cout << "oneCacheLinerThread() a passé " << elapsed.count() << " ms\n";
    if constexpr (xy)
        oneCacheLiner.x = elapsed.count();
    else
        oneCacheLiner.y = elapsed.count();
}
template<bool xy>
void twoCacheLinerThread()
{
    const auto start{now()};
    for (uint64_t count{}; count != max_write_iterations; ++count)
        if constexpr (xy)
            twoCacheLiner.x.fetch_add(1, std::memory_order_relaxed);
        else
            twoCacheLiner.y.fetch_add(1, std::memory_order_relaxed);
    const std::chrono::duration<double, std::milli> elapsed{now() - start};
    std::lock_guard lk{cout_mutex};
    std::cout << "twoCacheLinerThread() a passé " << elapsed.count() << " ms\n";
    if constexpr (xy)
        twoCacheLiner.x = elapsed.count();
    else
        twoCacheLiner.y = elapsed.count();
}
int main()
{
    std::cout << "__cpp_lib_hardware_interference_size "
#   ifdef __cpp_lib_hardware_interference_size
        "= " << __cpp_lib_hardware_interference_size << '\n';
#   else
        "n'est pas défini, utilisez " << hardware_destructive_interference_size
                               << " comme solution de repli\n";
#   endif
    std::cout << "hardware_destructive_interference_size == "
              << hardware_destructive_interference_size << '\n'
              << "hardware_constructive_interference_size == "
              << hardware_constructive_interference_size << "\n\n"
              << std::fixed << std::setprecision(2)
              << "sizeof( OneCacheLiner ) == " << sizeof(OneCacheLiner) << '\n'
              << "sizeof( TwoCacheLiner ) == " << sizeof(TwoCacheLiner) << "\n\n";
    constexpr int max_runs{4};
    int oneCacheLiner_average{0};
    for (auto i{0}; i != max_runs; ++i)
    {
        std::thread th1{oneCacheLinerThread<0>};
        std::thread th2{oneCacheLinerThread<1>};
        th1.join();
        th2.join();
        oneCacheLiner_average += oneCacheLiner.x + oneCacheLiner.y;
    }
    std::cout << "Temps moyen T1 : "
              << (oneCacheLiner_average / max_runs / 2) << " ms\n\n";
    int twoCacheLiner_average{0};
    for (auto i{0}; i != max_runs; ++i)
    {
        std::thread th1{twoCacheLinerThread<0>};
        std::thread th2{twoCacheLinerThread<1>};
        th1.join();
        th2.join();
        twoCacheLiner_average += twoCacheLiner.x + twoCacheLiner.y;
    }
    std::cout << "Temps T2 moyen : "
              << (twoCacheLiner_average / max_runs / 2) << " ms\n\n"
              << "Ratio T1/T2:~ "
              << 1.0 * oneCacheLiner_average / twoCacheLiner_average << '\n';
}

Sortie possible :

__cpp_lib_hardware_interference_size = 201703
hardware_destructive_interference_size == 64
hardware_constructive_interference_size == 64
sizeof( OneCacheLiner ) == 64
sizeof( TwoCacheLiner ) == 128
oneCacheLinerThread() a pris 517,83 ms
oneCacheLinerThread() a pris 533,43 ms
oneCacheLinerThread() a pris 527,36 ms
oneCacheLinerThread() a pris 555,69 ms
oneCacheLinerThread() a pris 574,74 ms
oneCacheLinerThread() a pris 591,66 ms
oneCacheLinerThread() a pris 555,63 ms
oneCacheLinerThread() a pris 555,76 ms
Temps moyen T1 : 550 ms
twoCacheLinerThread() a pris 89,79 ms
twoCacheLinerThread() a pris 89,94 ms
twoCacheLinerThread() a pris 89,46 ms
twoCacheLinerThread() a pris 90,28 ms
twoCacheLinerThread() a pris 89,73 ms
twoCacheLinerThread() a pris 91,11 ms
twoCacheLinerThread() a pris 89,17 ms
twoCacheLinerThread() a pris 90,09 ms
Temps moyen T2 : 89 ms
Ratio T1/T2:~ 6,16

Voir aussi

hardware_concurrency [static]	renvoie le nombre de threads concurrents pris en charge par l'implémentation (fonction membre publique statique de `std::thread` )
hardware_concurrency [static]	renvoie le nombre de threads concurrents pris en charge par l'implémentation (fonction membre publique statique de `std::jthread` )

Compiler support
Freestanding and hosted
Language
Standard library
Standard library headers
Named requirements
Feature test macros (C++20)
Language support library
Concepts library (C++20)
Diagnostics library
Memory management library
Metaprogramming library (C++11)
General utilities library
Containers library
Iterators library
Ranges library (C++20)
Algorithms library
Strings library
Text processing library
Numerics library
Date and time library
Input/output library
Filesystem library (C++17)
Concurrency support library (C++11)
Execution control library (C++26)
Technical specifications
Symbols index
External libraries

cppreference.net

Namespaces

Variants

std:: hardware_destructive_interference_size, std:: hardware_constructive_interference_size

Notes

Exemple

Voir aussi