借 shared_ptr 實現 copy-on-write

發表於2016-11-06

場景：

一個多執行緒的C++程式，24h x 5.5d執行。有幾個工作執行緒ThreadW{0,1,2,3}，處理客戶發過來的交易請求，另外有一個背景執行緒ThreadB，不定期更新程式內部的參考資料。這些執行緒都跟一個hash表打交道，工作執行緒只讀，背景執行緒讀寫，必然要用到一些同步機制，防止資料損壞。

這裡的示例程式碼用std::map代替hash表，意思是一樣的：

typedef map<string, vector<pair<string, int> > > Map;

map 的 key 是使用者名稱，value 是一個vector，裡邊存的是不同stock的最小交易間隔，vector已經排好序，可以用二分查詢。

我們的系統要求工作執行緒的延遲儘可能小，可以容忍背景執行緒的延遲略大。一天之內，背景執行緒對資料更新的次數屈指可數，最多一小時一次，更新的資料來自於網路，所以對更新的及時性不敏感。Map的資料量也不大，大約一千多條資料。

最簡單的同步辦法，用讀寫鎖，工作執行緒加讀鎖，背景執行緒加寫鎖。但是讀寫鎖的開銷比普通mutex要大，如果工作執行緒能用最普通的非重入Mutex實現同步，就不必用讀寫鎖，效能較高。我們藉助shared_ptr實現了這一點：

class Mutex;
class MutexLockGuard;

class CustomerData
{
 public:
  CustomerData() : data_(new Map)
  { }

  ~CustomerData();

  int query(const string& customer, const string& stock) const
  {
    MapPtr data = getData();
    // data 一旦拿到，就不再需要鎖了。取資料的時候只有getData()內部有鎖，多執行緒併發讀的效能很好。

    // 假設使用者肯定存在
    const EntryList& entries = (*data)[customer];
    return findEntry(entries, stock);
  }

  void update(const string& customer, const EntryList& entries );

 private:
  typedef vector<string, int> EntryList;
  typedef map<string, EntryList> Map;
  typedef tr1::shared_ptr<Map> MapPtr;

  static int findEntry(const EntryList& entries, const string& key) const
  { /* 用 lower_bound 在 entries 裡找 key */ }

  MapPtr getData() const
  {
    MutexLockGuard lock(dataMutex_);
    return data_;
  }

  MapPtr data_;
  mutable Mutex dataMutex_;
};

class Mutex;

class MutexLockGuard;

class CustomerData

{

public:

CustomerData() : data_(new Map)

{ }

~CustomerData();

int query(const string& customer, const string& stock) const

{

MapPtr data = getData();

// data 一旦拿到，就不再需要鎖了。取資料的時候只有getData()內部有鎖，多執行緒併發讀的效能很好。

// 假設使用者肯定存在

const EntryList& entries = (*data)[customer];

return findEntry(entries, stock);

}

void update(const string& customer, const EntryList& entries );

private:

typedef vector<string, int> EntryList;

typedef map<string, EntryList> Map;

typedef tr1::shared_ptr<Map> MapPtr;

static int findEntry(const EntryList& entries, const string& key) const

{ /* 用 lower_bound 在 entries 裡找 key */ }

MapPtr getData() const

{

MutexLockGuard lock(dataMutex_);

return data_;

}

MapPtr data_;

mutable Mutex dataMutex_;

};

關鍵看CustomerData::update()怎麼寫。既然要更新資料，那肯定得加鎖，如果這時候其他執行緒正在讀，那麼不能在原來的資料上修改，得建立一個副本，在副本上修改，修改完了再替換。如果沒有使用者在讀，那麼就能直接修改，節約一次拷貝。

void CustomerData::update(const string& customer, const EntryList& entries )
{
  MutexLockGuard lock(dataMutex_);
  if (!data_.unique())
  {
    MapPtr newData(new Map(*data_));
    data_.swap(newData);
  }
  assert(data_.unique());
  (*data_)[customer] = entries;
}

void CustomerData::update(const string& customer, const EntryList& entries )

{

MutexLockGuard lock(dataMutex_);

if (!data_.unique())

{

MapPtr newData(new Map(*data_));

data_.swap(newData);

}

assert(data_.unique());

(*data_)[customer] = entries;

}

注意其中用了shared_ptr::unique()來判斷是不是有人在讀，如果有人在讀，那麼我們不能直接修改，因為query()並沒有全程加鎖，只在getData()內部有鎖。shared_ptr::swap()把data_替換為新副本，而且我們還在鎖裡，不會有別的執行緒來讀，可以放心地更新。

據我們測試，大多數情況下更新都是在原來資料上進行的，拷貝的比例還不到1%，很高效。更準確的說，這不是copy-on-write，而是copy-on-other-reading。

我們將來可能會採用無鎖資料結構，不過目前這個實現已經非常好，滿足我們的要求。

shared_ptr實現多執行緒讀寫copy-on-write
2016-09-26
執行緒
Copy-On-Write技術
2014-03-13
Proxy模式：copy-on-write的疑惑
2007-09-04
模式
借鑑redux，實現一個react狀態管理方案
2019-05-29
ReduxReact
shared_ptr原始碼分析
2017-01-21
原始碼
std::string 的 Copy-on-Write：不如想象中美好
2016-08-18
智慧指標思想實踐(std::unique_ptr, std::shared_ptr)
2022-07-09
指標
智者當借力而行, 藉助Autodesk應用程式商店實現名利雙收
2014-01-13
小程式實戰：線上借書平臺
2018-10-08
shared_ptr和動態陣列
2019-02-01
陣列
shared_ptr原始碼分析後續
2017-01-21
原始碼
C++中的std::shared_ptr
2024-11-08
C++
基於區塊鏈的金融借貸交易平臺開發流程與實現
2023-04-20
區塊鏈
直播16小時實現31萬交易額，小眾新茶飲品牌借短影片“出圈”
2022-12-20
shared_ptr的理解和注意事項
2017-01-22
智慧指標(auto_ptr 和 shared_ptr)
2013-05-31
指標
採用 SOA 最佳實踐，借鑑經驗教訓
2010-04-19
Linux--寫時複製（Copy-On-Write,COW）技術簡述
2024-06-28
Linux
shared_ptr的執行緒安全性分析
2017-01-22
執行緒
智慧指標之手撕共享指標shared_ptr
2024-09-22
指標
《開源框架那些事兒21》：巧借力與借巧力
2017-04-23
框架
借“古商貿文化”實現題材破局，益世界這款新品憑什麼紅海突圍？
2021-05-28
shared_ptr的概念和一些特性調查
2024-05-17
借貸寶遭舉報，戳一戳“熟人借貸”模式的軟肋
2016-01-17
模式
C++基礎回顧4——智慧指標shared_ptr
2018-06-04
C++指標
C++基礎::shared_ptr 程式設計細節（一）
2015-11-18
C++程式設計
【死磕 Java 基礎】 — 談談那個寫時拷貝技術(copy-on-write)
2021-08-15
Java
企業如何借實時湖倉贏在“資料制勝”時代？
2023-12-12
C++智慧指標之shared_ptr與右值引用(詳細)
2021-07-12
C++指標
C++14 智慧指標unique_ptr、shared_ptr、weak_ptr
2017-07-09
C++指標
為什麼多執行緒讀寫shared_ptr需要加鎖
2017-01-22
執行緒
BitPay：2016年全球P2P借貸行業現狀分析
2016-02-04
行業
OA系統之檔案借閱管理，對檔案的去向實時掌控
2020-02-05
商業研究(15)：網際網路金融の信用借貸，有信用就可以借錢
2016-05-08
element 學習借鑑 p1
2018-11-01
ERP系統借貸關係
2009-12-09
python演算法:借書方案
2024-05-16
Python演算法
週末要聞回顧：阿里全現金收購優土蘋果借腹生車
2015-11-09
阿里蘋果

借 shared_ptr 實現 copy-on-write

相關文章