nsq topic

樑天發表於2021-10-08

原文網址 : https://www.cnblogs.com/gwyy/p/15382876.html

與Topic相關的程式碼主要位於nsqd/topic.go中。

上一篇文字我們講解了下nsq的啟動流程。對nsq的整體框架有了一個大概的瞭解。本篇文章就是由大到小。對於topic這一部分進行詳盡的講解。

topic 管理著多個 channel 通過從 client 中獲取訊息，然後將訊息傳送到 channel 中傳遞給客戶端.在 channel 初始化時會載入原有的 topic 並在最後統一執行 topic.Start(),新建立的 topic 會同步給 lookupd 後開始執行. nsqd 中通過建立建立多個 topic 來管理不同類別的頻道.

topic結構體：

type Topic struct {
  // 64bit atomic vars need to be first for proper alignment on 32bit platforms
  // 這兩個欄位僅作統計資訊,保證 32 位對其操作
  messageCount uint64  // 累計訊息數
  messageBytes uint64// 累計訊息體的位元組數

  sync.RWMutex  // 加鎖，包括 putMessage

  name              string // topic名，生產和消費時需要指定此名稱
  channelMap        map[string]*Channel  // 儲存每個channel name和channel指標的對映
  backend           BackendQueue    // 磁碟佇列，當記憶體memoryMsgChan滿時，寫入硬碟佇列
  memoryMsgChan     chan *Message    // 訊息優先存入這個記憶體chan
  startChan         chan int    // 接收開始訊號的 channel，呼叫 start 開始 topic 訊息迴圈

  exitChan          chan int    // 判斷 topic 是否退出

  // 在 select 的地方都要新增 exitChan
  // 除非使用 default 或者保證程式不會永遠阻塞在 select 處,即可以退出迴圈
  // channel 更新時用來通知並更新訊息迴圈中的 chan 陣列
  channelUpdateChan chan int
  // 用來等待所有的子 goroutine
  waitGroup         util.WaitGroupWrapper
  exitFlag          int32     // topic 退出識別符號
  idFactory         *guidFactory    // 生成 guid 的工廠方法

  ephemeral      bool  // 該 topic 是否是臨時 topic
  deleteCallback func(*Topic)   // topic 刪除時的回撥函式
  deleter        sync.Once   // 確保 deleteCallback 僅執行一次

  paused    int32   // topic 是否暫停
  pauseChan chan int   // 改變 topic 暫停/執行狀態的通道
  ctx *context  // topic 的上下文
}

可以看到。topic 採用了 map + *Channel 來管理所有的channel. 並且也有 memoryMsgChan 和 backend 2個佇列。

例項化Topic :

下面就是 topic 的建立流程,傳入的引數引數包括,topicName,上下文環境,刪除回撥函式:

func NewTopic(topicName string, ctx *context, deleteCallback func(*Topic)) *Topic {
  t := &Topic{
    name:              topicName, //topic名稱
    channelMap:        make(map[string]*Channel),
    memoryMsgChan:     nil,
    startChan:         make(chan int, 1),
    exitChan:          make(chan int),
    channelUpdateChan: make(chan int),
    ctx:               ctx, //上下文指標
    paused:            0,
    pauseChan:         make(chan int),
    deleteCallback:    deleteCallback, //刪除callback函式
    // 所有 topic 使用同一個 guidFactory，因為都是用的 nsqd 的 ctx.nsqd.getOpts().ID 為基礎生成的
    idFactory:         NewGUIDFactory(ctx.nsqd.getOpts().ID),
  }
  // create mem-queue only if size > 0 (do not use unbuffered chan)
  //  // 根據訊息佇列生成訊息 chan,default size = 10000
  if ctx.nsqd.getOpts().MemQueueSize > 0 {
    // 初始化一個訊息佇列
    t.memoryMsgChan = make(chan *Message, ctx.nsqd.getOpts().MemQueueSize)
  }
  // 判斷這個 topic 是不是暫時的，暫時的 topic 訊息僅僅儲存在記憶體中
  // DummyBackendQueue 和 diskqueue 均實現了 backend 介面
  if strings.HasSuffix(topicName, "#ephemeral") {
    // 臨時的 topic，設定標誌並使用 newDummyBackendQueue 初始化 backend
    t.ephemeral = true
    t.backend = newDummyBackendQueue()   // 實現了 backend 但是並沒有邏輯，所有操作僅僅返回 nil
  } else {
    dqLogf := func(level diskqueue.LogLevel, f string, args ...interface{}) {
      opts := ctx.nsqd.getOpts()
      lg.Logf(opts.Logger, opts.LogLevel, lg.LogLevel(level), f, args...)
    }
    // 使用 diskqueue 初始化 backend 佇列
    t.backend = diskqueue.New(
      topicName,
      ctx.nsqd.getOpts().DataPath,
      ctx.nsqd.getOpts().MaxBytesPerFile,
      int32(minValidMsgLength),
      int32(ctx.nsqd.getOpts().MaxMsgSize)+minValidMsgLength,
      ctx.nsqd.getOpts().SyncEvery,
      ctx.nsqd.getOpts().SyncTimeout,
      dqLogf,
    )
  }
  // 使用一個新的協程來執行 messagePump
  //startChan 就傳送給了它,messagePump 函式負責分發整個 topic 接收到的訊息給該 topic 下的 channels.
  t.waitGroup.Wrap(t.messagePump)
  // 呼叫 Notify
  t.ctx.nsqd.Notify(t)

  return t
}

可以看到先例項化了一個Topic指標物件。初始化memoryMsgChan佇列，預設1000個。並且判斷topicName是否是臨時topic,如果是的話，BackendQueue（這是個介面）實現了一個空的記憶體Queue. 否則使用 diskqueue來初始化 backend佇列。

隨後，NewTopic函式開啟一個新的goroutine來執行messagePump函式，該函式負責訊息迴圈，將進入topic中的訊息投遞到channel中。

最後，NewTopic函式執行t.ctx.nsqd.Notify(t)，該函式在topic和channel建立、停止的時候呼叫， Notify函式通過執行PersistMetadata函式，將topic和channel的資訊寫到檔案中。

func (n *NSQD) Notify(v interface{}) {
  persist := atomic.LoadInt32(&n.isLoading) == 0
  n.waitGroup.Wrap(func() {
    // by selecting on exitChan we guarantee that
    // we do not block exit, see issue #123
    select {
    //如果執行那一刻 有exitChan 那麼就走exit
    case <-n.exitChan:
      //否則就走正常邏輯 往notifyChan 裡發個訊息
    case n.notifyChan <- v:
      if !persist {
        return
      }
      n.Lock()
      err := n.PersistMetadata()
      if err != nil {
        n.logf(LOG_ERROR, "failed to persist metadata - %s", err)
      }
      n.Unlock()
    }
  })
}

在Notify函式的實現時，首先考慮了資料持久化的時機，如果當前nsqd尚在初始化，則不需要立即持久化資料，因為nsqd在初始化後會進行一次統一的持久化工作，

Notify在進行資料持久化的時候採用了非同步的方式。使得topic和channel能以同步的方式來呼叫Nofity而不阻塞。在非同步執行的過程中，通過waitGroup和監聽exitChan的使用保證了結束程式時goroutine能正常退出。

在執行持久化之前，case n.notifyChan <- v:語句向notifyChan傳遞訊息，觸發lookupLoop函式（nsqd/lookup.go中）接收notifyChan訊息的部分，從而實現向loopupd註冊/取消註冊響應的topic或channel。

訊息寫入Topic

客戶端通過nsqd的HTTP API或TCP API向特定topic傳送訊息，nsqd的HTTP或TCP模組通過呼叫對應topic的PutMessage或PutMessages函式，將訊息投遞到topic中。PutMessage或PutMessages函式都通過topic的私有函式put進行訊息的投遞，兩個函式的區別僅在PutMessage只呼叫一次put， PutMessages遍歷所有要投遞的訊息，對每條訊息使用put函式進行投遞。預設topic會優先往memoryMsgChan 佇列內投遞，如果記憶體佇列已滿，才會往磁碟佇列寫入，（臨時的topic磁碟佇列不做任何儲存，資料直接丟棄）

func (t *Topic) put(m *Message) error {
    select {
    case t.memoryMsgChan <- m:
    default:
        //寫入磁碟佇列
    }
    return nil
}

Start && messagePump 操作

topic的Start方法就是傳送了個 startChan ，這裡有個小技巧，nsq使用了select來傳送這個訊息，這樣做的目的是如果start被併發呼叫了，第二個start會直接走到default裡，什麼都不做.

那麼這個Start函式都有哪裡呼叫的呢。

1、 nsqd啟動的時候，觸發LoadMetadata 會把檔案裡的topic載入到記憶體裡，這時候會呼叫Start方法

2、使用者通過請求獲取topic的時候會通過 getTopic 來獲取或者建立topic

func (t *Topic) Start() {
  select {
  case t.startChan <- 1:
  default:
  }
}

接下來我們看下 messagePump, 剛才的 startChan 就是發給了這個函式，該函式在建立新的topic時通過waitGroup在新的goroutine中執行。該函式僅在觸發 startChan 開始執行，否則會阻塞住，直到退出。

for {
    select {
    case <-t.channelUpdateChan:
      continue
    case <-t.pauseChan:
      continue
    case <-t.exitChan:
      goto exit
    case <-t.startChan:
    }
    break
  }

messagePump函式初始化時先獲取當前存在的channel陣列，設定memoryMsgChan和backendChan，隨後進入訊息迴圈，在迴圈中主要處理四種訊息：

接收來自memoryMsgChan和backendChan兩個go channel進入的訊息，並向當前的channal陣列中的channel進行投遞
處理當前topic下channel的更新
處理當前topic的暫停和恢復
監聽當前topic的刪除

訊息投遞

case msg = <-memoryMsgChan:
case buf = <-backendChan:
    msg, err = decodeMessage(buf)
    if err != nil {
        t.ctx.nsqd.logf("ERROR: failed to decode message - %s", err)
        continue
    }

這兩個case語句處理進入topic的訊息，關於兩個go channel的區別會在後續的部落格中分析。從memoryMsgChanbackendChan讀取到的訊息是*Message型別，而從backendChan讀取到的訊息是byte陣列的。因此取出backendChan的訊息後海需要呼叫decodeMessage函式對byte陣列進行解碼，返回*Message型別的訊息。二者都儲存在msg變數中。

for i, channel := range chans {
    chanMsg := msg
    if i > 0 {
        chanMsg = NewMessage(msg.ID, msg.Body)
        chanMsg.Timestamp = msg.Timestamp
        chanMsg.deferred = msg.deferred
    }
    if chanMsg.deferred != 0 {
        channel.StartDeferredTimeout(chanMsg, chanMsg.deferred)
        continue
    }
    err := channel.PutMessage(chanMsg)
    if err != nil {
        t.ctx.nsqd.logf(
            "TOPIC(%s) ERROR: failed to put msg(%s) to channel(%s) - %s",
            t.name, msg.ID, channel.name, err)
    }
}

隨後是將訊息投到每個channel中，首先先對訊息進行復制操作，這裡有個優化，對於第一次迴圈，直接使用原訊息進行傳送以減少複製物件的開銷，此後的迴圈將對訊息進行復制。對於即時的訊息，直接呼叫channel的PutMessage函式進行投遞，對於延遲的訊息，呼叫channel的StartDeferredTimeout函式進行投遞。對於這兩個函式的投遞細節，後續博文中會詳細分析。

Topic下Channel的更新

case <-t.channelUpdateChan:
    chans = chans[:0]
    t.RLock()
    for _, c := range t.channelMap {
        chans = append(chans, c)
    }
    t.RUnlock()
    if len(chans) == 0 || t.IsPaused() {
        memoryMsgChan = nil
        backendChan = nil
    } else {
        memoryMsgChan = t.memoryMsgChan
        backendChan = t.backend.ReadChan()
    }
    continue

Channel的更新比較簡單，從channelMap中取出每個channel，構成channel的陣列以便後續進行訊息的投遞。並且根據當前是否有channel以及該topic是否處於暫停狀態來決定memoryMsgChan和backendChan是否為空。

Topic的暫停和恢復

case pause := <-t.pauseChan:
    if pause || len(chans) == 0 {
        memoryMsgChan = nil
        backendChan = nil
    } else {
        memoryMsgChan = t.memoryMsgChan
        backendChan = t.backend.ReadChan()
    }
    continue

這個case既處理topic的暫停也處理topic的恢復，pause變數決定其究竟是哪一種操作。 Topic的暫停和恢復其實和topic的更新很像，根據是否暫停以及是否有channel來決定是否分配memoryMsgChan和backendChan。

messagePump函式的退出

case <-t.exitChan:
    goto exit

// ...
exit:
    t.ctx.nsqd.logf("TOPIC(%s): closing ... messagePump", t.name)
}
// End of messagePump

messagePump通過監聽exitChan來獲知topic是否被刪除，當topic的刪除時，跳轉到函式的最後，輸出日誌後退出訊息迴圈。

Topic的關閉和刪除

// Delete empties the topic and all its channels and closes
func (t *Topic) Delete() error {
    return t.exit(true)
}

// Close persists all outstanding topic data and closes all its channels
func (t *Topic) Close() error {
    return t.exit(false)
}

func (t *Topic) exit(deleted bool) error {
    if !atomic.CompareAndSwapInt32(&t.exitFlag, 0, 1) {
        return errors.New("exiting")
    }

    if deleted {
        t.ctx.nsqd.logf("TOPIC(%s): deleting", t.name)

        // since we are explicitly deleting a topic (not just at system exit time)
        // de-register this from the lookupd
        t.ctx.nsqd.Notify(t)
    } else {
        t.ctx.nsqd.logf("TOPIC(%s): closing", t.name)
    }

    close(t.exitChan)

    // synchronize the close of messagePump()
    t.waitGroup.Wait()

    if deleted {
        t.Lock()
        for _, channel := range t.channelMap {
            delete(t.channelMap, channel.name)
            channel.Delete()
        }
        t.Unlock()

        // empty the queue (deletes the backend files, too)
        t.Empty()
        return t.backend.Delete()
    }

    // close all the channels
    for _, channel := range t.channelMap {
        err := channel.Close()
        if err != nil {
            // we need to continue regardless of error to close all the channels
            t.ctx.nsqd.logf("ERROR: channel(%s) close - %s", channel.name, err)
        }
    }

    // write anything leftover to disk
    t.flush()
    return t.backend.Close()
}
// Exiting returns a boolean indicating if this topic is closed/exiting
func (t *Topic) Exiting() bool {
    return atomic.LoadInt32(&t.exitFlag) == 1
}

Topic關閉和刪除的實現都是呼叫exit函式，只是傳遞的引數不同，刪除時呼叫exit(true)，關閉時呼叫exit(false)。 exit函式進入時通過atomic.CompareAndSwapInt32函式判斷當前是否正在退出，如果不是，則設定退出標記，對於已經在退出的topic，不再重複執行退出函式。接著對於關閉操作，使用Notify函式通知lookupd以便其他nsqd獲知該訊息。

隨後，exit函式呼叫close(t.exitChan)和t.waitGroup.Wait()通知其他正在執行goroutine當前topic已經停止，並等待waitGroup中的goroutine結束執行。

最後，對於刪除和關閉兩種操作，執行不同的邏輯來完成最後的清理工作：

對於刪除操作，需要清空channelMap並刪除所有channel，然後刪除記憶體和磁碟中所有未投遞的訊息。最後關閉backend管理的的磁碟檔案。
對於關閉操作，不清空channelMap，只是關閉所有的channel，使用flush函式將所有memoryMsgChan中未投遞的訊息用writeMessageToBackend儲存到磁碟中。最後關閉backend管理的的磁碟檔案。

func (t *Topic) flush() error {
    //...
    for {
        select {
        case msg := <-t.memoryMsgChan:
            err := writeMessageToBackend(&msgBuf, msg, t.backend)
            if err != nil {
                t.ctx.nsqd.logf(
                    "ERROR: failed to write message to backend - %s", err)
            }
        default:
            goto finish
        }
    }
    
finish:
    return nil
}

flush函式也使用到了default分支來檢測是否已經處理完全部訊息。由於此時已經沒有生產者向memoryMsgChan提供訊息，因此如果出現阻塞就表示訊息已經處理完畢。

func (t *Topic) Empty() error {
    for {
        select {
        case <-t.memoryMsgChan:
        default:
            goto finish
        }
    }

finish:
    return t.backend.Empty()
}

在刪除topic時用到的Empty函式跟flush處理邏輯類似，只不過Empty只釋放memoryMsgChan訊息，而不儲存它們。

topic 下的原始碼基本就看完了，雖然還沒有別的部分完整的完整的串聯起來，但是也可以瞭解到，多個 topic 在初始化時就開啟了訊息迴圈 goroutine，執行完 Start 後開始訊息分發，如果是正常的Topic,除了預設10000的記憶體佇列，還會有個硬碟佇列。topic將收到的訊息分發到管理的 channel 中.每個 topic 執行的 goroutine 比較簡單，只有一個訊息分發 goroutine: messagePump.

nsq
2020-09-18
Go操作NSQ
2024-03-18
Go
How we redesign the NSQ-NSQ重塑之客戶端
2019-05-14
客戶端
kafka檢視topic
2018-12-17
Kafka
php連結nsq客戶端
2021-04-06
PHP客戶端
linux shell快速搭建NSQ叢集
2020-12-17
Linux
kafka2.x常用命令筆記（一）建立topic，檢視topic列表、分割槽、副本詳情，刪除topic，測試topic傳送與消費
2021-12-29
Kafka筆記
Topic太多！RocketMQ炸了！
2023-03-28
MQ
RabbitMQ - SpringBoot 案例 - topic 模式
2021-06-08
MQSpring Boot模式
Go之NSQ簡介,原理和使用
2020-10-27
Go
NSQ--nsqd初始化啟動
2018-12-27
【RabbitMQ】topic type exchange example in golang
2018-07-09
MQGolang
Kafka原理剖析之「Topic建立」
2024-09-07
Kafka
nsq 部署 / 投遞 / 消費 / 叢集示例
2022-03-08
kafka-eagle刪除topic需要token
2018-06-22
Kafka
RabbitMQ的使用--以topic路由為例
2024-06-27
MQ路由
RabbitMQ Go客戶端教程5——topic
2020-12-06
MQGo客戶端
除夕快樂，Hyperf 釋出 NSQ 元件和 1.1.17 版本
2020-01-24
元件
nsqlookupd：高效能訊息中介軟體 NSQ 解析
2021-09-11
SQL
RocketMQ Series---No route info of this topic異常分析
2018-05-13
MQ
《Kafka筆記》2、環境搭建、Topic管理
2020-10-19
Kafka筆記
Python指令碼消費多個Kafka topic
2024-11-20
Python指令碼Kafka
RocketMQ Compaction Topic的設計與實現
2023-01-04
MQ
醫療大資料Topic推薦-AMiner
2021-01-04
大資料
How we redesigned the NSQ- 其他特性及未來計劃
2019-03-04
2024 杭州測試沙龍 TOPIC 徵集開始了
2024-04-23
實操筆記：為 NSQ 配置監控服務的心路歷程
2021-09-09
筆記
Community Cloud零基礎學習（五）Topic（主題）管理
2021-02-16
UnityCloud
最近讀了點 nsq 的原始碼，寫了兩篇文章給大家分享下。
2018-05-11
原始碼
Mysql增量寫入Hdfs（一） --將Mysql資料寫入Kafka Topic
2018-12-08
MySqlKafka
如何為Kafka叢集選擇合適的Topic/Partitions數量
2018-07-24
Kafka
急急急！Kafka Topic 資源許可權緊張怎麼辦？
2023-04-27
Kafka
Canal1.1.4獲取資料後直接傳送到kafka的Topic中
2020-11-10
Kafka
Kafka Topic 中明明有可拉取的訊息，為什麼 poll 不到
2024-08-28
Kafka
kafka檢視Topic列表及消費狀態等常用命令
2021-08-27
Kafka
Sensitive-Topic-History-Quiz: 完全由ChatGPT編寫的網頁遊戲
2022-12-06
UIChatGPT網頁遊戲
git-topic/V1.0拉取遠端分支程式碼,merging,iconfont add
2018-12-06
Git
Aizu Online Judge Introduction to Programming I C語言實現 ITP1 Topic # 1
2019-02-06
AIC語言