node叢集（cluster）

duan777發表於2021-01-29

原文網址 : https://www.cnblogs.com/helloxiaoduan/p/14345825.html

使用例子

為了讓node應用能夠在多核伺服器中提高效能，node提供cluster API，用於建立多個工作程式，然後由這些工作程式並行處理請求。

// master.js
const cluster = require('cluster');
const cpusLen = require('os').cpus().length;
const path = require('path');

console.log(`主程式：${process.pid}`);
cluster.setupMaster({
  exec: path.resolve(__dirname, './work.js'),
});

for (let i = 0; i < cpusLen; i++) {
  cluster.fork();
}

// work.js
const http = require('http');

console.log(`工作程式：${process.pid}`);
http.createServer((req, res) => {
  res.end('hello');
}).listen(8080);

上面例子中，使用cluster建立多個工作程式，這些工作程式能夠共用8080埠，我們請求localhost:8080，請求任務會交給其中一個工作程式進行處理，該工作程式處理完成後，自行響應請求。

埠占用問題

這裡有個問題，前面例子中，出現多個程式監聽相同的埠，為什麼程式沒有報埠占用問題，由於socket套接字監聽埠會有一個檔案描述符，而每個程式的檔案描述符都不相同，無法讓多個程式都監聽同一個埠，如下：

// master.js
const fork = require('child_process').fork;
const cpusLen = require('os').cpus().length;
const path = require('path');

console.log(`主程式：${process.pid}`);
for (let i = 0; i < cpusLen; i++) {
  fork(path.resolve(__dirname, './work.js'));
}

// work.js
const http = require('http');

console.log(`工作程式：${process.pid}`);
http.createServer((req, res) => {
  res.end('hello');
}).listen(8080);

當執行master.js檔案的時候，會報埠被佔用的問題（Error: listen EADDRINUSE: address already in use :::8080）。

我們修改下，只使用主程式監聽埠，主程式將請求套接字發放給工作程式，由工作程式來進行業務處理。

// master.js
const fork = require('child_process').fork;
const cpusLen = require('os').cpus().length;
const path = require('path');
const net = require('net');
const server = net.createServer();

console.log(`主程式：${process.pid}`);
const works = [];
let current = 0
for (let i = 0; i < cpusLen; i++) {
  works.push(fork(path.resolve(__dirname, './work.js')));
}

server.listen(8080, () => {
  if (current > works.length - 1) current = 0
  works[current++].send('server', server);
  server.close();
});

// work.js
const http = require('http');
const server = http.createServer((req, res) => {
  res.end('hello');
});

console.log(`工作程式：${process.pid}`);
process.on('message', (type, tcp) => {
  if (type === 'server') {
    tcp.on('connection', socket => {
      server.emit('connection', socket)
    });
  }
})

實際上，cluster新建的工作程式並沒有真正去監聽埠，在工作程式中的net server listen函式會被hack，工作程式呼叫listen，不會有任何效果。監聽埠工作交給了主程式，該埠對應的工作程式會被繫結到主程式中，當請求進來的時候，主程式會將請求的套接字下發給相應的工作程式，工作程式再對請求進行處理。

接下來我們看看cluster API中的實現，看下cluster內部是如何做到下面兩個功能：

主程式：對傳入的埠進行監聽
工作程式：
- 主程式註冊當前工作程式，如果主程式是第一次監聽此埠，就新建一個TCP伺服器，並將當前工作程式和TCP伺服器繫結。
- hack掉工作程式中的listen函式，讓該程式不能監聽埠

原始碼解讀

本文使用的是node@14.15.4。

// lib/cluster.js
'use strict';

const childOrPrimary = 'NODE_UNIQUE_ID' in process.env ? 'child' : 'primary';
module.exports = require(`internal/cluster/${childOrPrimary}`);

這個是cluster API入口，在引用cluster的時候，程式首先會判斷環境變數中是否存在NODE_UNIQUE_ID變數，來確定當前程式是在主程式執行還是工作程式中執行。NODE_UNIQUE_ID實際上就是一個自增的數字，是工作程式的ID，後面會在建立工作程式相關程式碼中看到，這裡就不多做解釋了。

通過前面程式碼我們知道，如果在主程式中引用cluster，程式匯出的是internal/cluster/primary.js這檔案，因此我們先看看這個檔案內部的一些實現。

// internal/cluster/primary.js
// ...
const EventEmitter = require('events');
const cluster = new EventEmitter();
// 下面這三個引數會在node內部功能實現的時候用到，之後我們看net原始碼的時候會用到這些引數
cluster.isWorker = false; // 是否是工作程式
cluster.isMaster = true; // 是否是主程式
cluster.isPrimary = true; // 是否是主程式

module.exports = cluster;

cluster.setupPrimary = function(options) {
  const settings = {
    args: ArrayPrototypeSlice(process.argv, 2),
    exec: process.argv[1],
    execArgv: process.execArgv,
    silent: false,
    ...cluster.settings,
    ...options
  };

  cluster.settings = settings;
  // ...
}

cluster.setupMaster = cluster.setupPrimary;

cluster.fork = function(env) {
  cluster.setupPrimary();
  const id = ++ids;
  const workerProcess = createWorkerProcess(id, env);
}

const { fork } = require('child_process');
function createWorkerProcess(id, env) {
  // 這裡的NODE_UNIQUE_ID就是入口檔案用來分辨當前程式型別用的
  const workerEnv = { ...process.env, ...env, NODE_UNIQUE_ID: `${id}` };
  // ...
  return fork(cluster.settings.exec, cluster.settings.args, {
    env: workerEnv,
    // ...
  });
}

cluster.fork用來新建一個工作程式，其內部使用child_process中的fork函式，來建立一個程式，建立的新程式預設會執行命令列中執行的入口檔案（process.argv[1]），當然我們也可以執行luster.setupPrimary或者cluster.setupMaster並傳入exec引數來修改工作程式執行的檔案。

我們再來簡單看下工作程式引用的internal/cluster/child.js檔案：

// internal/cluster/child.js
const EventEmitter = require('events');
const cluster = new EventEmitter();

module.exports = cluster;
// 這裡定義的就是一個工作程式，後續會用到這裡的引數
cluster.isWorker = true;
cluster.isMaster = false;
cluster.isPrimary = false;

cluster._getServer = function(obj, options, cb) {
  // ...
};
// ...

這裡我們主要記住工作程式中的cluster有個_getServer函式，後續流程走到這個函式的時候，會詳細看裡面的程式碼。

接下來進入正題，看下net server listen函式：

// lib/net.js
Server.prototype.listen = function(...args) {
  // ...
  if (typeof options.port === 'number' || typeof options.port === 'string') {
    // 如果是向最開始那種直接呼叫listen時直接傳入一個埠，就會直接進入else，我們也主要看else中的邏輯
    if (options.host) {
      // ...
    } else  {
      // listen(8080, () => {...})呼叫方式，將執行這條分支
      listenInCluster(this, null, options.port | 0, 4, backlog, undefined, options.exclusive);
    }
    return this;
  }
  // ...
}

function listenInCluster(server, address, port, addressType, backlog, fd, exclusive, flags) {
  // ...
  // 這裡就用到cluster初始時寫入的isPrimary引數，當前如果在主程式isPrimary就為true，反之為false。主程式會直接去執行server._listen2函式，工作程式之後也會執行這個函式，等下一起看server._listen2內部的功能。
  if (cluster.isPrimary || exclusive) {
    server._listen2(address, port, addressType, backlog, fd, flags);
    return;
  }

  // 後面的程式碼只有在工作程式中才會執行
  const serverQuery = {
    address: address,
    port: port,
    addressType: addressType,
    fd: fd,
    flags,
  };

  // 這裡執行的是internal/cluster/child.js中的cluster._getServer，同時會傳入listenOnPrimaryHandle這個回撥函式，這個回撥函式會在主程式新增埠監聽，同時將工作程式繫結到對應的TCP服務後才會執行，裡面工作就是對net server listen等函式進行hack。
  cluster._getServer(server, serverQuery, listenOnPrimaryHandle);

  function listenOnPrimaryHandle(err, handle) {
    // ...
    server._handle = handle;
    server._listen2(address, port, addressType, backlog, fd, flags);
  }
}

// 等工作程式執行這個函式的時候再一起講
Server.prototype._listen2 = setupListenHandle;
function setupListenHandle(...) {
  // ...
}

從上面程式碼中可以得知，主程式和工作程式中執行net server listen都會進入到一個setupListenHandle函式中。不過區別是，主程式是直接執行該函式，而工作程式需要先執行cluster._getServer函式，讓主程式監聽工作程式埠，同時對listen函式進行hack處理，然後再執行setupListenHandle函式。接下來我們看下cluster._getServer函式的內部實現。

// lib/internal/cluster/child.js
cluster._getServer = function(obj, options, cb) {
  // ...
  // 這個是工作程式第一次傳送內部訊息的內容。
  // 注意這裡act值為queryServer
  const message = {
    act: 'queryServer',
    index,
    data: null,
    ...options
  };
  // ...
  // send函式內部使用IPC通道向工作程式傳送內部訊息。主程式在使用cluster.fork新建工作程式的時候，會讓工作程式監聽內部訊息事件，下面會展示具體程式碼
  // send呼叫傳入的回撥函式會被寫入到lib/internal/cluster/utils.js檔案中的callbacks map中，等後面要用的時候，再提取出來。
  send(message, (reply, handle) => {
    if (typeof obj._setServerData === 'function')
      obj._setServerData(reply.data);

    if (handle)
      shared(reply, handle, indexesKey, index, cb);
    else
      // 這個函式內部會定義一個listen函式，用來hack net server listen函式
      rr(reply, indexesKey, index, cb);
  });
  // ...
}

function send(message, cb) {
  return sendHelper(process, message, null, cb);
}

// lib/internal/cluster/utils.js
// ...
const callbacks = new SafeMap();
let seq = 0;
function sendHelper(proc, message, handle, cb) {
  message = { cmd: 'NODE_CLUSTER', ...message, seq };

  if (typeof cb === 'function')
    // 這裡將傳入的回撥函式記錄下來。
    // 注意這裡的key是遞增數字
    callbacks.set(seq, cb);

  seq += 1;
  // 利用IPC通道，給當前工作程式傳送內部訊息
  return proc.send(message, handle);
}
// ...

工作程式中cluster._getServer函式執行，將生成一個回撥函式，將這個回撥函式存放起來，並且會使用IPC通道，向當前工作程式傳送內部訊息。主程式執行cluster.fork生成工作程式的時候，會在工作程式中註冊internalMessage事件。接下來我們看下cluster.fork中與工作程式註冊內部訊息事件的程式碼。

// internal/cluster/primary.js
cluster.fork = function(env) {
  // ...
  // internal函式執行會返回一個接收message物件的回撥函式。
  // 可以先看下lib/internal/cluster/utils.js中的internal函式，瞭解內部的工作
  worker.process.on('internalMessage', internal(worker, onmessage));
  // ...
}

const methodMessageMapping = {
  close,
  exitedAfterDisconnect,
  listening,
  online,
  queryServer,
};

// 第一次觸發internalMessage執行的回撥是這個函式。
// 此時message的act為queryServer
function onmessage(message, handle) {
  // internal內部在執行onmessage時會將這個函式執行上下文繫結到工作程式的work上
  const worker = this;

  // 工作程式傳入的
  const fn = methodMessageMapping[message.act];

  if (typeof fn === 'function')
    fn(worker, message);
}

function queryServer(worker, message) {
  // ...
}

// lib/internal/cluster/utils.js
// ...
const callbacks = new SafeMap();

function internal(worker, cb) {
  return function onInternalMessage(message, handle) {
    let fn = cb;

    // 工作程式第一次傳送內部訊息：ack為undefined，callback為undefined，直接執行internal呼叫傳入的onmessage函式，message函式只是用於解析訊息的，實際會執行queryServer函式
    // 工作程式第二次傳送內部訊息：主程式queryServer函式執行會用工作程式傳送內部訊息，並向message中新增ack引數，讓message.ack=message.seq
    if (message.ack !== undefined) {
      const callback = callbacks.get(message.ack);

      if (callback !== undefined) {
        fn = callback;
        callbacks.delete(message.ack);
      }
    }

    ReflectApply(fn, worker, arguments);
  };
}

工作程式第一次傳送內部訊息時，由於傳入的message.ack（這裡注意分清act和ack）為undefind，因此沒辦法直接拿到cluster._getServer中呼叫send寫入的回撥函式，因此只能先執行internal/cluster/primary.js中的queryServer函式。接下來看下queryServer函式內部邏輯。

// internal/cluster/primary.js
// hadles中存放的就是TCP伺服器。
// 主程式在代替工作程式監聽埠生成新的TCP伺服器前，
// 需要先判斷該伺服器是否有建立，如果有，就直接複用之前的伺服器，然後將工作程式繫結到相應的伺服器上；如果沒有，就新建一個TCP伺服器，然後將工作程式繫結到新建的伺服器上。
function queryServer(worker, message) {
  // 這裡key就是伺服器的唯一標識
  const key = `${message.address}:${message.port}:${message.addressType}:` +
              `${message.fd}:${message.index}`;
  // 從現存的伺服器中檢視是否有當前需要的伺服器
  let handle = handles.get(key);
  // 如果沒有需要的伺服器，就新建一個
  if (handle === undefined) {
    // ...
    // RoundRobinHandle構建函式中，會新建一個TCP伺服器
    let constructor = RoundRobinHandle;
    handle = new constructor(key, address, message);
    // 將這個伺服器存放起來
    handles.set(key, handle);
  }

  if (!handle.data)
    handle.data = message.data;

  // 可以先看下下面關於RoundRobinHandle構建函式的程式碼，瞭解內部機制
  handle.add(worker, (errno, reply, handle) => {
    const { data } = handles.get(key);

    if (errno)
      handles.delete(key);

    // 這裡會向工作程式中傳送第二次內部訊息。
    // 這裡只傳了worker和message，沒有傳入handle和cb
    send(worker, {
      errno,
      key,
      ack: message.seq, // 注意這裡增加了ack屬性
      data,
      ...reply
    }, handle);
  });
}
function send(worker, message, handle, cb) {
  return sendHelper(worker.process, message, handle, cb);
}

// internal/cluster/round_robin_handle.js
function RoundRobinHandle(key, address, { port, fd, flags }) {
  // ...
  this.server = net.createServer(assert.fail);
  if (fd >= 0)
    this.server.listen({ fd });
  else if (port >= 0) {
    this.server.listen({
      port,
      host: address,
      ipv6Only: Boolean(flags & constants.UV_TCP_IPV6ONLY),
    });
  } else
    this.server.listen(address);

  // 當服務處於監聽狀態，就會執行這個回撥。
  this.server.once('listening', () => {
    this.handle = this.server._handle;
    this.handle.onconnection = (err, handle) => this.distribute(err, handle);
    this.server._handle = null;
    // 注意：如果監聽成功，就會將server刪除
    this.server = null;
  });
}

RoundRobinHandle.prototype.add = function(worker, send) {
  const done = () => {
    if (this.handle.getsockname) {
      // ...
      send(null, { sockname: out }, null);
    } else {
      send(null, null, null);  // UNIX socket.
    }
    // ...
  };

  // 如果在add執行前server就已經處於listening狀態，this.server就會為null
  if (this.server === null)
    return done();
  // 如果add執行後，server才處於listening，就會走到這裡，始終都會執行add呼叫時傳入的回撥
  this.server.once('listening', done);
}

在這一步，主程式替工作程式生成或者是獲取了一個可用的TCP伺服器，並將工作程式與相應的伺服器繫結在一起（方便後續請求任務分配）。當工作程式繫結完成以後，就向工作程式中傳送了第二次內部訊息，接下來我們再次進入lib/internal/cluster/utils.js看看內部流程：

// lib/internal/cluster/utils.js
const callbacks = new SafeMap();

function internal(worker, cb) {
  // 注意這裡handle為undefined
  return function onInternalMessage(message, handle) {
    let fn = cb;

    // 第二次工作程式內部訊息執行的時候message.ack已經被賦值為message.seq
    // 因此這次能夠獲取到之前lib/cluster.child.js cluster._getServer函式執行是呼叫send寫入的回撥函式
    if (message.ack !== undefined) {
      const callback = callbacks.get(message.ack);

      if (callback !== undefined) {
        fn = callback;
        callbacks.delete(message.ack);
      }
    }

    ReflectApply(fn, worker, arguments);
  };
}

工作程式第二次接受到內部訊息時，cluster._getServer函式執行是呼叫send寫入的回撥函式會被執行，接下來看下send寫入的回撥函式內容：

// lib/internal/cluster/child.js
send(message, (reply, handle) => {
  // 此時handle為undefined，流程會直接執行rr函式
  if (handle)
    shared(reply, handle, indexesKey, index, cb); 
  else
    // 這裡的cb是lib/net.js在執行cluster._getServer時傳入listenOnPrimaryHandle函式，後面會介紹他的工作。
    rr(reply, indexesKey, index, cb);
});

function rr(message, indexesKey, index, cb) {
  let key = message.key;

  // 這裡定義的listen用於hack net server.listen，在工作程式中執行listen，工作程式並不會真正去監聽埠
  function listen(backlog) {
    return 0;
  }

  function close() {...}

  function getsockname(out) {...}

  const handle = { close, listen, ref: noop, unref: noop };
  handles.set(key, handle);
  // 執行傳入的listenOnPrimaryHandle函式
  cb(0, handle);
}

rr函式執行，會新建幾個與net server中同名的函式，並通過handle傳入listenOnPrimaryHandle函式。

// lib/net.js
function listenInCluster(...) {
  cluster._getServer(server, serverQuery, listenOnPrimaryHandle);

  // listenOnPrimaryHandle函式中將工作程式生成的server._handle物件替換成自定義的handle物件，後續server listen執行的就是server._handle中的listen函式，因此這裡就完成了對工作程式中的listen函式hack
  function listenOnPrimaryHandle(err, handle) {
    // ...
    // handle：{ listen: ..., close: ...., ... }
    server._handle = handle;
    server._listen2(address, port, addressType, backlog, fd, flags);
  }
}

下面看下server._listen2函式執行內容

Server.prototype._listen2 = setupListenHandle;

function setupListenHandle(address, port, addressType, backlog, fd, flags) {
  // 忽略，只要是從工作程式進來的，this._handle就是自己定義的物件內容
  if (this._handle) {
    debug('setupListenHandle: have a handle already');
  } else {
    // 主程式會進入這一層邏輯，會在這裡生成一個伺服器
    // ...
    rval = createServerHandle(address, port, addressType, fd, flags);
    // ...
    this._handle = rval;
  }
  const err = this._handle.listen(backlog || 511);
  // ...
}

至此，工作程式埠監聽相關的原始碼就看完了，現在差不多瞭解到工作程式中執行net server listen時，工作程式並不會真正去監聽埠，埠監聽工作始終會交給主程式來完成。主程式在接到工作程式發來的埠監聽的時候，首先會判斷是否有相同的伺服器，如果有，就直接將工作程式繫結到對應的伺服器上，這樣就不會出現埠被佔用的問題；如果沒有對應的伺服器，就生成一個新的服務。主程式接受到請求的時候，就會將請求任務分配給工作程式，如何分配，就需要看具體使用的哪種負載均衡了。

Node js 叢集（cluster）
2019-05-22
JS
Redis cluster 叢集
2019-04-12
Redis
Redis Cluster(叢集)
2018-07-08
Redis
redis偽叢集配置Cluster叢集模式
2019-04-09
Redis模式
搭建redis cluster叢集
2024-07-03
Redis
Redis Cluster叢集模式部署
2024-12-09
Redis模式
Docker Redis 5.0 叢集（cluster）搭建
2019-03-28
DockerRedis
認識Redis叢集——Redis Cluster
2020-11-19
Redis
玩轉Redis叢集之Cluster
2018-12-20
Redis
Redis Cluster叢集模式部署X
2024-12-09
Redis模式
dubbo原始碼分析之叢集Cluster
2020-11-05
原始碼
【Redis叢集實戰】Redis Cluster 部署
2024-11-22
Redis
Spring Boot整合Redis叢集（Cluster模式）
2021-08-30
Spring BootRedis模式
在wildfly 21中搭建cluster叢集
2020-12-29
Redis-cluster叢集搭建部署
2020-12-20
Redis
akka-typed(7) - cluster：sharding, 叢集分片
2020-06-12
【Redis】Redis Cluster-叢集故障轉移
2022-06-19
Redis
redis cluster 叢集故障恢復操作思路
2022-04-07
Redis
在滴滴雲上搭建 Redis-Cluster 叢集
2018-12-26
Redis
Akka-Cluster（3）- ClusterClient, 叢集客戶端
2018-12-10
client客戶端
MySQL Galera cluster叢集常用引數說明
2018-05-16
MySql
redis Cluster模式叢集多機器 docker 部署
2024-04-28
Redis模式Docker
三 GBase 8a MPP Cluster叢集擴容
2020-05-21
四 GBase 8a MPP Cluster叢集縮容
2020-05-21
Redis Cluster 叢集搭建與擴容、縮容
2022-03-27
Redis
ProxySQL Cluster 高可用叢集環境部署記錄
2019-02-21
SQL
mysql之 Percona XtraDB Cluster叢集執行緒模型
2018-10-09
MySql執行緒模型
搭建分散式 Redis Cluster 叢集與 Redis 入門
2020-10-18
分散式Redis
Redis的主從複製，哨兵和Cluster叢集
2024-11-07
Redis
Kubernetes(k8s)部署redis-cluster叢集
2021-11-05
K8SRedis
基於Docker搭建Percona XtraDB Cluster資料庫叢集
2018-08-07
Docker資料庫
ASM叢集檔案系統ACFS（ASM Cluster File System）
2020-01-22
ASM
Redis Cluster高可用叢集線上遷移操作記錄
2018-10-24
Redis
Redis 超詳細的手動搭建Cluster叢集步驟
2021-04-02
Redis
node Cluster 模組分析
2018-12-29
Redis Cluster 叢集部署
2024-04-19
Redis
部署Redis Cluster 6.0 叢集並開啟密碼認證和 Redis-cluster-proxy負載
2020-07-05
Redis密碼負載
oracle 12C RAC 12.1.0.2 叢集日誌（cluster log)目錄
2018-03-14
Oracle

node叢集（cluster）

使用例子

埠占用問題

原始碼解讀

相關文章