【推理引擎】從原始碼看ONNXRuntime的執行流程

虔誠的樹發表於2022-03-29

原文網址 : https://www.cnblogs.com/xxxxxxxxx/p/16072145.html

前言

在上一篇部落格中：【推理引擎】ONNXRuntime 的架構設計，主要從文件上對ONNXRuntime的執行流程進行了梳理，但是想要深入理解，還需從原始碼角度進行分析。

本文以目標檢測模型NanoDet作為分析的基礎，部分程式碼主要參考：超輕量級NanoDet MNN/TNN/NCNN/ONNXRuntime C++工程記錄 - DefTruth的文章 - 知乎，在此表示感謝！

準備工作

OrtHandlerBase 是用來操控 ONNXRuntime 的基類，各種網路模型都可以通過繼承該類進而擁有 ONNXRuntime 的使用許可權，比如 NanoDet；同時，NanoDet還可以擴充套件獨屬於自己的方法和成員變數，以方便推理前後的預處理和後處理工作。

構造NanoDet物件時，會自動呼叫OrtHandlerBase的構造方法，在構造方法內部會首先初始化一些必要的成員變數（Ort::Env、Ort::SessionOptions），這兩個變數主要用於初始化 Ort::Session:

ort_env = Ort::Env(ORT_LOGGING_LEVEL_ERROR, log_id);

Ort::SessionOptions session_options;
session_options.SetIntraOpNumThreads(num_threads);
session_options.SetGraphOptimizationLevel(GraphOptimizationLevel::ORT_ENABLE_ALL);
session_options.SetLogSeverityLevel(4);

ort_session = new Ort::Session(ort_env, onnx_model_path, session_options);

構造 InferenceSession 物件 & 初始化

在構造 Ort::Session 物件的過程中，會呼叫ONNXRuntime -> onnxruntime_cxx_inline.h 中的API：

// include/onnxruntime/core/session/onnxruntime_cxx_inline.h
inline Session::Session(Env& env, const ORTCHAR_T* model_path, const SessionOptions& options) {
  ThrowOnError(GetApi().CreateSession(env, model_path, options, &p_));
}

GetApi() 是在 onnxruntime_cxx_api.h 中定義的：

// include/onnxruntime/core/session/onnxruntime_cxx_api.h

// This returns a reference to the OrtApi interface in use
inline const OrtApi& GetApi() { return *Global<void>::api_; }

// 其中 Global 的定義如下：
template <typename T>
struct Global {
  static const OrtApi* api_;
};

這裡面主要定義了靜態常量指標OrtApi*，OrtApi是在 onnxruntime_c_api.h 中定義的：

// include/onnxruntime/core/session/onnxruntime_c_api.h

// All C API functions are defined inside this structure as pointers to functions.
// Call OrtApiBase::GetApi to get a pointer to it
struct OrtApi;
typedef struct OrtApi OrtApi;

struct OrtApi{
  ...
  // 以 CreateSession 為例：
  ORT_API2_STATUS(CreateSession, _In_ const OrtEnv* env, _In_ const ORTCHAR_T* model_path,
                  _In_ const OrtSessionOptions* options, _Outptr_ OrtSession** out);

  // 展開ORT_API2_STATUS巨集：
  // _Check_return_ _Ret_maybenull_ OrtStatusPtr(ORT_API_CALL* CreateSession)(const OrtEnv* env, 
  //                                                                          const char* model_path, 
  //                                                                          const OrtSessionOptions* options, 
  //                                                                          OrtSession** out) NO_EXCEPTION ORT_MUST_USE_RESULT;
  
  ...
}

相應地，在 onnxruntime_c_api.cc 檔案中定義了 CreateSesssion 的實現：

// onnxruntime/core/session/onnxruntime_c_api.cc

ORT_API_STATUS_IMPL(OrtApis::CreateSession, _In_ const OrtEnv* env, _In_ const ORTCHAR_T* model_path,
                    _In_ const OrtSessionOptions* options, _Outptr_ OrtSession** out) {
  API_IMPL_BEGIN
  std::unique_ptr<onnxruntime::InferenceSession> sess;
  OrtStatus* status = nullptr;
  *out = nullptr;

  ORT_TRY {
    ORT_API_RETURN_IF_ERROR(CreateSessionAndLoadModel(options, env, model_path, nullptr, 0, sess));
    ORT_API_RETURN_IF_ERROR(InitializeSession(options, sess));

    *out = reinterpret_cast<OrtSession*>(sess.release());
  }
  ORT_CATCH(const std::exception& e) {
    ORT_HANDLE_EXCEPTION([&]() {
      status = OrtApis::CreateStatus(ORT_FAIL, e.what());
    });
  }

  return status;
  API_IMPL_END
}

到此，我們已經定位到CreateSession的具體實現內容，可以發現它主要由兩個部分組成：CreateSessionAndLoadModel 和 InitializeSession，接下來分析這兩個函式。

從 CreateSessionAndLoadModel 的名字就可以看出，這個函式主要負責建立 Session，以及載入模型：

// onnxruntime/core/session/onnxruntime_c_api.cc

// provider either model_path, or modal_data + model_data_length.
// 也就是說，共有兩種方式用來讀取模型：一種是根據ONNX模型路徑；另一種時從模型資料緩衝（Model data buffer）中讀取，並且需要指定模型大小（Model data buffer size）
static ORT_STATUS_PTR CreateSessionAndLoadModel(_In_ const OrtSessionOptions* options,
                                                _In_ const OrtEnv* env,
                                                _In_opt_z_ const ORTCHAR_T* model_path,
                                                _In_opt_ const void* model_data,
                                                size_t model_data_length,
                                                std::unique_ptr<onnxruntime::InferenceSession>& sess) {
  // quick check here to decide load path. InferenceSession will provide error message for invalid values.
  const Env& os_env = Env::Default();  // OS environment (注意：OS environment != ORT environment)
  bool load_config_from_model =
      os_env.GetEnvironmentVar(inference_session_utils::kOrtLoadConfigFromModelEnvVar) == "1";
  
  // 建立 InferenceSession
  if (load_config_from_model) {
    if (model_path != nullptr) {
      sess = std::make_unique<onnxruntime::InferenceSession>(
          options == nullptr ? onnxruntime::SessionOptions() : options->value,
          env->GetEnvironment(),
          model_path);
    } else {
      sess = std::make_unique<onnxruntime::InferenceSession>(
          options == nullptr ? onnxruntime::SessionOptions() : options->value,
          env->GetEnvironment(),
          model_data, static_cast<int>(model_data_length));
    }
  } else {
    sess = std::make_unique<onnxruntime::InferenceSession>(
        options == nullptr ? onnxruntime::SessionOptions() : options->value,
        env->GetEnvironment());
  }

  // Add custom domains
  if (options && !options->custom_op_domains_.empty()) {
    ORT_API_RETURN_IF_STATUS_NOT_OK(sess->AddCustomOpDomains(options->custom_op_domains_));
  }

  // Finish load
  if (load_config_from_model) {
    ORT_API_RETURN_IF_STATUS_NOT_OK(sess->Load());
  } else {
    if (model_path != nullptr) {
      ORT_API_RETURN_IF_STATUS_NOT_OK(sess->Load(model_path));
    } else {
      ORT_API_RETURN_IF_STATUS_NOT_OK(sess->Load(model_data, static_cast<int>(model_data_length)));
    }
  }

  return nullptr;
}

在建立完 InferenceSession 後，需要進行初始化操作（InitializeSession）：

// onnxruntime/core/session/onnxruntime_c_api.cc

static ORT_STATUS_PTR InitializeSession(_In_ const OrtSessionOptions* options,
                                        _In_ std::unique_ptr<::onnxruntime::InferenceSession>& sess,
                                        _Inout_opt_ OrtPrepackedWeightsContainer* prepacked_weights_container = nullptr) {
  // 建立 Providers
  std::vector<std::unique_ptr<IExecutionProvider>> provider_list;
  if (options) {
    for (auto& factory : options->provider_factories) {
      auto provider = factory->CreateProvider();
      provider_list.push_back(std::move(provider));
    }
  }

  // 註冊 Providers 到 InferenceSession 中
  for (auto& provider : provider_list) {
    if (provider) {
      ORT_API_RETURN_IF_STATUS_NOT_OK(sess->RegisterExecutionProvider(std::move(provider)));
    }
  }

  if (prepacked_weights_container != nullptr) {
    ORT_API_RETURN_IF_STATUS_NOT_OK(sess->AddPrePackedWeightsContainer(
        reinterpret_cast<PrepackedWeightsContainer*>(prepacked_weights_container)));
  }
  
  // 初始化 InferenceSession
  ORT_API_RETURN_IF_STATUS_NOT_OK(sess->Initialize());

  return nullptr;
}

接下來，深入到 InferenceSession 的 Initialize() 函式中，這個函式水很深，需要分為幾個小的模組來分析。

// onnxruntime/core/session/inference_session.cc

common::Status InferenceSession::Initialize() {
  ...

  bool have_cpu_ep = false;
  
  // 這裡使用 {} 可以提前釋放 session_mutex_，不必等到退出Initialize函式才釋放，可提升效率
  {   
    std::lock_guard<onnxruntime::OrtMutex> initial_guard(session_mutex_);
    // 判斷模型是否已被載入
    if (!is_model_loaded_) {    
      LOGS(*session_logger_, ERROR) << "Model was not loaded";
      return common::Status(common::ONNXRUNTIME, common::FAIL, "Model was not loaded.");
    }

    if (is_inited_) {  // 判斷是否已經初始化，如果已經初始化就可以直接退出Initialize函式了
      LOGS(*session_logger_, INFO) << "Session has already been initialized.";
      return common::Status::OK();
    }
    
    // 判斷是否已經設定 CPU EP 來兜底，如果忘記設定，後面會自動新增
    have_cpu_ep = execution_providers_.Get(onnxruntime::kCpuExecutionProvider) != nullptr;
  }

  if (!have_cpu_ep) {
    LOGS(*session_logger_, INFO) << "Adding default CPU execution provider.";
    CPUExecutionProviderInfo epi{session_options_.enable_cpu_mem_arena};
    auto p_cpu_exec_provider = std::make_unique<CPUExecutionProvider>(epi);
    ORT_RETURN_IF_ERROR_SESSIONID_(RegisterExecutionProvider(std::move(p_cpu_exec_provider)));
  }
  ...
}

以上程式碼確保了 EPs（複數，多個EP，hhh）已被正常設定（主要是CPU已經被用作兜底），接下來從 Ort 環境中讀取共享的分配器（shared allocators），並更新 EPs：

// onnxruntime/core/session/inference_session.cc

common::Status InferenceSession::Initialize() {
  ...

  std::string use_env_allocators = session_options_.config_options.GetConfigOrDefault(kOrtSessionOptionsConfigUseEnvAllocators,
                                                                                      "0");
  if (use_env_allocators == "1") {
    LOGS(*session_logger_, INFO) << "This session will use the allocator registered with the environment.";
    UpdateProvidersWithSharedAllocators();    // 更新 EPs
  }

  ...

接下來需要設定 SessionState，需要注意：SessionState 只能被 InferenceSession 修改，

// onnxruntime/core/session/inference_session.cc

common::Status InferenceSession::Initialize() {
  ...

  session_state_ = std::make_unique<SessionState>(
      model_->MainGraph(),
      execution_providers_,
      session_options_.enable_mem_pattern && session_options_.execution_mode == ExecutionMode::ORT_SEQUENTIAL,
      GetIntraOpThreadPoolToUse(),
      GetInterOpThreadPoolToUse(),
      data_transfer_mgr_,
      *session_logger_,
      session_profiler_,
      session_options_.use_deterministic_compute,
      session_options_.enable_mem_reuse,
      prepacked_weights_container_);
  
  ...
}

接下來從EPs例項中收集核心登錄檔（kernel registries），核心登錄檔分為兩類：

Custom execution provider type specific kernel registries. 》》比如CUDA EP
Common execution provider type specific kernel registries. 》》比如CPU EP

這兩類登錄檔的優先順序並不相同，前者要高於後者。

// onnxruntime/core/session/inference_session.cc

common::Status InferenceSession::Initialize() {
  ...

  ORT_RETURN_IF_ERROR_SESSIONID_(kernel_registry_manager_.RegisterKernels(execution_providers_));
  
  ...
}

在 KernelRegistryManager 中註冊完登錄檔之後，開始執行非常重要的圖優化，以及分割子圖：

// onnxruntime/core/session/inference_session.cc

common::Status InferenceSession::Initialize() {
  ...

  // add predefined transformers
  // 新增預先定義的變換
  ORT_RETURN_IF_ERROR_SESSIONID_(AddPredefinedTransformers(graph_transformation_mgr_,
                                                            session_options_.graph_optimization_level,
                                                            saving_runtime_optimizations));

  // apply any transformations to the main graph and any subgraphs
  // 在主圖和子圖上執行所有的優化Pass
  ORT_RETURN_IF_ERROR_SESSIONID_(TransformGraph(graph, graph_transformation_mgr_,
                                                execution_providers_, kernel_registry_manager_,
                                                insert_cast_transformer_,
                                                *session_state_,
                                                saving_ort_format));

  // now that all the transforms are done, call Resolve on the main graph. this will recurse into the subgraphs.
  // 所有的圖變換都已經執行完畢，然後開始遞迴分割子圖
  ORT_RETURN_IF_ERROR_SESSIONID_(graph.Resolve());

  // Update temporary copies of metadata, input- and output definitions to the same state as the resolved graph
  ORT_RETURN_IF_ERROR_SESSIONID_(SaveModelMetadata(*model_));

  ...
}

分割子圖之後，還有一些結尾工作：

// onnxruntime/core/session/inference_session.cc

common::Status InferenceSession::Initialize() {
  ...

  ORT_RETURN_IF_ERROR_SESSIONID_(
      session_state_->FinalizeSessionState(model_location_, kernel_registry_manager_,
                                            session_options_,
                                            serialized_session_state,
                                            // need to keep the initializers if saving the optimized model
                                            !saving_model,
                                            saving_ort_format));
  
  // Resolve memory pattern flags of the main graph and subgraph session states
  ResolveMemoryPatternFlags(*session_state_);

  if (status.IsOK()) {
    for (auto& xp : execution_providers_) {
      auto end_status = xp->OnSessionInitializationEnd();
      if (status.IsOK()) {
        status = end_status;
      }
    }
  }
  ...
}

讓模型 Run

通過上一個階段，已經成功構造出 NanoDet 物件，接下來需要輸入影像，並由 NanoDet 來執行：

// 
std::vector<types::BoxF> detected_boxes;
cv::Mat img_bgr = cv::imread(test_img_path);
nanodet->detect(img_bgr, detected_boxes);

detect 函式內部：

void NanoDet::detect(const cv::Mat &mat, std::vector<types::BoxF> &detected_boxes,
                     float score_threshold, float iou_threshold,
                     unsigned int topk, unsigned int nms_type)
{
    if (mat.empty()) return;
    auto img_height = static_cast<float>(mat.rows);
    auto img_width = static_cast<float>(mat.cols);
    const int target_height = (int) input_node_dims.at(2);
    const int target_width = (int) input_node_dims.at(3);

    // 0. resize & unscale
    cv::Mat mat_rs;
    NanoScaleParams scale_params;
    this->resize_unscale(mat, mat_rs, target_height, target_width, scale_params);

    // 1. make input tensor
    Ort::Value input_tensor = this->transform(mat_rs);
    
    // 2. inference scores & boxes.
    auto output_tensors = ort_session->Run(
        Ort::RunOptions{nullptr}, input_node_names.data(),
        &input_tensor, 1, output_node_names.data(), num_outputs
    );
    // 3. rescale & exclude.
    std::vector<types::BoxF> bbox_collection;
    this->generate_bboxes(scale_params, bbox_collection, output_tensors, score_threshold, img_height, img_width);
    // 4. hard|blend|offset nms with topk.
    this->nms(bbox_collection, detected_boxes, iou_threshold, topk, nms_type);
}

其中，第 0 和 1 步是模型輸入的預處理，這裡不再深入介紹，想要了解可參考原始碼。接下來重點對第 2 步的 ort_seesion->Run() 進行深入剖析。

// include/onnxruntime/core/session/onnxruntime_cxx_inline.h

inline std::vector<Value> Session::Run(const RunOptions& run_options, const char* const* input_names, const Value* input_values, size_t input_count,
                                       const char* const* output_names, size_t output_names_count) {
  std::vector<Ort::Value> output_values;
  for (size_t i = 0; i < output_names_count; i++)
    output_values.emplace_back(nullptr);
  Run(run_options, input_names, input_values, input_count, output_names, output_values.data(), output_names_count);
  return output_values;
}

inline void Session::Run(const RunOptions& run_options, const char* const* input_names, const Value* input_values, size_t input_count,
                         const char* const* output_names, Value* output_values, size_t output_count) {
  static_assert(sizeof(Value) == sizeof(OrtValue*), "Value is really just an array of OrtValue* in memory, so we can reinterpret_cast safely");
  auto ort_input_values = reinterpret_cast<const OrtValue**>(const_cast<Value*>(input_values));
  auto ort_output_values = reinterpret_cast<OrtValue**>(output_values);
  ThrowOnError(GetApi().Run(p_, run_options, input_names, ort_input_values, input_count, output_names, output_count, ort_output_values));
}

又到了熟悉的環節，GetApi()可參考上一章節的內容，直接到 onnxruntime_c_api.cc 中檢視 Run 函式對應的實現：

// onnxruntime/core/session/onnxruntime_c_api.cc

ORT_API_STATUS_IMPL(OrtApis::Run, _Inout_ OrtSession* sess, _In_opt_ const OrtRunOptions* run_options,
                    _In_reads_(input_len) const char* const* input_names,
                    _In_reads_(input_len) const OrtValue* const* input, size_t input_len,
                    _In_reads_(output_names_len) const char* const* output_names1, size_t output_names_len,
                    _Inout_updates_all_(output_names_len) OrtValue** output) {
  API_IMPL_BEGIN
  // 獲取 inferencesession
  auto session = reinterpret_cast<::onnxruntime::InferenceSession*>(sess);
  const int queue_id = 0;
  
  // 模型輸入：feed_names & feeds
  std::vector<std::string> feed_names(input_len);
  std::vector<OrtValue> feeds(input_len);

  for (size_t i = 0; i != input_len; ++i) {
    if (input_names[i] == nullptr || input_names[i][0] == '\0') {
      return OrtApis::CreateStatus(ORT_INVALID_ARGUMENT, "input name cannot be empty");
    }

    feed_names[i] = input_names[i];
    auto& ort_value = feeds[i] = *reinterpret_cast<const ::OrtValue*>(input[i]);

    if (ort_value.Fence()) ort_value.Fence()->BeforeUsingAsInput(onnxruntime::kCpuExecutionProvider, queue_id);
  }

  // 模型輸出：output_names & fetches
  std::vector<std::string> output_names(output_names_len);
  for (size_t i = 0; i != output_names_len; ++i) {
    if (output_names1[i] == nullptr || output_names1[i][0] == '\0') {
      return OrtApis::CreateStatus(ORT_INVALID_ARGUMENT, "output name cannot be empty");
    }
    output_names[i] = output_names1[i];
  }

  std::vector<OrtValue> fetches(output_names_len);
  for (size_t i = 0; i != output_names_len; ++i) {
    if (output[i] != nullptr) {
      ::OrtValue& value = *(output[i]);
      if (value.Fence())
        value.Fence()->BeforeUsingAsOutput(onnxruntime::kCpuExecutionProvider, queue_id);
      fetches[i] = value;
    }
  }
  
  // 呼叫 InferenceSession 的 Run 函式，執行推理
  Status status;
  if (run_options == nullptr) {
    OrtRunOptions op;
    status = session->Run(op, feed_names, feeds, output_names, &fetches, nullptr);
  } else {
    status = session->Run(*run_options, feed_names, feeds, output_names, &fetches, nullptr);
  }
  
  // Run 結束後，將 fetches 中的內容取出放到 output 中
  if (!status.IsOK())
    return ToOrtStatus(status);
  for (size_t i = 0; i != output_names_len; ++i) {
    ::OrtValue& value = fetches[i];
    if (value.Fence())
      value.Fence()->BeforeUsingAsInput(onnxruntime::kCpuExecutionProvider, queue_id);
    if (output[i] == nullptr) {
      output[i] = new OrtValue(value);
    }
  }
  return nullptr;
  API_IMPL_END
}

進入到 InferenceSession::Run 的內部：

Status InferenceSession::Run(const RunOptions& run_options,
                             const std::vector<std::string>& feed_names, const std::vector<OrtValue>& feeds,
                             const std::vector<std::string>& output_names, std::vector<OrtValue>* p_fetches,
                             const std::vector<OrtDevice>* p_fetches_device_info) {

  std::vector<IExecutionProvider*> exec_providers_to_stop;
  exec_providers_to_stop.reserve(execution_providers_.NumProviders());

  std::vector<AllocatorPtr> arenas_to_shrink;

  // 驗證輸入輸出，並由 FeedsFetchesManager 進行管理
  ORT_RETURN_IF_ERROR_SESSIONID_(ValidateInputs(feed_names, feeds));
  ORT_RETURN_IF_ERROR_SESSIONID_(ValidateOutputs(output_names, p_fetches));
  FeedsFetchesInfo info(feed_names, output_names, session_state_->GetOrtValueNameIdxMap());
  FeedsFetchesManager feeds_fetches_manager{std::move(info)};
  
  // current_num_runs_ 的型別是：std::atomic<int>，表示並行執行 EP 的數量
  ++current_num_runs_; 
  
  // info all execution providers InferenceSession:Run started
  for (auto& xp : execution_providers_) {
    // call OnRunStart and add to exec_providers_to_stop if successful
    auto start_func = [&xp, &exec_providers_to_stop]() {
      auto status = xp->OnRunStart();
      if (status.IsOK())
        exec_providers_to_stop.push_back(xp.get());

      return status;
    };

    ORT_CHECK_AND_SET_RETVAL(start_func());
  }
  
  if (run_options.only_execute_path_to_fetches) {
    session_state_->UpdateToBeExecutedNodes(feeds_fetches_manager.GetFeedsFetchesInfo().fetches_mlvalue_idxs);
  }

  session_state_->IncrementGraphExecutionCounter();
  
  // execute the graph
  ORT_CHECK_AND_SET_RETVAL(utils::ExecuteGraph(*session_state_, feeds_fetches_manager, feeds, *p_fetches,
                                                session_options_.execution_mode, run_options.terminate, run_logger,
                                                run_options.only_execute_path_to_fetches));

  // info all execution providers InferenceSession:Run ended
  for (auto* xp : exec_providers_to_stop) {
    auto status = xp->OnRunEnd(/*sync_stream*/ true);
    ORT_CHECK_AND_SET_RETVAL(status);
  }
  
  --current_num_runs_;
}

至此，模型已經完成推理，接下來只需處理輸出內容即可，對應 nanodet->detect() 函式的 3、4 部分。

總結

本文主要介紹了InferenceSession的構造和初始化，以及模型的推理過程，可以發現其中還是蠻複雜的。由於對ONNXRuntime的原始碼仍然瞭解有限，有許多重要的部分被略過，打算接下來分別針對突破。

【推理引擎】ONNXRuntime 的架構設計
2022-03-29
架構
【推理引擎】如何在 ONNXRuntime 中新增新的運算元
2022-03-30
從ReentrantLock看AQS (AbstractQueuedSynchronizer) 執行流程
2021-07-03
ReentrantLockAQS
【推理引擎】在 VS Code 除錯 ONNXRuntime 的測試單元
2022-03-30
除錯
執行流程原始碼分析
2024-09-27
原始碼
深入Mybatis原始碼——執行流程
2020-07-07
MyBatis原始碼
Mybatis執行流程原始碼分析
2020-12-15
MyBatis原始碼
Mybatis原始碼系列執行流程（一）
2021-12-31
MyBatis原始碼
SpringMVC執行流程及原始碼分析
2021-03-06
SpringMVC原始碼
從原始碼的角度解析執行緒池執行原理
2019-04-25
原始碼執行緒
onnxruntime模型部署流程
2020-10-03
模型
從2張執行流程圖看vue和react區別
2018-10-08
流程圖VueReact
從template到DOM(Vue.js原始碼角度看內部執行機制)
2019-03-04
Vue.js原始碼
spark 原始碼分析之二十一 -- Task的執行流程
2019-07-29
Spark原始碼
PX4原始碼工作佇列執行流程
2024-07-05
原始碼佇列
從核心原始碼看 slab 記憶體池的建立初始化流程
2023-04-12
原始碼記憶體
【原始碼分析】XXL-JOB的執行器的註冊流程
2023-04-22
原始碼
從 Linux 原始碼看 socket 的 close
2018-08-16
Linux原始碼
從原始碼角度看ContentProvider
2019-03-01
原始碼IDE
從JDK原始碼看Reader
2019-02-25
JDK原始碼
從JDK原始碼看OutputStream
2019-02-25
JDK原始碼
從JDK原始碼看StringBuilder
2018-05-25
JDK原始碼UI
從JDK原始碼看StringBuffer
2018-06-03
JDK原始碼
從Chrome原始碼看WebSocket
2018-05-27
Chrome原始碼Web
從linux原始碼看epoll
2020-06-19
Linux原始碼
從Class原始碼看反射
2020-09-21
原始碼反射
MySQL 主從複製的執行流程
2024-04-19
MySql
Java servlet執行的完整流程（圖解含原始碼分析）
2019-05-23
JavaServlet圖解原始碼
DRF之請求執行流程和APIView原始碼分析
2024-04-23
APIView原始碼
從linux原始碼看socket(tcp)的timeout
2020-06-10
Linux原始碼TCP
從Linux原始碼看Socket(TCP)的bind
2020-10-16
Linux原始碼TCP
從Linux原始碼看Socket(TCP)的accept
2020-12-07
Linux原始碼TCP
利用 onnxruntime 庫同時推理多個模型的效率研究
2022-04-06
模型
從JDK原始碼角度看Long
2019-03-01
JDK原始碼
從原始碼看React.PureComponent
2018-11-11
原始碼React
從JDK原始碼角度看Integer
2019-03-04
JDK原始碼
從JDK原始碼角度看Float
2019-02-26
JDK原始碼
NEO從原始碼分析看NEOVM
2019-01-21
原始碼

【推理引擎】從原始碼看ONNXRuntime的執行流程

前言

準備工作

構造 InferenceSession 物件 & 初始化

讓模型 Run

總結

相關文章