elasticsearch bulk資料--ES批量匯入json資料

後開啟撒打發了發表於2017-11-22

一、Bulk API
官網給出的介紹：https://www.elastic.co/guide/en/elasticsearch/reference/6.0/docs-bulk.html

The REST API endpoint is /_bulk, and it expects the following newline delimited JSON (NDJSON) structure:

action_and_meta_data\n
optional_source\n
action_and_meta_data\n
optional_source\n
....
action_and_meta_data\n
optional_source\n

也就是說每一個操作都有2行資料組成，末尾要回車換行。第一行用來說明操作命令和原資料、第二行是自定義的選項.舉個例子，同時執行插入2條資料、刪除一條資料。

{ "create" : { "_index" : "blog", "_type" : "article", "_id" : "3" }}
{ "title":"title1","posttime":"2016-07-02","content":"內容一" }

{ "create" : { "_index" : "blog", "_type" : "article", "_id" : "4" }}
{ "title":"title2","posttime":"2016-07-03","content":"內容2" }

{ "delete":{"_index" : "blog", "_type" : "article", "_id" : "1" }}

官網的解釋和例子：
Because this format uses literal \n's as delimiters, please be sure that the JSON actions and sources are not pretty printed. Here is an example of a correct sequence of bulk commands:

POST _bulk
{ "index" : { "_index" : "test", "_type" : "type1", "_id" : "1" } }
{ "field1" : "value1" }
{ "delete" : { "_index" : "test", "_type" : "type1", "_id" : "2" } }
{ "create" : { "_index" : "test", "_type" : "type1", "_id" : "3" } }
{ "field1" : "value3" }
{ "update" : {"_id" : "1", "_type" : "type1", "_index" : "test"} }
{ "doc" : {"field2" : "value2"} }

二、把資料儲存在檔案中的提交方法。官網的介紹和說明：
If you’re providing text file input to curl, you must use the --data-binary flag instead of plain -d. The latter doesn’t preserve newlines. Example:

$ cat requests
{ "index" : { "_index" : "test", "_type" : "type1", "_id" : "1" } }
{ "field1" : "value1" }
$ curl -s -H "Content-Type: application/x-ndjson" -XPOST localhost:9200/_bulk --data-binary "@requests"

具體例子：把下面的資料儲存在檔案request中，然後使用命令提交：

vim retuqest
curl  -XPOST  '192.168.0.153:9200/_bulk'   --data-binary  @request

{ "index" : { "_index" : "test_index", "_type" : "chen", "_id" : "1" } }
{ "field1" : "value1" }
{ "index" : { "_index" : "test_index", "_type" : "chen", "_id" : "2" } }
{ "field1" : "value2" }
{ "index" : { "_index" : "test_index", "_type" : "chen", "_id" : "3" } }
{ "field1" : "value3" }

看看有沒有提交成功：

curl -XGET 'http://192.168.0.153:9200/test_index/chen/1?pretty'
{
  "_index" : "test_index",
  "_type" : "chen",
  "_id" : "1",
  "_version" : 2,
  "found" : true,
  "_source" : {
    "field1" : "value1"
  }
}

ok，提交成功。

Elasticsearch批量匯入資料指令碼（python）
2018-08-11
Elasticsearch指令碼Python
109.全文檢索-ElasticSearch-入門-刪除資料&bulk批量操作匯入樣本測試資料
2020-10-22
Elasticsearch
ElasticSearch7.4批量匯入_bulk
2020-01-19
Elasticsearch
圖解JanusGraph系列 - 關於JanusGraph圖資料批量快速匯入的方案和想法（bulk load data）
2020-12-22
圖解
極速匯入elasticsearch測試資料
2022-09-11
Elasticsearch
Elasticsearch Lucene 資料寫入原理 | ES 核心篇
2019-08-16
Elasticsearch
使用csv批量匯入、匯出資料的需求處理
2020-09-30
資料庫 MySQL 資料匯入匯出
2021-08-10
資料庫MySql
Dynamics CRM 如何使用XrmToolBox中的Bulk Workflow Execution批量更新資料
2018-03-20
大文字資料，匯入匯出到資料庫
2018-08-28
資料庫
批量備份還原匯入與匯出MongoDB資料方式昝璽
2022-03-01
MongoDB
MATLAB匯入資料
2020-10-11
Matlab
sqoop資料匯入匯出
2018-09-10
OOP
Oracle 資料匯入匯出
2018-06-14
Oracle
資料泵匯出匯入
2019-02-01
Oracle資料匯入匯出
2024-07-23
Oracle
phpMyAdmin匯入/匯出資料
2024-11-27
PHP
【elasticsearch】bulk api奇特的json格式的原因
2020-10-03
ElasticsearchAPIJSON
資料匯入終章：如何將HBase的資料匯入HDFS？
2018-10-15
.NET Core使用NPOI將Excel中的資料批量匯入到MySQL
2020-09-11
ExcelMySql
Python批量匯入Excel資料到MySQL
2020-11-20
PythonExcelMySql
Excel 表匯入資料
2019-06-01
Excel
MySQL資料的匯入
2024-10-17
MySql
Oracle 資料匯入Excel
2022-06-11
OracleExcel
mysqlimport 資料匯入程式
2022-03-17
MySqlImport
PHP大資料xlswriter匯入匯出(最優資料化)
2022-05-13
PHP大資料
MySQL入門--匯出和匯入資料
2019-06-04
MySql
Elasticsearch 資料寫入原理分析
2019-09-06
Elasticsearch
如何將Excl內資料匯入資料庫？
2022-01-13
資料庫
Oracle使用資料泵expdp,impdp進行資料匯出匯入
2018-04-04
Oracle
Mongodb資料的匯出與匯入
2018-10-30
MongoDB
oracle資料匯出匯入（exp/imp）
2018-05-30
Oracle
匯入和匯出AWR的資料
2018-06-10
EasyPoi, Excel資料的匯入匯出
2020-10-01
Excel
Mysql 資料庫匯入與匯出
2024-06-15
MySql資料庫
es6將txt資料序列化成json
2021-09-09
JSON
匯入excel資源到資料庫
2020-12-10
Excel資料庫
mybatis插入資料、批量插入資料
2021-12-08
MyBatis
Access 匯入 oracle 資料庫
2019-10-04
Oracle資料庫

elasticsearch bulk資料--ES批量匯入json資料

相關文章