密碼學系列之:Argon2加密演算法詳解

flydean發表於2021-09-20

原文網址 : https://segmentfault.com/a/1190000040711422

簡介

Argon2是一個金鑰推導函式，在2015年7月被選為密碼雜湊大賽的冠軍，它由盧森堡大學的Alex Biryukov、Daniel Dinu和Dmitry Khovratovich設計，Argon2的實現通常是以Creative Commons CC0許可（即公共領域）或Apache License 2.0釋出，並提供了三個相關版本，分別是Argon2d，Argon2i和Argon2id。

本文將會討論一下Argon2的原理和使用。

金鑰推導函式key derivation function

在密碼學中，金鑰推導函式(KDF)是一種密碼學雜湊函式，它使用偽隨機函式從一個祕密值(如主金鑰、密碼或口令)中推匯出一個或多個金鑰。 KDF可用於將金鑰拉伸成更長的金鑰，或獲得所需格式的金鑰，例如將Diffie-Hellman金鑰交換的結果轉換為用於AES的對稱金鑰。

Password Hashing Competition

密碼學雖然是研究密碼的，但是其加密演算法是越公開越好，只有公開才能去檢視該演算法的好壞，只有經過大家的徹底研究，才能夠讓該演算法得以在業界使用和傳播。

最出名的密碼演算法大賽肯定是由NIST在2001年為了指定標準的AES演算法舉辦的大賽，該大賽的目的尋找最新的加密演算法來替代老的DES演算法。在這次大賽中，湧現了許多優秀的演算法，包括CAST-256, CRYPTON, DEAL, DFC, E2, FROG, HPC, LOKI97, MAGENTA, MARS, RC6, Rijndael, SAFER+, Serpent, 和 Twofish等。最終Rijndael演算法被選為最終的AES演算法實現。

同樣的PHC也是一個這樣的演算法比賽，和NIST舉辦的演算法比賽不同的是，這是一個非官方的，由密碼學家們組織的比賽。它是在由Jean-Philippe Aumasson於2012年秋季發起。

2013年第一季度，釋出了徵集意見書的通知，到2014年3月31日截止日期，共收到24份意見書。2014年12月，確定了9個入圍名單。2015年7月，宣佈Argon2為優勝者。

Argon2演算法

Argon2 的設計很簡單，旨在實現最高的記憶體填充率和對多個計算單元的有效利用，同時還能提供對 tradeoff attacks 的防禦（通過利用處理器的快取和記憶體）。

Argon2有三個變種。Argon2i、Argon2d和Argon2id。Argon2d速度更快，並且使用資料依賴的記憶體訪問方式，這使得它對GPU破解攻擊有很強的抵抗力，適合沒有side-channel timing attacks威脅的應用（例如加密貨幣）。

Argon2i則使用資料無關的記憶體訪問，這對於密碼雜湊和基於密碼的金鑰推導演算法來說是首選，其特點是速度較慢，因為它在記憶體上執行了更多的處理邏輯，以防止 tradeoff attacks 。

Argon2id是Argon2i和Argon2d的混合體，採用資料依賴型和資料獨立型記憶體訪問相結合的方式，從而可以同時抵禦side-channel timing attacks和GPU破解攻擊的能力。

Argon2的輸入引數

Argon2有兩類輸入引數，分別是primary inputs和secondary inputs。

primary inputs包括要加密的訊息P和nonce S，分別代表password和salt。

P的長度是0到2³²-1位元組，S的長度是8到2³²-1位元組（如果是做密碼hash，推薦16位元組）。

之所以叫做primary inputs，是因為這兩個引數是必須輸入的。

剩下的引數叫做secondary inputs，他們包括：

並行程度p，表示同時可以有多少獨立的計算鏈同時執行，取值是1到2²⁴-1。
Tag長度 τ，長度從4到2³²-1位元組。‘
記憶體大小 m, 單位是兆，值取 8p到2³²-1。
迭代器的個數t，提升執行速度。取值1到2³²-1。
版本號v，一個位元組，取值0x13。
安全值 K ，長度是0到2³²-1位元組。
附加資料 X，長度是0到2³²-1位元組。
Argon2的型別，0代表Argon2d，1代表Argon2i，2代表Argon2id。

這些輸入可以用下面的程式碼來表示：

   Inputs:
      password (P):       Bytes (0..232-1)    Password (or message) to be hashed
      salt (S):           Bytes (8..232-1)    Salt (16 bytes recommended for password hashing)
      parallelism (p):    Number (1..224-1)   Degree of parallelism (i.e. number of threads)
      tagLength (T):      Number (4..232-1)   Desired number of returned bytes
      memorySizeKB (m):   Number (8p..232-1)  Amount of memory (in kibibytes) to use
      iterations (t):     Number (1..232-1)   Number of iterations to perform
      version (v):        Number (0x13)       The current version is 0x13 (19 decimal)
      key (K):            Bytes (0..232-1)    Optional key (Errata: PDF says 0..32 bytes, RFC says 0..232 bytes)
      associatedData (X): Bytes (0..232-1)    Optional arbitrary extra data
      hashType (y):       Number (0=Argon2d, 1=Argon2i, 2=Argon2id)
   Output:
      tag:                Bytes (tagLength)   The resulting generated bytes, tagLength bytes long

處理流程

我們先來看一下非並行的Argon2的演算法流程：

非並行的Argon2是最簡單的。

上圖中G表示的是一個壓縮函式，接收兩個1024byte的輸入，輸出一個1024byte。

i表示的是執行的步數，上面的φ(i) 就是輸入，取自記憶體空間。

作為一個memory-hard的演算法，一個很重要的工作就是構建初始記憶體。接下來，我們看一下如何構建初始記憶體空間。

首先，我們需要構建 H0 ，這是一個 64-byte 的block值，通過H0，可以去構建更多的block。計算H0的公式如下：

H0 = H(p,τ,m,t,v,y,⟨P⟩,P,⟨S⟩,S,⟨K⟩,K,⟨X⟩,X)

它是前面我們提到的輸入引數的H函式。H0的大小是64byte。

看下H0的程式碼生成：

   Generate initial 64-byte block H0.
    All the input parameters are concatenated and input as a source of additional entropy.
    Errata: RFC says H0 is 64-bits; PDF says H0 is 64-bytes.
    Errata: RFC says the Hash is H^, the PDF says it's ℋ (but doesn't document what ℋ is). It's actually Blake2b.
    Variable length items are prepended with their length as 32-bit little-endian integers.
   buffer ← parallelism ∥ tagLength ∥ memorySizeKB ∥ iterations ∥ version ∥ hashType
         ∥ Length(password)       ∥ Password
         ∥ Length(salt)           ∥ salt
         ∥ Length(key)            ∥ key
         ∥ Length(associatedData) ∥ associatedData
   H0 ← Blake2b(buffer, 64) //default hash size of Blake2b is 64-bytes

對於輸入引數並行程度p來說，需要將記憶體分成一個記憶體矩陣B[i][j], 它是一個 p 行的矩陣。

計算矩陣B的值：

其中H′ 是一個基於H的變長hash演算法。

我們給一下這個演算法的實現：

Function Hash(message, digestSize)
   Inputs:
      message:         Bytes (0..232-1)     Message to be hashed
      digestSize:      Integer (1..232)     Desired number of bytes to be returned
   Output:
      digest:          Bytes (digestSize)   The resulting generated bytes, digestSize bytes long

   Hash is a variable-length hash function, built using Blake2b, capable of generating
   digests up to 232 bytes.

   If the requested digestSize is 64-bytes or lower, then we use Blake2b directly
   if (digestSize <= 64) then
      return Blake2b(digestSize ∥ message, digestSize) //concatenate 32-bit little endian digestSize with the message bytes

   For desired hashes over 64-bytes (e.g. 1024 bytes for Argon2 blocks),
   we use Blake2b to generate twice the number of needed 64-byte blocks,
   and then only use 32-bytes from each block

   Calculate the number of whole blocks (knowing we're only going to use 32-bytes from each)
   r ← Ceil(digestSize/32)-1;

   Generate r whole blocks.
   Initial block is generated from message
   V1 ← Blake2b(digestSize ∥ message, 64);
   Subsequent blocks are generated from previous blocks
   for i ← 2 to r do
      Vi ← Blake2b(Vi-1, 64)
   Generate the final (possibly partial) block
   partialBytesNeeded ← digestSize – 32*r;
   Vr+1 ← Blake2b(Vr, partialBytesNeeded)

   Concatenate the first 32-bytes of each block Vi
   (except the possibly partial last block, which we take the whole thing)
   Let Ai represent the lower 32-bytes of block Vi
   return A1 ∥ A2 ∥ ... ∥ Ar ∥ Vr+1

如果我們的迭代次數多於一次，也就是說t > 1, 我們這樣計算下一次迭代的 B ：

\(B^{t}[i][0]=G\left(B^{t-1}[i][q-1], B\left[i^{\prime}\right]\left[j^{\prime}\right]\right) \oplus B^{t-1}[i][0]\)

\(B^{t}[i][j]=G\left(B^{t}[i][j-1], B\left[i^{\prime}\right]\left[j^{\prime}\right]\right) \oplus B^{t-1}[i][j]\)

最終遍歷T次之後，我們得到最終的B ：

\(B_{\text {final }}=B^{T}[0][q-1] \oplus B^{T}[1][q-1] \oplus \cdots \oplus B^{T}[p-1][q-1]\)

最後得到輸出：

\(\mathrm{Tag} \leftarrow H^{\prime}\left(B_{\text {final }}\right)\)

這段邏輯也可以用程式碼來表示：

   Calculate number of 1 KB blocks by rounding down memorySizeKB to the nearest multiple of 4*parallelism kibibytes
   blockCount ← Floor(memorySizeKB, 4*parallelism)

   Allocate two-dimensional array of 1 KiB blocks (parallelism rows x columnCount columns)
   columnCount ← blockCount / parallelism;   //In the RFC, columnCount is referred to as q

   Compute the first and second block (i.e. column zero and one ) of each lane (i.e. row)
   for i ← 0 to parallelism-1 do for each row
      Bi[0] ← Hash(H0 ∥ 0 ∥ i, 1024) //Generate a 1024-byte digest
      Bi[1] ← Hash(H0 ∥ 1 ∥ i, 1024) //Generate a 1024-byte digest

   Compute remaining columns of each lane
   for i ← 0 to parallelism-1 do //for each row
      for j ← 2 to columnCount-1 do //for each subsequent column
         //i' and j' indexes depend if it's Argon2i, Argon2d, or Argon2id (See section 3.4)
         i′, j′ ← GetBlockIndexes(i, j)  //the GetBlockIndexes function is not defined
         Bi[j] = G(Bi[j-1], Bi′[j′]) //the G hash function is not defined

   Further passes when iterations > 1
   for nIteration ← 2 to iterations do
      for i ← 0 to parallelism-1 do for each row
        for j ← 0 to columnCount-1 do //for each subsequent column
           //i' and j' indexes depend if it's Argon2i, Argon2d, or Argon2id (See section 3.4)
           i′, j′ ← GetBlockIndexes(i, j)
           if j == 0 then 
             Bi[0] = Bi[0] xor G(Bi[columnCount-1], Bi′[j′])
           else
             Bi[j] = Bi[j] xor G(Bi[j-1], Bi′[j′])

   Compute final block C as the XOR of the last column of each row
   C ← B0[columnCount-1]
   for i ← 1 to parallelism-1 do
      C ← C xor Bi[columnCount-1]

   Compute output tag
   return Hash(C, tagLength)

本文已收錄於 http://www.flydean.com/40-argon2/

最通俗的解讀，最深刻的乾貨，最簡潔的教程，眾多你不知道的小技巧等你來發現！

歡迎關注我的公眾號:「程式那些事」,懂技術，更懂你！

密碼學系列之:bcrypt加密演算法詳解
2021-09-16
密碼學加密演算法
密碼學系列之:加密貨幣中的scrypt演算法
2021-10-14
密碼學加密演算法
【密碼學系列】|| 分組密碼的工作模式詳解
2020-04-07
密碼學模式
《密碼學系列》|| 詳解密碼學的多重DES演算法...
2020-04-07
密碼學解密演算法
密碼學系列之:NIST和SHA演算法
2021-05-10
密碼學演算法
密碼學之對稱加密
2019-09-05
密碼學加密
密碼學系列之:線上證書狀態協議OCSP詳解
2022-07-06
密碼學協議
密碼學系列之:feistel cipher
2021-06-16
密碼學
密碼學系列之:1Password的加密基礎PBKDF2
2021-10-04
密碼學加密
密碼學系列之:生日攻擊
2021-06-09
密碼學
密碼學系列之:twofish對稱金鑰分組演算法
2021-06-30
密碼學演算法
密碼學系列之:blowfish對稱金鑰分組演算法
2021-06-21
密碼學演算法
密碼學之DES/AES演算法
2019-02-16
密碼學演算法
密碼學系列之:內容嗅探
2021-03-10
密碼學
簡話密碼學3 - 常用加密演算法介紹
2020-10-20
密碼學加密演算法
密碼學系列——常見的加密方式（c#程式碼實操）
2020-08-05
密碼學加密C#
密碼學系列之:memory-bound函式
2021-07-14
密碼學函式
密碼學系列之:memory-hard函式
2021-05-26
密碼學函式
RSA加密原理&密碼學&HASH
2019-10-16
加密密碼學
密碼學基礎：編碼方式、訊息摘要演算法、加密演算法總結
2021-12-02
密碼學演算法加密
密碼學系列之:海綿函式sponge function
2021-09-14
密碼學函式Function
《密碼學系列》|| 密碼學中的流密碼是怎麼回事？
2020-04-07
密碼學
HTTPS詳解-加密演算法（一）
2018-07-01
HTTP加密演算法
HTTPS詳解-加密演算法（二）完
2018-07-23
HTTP加密演算法
商密學習-分組密碼加密模式筆記
2024-11-06
密碼加密模式筆記
golang 密碼學-1. 對稱加密
2020-05-22
Golang密碼學加密
現代密碼學之加密雜湊函式與訊息認證碼
2021-01-04
密碼學加密函式
密碼學系列之:csrf跨站點請求偽造
2021-03-18
密碼學
oracle 密碼詳解以及破解
2021-07-25
Oracle密碼
netty系列之:channelHandlerContext詳解
2022-03-02
NettyContext
netty系列之:channelPipeline詳解
2022-02-28
Netty
golang 密碼學-2. 非對稱加密
2020-05-25
Golang密碼學加密
Python學習——Caesar Cipher密碼破解（加密）方法
2020-12-15
Python密碼加密
密碼學系列之:碰撞抵禦和碰撞攻擊collision attack
2021-06-03
密碼學
密碼學系列之:使用openssl檢測網站是否支援ocsp
2022-07-11
密碼學網站
密碼學系列之:PKI的證書格式表示X.509
2022-06-22
密碼學
【密碼學原理】流密碼和RC4演算法
2020-10-03
密碼學演算法
明文顯示密碼效果詳解
2018-11-23
密碼