Linux記憶體管理：Malloc

發表於2015-09-24

對於核心的記憶體管理，像kmalloc，vmalloc，kmap，ioremap等比較熟悉。而對使用者層的管理機制不是很熟悉，下面就從malloc的實現入手.( 這裡不探討linux系統呼叫的實現機制. ) ,參考了《深入理解計算機系統》和一些網上的資料.
首先從http://ftp.gnu.org/gnu/glibc下載glibc庫2.21,

通常我們用的bsp或者sdk裡面的工具鏈都是編譯好的，而這個是原始碼，需要自己編譯(常用的有定製交叉編譯工具鏈).有時候我們需要新增自定義庫.

Linux中malloc的早期版本是由Doug Lea實現的，它有一個重要問題就是在並行處理時多個執行緒共享程式的記憶體空間，各執行緒可能併發請求記憶體，在這種情況下應該如何保證分配和回收的正確和有效。Wolfram Gloger在Doug Lea的基礎上改進使得glibc的malloc可以支援多執行緒——ptmalloc，在glibc-2.3.x.中已經整合了ptmalloc2，這就是我們平時使用的malloc.

其做法是，為了支援多執行緒並行處理時對於記憶體的併發請求操作，malloc的實現中把全域性使用者堆（heap）劃分成很多子堆（sub-heap）。這些子堆是按照迴圈單連結串列的形式組織起來的。每一個子堆利用互斥鎖（mutex）使執行緒對於該子堆的訪問互斥。當某一執行緒需要呼叫malloc分配記憶體空間時，該執行緒搜尋迴圈連結串列試圖獲得一個沒有加鎖的子堆。如果所有的子堆都已經加鎖，那麼malloc會開闢一塊新的子堆，對於新開闢的子堆預設情況下是不加鎖的，因此執行緒不需要阻塞就可以獲得一個新的子堆並進行分配操作。在回收free操作中，執行緒同樣試圖獲得待回收塊所在子堆的鎖，如果該子堆正在被別的執行緒使用，則需要等待直到其他執行緒釋放該子堆的互斥鎖之後才可以進行回收操作。

申請小塊記憶體時會產生很多記憶體碎片，ptmalloc在整理時需要對子堆做加鎖操作，每個加鎖操作大概需要5～10個cpu指令，而且程式執行緒數很高的情況下，鎖等待的時間就會延長，導致malloc效能下降。

因此很多大型的服務端應用會自己實現記憶體池，以降低向系統malloc的開銷。Hoard和TCmalloc是在glibc和應用程式之間實現的記憶體管理。Hoard的作者是美國麻省的Amherst College的一名老師，理論角度對hoard的研究和優化比較多，相關的文獻可以hoard主頁下載到到。從我自己專案中的系統使用來看，Hoard確實能夠很大程度的提高程式的效能和穩定性。TCMalloc（Thread-Caching Malloc）是google開發的開源工具──“google-perftools”中的成員。這裡有它的系統的介紹和安裝方法。這個只是對它歷史發展的一個簡單介紹，具體改動還需去官網檢視.

下面我們就看看malloc：

malloc的全稱是memory allocation，中文叫動態記憶體分配，當無法知道記憶體具體位置的時候，想要繫結真正的記憶體空間，就需要用到動態的分配記憶體。

原型為： extern void *malloc(unsigned int num_bytes)。

具體宣告在malloc.h中：

/* Allocate SIZE bytes of memory. */
extern void *malloc (size_t __size) __THROW __attribute_malloc__ __wur;

1 2	/* Allocate SIZE bytes of memory. / extern void malloc (size_t __size) __THROW __attribute_malloc__ __wur;

返回值：

如果分配成功則返回指向被分配記憶體的指標(此儲存區中的初始值不確定)，否則返回空指標NULL。當記憶體不再使用時，應使用free()函式將記憶體塊釋放。函式返回的指標一定要適當對齊，使其可以用於任何資料物件。
注意:
malloc(0) 返回不為空。 Free(p) 後p不為空。

那麼malloc到底是從哪裡獲取的記憶體呢？
答案是從堆裡面獲得空間；malloc的應用必然是某一個程式呼叫，而每一個程式在啟動的時候，系統預設給它分配了heap。下面我們就看看程式的記憶體空間佈局：
Anyway, here is the standard segment layout in a Linux process：(這個是x86 虛擬地址空間的預設佈局)

在glibc庫中找到malloc.c檔案：

strong_alias (__libc_malloc, __malloc) strong_alias (__libc_malloc, malloc)

1	strong_alias (__libc_malloc, __malloc) strong_alias (__libc_malloc, malloc)

即malloc別名為__libc_malloc,__malloc.並且在malloc.c中我們不能找到malloc的直接實現,而是有__libc_malloc：

/*------------------------ Public wrappers. --------------------------------*/

void *
__libc_malloc (size_t bytes)
{
  mstate ar_ptr;
  void *victim;

  void *(*hook) (size_t, const void *)
    = atomic_forced_read (__malloc_hook);
  if (__builtin_expect (hook != NULL, 0))
    return (*hook)(bytes, RETURN_ADDRESS (0));

  arena_lookup (ar_ptr);

  arena_lock (ar_ptr, bytes);
  if (!ar_ptr)
    return 0;

  victim = _int_malloc (ar_ptr, bytes);
  if (!victim)
    {
      LIBC_PROBE (memory_malloc_retry, 1, bytes);
      ar_ptr = arena_get_retry (ar_ptr, bytes);
      if (__builtin_expect (ar_ptr != NULL, 1))
        {
          victim = _int_malloc (ar_ptr, bytes);
          (void) mutex_unlock (&ar_ptr->mutex);
        }
    }
  else
    (void) mutex_unlock (&ar_ptr->mutex);
  assert (!victim || chunk_is_mmapped (mem2chunk (victim)) ||
          ar_ptr == arena_for_chunk (mem2chunk (victim)));
  return victim;
}

/*------------------------ Public wrappers. --------------------------------*/

void *

__libc_malloc (size_t bytes)

{

mstate ar_ptr;

void *victim;

void *(*hook) (size_t, const void *)

= atomic_forced_read (__malloc_hook);

if (__builtin_expect (hook != NULL, 0))

return (*hook)(bytes, RETURN_ADDRESS (0));

arena_lookup (ar_ptr);

arena_lock (ar_ptr, bytes);

if (!ar_ptr)

return 0;

victim = _int_malloc (ar_ptr, bytes);

if (!victim)

{

LIBC_PROBE (memory_malloc_retry, 1, bytes);

ar_ptr = arena_get_retry (ar_ptr, bytes);

if (__builtin_expect (ar_ptr != NULL, 1))

{

victim = _int_malloc (ar_ptr, bytes);

(void) mutex_unlock (&ar_ptr->mutex);

}

else

(void) mutex_unlock (&ar_ptr->mutex);

assert (!victim || chunk_is_mmapped (mem2chunk (victim)) ||

ar_ptr == arena_for_chunk (mem2chunk (victim)));

return victim;

}

在這個函式的第一行是關於hook的，我們先看一個定義：

void *weak_variable (*__malloc_hook)
  (size_t __size, const void *) = malloc_hook_ini;

1 2	void weak_variable (__malloc_hook) (size_t __size, const void *) = malloc_hook_ini;

它是gcc attribute weak的特性，可以查資料進一步瞭解.這裡說明一下由於是弱屬性，所以當有具體的實現的時候，就以外部實現為準.

static void *
malloc_hook_ini (size_t sz, const void *caller)
{
  __malloc_hook = NULL;
  ptmalloc_init ();
  return __libc_malloc (sz);
}

static void *

malloc_hook_ini (size_t sz, const void *caller)

{

__malloc_hook = NULL;

ptmalloc_init ();

return __libc_malloc (sz);

}

__libc_malloc中首先判斷hook函式指標是否為空，不為空則呼叫它，並返回。glibc2.21裡預設malloc_hook是初始化為malloc_hook_ini的。
但是我們發現在malloc_hook_ini中把__malloc_hook賦值為NULl，這樣就避免了遞迴呼叫.
同理在最後部分也有一個__malloc_initialize_hook的：預設為空.

void weak_variable (*__malloc_initialize_hook) (void) = NULL;

1	void weak_variable (*__malloc_initialize_hook) (void) = NULL;

那麼ptmalloc_init到底又做了什麼工作呢？

static void
ptmalloc_init (void)
{
  if (__malloc_initialized >= 0)
    return;

  __malloc_initialized = 0;

#ifdef SHARED
  /* In case this libc copy is in a non-default namespace, never use brk.
     Likewise if dlopened from statically linked program. */
  Dl_info di;
  struct link_map *l;

  if (_dl_open_hook != NULL
      || (_dl_addr (ptmalloc_init, &di, &l, NULL) != 0
          && l->l_ns != LM_ID_BASE))
    __morecore = __failing_morecore;
#endif

  tsd_key_create (&arena_key, NULL);
  tsd_setspecific (arena_key, (void *) &main_arena);
  thread_atfork (ptmalloc_lock_all, ptmalloc_unlock_all, ptmalloc_unlock_all2);
  const char *s = NULL;
  if (__glibc_likely (_environ != NULL))
    {
      char **runp = _environ;
      char *envline;

      while (__builtin_expect ((envline = next_env_entry (&runp)) != NULL,
                               0))
        {
          size_t len = strcspn (envline, "=");

          if (envline[len] != '=')
            /* This is a "MALLOC_" variable at the end of the string
               without a '=' character. Ignore it since otherwise we
               will access invalid memory below. */
            continue;

          switch (len)
            {
            case 6:
              if (memcmp (envline, "CHECK_", 6) == 0)
                s = &envline[7];
              break;
            case 8:
              if (!__builtin_expect (__libc_enable_secure, 0))
                {
                  if (memcmp (envline, "TOP_PAD_", 8) == 0)
                    __libc_mallopt (M_TOP_PAD, atoi (&envline[9]));
                  else if (memcmp (envline, "PERTURB_", 8) == 0)
                    __libc_mallopt (M_PERTURB, atoi (&envline[9]));
                }
              break;
            case 9:
              if (!__builtin_expect (__libc_enable_secure, 0))
                {
                  if (memcmp (envline, "MMAP_MAX_", 9) == 0)
                    __libc_mallopt (M_MMAP_MAX, atoi (&envline[10]));
                  else if (memcmp (envline, "ARENA_MAX", 9) == 0)
                    __libc_mallopt (M_ARENA_MAX, atoi (&envline[10]));
                }
              break;
            case 10:
              if (!__builtin_expect (__libc_enable_secure, 0))
                {
                  if (memcmp (envline, "ARENA_TEST", 10) == 0)
                    __libc_mallopt (M_ARENA_TEST, atoi (&envline[11]));
                }
              break;
            case 15:
              if (!__builtin_expect (__libc_enable_secure, 0))
                {
                  if (memcmp (envline, "TRIM_THRESHOLD_", 15) == 0)
                    __libc_mallopt (M_TRIM_THRESHOLD, atoi (&envline[16]));
                  else if (memcmp (envline, "MMAP_THRESHOLD_", 15) == 0)
                    __libc_mallopt (M_MMAP_THRESHOLD, atoi (&envline[16]));
                }
              break;
            default:
              break;
            }
        }
    }
  if (s && s[0])
    {
      __libc_mallopt (M_CHECK_ACTION, (int) (s[0] - '0'));
      if (check_action != 0)
        __malloc_check_init ();
    }
  void (*hook) (void) = atomic_forced_read (__malloc_initialize_hook);
  if (hook != NULL)
    (*hook)();
  __malloc_initialized = 1;
}

static void

ptmalloc_init (void)

{

if (__malloc_initialized >= 0)

return;

__malloc_initialized = 0;

#ifdef SHARED

/* In case this libc copy is in a non-default namespace, never use brk.

Likewise if dlopened from statically linked program. */

Dl_info di;

struct link_map *l;

if (_dl_open_hook != NULL

|| (_dl_addr (ptmalloc_init, &di, &l, NULL) != 0

&& l->l_ns != LM_ID_BASE))

__morecore = __failing_morecore;

#endif

tsd_key_create (&arena_key, NULL);

tsd_setspecific (arena_key, (void *) &main_arena);

thread_atfork (ptmalloc_lock_all, ptmalloc_unlock_all, ptmalloc_unlock_all2);

const char *s = NULL;

if (__glibc_likely (_environ != NULL))

{

char **runp = _environ;

char *envline;

while (__builtin_expect ((envline = next_env_entry (&runp)) != NULL,

0))

{

size_t len = strcspn (envline, "=");

if (envline[len] != '=')

/* This is a "MALLOC_" variable at the end of the string

without a '=' character. Ignore it since otherwise we

will access invalid memory below. */

continue;

switch (len)

{

case 6:

if (memcmp (envline, "CHECK_", 6) == 0)

s = &envline[7];

break;

case 8:

if (!__builtin_expect (__libc_enable_secure, 0))

{

if (memcmp (envline, "TOP_PAD_", 8) == 0)

__libc_mallopt (M_TOP_PAD, atoi (&envline[9]));

else if (memcmp (envline, "PERTURB_", 8) == 0)

__libc_mallopt (M_PERTURB, atoi (&envline[9]));

}

break;

case 9:

if (!__builtin_expect (__libc_enable_secure, 0))

{

if (memcmp (envline, "MMAP_MAX_", 9) == 0)

__libc_mallopt (M_MMAP_MAX, atoi (&envline[10]));

else if (memcmp (envline, "ARENA_MAX", 9) == 0)

__libc_mallopt (M_ARENA_MAX, atoi (&envline[10]));

}

break;

case 10:

if (!__builtin_expect (__libc_enable_secure, 0))

{

if (memcmp (envline, "ARENA_TEST", 10) == 0)

__libc_mallopt (M_ARENA_TEST, atoi (&envline[11]));

}

break;

case 15:

if (!__builtin_expect (__libc_enable_secure, 0))

{

if (memcmp (envline, "TRIM_THRESHOLD_", 15) == 0)

__libc_mallopt (M_TRIM_THRESHOLD, atoi (&envline[16]));

else if (memcmp (envline, "MMAP_THRESHOLD_", 15) == 0)

__libc_mallopt (M_MMAP_THRESHOLD, atoi (&envline[16]));

}

break;

default:

break;

}

if (s && s[0])

{

__libc_mallopt (M_CHECK_ACTION, (int) (s[0] - '0'));

if (check_action != 0)

__malloc_check_init ();

}

void (*hook) (void) = atomic_forced_read (__malloc_initialize_hook);

if (hook != NULL)

(*hook)();

__malloc_initialized = 1;

}

而__malloc_initialized在arena.c中預設初始化為：即開始的時候小於0.

/* Already initialized? */
int __malloc_initialized = -1;

1 2	/* Already initialized? */ int __malloc_initialized = -1;

函式開始把它賦值為0，最後初始化完成賦值為1. 所以這個函式完成了malloc的初始化工作.只有第一次呼叫的時候會用到.
接著是處理_environ即傳遞過來的環境變數，進行記憶體分配策略控制，你可以定製記憶體管理函式的行為，通過調整由mallopt()函式的引數。（預設環境變數為空。）

記憶體分配調整甚至可以不在你的程式中引入mallopt()呼叫和重新編譯它。在你想快速測試一些值或者你沒有原始碼時，這非常有用。你僅需要做的是在執行程式前，設定合適的環境變數。表1展示mallopt()引數和環境變數的對映關係以及一些額外的資訊。例如，如果你希望設定記憶體消減閾值為64k，你可以執行這個程式：
#MALLOC_TRIM_THRESHOLD=65536 my_prog

記憶體除錯：連續性檢查，可以設定變數MALLOC_CHECK_=1

#MALLOC_CHECK_=1 my_prog

還有一個mtrace使用的例子：

#include <stdio.h>
#include <stdlib.h>
#include <malloc.h>

void main(void)
{

 char *p;
 mtrace();
 p=(char *)malloc(100);
 if(p ==NULL)
     return 0;
  memcpy(p,"helllllllllll",20);
 printf("....1 %s.....\n",p);
 //free(p);

}

#include <stdio.h>

#include <stdlib.h>

#include <malloc.h>

void main(void)

{

char *p;

mtrace();

p=(char *)malloc(100);

if(p ==NULL)

return 0;

memcpy(p,"helllllllllll",20);

printf("....1 %s.....\n",p);

//free(p);

}

執行： #MALLOC_TRACE=”1.txt” ./a.out
然後用mtrace檢視結果：

mtrace 1.txt 

Memory not freed:
-----------------
   Address Size Caller
0x09849378 0x64 at 0x804849e

mtrace 1.txt

Memory not freed:

-----------------

Address Size Caller

0x09849378 0x64 at 0x804849e

一些GNU C庫提供的標準除錯工具可能並不適合你程式的特殊需求。在這種情況下，你可以藉助一個外部的記憶體除錯工具(見 Resource)或者在你的庫內部作修改。做這件事中只是簡單的寫三個函式以及將它們與預先定義的變數相關聯：

__malloc_hook points to a function to be called when the user calls malloc(). You can do your own checks and accounting here, and then call the real malloc() to get the memory that was requested.
__malloc_hook 指向一個函式，當使用者呼叫malloc()時，這個函式將被呼叫。你可以在這裡做你自己的檢查和計數，然後呼叫真實的malloc來得到被請求的記憶體。
__free_hook points to a function called instead of the standard free().
__free_hook 指向一個函式，用來替換標準的free()
__malloc_initialize_hook points to a function called when the memory management system is initialized. This allows you to perform some operations, say, setting the values of the previous hooks, before any memory-related operation takes place.

__malloc_initialize__hook 指向一個函式，當記憶體管理系統被初始化的時候，這個函式被呼叫。這允許你來實施一些操作，例如，在任何記憶體相關的操作生效前，設定前面的勾子值。

在其它的記憶體相關的呼叫中，Hooks()也有效，包括realloc()，calloc()等等。確保在呼叫malloc()或free()之前，儲存先前的勾子的值，把它們儲存起來。如果你不這麼做，你的程式將陷入無盡的遞迴。看看libc info page給的一個記憶體除錯的例子來看看相關細節，最後一點，勾子也被mcheck和mtrace系統使用。在使用所有它們的組合的時候，小心是沒錯的。

而下面的是關於多執行緒的：

建立執行緒私有例項 arena_key，該私有例項儲存的是分配區（ arena ）的 malloc_state 例項指標。 arena_key 指向的可能是主分配區的指標，也可能是非主分配區的指標，這裡將呼叫 ptmalloc_init() 的執行緒的 arena_key 繫結到主分配區上。意味著本執行緒首選從主分配區分配記憶體。

然後呼叫 thread_atfork() 設定當前程式在 fork 子執行緒（ linux 下執行緒是輕量級程式，使用類似 fork 程式的機制建立）時處理 mutex 的回撥函式，在本程式 fork 子執行緒時，呼叫 ptmalloc_lock_all() 獲得所有分配區的鎖，禁止所有分配區分配記憶體，當子執行緒建立完畢，父程式呼叫 ptmalloc_unlock_all() 重新 unlock 每個分配區的鎖 mutex ，子執行緒呼叫 ptmalloc_unlock_all2() 重新初始化每個分配區的鎖 mutex

tsd_key_create (&arena_key, NULL);
  tsd_setspecific (arena_key, (void *) &main_arena);
  thread_atfork (ptmalloc_lock_all, ptmalloc_unlock_all, ptmalloc_unlock_all2);

tsd_key_create (&arena_key, NULL);

tsd_setspecific (arena_key, (void *) &main_arena);

thread_atfork (ptmalloc_lock_all, ptmalloc_unlock_all, ptmalloc_unlock_all2);

當有多個執行緒同時申請訪問記憶體的時候，arena_key的main_arena處於保持互斥鎖狀態，那麼為了提高效率即上面的程式碼，保證了在獲取不到主分割槽的時候，呼叫arena_get2自動建立次分割槽state。見程式碼：

#define arena_lookup(ptr) do { \
      void *vptr = NULL;                         \
      ptr = (mstate) tsd_getspecific (arena_key, vptr);             \
  } while (0)

#define arena_lookup(ptr) do { \

void *vptr = NULL; \

ptr = (mstate) tsd_getspecific (arena_key, vptr); \

} while (0)

和

#define arena_lock(ptr, size) do {                     \
      if (ptr)                                 \
        (void) mutex_lock (&ptr->mutex);                 \
      else                                 \
        ptr = arena_get2 (ptr, (size), NULL);                 \
  } while (0)

#define arena_lock(ptr, size) do { \

if (ptr) \

(void) mutex_lock (&ptr->mutex); \

else \

ptr = arena_get2 (ptr, (size), NULL); \

} while (0)

在繼續之前我們補一下關鍵的資料結構：

struct malloc_state
{
  /* Serialize access. */
  mutex_t mutex;

  /* Flags (formerly in max_fast). */
  int flags;

  /* Fastbins */
  mfastbinptr fastbinsY[NFASTBINS];

  /* Base of the topmost chunk -- not otherwise kept in a bin */
  mchunkptr top;

  /* The remainder from the most recent split of a small request */
  mchunkptr last_remainder;

  /* Normal bins packed as described above */
  mchunkptr bins[NBINS * 2 - 2];

  /* Bitmap of bins */
  unsigned int binmap[BINMAPSIZE];

  /* Linked list */
  struct malloc_state *next;

  /* Linked list for free arenas. */
  struct malloc_state *next_free;

  /* Memory allocated from the system in this arena. */
  INTERNAL_SIZE_T system_mem;
  INTERNAL_SIZE_T max_system_mem;
}

struct malloc_state

{

/* Serialize access. */

mutex_t mutex;

/* Flags (formerly in max_fast). */

int flags;

/* Fastbins */

mfastbinptr fastbinsY[NFASTBINS];

/* Base of the topmost chunk -- not otherwise kept in a bin */

mchunkptr top;

/* The remainder from the most recent split of a small request */

mchunkptr last_remainder;

/* Normal bins packed as described above */

mchunkptr bins[NBINS * 2 - 2];

/* Bitmap of bins */

unsigned int binmap[BINMAPSIZE];

/* Linked list */

struct malloc_state *next;

/* Linked list for free arenas. */

struct malloc_state *next_free;

/* Memory allocated from the system in this arena. */

INTERNAL_SIZE_T system_mem;

INTERNAL_SIZE_T max_system_mem;

}

還有具體分配的chunk：關於它的註釋部分這麼就不翻譯了，但需要好好看看。

/*
  ----------------------- Chunk representations -----------------------
*/

/*
  This struct declaration is misleading (but accurate and necessary).
  It declares a "view" into memory allowing access to necessary
  fields at known offsets from a given base. See explanation below.
*/

struct malloc_chunk {

  INTERNAL_SIZE_T prev_size; /* Size of previous chunk (if free). */
  INTERNAL_SIZE_T size; /* Size in bytes, including overhead. */

  struct malloc_chunk* fd; /* double links -- used only if free. */
  struct malloc_chunk* bk;

  /* Only used for large blocks: pointer to next larger size. */
  struct malloc_chunk* fd_nextsize; /* double links -- used only if free. */
  struct malloc_chunk* bk_nextsize;
};

/*
   malloc_chunk details:

    (The following includes lightly edited explanations by Colin Plumb.)

    Chunks of memory are maintained using a `boundary tag' method as
    described in e.g., Knuth or Standish. (See the paper by Paul
    Wilson ftp://ftp.cs.utexas.edu/pub/garbage/allocsrv.ps for a
    survey of such techniques.) Sizes of free chunks are stored both
    in the front of each chunk and at the end. This makes
    consolidating fragmented chunks into bigger chunks very fast. The
    size fields also hold bits representing whether chunks are free or
    in use.

    An allocated chunk looks like this:

    chunk-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
     | Size of previous chunk, if allocated | |
     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
     | Size of chunk, in bytes |M|P|
      mem-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
     | User data starts here... .
     . .
     . (malloc_usable_size() bytes) .
     . |
nextchunk-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
     | Size of chunk |
     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

    Where "chunk" is the front of the chunk for the purpose of most of
    the malloc code, but "mem" is the pointer that is returned to the
    user. "Nextchunk" is the beginning of the next contiguous chunk.

    Chunks always begin on even word boundaries, so the mem portion
    (which is returned to the user) is also on an even word boundary, and
    thus at least double-word aligned.

    Free chunks are stored in circular doubly-linked lists, and look like this:

    chunk-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
     | Size of previous chunk |
     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
    `head:' | Size of chunk, in bytes |P|
      mem-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
     | Forward pointer to next chunk in list |
     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
     | Back pointer to previous chunk in list |
     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
     | Unused space (may be 0 bytes long) .
     . .
     . |
nextchunk-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
    `foot:' | Size of chunk, in bytes |
     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

    The P (PREV_INUSE) bit, stored in the unused low-order bit of the
    chunk size (which is always a multiple of two words), is an in-use
    bit for the *previous* chunk. If that bit is *clear*, then the
    word before the current chunk size contains the previous chunk
    size, and can be used to find the front of the previous chunk.
    The very first chunk allocated always has this bit set,
    preventing access to non-existent (or non-owned) memory. If
    prev_inuse is set for any given chunk, then you CANNOT determine
    the size of the previous chunk, and might even get a memory
    addressing fault when trying to do so.

    Note that the `foot' of the current chunk is actually represented
    as the prev_size of the NEXT chunk. This makes it easier to
    deal with alignments etc but can be very confusing when trying
    to extend or adapt this code.

    The two exceptions to all this are

     1. The special chunk `top' doesn't bother using the
    trailing size field since there is no next contiguous chunk
    that would have to index off it. After initialization, `top'
    is forced to always exist. If it would become less than
    MINSIZE bytes long, it is replenished.

     2. Chunks allocated via mmap, which have the second-lowest-order
    bit M (IS_MMAPPED) set in their size fields. Because they are
    allocated one-by-one, each must contain its own trailing size field.

*/

100

101

102

103

104

105

106

107

----------------------- Chunk representations -----------------------

This struct declaration is misleading (but accurate and necessary).

It declares a "view" into memory allowing access to necessary

fields at known offsets from a given base. See explanation below.

struct malloc_chunk {

INTERNAL_SIZE_T prev_size; /* Size of previous chunk (if free). */

INTERNAL_SIZE_T size; /* Size in bytes, including overhead. */

struct malloc_chunk* fd; /* double links -- used only if free. */

struct malloc_chunk* bk;

/* Only used for large blocks: pointer to next larger size. */

struct malloc_chunk* fd_nextsize; /* double links -- used only if free. */

struct malloc_chunk* bk_nextsize;

};

malloc_chunk details:

(The following includes lightly edited explanations by Colin Plumb.)

Chunks of memory are maintained using a `boundary tag' method as

described in e.g., Knuth or Standish. (See the paper by Paul

Wilson ftp://ftp.cs.utexas.edu/pub/garbage/allocsrv.ps for a

survey of such techniques.) Sizes of free chunks are stored both

in the front of each chunk and at the end. This makes

consolidating fragmented chunks into bigger chunks very fast. The

size fields also hold bits representing whether chunks are free or

in use.

An allocated chunk looks like this:

chunk-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Size of previous chunk, if allocated | |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Size of chunk, in bytes |M|P|

mem-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| User data starts here... .

. .

. (malloc_usable_size() bytes) .

. |

nextchunk-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Size of chunk |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

Where "chunk" is the front of the chunk for the purpose of most of

the malloc code, but "mem" is the pointer that is returned to the

user. "Nextchunk" is the beginning of the next contiguous chunk.

Chunks always begin on even word boundaries, so the mem portion

(which is returned to the user) is also on an even word boundary, and

thus at least double-word aligned.

Free chunks are stored in circular doubly-linked lists, and look like this:

chunk-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Size of previous chunk |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

`head:' | Size of chunk, in bytes |P|

mem-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Forward pointer to next chunk in list |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Back pointer to previous chunk in list |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

| Unused space (may be 0 bytes long) .

. .

. |

nextchunk-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

`foot:' | Size of chunk, in bytes |

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

The P (PREV_INUSE) bit, stored in the unused low-order bit of the

chunk size (which is always a multiple of two words), is an in-use

bit for the *previous* chunk. If that bit is *clear*, then the

word before the current chunk size contains the previous chunk

size, and can be used to find the front of the previous chunk.

The very first chunk allocated always has this bit set,

preventing access to non-existent (or non-owned) memory. If

prev_inuse is set for any given chunk, then you CANNOT determine

the size of the previous chunk, and might even get a memory

addressing fault when trying to do so.

Note that the `foot' of the current chunk is actually represented

as the prev_size of the NEXT chunk. This makes it easier to

deal with alignments etc but can be very confusing when trying

to extend or adapt this code.

The two exceptions to all this are

1. The special chunk `top' doesn't bother using the

trailing size field since there is no next contiguous chunk

that would have to index off it. After initialization, `top'

is forced to always exist. If it would become less than

MINSIZE bytes long, it is replenished.

2. Chunks allocated via mmap, which have the second-lowest-order

bit M (IS_MMAPPED) set in their size fields. Because they are

allocated one-by-one, each must contain its own trailing size field.

實際分配的chunk圖，空間裡如何佈局的：

而空的chunk結構如下圖：

因為後邊程式碼就是按照這個結構來操作的.

繼續回到__libc_malloc函式,hook處理完之後，Arena_lookup查詢arena_key，當然不為空，前面第一次呼叫hook時已經賦值。（如果多執行緒下，獲取不到main_arena，則分配次分割槽，這個前面也討論過）

那麼獲得互斥鎖。

進入記憶體分配的核心函式_int_malloc，它是malloc分配的核心程式碼和實現.程式碼挺多，自行分析.
1. 判斷申請的空間是否在fastbin，如果在則申請返回，否則繼續（<64B,一般小位元組chunk釋放後放在這裡）
2. 判斷申請的空間是否在smallbin（小於512B），如果是申請返回否則在largebin中
3. 前兩個都不在，那麼肯定在largebin，計算索引，繼續
4. 進入for(;;)後續處理.主要是垃圾回收工作。如果垃圾回收也不行，則進入use_top chunk
5. use_top chunk申請，如果沒有，則呼叫sysmalloc擴充套件heap，如下：

/*
         Otherwise, relay to handle system-dependent cases
       */
      else
        {
          void *p = sysmalloc (nb, av);
          if (p != NULL)
            alloc_perturb (p, bytes);
          return p;
        }

Otherwise, relay to handle system-dependent cases

else

{

void *p = sysmalloc (nb, av);

if (p != NULL)

alloc_perturb (p, bytes);

return p;

}

對於第一次呼叫malloc它直接到sysmalloc來擴充套件heap，自測的例子是先申請200位元組的空間.由於程式一開始fast_max為0，所以肯定在smallbins分類中，但是由於初始化，所以
會呼叫malloc_consolidate來初始化bins，見malloc_init_state(av);：

/*
   Initialize a malloc_state struct.

   This is called only from within malloc_consolidate, which needs
   be called in the same contexts anyway. It is never called directly
   outside of malloc_consolidate because some optimizing compilers try
   to inline it at all call points, which turns out not to be an
   optimization at all. (Inlining it in malloc_consolidate is fine though.)
 */

static void
malloc_init_state (mstate av)
{
  int i;
  mbinptr bin;

  /* Establish circular links for normal bins */
  for (i = 1; i < NBINS; ++i)
    {
      bin = bin_at (av, i);
      bin->fd = bin->bk = bin;   //  連結串列初始化指向自己
    }

#if MORECORE_CONTIGUOUS
  if (av != &main_arena)
#endif
  set_noncontiguous (av);
  if (av == &main_arena)
    set_max_fast (DEFAULT_MXFAST);            //  設定fastbin max 為 64B
  av->flags |= FASTCHUNKS_BIT;

  av->top = initial_top (av);
}

Initialize a malloc_state struct.

This is called only from within malloc_consolidate, which needs

be called in the same contexts anyway. It is never called directly

outside of malloc_consolidate because some optimizing compilers try

to inline it at all call points, which turns out not to be an

optimization at all. (Inlining it in malloc_consolidate is fine though.)

static void

malloc_init_state (mstate av)

{

int i;

mbinptr bin;

/* Establish circular links for normal bins */

for (i = 1; i < NBINS; ++i)

{

bin = bin_at (av, i);

bin->fd = bin->bk = bin; // 連結串列初始化指向自己

}

#if MORECORE_CONTIGUOUS

if (av != &main_arena)

#endif

set_noncontiguous (av);

if (av == &main_arena)

set_max_fast (DEFAULT_MXFAST); // 設定fastbin max 為 64B

av->flags |= FASTCHUNKS_BIT;

av->top = initial_top (av);

}

我們進入sysmalloc,一開始判斷申請的nb是否大於需要map的閥值，如果大於則進入mmap。一般大於128K，它可以動態調整

/*
  MMAP_THRESHOLD_MAX and _MIN are the bounds on the dynamically
  adjusted MMAP_THRESHOLD.
*/

#ifndef DEFAULT_MMAP_THRESHOLD_MIN
#define DEFAULT_MMAP_THRESHOLD_MIN (128 * 1024)
#endif

#ifndef DEFAULT_MMAP_THRESHOLD_MAX
  /* For 32-bit platforms we cannot increase the maximum mmap
     threshold much because it is also the minimum value for the
     maximum heap size and its alignment. Going above 512k (i.e., 1M
     for new heaps) wastes too much address space. */
# if __WORDSIZE == 32
# define DEFAULT_MMAP_THRESHOLD_MAX (512 * 1024)
# else
# define DEFAULT_MMAP_THRESHOLD_MAX (4 * 1024 * 1024 * sizeof(long))
# endif
#endi

MMAP_THRESHOLD_MAX and _MIN are the bounds on the dynamically

adjusted MMAP_THRESHOLD.

#ifndef DEFAULT_MMAP_THRESHOLD_MIN

#define DEFAULT_MMAP_THRESHOLD_MIN (128 * 1024)

#endif

#ifndef DEFAULT_MMAP_THRESHOLD_MAX

/* For 32-bit platforms we cannot increase the maximum mmap

threshold much because it is also the minimum value for the

maximum heap size and its alignment. Going above 512k (i.e., 1M

for new heaps) wastes too much address space. */

# if __WORDSIZE == 32

# define DEFAULT_MMAP_THRESHOLD_MAX (512 * 1024)

# else

# define DEFAULT_MMAP_THRESHOLD_MAX (4 * 1024 * 1024 * sizeof(long))

# endif

#endi

然後是判斷是主分配區或非主分配區，分別不同處理。
這裡進入主分配，我們看下部分核心分配程式碼：

else /* av == main_arena */

    { /* Request enough space for nb + pad + overhead */
      size = nb + mp_.top_pad + MINSIZE;

      /*
         If contiguous, we can subtract out existing space that we hope to
         combine with new space. We add it back later only if
         we don't actually get contiguous space.
       */

      if (contiguous (av))
        size -= old_size;

      /*
         Round to a multiple of page size.
         If MORECORE is not contiguous, this ensures that we only call it
         with whole-page arguments. And if MORECORE is contiguous and
         this is not first time through, this preserves page-alignment of
         previous calls. Otherwise, we correct to page-align below.
       */

      size = (size + pagemask) & ~pagemask;

      /*
         Don't try to call MORECORE if argument is so big as to appear
         negative. Note that since mmap takes size_t arg, it may succeed
         below even if we cannot call MORECORE.
       */

      if (size > 0)
        {
          brk = (char *) (MORECORE (size));
          LIBC_PROBE (memory_sbrk_more, 2, brk, size);
        }

      if (brk != (char *) (MORECORE_FAILURE))
        {
          /* Call the `morecore' hook if necessary. */
          void (*hook) (void) = atomic_forced_read (__after_morecore_hook);
          if (__builtin_expect (hook != NULL, 0))
            (*hook)();
        }
      else
        {

else /* av == main_arena */

{ /* Request enough space for nb + pad + overhead */

size = nb + mp_.top_pad + MINSIZE;

If contiguous, we can subtract out existing space that we hope to

combine with new space. We add it back later only if

we don't actually get contiguous space.

if (contiguous (av))

size -= old_size;

Round to a multiple of page size.

If MORECORE is not contiguous, this ensures that we only call it

with whole-page arguments. And if MORECORE is contiguous and

this is not first time through, this preserves page-alignment of

previous calls. Otherwise, we correct to page-align below.

size = (size + pagemask) & ~pagemask;

Don't try to call MORECORE if argument is so big as to appear

negative. Note that since mmap takes size_t arg, it may succeed

below even if we cannot call MORECORE.

if (size > 0)

{

brk = (char *) (MORECORE (size));

LIBC_PROBE (memory_sbrk_more, 2, brk, size);

}

if (brk != (char *) (MORECORE_FAILURE))

{

/* Call the `morecore' hook if necessary. */

void (*hook) (void) = atomic_forced_read (__after_morecore_hook);

if (__builtin_expect (hook != NULL, 0))

(*hook)();

}

else

{

由於申請的是200B，8位元組對齊為208B，而mp_.top_pad用的預設值為7個pages(0x20000)，MINSIZE為16B.後邊還需要page對齊，所以需要申請8個page。
下面我們看關鍵的程式碼：

brk = (char *) (MORECORE (size));

1	brk = (char *) (MORECORE (size));

這個是什麼？

/* Definition for getting more memory from the OS. */
#define MORECORE (*__morecore)
#define MORECORE_FAILURE 0
void * __default_morecore (ptrdiff_t);
void *(*__morecore)(ptrdiff_t) = __default_morecore;

#include <string.h>

/*
  MORECORE-related declarations. By default, rely on sbrk
*/

/*
  MORECORE is the name of the routine to call to obtain more memory
  from the system. See below for general guidance on writing
  alternative MORECORE functions, as well as a version for WIN32 and a
  sample version for pre-OSX macos.
*/

#ifndef MORECORE
#define MORECORE sbrk
#endif

/* Definition for getting more memory from the OS. */

#define MORECORE (*__morecore)

#define MORECORE_FAILURE 0

void * __default_morecore (ptrdiff_t);

void *(*__morecore)(ptrdiff_t) = __default_morecore;

#include <string.h>

MORECORE-related declarations. By default, rely on sbrk

MORECORE is the name of the routine to call to obtain more memory

from the system. See below for general guidance on writing

alternative MORECORE functions, as well as a version for WIN32 and a

sample version for pre-OSX macos.

#ifndef MORECORE

#define MORECORE sbrk

#endif

而__default_morecore是：

/* Allocate INCREMENT more bytes of data space,
   and return the start of data space, or NULL on errors.
   If INCREMENT is negative, shrink data space. */
void *
__default_morecore (ptrdiff_t increment)
{
  void *result = (void *) __sbrk (increment);
  if (result == (void *) -1)
    return NULL;

  return result;
}
libc_hidden_def (__default_morecore)

/* Allocate INCREMENT more bytes of data space,

and return the start of data space, or NULL on errors.

If INCREMENT is negative, shrink data space. */

void *

__default_morecore (ptrdiff_t increment)

{

void *result = (void *) __sbrk (increment);

if (result == (void *) -1)

return NULL;

return result;

}

libc_hidden_def (__default_morecore)

這裡解釋下sbrk：

sbrk不是系統呼叫，是C庫函式。系統呼叫通常提供一種最小功能，而庫函式通常提供比較複雜的功能。sbrk/brk是從堆中分配空間，本質是移動一個位置，向後移就是分配空間，向前移就是釋放空間，sbrk用相對的整數值確定位置，如果這個整數是正數，會從當前位置向後移若干位元組，如果為負數就向前若干位元組。在任何情況下，返回值永遠是移動之前的位置。sbrk是brk的封裝。
預設mp_.sbrk_base為空。所以需要：

if (mp_.sbrk_base == 0)
            mp_.sbrk_base = brk;

1 2	if (mp_.sbrk_base == 0) mp_.sbrk_base = brk;

av->system_mem預設也為0

av->system_mem += size;

1	av->system_mem += size;

然後需要做一些調整：

/*
             Otherwise, make adjustments:

           * If the first time through or noncontiguous, we need to call sbrk
              just to find out where the end of memory lies.

           * We need to ensure that all returned chunks from malloc will meet
              MALLOC_ALIGNMENT

           * If there was an intervening foreign sbrk, we need to adjust sbrk
              request size to account for fact that we will not be able to
              combine new space with existing space in old_top.

           * Almost all systems internally allocate whole pages at a time, in
              which case we might as well use the whole last page of request.
              So we allocate enough more memory to hit a page boundary now,
              which in turn causes future contiguous calls to page-align.
           */

          else
            {
              front_misalign = 0;
              end_misalign = 0;
              correction = 0;
              aligned_brk = brk;

              /* handle contiguous cases */
              if (contiguous (av))
                {
                  /* Count foreign sbrk as system_mem. */

Otherwise, make adjustments:

* If the first time through or noncontiguous, we need to call sbrk

just to find out where the end of memory lies.

* We need to ensure that all returned chunks from malloc will meet

MALLOC_ALIGNMENT

* If there was an intervening foreign sbrk, we need to adjust sbrk

request size to account for fact that we will not be able to

combine new space with existing space in old_top.

* Almost all systems internally allocate whole pages at a time, in

which case we might as well use the whole last page of request.

So we allocate enough more memory to hit a page boundary now,

which in turn causes future contiguous calls to page-align.

else

{

front_misalign = 0;

end_misalign = 0;

correction = 0;

aligned_brk = brk;

/* handle contiguous cases */

if (contiguous (av))

{

/* Count foreign sbrk as system_mem. */

後面有這麼一句：由於correction為0，所以返回當前的值.

snd_brk = (char *) (MORECORE (correction));

1	snd_brk = (char *) (MORECORE (correction));

最後來分配空間：

/* finally, do the allocation */
  p = av->top;    
  size = chunksize (p);

  /* check that one of the above allocation paths succeeded */
  if ((unsigned long) (size) >= (unsigned long) (nb + MINSIZE))    // size 為top->size  ,為剛申請空間的大小. 我們的例子是0x21000 （8pages）而nb為208B
    {
      remainder_size = size - nb;// 0x2100 - 208(0xd0)
      remainder = chunk_at_offset (p, nb);
      av->top = remainder;                                 // 改變top的指標                                              
      set_head (p, nb | PREV_INUSE | (av != &main_arena ? NON_MAIN_ARENA : 0));
      set_head (remainder, remainder_size | PREV_INUSE);
      check_malloced_chunk (av, p, nb);
      return chunk2mem (p);
    }

/* finally, do the allocation */

p = av->top;

size = chunksize (p);

/* check that one of the above allocation paths succeeded */

if ((unsigned long) (size) >= (unsigned long) (nb + MINSIZE)) // size 為top->size ,為剛申請空間的大小. 我們的例子是0x21000 （8pages）而nb為208B

{

remainder_size = size - nb;// 0x2100 - 208(0xd0)

remainder = chunk_at_offset (p, nb);

av->top = remainder; // 改變top的指標

set_head (p, nb | PREV_INUSE | (av != &main_arena ? NON_MAIN_ARENA : 0));

set_head (remainder, remainder_size | PREV_INUSE);

check_malloced_chunk (av, p, nb);

return chunk2mem (p);

}

av->top是什麼值呢？

/* Adjust top based on results of second sbrk */
              if (snd_brk != (char *) (MORECORE_FAILURE))
                {
                  av->top = (mchunkptr) aligned_brk;    //      aligned_brk = brk;  當然如果需要對齊，aligned_brk會偏移一些位元組

/* Adjust top based on results of second sbrk */

if (snd_brk != (char *) (MORECORE_FAILURE))

{

av->top = (mchunkptr) aligned_brk; // aligned_brk = brk; 當然如果需要對齊，aligned_brk會偏移一些位元組

也就是top就是一個指向heap開始的指標.並轉換為struct malloc_chunk 指標.和我們上面的圖就對應起來了。

然後重新設定top指標，和size的標誌位，偏移過pre_size和size，就是實際資料地址即return chunk2mem (p);

如果我們緊接著申請了200B後，馬上申請16B，由於fastbins雖然設定了max 為64B但是它裡面的chunk是free的時候放置進來的，目前為空。

所以繼續進入smallbin。同理由於沒有free的small chunk 。所以進入top chunk 分配成功：

use_top:
      /*
         If large enough, split off the chunk bordering the end of memory
         (held in av->top). Note that this is in accord with the best-fit
         search rule. In effect, av->top is treated as larger (and thus
         less well fitting) than any other available chunk since it can
         be extended to be as large as necessary (up to system
         limitations).

         We require that av->top always exists (i.e., has size >=
         MINSIZE) after initialization, so if it would otherwise be
         exhausted by current request, it is replenished. (The main
         reason for ensuring it exists is that we may need MINSIZE space
         to put in fenceposts in sysmalloc.)
       */

      victim = av->top;
      size = chunksize (victim);

      if ((unsigned long) (size) >= (unsigned long) (nb + MINSIZE))
        {
          remainder_size = size - nb;
          remainder = chunk_at_offset (victim, nb);
          av->top = remainder;
          set_head (victim, nb | PREV_INUSE |
                    (av != &main_arena ? NON_MAIN_ARENA : 0));
          set_head (remainder, remainder_size | PREV_INUSE);

          check_malloced_chunk (av, victim, nb);
          void *p = chunk2mem (victim);
          alloc_perturb (p, bytes);
          return p;
        }

use_top:

If large enough, split off the chunk bordering the end of memory

(held in av->top). Note that this is in accord with the best-fit

search rule. In effect, av->top is treated as larger (and thus

less well fitting) than any other available chunk since it can

be extended to be as large as necessary (up to system

limitations).

We require that av->top always exists (i.e., has size >=

MINSIZE) after initialization, so if it would otherwise be

exhausted by current request, it is replenished. (The main

reason for ensuring it exists is that we may need MINSIZE space

to put in fenceposts in sysmalloc.)

victim = av->top;

size = chunksize (victim);

if ((unsigned long) (size) >= (unsigned long) (nb + MINSIZE))

{

remainder_size = size - nb;

remainder = chunk_at_offset (victim, nb);

av->top = remainder;

set_head (victim, nb | PREV_INUSE |

(av != &main_arena ? NON_MAIN_ARENA : 0));

set_head (remainder, remainder_size | PREV_INUSE);

check_malloced_chunk (av, victim, nb);

void *p = chunk2mem (victim);

alloc_perturb (p, bytes);

return p;

}

當然如果我們釋放了16B後，有馬上申請16B，那麼它會直接進入fastbin並申請返回成功。，這裡我們知道當第一次使用的時候不論什麼bin都是空的，只有當多次使用多次釋放的時候才會體會出來它的優勢和效率來.

這裡附上自己測試的小程式：

#include <stdio.h>
#include <stdlib.h>
#include <string.h>

int main(void)
{

  char *p,*q;
  void * brk;
 brk=sbrk(0);
printf("brk is ....%p...\n",brk);
  p = (char *)malloc(200);
 if(p==NULL)
  return 1;

 strcpy(p,"hello");

  brk=sbrk(100);
printf("111....brk is ....%p...\n",brk);
 printf("200,p is %s...\n",p);

  q= (char *)malloc(16);
  if(q==NULL)
   return 1;

 strcpy(q,"nihao");

 printf("16,q is %s...\n",q);

 free(q);
 free(p);

  p= (char *)malloc(16);

 return 0;

}

#include <stdio.h>

#include <stdlib.h>

#include <string.h>

int main(void)

{

char *p,*q;

void * brk;

brk=sbrk(0);

printf("brk is ....%p...\n",brk);

p = (char *)malloc(200);

if(p==NULL)

return 1;

strcpy(p,"hello");

brk=sbrk(100);

printf("111....brk is ....%p...\n",brk);

printf("200,p is %s...\n",p);

q= (char *)malloc(16);

if(q==NULL)

return 1;

strcpy(q,"nihao");

printf("16,q is %s...\n",q);

free(q);

free(p);

p= (char *)malloc(16);

return 0;

}

關於不論fastbin還是smallbin的機制，或許我們記得在《深入理解計算機系統》中，講到垃圾回收的時候，腳註法。書和程式碼一起看效果會不錯.

記憶體的延遲分配，只有在真正訪問一個地址的時候才建立這個地址的物理對映，這是Linux記憶體管理的基本思想之一

還有就是：

核心預設配置下，程式的棧和mmap對映區域並不是從一個固定地址開始，並且每次啟動時的值都不一樣，這是程式在啟動時隨機改變這些值的設定，使得使用緩衝區溢位進行攻擊更加困難。當然也可以讓程式的棧和mmap對映區域從一個固定位置開始，只需要設定全域性變數randomize_va_space值為0，這個變數預設值為1。使用者可以通過設定/proc/sys/kernel/randomize_va_space來停用該特性，也可以用如下命令： sudo sysctl -w kernel.randomize_va_space=0

Linux實體記憶體管理
2024-11-28
Linux記憶體
struct和malloc記憶體互轉例子
2024-05-17
Struct記憶體
iOS探索記憶體對齊&malloc原始碼
2020-01-02
iOS記憶體原始碼
Linux共享記憶體的管理
2018-06-07
Linux記憶體
Linux 記憶體區管理 slab
2024-04-26
Linux記憶體
linux記憶體管理（二）- vmalloc
2024-06-11
Linux記憶體
linux記憶體管理（一）實體記憶體的組織和記憶體分配
2024-06-07
Linux記憶體
Linux記憶體洩露案例分析和記憶體管理分享
2024-10-24
Linux記憶體洩露
記憶體分配詳解 malloc, new, HeapAlloc, VirtualAlloc，GlobalAlloc
2018-05-03
記憶體
記憶體管理記憶體管理概述
2020-11-03
記憶體
Linux 的記憶體分頁管理
2018-08-08
Linux記憶體
Linux 記憶體管理 pt.3
2023-05-17
Linux記憶體
Linux 記憶體管理 pt.1
2023-04-27
Linux記憶體
Linux 記憶體管理 pt.2
2023-05-05
Linux記憶體
Linux的記憶體分頁管理
2020-03-26
Linux記憶體
Linux-記憶體和磁碟管理
2022-02-14
Linux記憶體
記憶體管理篇——實體記憶體的管理
2022-02-23
記憶體
linux記憶體管理學習總結
2024-11-04
Linux記憶體
【記憶體管理】記憶體佈局
2024-06-10
記憶體
Linux使用者空間記憶體管理
2018-09-26
Linux記憶體
linux 非連續記憶體區管理 vmalloc
2024-04-26
Linux記憶體
linux記憶體管理（八）- 反向對映RMAP
2024-06-15
Linux記憶體
linux記憶體管理（十）- 頁面回收（二）
2024-06-18
Linux記憶體
linux記憶體管理（十一）- 頁面遷移
2024-06-18
Linux記憶體
linux記憶體管理（六）- 核心新struct - folio
2024-06-11
Linux記憶體Struct
淺析Linux Kernel[5.11.0]記憶體管理（一）
2022-01-18
Linux記憶體
記憶體管理兩部曲之實體記憶體管理
2021-05-22
記憶體
Java的記憶體 -JVM 記憶體管理
2018-08-20
Java記憶體JVM
Go：記憶體管理與記憶體清理
2020-08-04
Go記憶體
【記憶體管理】Oracle AMM自動記憶體管理詳解
2020-08-27
記憶體Oracle
記憶體管理兩部曲之虛擬記憶體管理
2021-05-31
記憶體
Linux堆記憶體管理深入分析(下半部)
2020-08-19
Linux記憶體
[Linux]共享記憶體
2024-12-07
Linux記憶體
Linux核心筆記004 - 從記憶體管理開始，認識Linux核心
2020-05-28
Linux筆記記憶體
JavaScript 記憶體管理
2018-11-02
JavaScript記憶體
iOS 記憶體管理
2018-12-20
iOS記憶體
Android記憶體管理
2018-06-13
Android記憶體
OC記憶體管理
2018-08-29
記憶體
記憶體管理-swMemoryGlobal
2019-09-05
記憶體

Linux記憶體管理：Malloc

相關文章