搓一個Pythonic list

ChebyshevTST發表於2023-11-02

原文網址 : https://www.cnblogs.com/ChebyshevTST/p/17805547.html

總所周知，Python語言當中的list是可以儲存不同型別的元素的，對應到現代C++當中，可以用std::variant或者std::any實現類似的功能。而Python官方的實現當中用到了二級指標，不過拋開這些，我們也可以自己設計一個list的架構，實現多型別值的儲存容器。

下圖是自己實現的list的架構，按照這個架構，我們來逐步分析程式碼。不過為了節省篇幅，我僅僅只實現了一部分的方法，比如append，但是這裡我們著重的是容器的設計。

我們自頂向下分析。list這個結構體是最終要實現的容器，裡面包含了一個指向__list的指標，__list裡面存著一系列的Node節點。除了指標，還有offset偏移量，記錄當前__list指標ptr的偏移量，size是list的元素大小，而最後一個聯合體u則為了實現多值儲存而塞的一個成員。Node這邊，含有一個void型別的指標，它可以指向任意元素的地址，待會我們會將它轉換回對應的元素型別，從而獲取其指向的值。type記錄該指標指向的具體型別。

以下對應了這三個結構體的實現。

struct Node {
	void *data = nullptr;
	int type;
};

struct __list {
	Node node;
};

struct list {
	__list *ptr;
	int offset{};
	int size;

	U u;

	list(int size) : size(size) {
		ptr = static_cast<__list *>(malloc(sizeof(__list) * (size + 1)));
	}

	list(const list& other) = default;
	
	~list() {
		ptr -= offset;
		free(ptr);
	}
}

在分配記憶體的時候，要注意額外分配多一個空位，因為ptr是指向list最後元素的下一個位置。解構函式的時候也要記得將ptr回退到最開始的位置，不然會出現記憶體方面的問題。

在型別方面，這裡僅寫了幾種常用的型別，可以按照實際需要補充更多的型別上去。

enum {
	INT,
	UINT,
	CHAR,
	UCHAR,
	FLOAT,
	DOUBLE
};

append函式，這裡我沒有使用泛型實現，而是使用了函式過載，覺得比較好寫，以下是int型別的實現，其它型別同理，只需要稍微改改。

void append(uint& __data) {
		ptr->node.data = static_cast<void *>(&__data);
		ptr->node.type = UINT;

		if (offset + 1 <= size) {
			++ptr;
			++offset;
		}
		else 
			std::cout << "The list has achived it's capacity\n";
	}

另外，還過載了[]運算子，這裡就用到了前面所提到的union了，這裡設定了返回值為union，這樣可以比較巧妙的處理不同返回值的情況。

U operator[](int index) {
		auto it = ptr - offset + index;
		auto __data = it->node.data;
		int type = it->node.type;

		switch (type) {
			case INT: {
				u.intData = *(static_cast<int *>(__data));
				u.type = INT;
				break;
			}
			case UINT: {
				u.uintData = *static_cast<uint *>(__data);
				u.type = UINT;
				break;
			}
			case CHAR: {
				u.charData = *static_cast<char *>(__data);
				u.type = CHAR;
				break;
			}
			case UCHAR: {
				u.ucharData = *static_cast<u_char *>(__data);
				u.type = UCHAR;
				break;
			}
			case FLOAT: {
				u.floatData = *static_cast<float *>(__data);
				u.type = FLOAT;
				break;
			}
			case DOUBLE: {
				u.doubleData = *static_cast<double *>(__data);
				u.type = DOUBLE;
				break;
			}
			default: {
				assert(0);
			}
		}

		return u;
	}

為了最終可以遍歷元素並且輸出出來，還需要對union進行過載一下。

struct U {
	union {
		int intData;
		uint uintData;
		char charData;
		u_char ucharData;
		float floatData;
		double doubleData;
	};

	// To figure out which type we're using
	int type;

	friend std::ostream& operator<<(std::ostream& os, const U& u) {
		int type = u.type;

		switch (type) {
			case INT: {
				os << u.intData;
				break;
			}
			case UINT: {
				os << u.uintData;
				break;
			}
			case CHAR: {
				os << u.charData;
				break;
			}
			case UCHAR: {
				os << u.ucharData;
				break;
			}
			case FLOAT: {
				os << u.floatData;
				break;
			}
			case DOUBLE: {
				os << u.doubleData;
				break;
			}
			default: {
				assert(0);
			}
		}

		return os;
	}
};

（能用switch代替if else就儘量代替）

到這裡，所設計的list就差不多了，剩下的函式可以由讀者來擴充。不過還有侷限性，可以看看它怎麼使用。

int main() {
	list lst{3};

	std::vector v{1, 2, 3};

	for (int i{}; i < v.size(); ++i)
		lst.append(v[i]);
	
	for (int i{}; i < lst.size; ++i)
		std::cout << lst[i] << ' ';
}

由於沒有寫對右值資料的處理，所以只能先將想要存的資料存入另一個容器當中。我們再來測試一下。

int main() {
	list lst{3};

	int a = 1;
	double b = 1.1;
	char c = 'c';

	lst.append(a);
	lst.append(b);
	lst.append(c);

	for (int i{}; i < lst.size; ++i)
		std::cout << lst[i] << ' ';
}

執行結果是1, 1.1, c，符合預期。

以下是完整程式碼

#include <iostream>
#include <cstdlib>
#include <cassert>
#include <vector>
#include <type_traits>

enum {
	INT,
	UINT,
	CHAR,
	UCHAR,
	FLOAT,
	DOUBLE
};

struct U {
	union {
		int intData;
		uint uintData;
		char charData;
		u_char ucharData;
		float floatData;
		double doubleData;
	};

	// To figure out which type we're using
	int type;

	friend std::ostream& operator<<(std::ostream& os, const U& u) {
		int type = u.type;

		switch (type) {
			case INT: {
				os << u.intData;
				break;
			}
			case UINT: {
				os << u.uintData;
				break;
			}
			case CHAR: {
				os << u.charData;
				break;
			}
			case UCHAR: {
				os << u.ucharData;
				break;
			}
			case FLOAT: {
				os << u.floatData;
				break;
			}
			case DOUBLE: {
				os << u.doubleData;
				break;
			}
			default: {
				assert(0);
			}
		}

		return os;
	}
};

struct Node {
	void *data = nullptr;
	int type;
};

struct __list {
	Node node;
};

struct list {
	__list *ptr;
	int offset{};
	int size;

	U u;

	list(int size) : size(size) {
		ptr = static_cast<__list *>(malloc(sizeof(__list) * (size + 1)));
	}

	list(const list& other) = default;
	list& operator=(const list& other) = default;

	
	~list() {
		ptr -= offset;
		free(ptr);
	}

	void append(int& __data) {
		if (offset + 1 <= size) {
			ptr->node.data = static_cast<void *>(&__data);
			ptr->node.type = INT;
			++ptr;
			++offset;
		}
		else 
			std::cout << "The list has achived it's capacity\n";
	}



	void append(float& __data) {
		ptr->node.data = static_cast<void *>(&__data);
		ptr->node.type = FLOAT;

		if (offset + 1 <= size) {
			++ptr;
			++offset;
		}
		else 
			std::cout << "The list has achived it's capacity\n";
	}



	void append(double& __data) {
		ptr->node.data = static_cast<void *>(&__data);
		ptr->node.type = DOUBLE;

		if (offset + 1 <= size) {
			++ptr;
			++offset;
		}
		else 
			std::cout << "The list has achived it's capacity\n";
	}


	void append(char& __data) {
		ptr->node.data = static_cast<void *>(&__data);
		ptr->node.type = CHAR;

		if (offset + 1 <= size) {
			++ptr;
			++offset;
		}
		else 
			std::cout << "The list has achived it's capacity\n";
	}
	

	void append(u_char& __data) {
		ptr->node.data = static_cast<void *>(&__data);
		ptr->node.type = UCHAR;

		if (offset + 1 <= size) {
			++ptr;
			++offset;
		}
		else 
			std::cout << "The list has achived it's capacity\n";
	}

	void append(uint& __data) {
		ptr->node.data = static_cast<void *>(&__data);
		ptr->node.type = UINT;

		if (offset + 1 <= size) {
			++ptr;
			++offset;
		}
		else 
			std::cout << "The list has achived it's capacity\n";
	}

	U operator[](int index) {
		auto it = ptr - offset + index;
		auto __data = it->node.data;
		int type = it->node.type;

		switch (type) {
			case INT: {
				u.intData = *(static_cast<int *>(__data));
				u.type = INT;
				break;
			}
			case UINT: {
				u.uintData = *static_cast<uint *>(__data);
				u.type = UINT;
				break;
			}
			case CHAR: {
				u.charData = *static_cast<char *>(__data);
				u.type = CHAR;
				break;
			}
			case UCHAR: {
				u.ucharData = *static_cast<u_char *>(__data);
				u.type = UCHAR;
				break;
			}
			case FLOAT: {
				u.floatData = *static_cast<float *>(__data);
				u.type = FLOAT;
				break;
			}
			case DOUBLE: {
				u.doubleData = *static_cast<double *>(__data);
				u.type = DOUBLE;
				break;
			}
			default: {
				assert(0);
			}
		}

		return u;
	}

};

到這裡，一個Pythonic的list就成型了，剩下的其它函式實現方式也就大同小異。在設計list的時候，由於設計到指標，因此對於記憶體洩露方面需要比較謹慎。以上的實現僅僅涉及到了一級指標，Python官方實現是採用二級指標，感興趣的話可以去學習學習別人是怎麼實現的~

用Python搓一個太陽系
2022-02-25
Python
手搓大模型Task03：手搓一個最小的 Agent 系統
2024-09-27
大模型
學會Lambda，讓程式Pythonic一點
2019-08-15
Python
手搓一個兔子問題（分享一個C語言問題，持續更新…）
2018-10-17
C語言
集合第一個Array List理解
2020-11-04
用Vue全家桶純手工搓了一個開源版「抖音」
2024-04-16
Vue
嗯，手搓一個TinyPng壓縮圖片的WebpackPlugin也SoEasy啦
2020-08-10
WebPlugin
pythonic context manager知多少
2020-06-29
PythonContext
Pythonic AI generation of images and videos
2024-08-08
PythonAIIDE
一個 List.of 引發的“血案”
2023-11-13
通過Guava實現兩個包含不同物件的List合併成一個List
2019-03-04
Guava物件
[work] python list中數字與一個數相乘
2019-01-14
Python
python List，它不是一個簡單的陣列
2018-08-22
Python陣列
List 按照指定大小分割為多個list的幾種方式,list分片
2024-07-08
幾種實用的 pythonic 語法
2019-02-16
Python
如何在 SAPGUI 的同一個螢幕顯示兩個 ALV list
2022-04-03
GUI
vue+quasar+electron+springboot+mysql擼一個TODO LIST 看板
2021-04-13
VueSpring BootMySql
再探URLDNS鏈(手搓exp)
2024-05-10
DNS
替代 for 迴圈，讓 Python 程式碼更 pythonic !
2024-03-04
Python
瞧瞧，這樣的「函式」才叫 Pythonic
2020-11-05
函式Python
List＜實體類＞轉換成map 一個鍵對應多個值
2020-12-16
編寫一個非常精美的Flutter Todo-List專案
2019-07-29
Flutter
如何利用Python隨機從list中挑選一個元素
2022-06-11
Python隨機
輸入多個編碼並支援模糊搜尋，引數是一個list
2024-05-15
Scala——三個容器：List Set Map
2020-09-26
如何使用程式碼獲得一個function module的Where Used List
2018-03-14
Function
YII 初體驗 —— 搭建一個簡單的 Todo List 系統
2022-02-03
【手搓模型】親手實現 Vision Transformer
2023-03-17
模型ORM
List簡易筆記一
2020-09-28
筆記
Mybatis(五)--原始碼分析傳入單個list引數和多個list引數寫法
2018-03-24
MyBatis原始碼
從零開始手搓GPU，照著英偉達CUDA來，只用兩個星期
2024-05-13
GPU
使用Java線上編譯器手搓一款摸魚小遊戲
2022-11-23
Java編譯遊戲
“智慧體風”吹進體育圈粉絲手搓上百個智慧體為中國健兒應援太有AI了！粉絲手搓上百個智慧體為中國健兒打CALL
2024-07-27
智慧體AI
SAP WebClient UI drop down list(下拉選單)的一個故障和解決方法
2020-09-15
WebclientUI
C#怎麼從List集合中隨機取出其中一個值
2020-10-08
C#隨機
easyExcel匯出多個list列表的excel
2020-12-03
Excel
List分頁（SQL引數化2100個）
2020-12-07
SQL
Python List 列表list()方法
2021-09-14
Python

搓一個Pythonic list

相關文章