pandas中的遍歷方式速度對比

Dapenson發表於2021-02-03

原文網址 : https://www.cnblogs.com/dapenson/p/14369952.html

對一個20667行的xlsx檔案進行遍歷測試


import pandas as pd

# 定義一個計算執行時間的函式作裝飾器，傳入引數為裝飾的函式或方法
def print_execute_time(func):
    from time import time

    # 定義巢狀函式，用來列印出裝飾的函式的執行時間
    def wrapper(*args, **kwargs):
        # 定義開始時間和結束時間，將func夾在中間執行，取得其返回值
        start = time()
        func_return = func(*args, **kwargs)
        end = time()
        # 列印方法名稱和其執行時間
        print(f'{func.__name__}() execute time: {end - start}s')
        # 返回func的返回值
        return func_return

    # 返回巢狀的函式
    return wrapper

file_path = r"D:\git\xxxx\dev\pd-xxx1.2\合併.xlsx"
data = pd.read_excel(file_path,sheet_name="xxxx",engine='openpyxl')
# 空值處理
df = data.where(data.notnull(),None)


@print_execute_time
def iterrows():
    for index, row in df.iterrows():
        # print(index," = ",row['機號'])
        pass


@print_execute_time
def itertuples():
    for row in df.itertuples():
        # print(row['機號'])
        pass


@print_execute_time
def iteritems():
    for index, row in df.iteritems():
        # print(index," = ",row['機號'])
        pass

@print_execute_time
def index():
    for i in df.index:
        # print(i," = ",df['機號'].at[i])
        pass

if __name__ == '__main__':
    print('begining ...')
    print(iterrows(),itertuples(),iteritems(),index())
    print('Done !')

測試結果

begining ...
iterrows() execute time: 2.003657817840576s
itertuples() execute time: 0.04618692398071289s
iteritems() execute time: 0.0009987354278564453s
index() execute time: 0.0029909610748291016s    
Done !

iterrows() execute time: 2.2464449405670166s
itertuples() execute time: 0.08178043365478516s
iteritems() execute time: 0.000997781753540039s
index() execute time: 0.0059833526611328125s

因此從效率上考慮,優先採用iteritems或index來進行遍歷資料

golang for range 遍歷對比 PHP、python
2020-06-02
GolangPHPPython
樹的遍歷方式
2024-06-24
JS中遍歷陣列、物件的方式
2018-11-05
JS陣列物件
你真的會寫迴圈嗎–8種遍歷方法執行速度深度°對比
2019-03-04
JavaScript遍歷方法總結與對比
2019-03-30
JavaScript
JS遍歷物件的方式
2018-12-05
JS物件
如何遍歷 HashMap，遍歷HashMap 的 5 種最佳方式
2020-10-18
HashMap
陣列常見的遍歷迴圈方法、陣列的迴圈遍歷的效率對比
2019-02-17
陣列
CArray CList CMap 插入與遍歷效率對比
2021-06-12
python字串遍歷方式
2020-12-08
Python字串
map的四種遍歷方式
2018-05-05
hashMap的四種遍歷方式
2021-02-13
HashMap
js 遍歷陣列方式
2020-09-27
JS陣列
Map集合的四種遍歷方式
2018-04-03
python字典的四種遍歷方式
2024-07-11
Python
JS中的遍歷
2019-11-20
JS
遍歷資料夾的幾種方式
2018-07-26
遍歷PHP陣列的6種方式
2018-06-16
PHP陣列
Java遍歷Map物件的四種方式
2018-12-29
Java物件
Python字典的遍歷,包括key遍歷/value遍歷/item遍歷/
2020-12-07
Python
c++遍歷陣列的多種方式
2024-09-05
C++陣列
JS遍歷物件屬性的7種方式
2022-06-19
JS物件
HashMap 的 7 種遍歷方式與效能分析
2022-02-11
HashMap
遍歷 Dictionary，你會幾種方式？
2020-09-28
影片直播系統原始碼，java中Map遍歷的三種方式
2023-02-07
原始碼Java
php手冊 php陣列的遍歷有哪幾種方式？php陣列如何遍歷？
2021-04-16
PHP陣列
php陣列中常用的多種遍歷方式
2020-04-04
PHP陣列
【C#】-遍歷資料夾簡約的方式
2024-08-14
C#
PHP二維關聯陣列的遍歷方式
2021-09-09
PHP陣列
Morris中序遍歷
2020-09-24
js的map遍歷和array遍歷
2018-11-15
JS
遍歷物件鍵值對的兩種方法
2018-04-04
物件
Winform 遍歷 ListBox中的所有項
2019-02-18
ORM
Python中的字典遍歷有序嗎？
2021-09-11
Python
JavaScript中遍歷的幾種方法
2020-12-14
JavaScript
如何從效能角度選擇陣列的遍歷方式
2021-11-23
陣列
面試中很值得聊的二叉樹遍歷方法——Morris遍歷
2020-05-27
面試二叉樹
前端技巧：遍歷陣列都有哪些方式呢？
2021-03-29
前端陣列

pandas中的遍歷方式速度對比

測試結果

相關文章