笨辦法學C 練習47：一個快速的URL路由

飛龍發表於2019-05-09

原文網址 : https://flycode.co/archives/74429

練習47：一個快速的URL路由

原文：Exercise 47: A Fast URL Router

譯者：飛龍

我現在打算向你展示使用TSTree來建立伺服器中的快速URL路由。它適用於應用中的簡單的URL匹配，而不是在許多Web應用框架中的更復雜（一些情況下也不必要）的路由發現功能。

我打算程式設計一個小型命令列工具和路由互動，他叫做urlor，讀取簡單的路由檔案，之後提示使用者輸入要檢索的URL。

#include <lcthw/tstree.h>
#include <lcthw/bstrlib.h>

TSTree *add_route_data(TSTree *routes, bstring line)
{
    struct bstrList *data = bsplit(line, ` `);
    check(data->qty == 2, "Line `%s` does not have 2 columns",
            bdata(line));

    routes = TSTree_insert(routes,
            bdata(data->entry[0]), blength(data->entry[0]),
            bstrcpy(data->entry[1]));

    bstrListDestroy(data);

    return routes;

error:
    return NULL;
}

TSTree *load_routes(const char *file)
{
    TSTree *routes = NULL;
    bstring line = NULL;
    FILE *routes_map = NULL;

    routes_map = fopen(file, "r");
    check(routes_map != NULL, "Failed to open routes: %s", file);

    while((line = bgets((bNgetc)fgetc, routes_map, `
`)) != NULL) {
        check(btrimws(line) == BSTR_OK, "Failed to trim line.");
        routes = add_route_data(routes, line);
        check(routes != NULL, "Failed to add route.");
        bdestroy(line);
    }

    fclose(routes_map);
    return routes;

error:
    if(routes_map) fclose(routes_map);
    if(line) bdestroy(line);

    return NULL;
}

bstring match_url(TSTree *routes, bstring url)
{
    bstring route = TSTree_search(routes, bdata(url), blength(url));

    if(route == NULL) {
        printf("No exact match found, trying prefix.
");
        route = TSTree_search_prefix(routes, bdata(url), blength(url));
    }

    return route;
}

bstring read_line(const char *prompt)
{
    printf("%s", prompt);

    bstring result = bgets((bNgetc)fgetc, stdin, `
`);
    check_debug(result != NULL, "stdin closed.");

    check(btrimws(result) == BSTR_OK, "Failed to trim.");

    return result;

error:
    return NULL;
}

void bdestroy_cb(void *value, void *ignored)
{
    (void)ignored;
    bdestroy((bstring)value);
}

void destroy_routes(TSTree *routes)
{
    TSTree_traverse(routes, bdestroy_cb, NULL);
    TSTree_destroy(routes);
}

int main(int argc, char *argv[])
{
    bstring url = NULL;
    bstring route = NULL;
    check(argc == 2, "USAGE: urlor <urlfile>");

    TSTree *routes = load_routes(argv[1]);
    check(routes != NULL, "Your route file has an error.");

    while(1) {
        url = read_line("URL> ");
        check_debug(url != NULL, "goodbye.");

        route = match_url(routes, url);

        if(route) {
            printf("MATCH: %s == %s
", bdata(url), bdata(route));
        } else {
            printf("FAIL: %s
", bdata(url));
        }

        bdestroy(url);
    }

    destroy_routes(routes);
    return 0;

error:
    destroy_routes(routes);
    return 1;
}

之後我建立了一個簡單的檔案，含有一些用於互動的偽造的路由：

/ MainApp /hello Hello /hello/ Hello /signup Signup /logout Logout /album/ Album

你會看到什麼

一旦你使urlor工作，並且建立了路由檔案，你可以嘗試這樣：

$ ./bin/urlor urls.txt
URL> /
MATCH: / == MainApp
URL> /hello
MATCH: /hello == Hello
URL> /hello/zed  
No exact match found, trying prefix.
MATCH: /hello/zed == Hello
URL> /album
No exact match found, trying prefix.
MATCH: /album == Album
URL> /album/12345
No exact match found, trying prefix.
MATCH: /album/12345 == Album
URL> asdfasfdasfd
No exact match found, trying prefix.
FAIL: asdfasfdasfd
URL> /asdfasdfasf
No exact match found, trying prefix.
MATCH: /asdfasdfasf == MainApp
URL>
$

你可以看到路由系統首先嚐試精確匹配，之後如果找不到的話則會嘗試字首匹配。這主要是嘗試這二者的不同。根據你的URL的語義，你可能想要之中精確匹配，始終字首匹配，或者執行二者並選出“最好”的那個。

如何改進

URL非常古怪。因為人們想讓它們神奇地處理它們的web應用所具有的，所有瘋狂的事情，即使不是很合邏輯。在這個對如何將TSTree用作路由的簡單演示中，它具有一些人們不想要的缺陷。比如，它會把/al匹配到Album，它是人們通常不想要的。它們想要/album/*匹配到Album以及/al匹配到404錯誤。

這並不難以實現，因為你可以修改字首演算法來以你想要的任何方式匹配。如果你修改了匹配演算法，來尋找所有匹配的字首，之後選出“最好”的那個，你就可以輕易做到它。這種情況下，/al回匹配MainApp或者Album。獲得這些結果之後，就可以執行一些邏輯來決定哪個“最好”。

另一件你能在真正的路由系統裡做的事情，就是使用TSTree來尋找所有可能的匹配，但是這些匹配是需要檢查的一些模式串。在許多web應用中，有一個正規表示式的列表，用於和每個請求的URL進行匹配。匹配所有這些正規表示式非常花時間，所以你可以使用TSTree來通過它們的字首尋找所有可能的結果。於是你就可以縮小模式串的範圍，更快速地做嘗試。

使用這種方式，你的URL會精確匹配，因為你實際上執行了正規表示式，它們匹配起來更快，因為你通過可能的字首來查詢它們。

這種演算法也可用於所有需要使用者視覺化的靈活路由機制。域名、IP地址、包註冊器和目錄，檔案或者URL。

附加題

建立一個實際的引擎，使用Handler結構儲存應用，而不是僅僅儲存應用的字串。這個結構儲存它所繫結的URL，名稱和任何需要構建實際路由系統的東西。
將URL對映到.so檔案而不是任意的名字，並且使用dlopen系統動態載入處理器，並執行它們所包含的回撥。將這些回撥放進你的Handler結構體中，之後你就用C編寫了動態回撥處理器系統的全部。

笨辦法學C 練習12：If，Else If，Else
2019-05-10
笨辦法學C 練習36：更安全的字串
2019-05-12
字串
笨辦法學C 練習28：Makefile 進階
2019-05-10
笨辦法學C 練習13：Switch語句
2019-05-10
笨辦法學C 練習45：一個簡單的TCP/IP客戶端
2019-05-10
TCP客戶端
笨辦法學C 練習18：函式指標
2019-05-12
函式指標
笨辦法學C 練習29：庫和連結
2019-05-12
笨辦法學C 練習34：動態陣列
2019-05-09
陣列
笨辦法學C 練習25：變參函式
2019-05-10
函式
笨辦法學C 練習42：棧和佇列
2019-05-14
佇列
笨辦法學C 練習1：啟用編譯器
2019-05-11
編譯
笨辦法學C 練習2：用Make來代替Python
2019-05-11
Python
笨辦法學C 練習44：環形緩衝區
2019-05-09
笨辦法學C 練習38：雜湊演算法
2019-05-14
演算法
笨辦法學C 練習23：認識達夫裝置
2019-05-11
笨辦法學C 練習46：三叉搜尋樹
2019-05-12
笨辦法學C 練習7：更多變數和一些算術
2019-05-10
變數
笨辦法學C 前言
2019-05-13
笨辦法學Python習題48
2020-10-09
Python
笨辦法學C 導言：C的笛卡爾之夢
2019-05-14
笨辦法學C 後記：“解構 K&R C” 已死
2019-05-10
笨辦法學 Golang 開山篇
2020-07-11
Golang
笨辦法學Python – 習題8-10: Printing & Printing, Printing
2019-01-17
Python
每週一書：162頁《笨辦法學 Python》分享！
2018-07-30
Python
RESTful在路由url有名義的寫法
2018-04-05
REST路由
一個練習專案，好玩的bbs-c#
2024-06-21
C#
每日一練（47）：找不同
2022-04-28
Angular專案下一個路由如何接收到上一個路由url中的引數id
2019-02-28
Angular路由
求推薦，有沒有類似《笨辦法學 python》的書用來學 Java 的？
2020-05-01
PythonJava
rust學習十二、一個I/O程式練習
2024-12-03
Rust
《“笨辦法”學Python（第3版）》點評贈書‏獲獎名單
2022-11-29
Python
一個小練習。
2018-03-31
Java學習之基礎語法練習
2020-12-30
Java
笨方法學C 筆記 (day1)
2019-05-11
筆記
簡單搭建一個小型拓撲，練習路由器的Esay IP的設定
2024-11-27
路由器
Excel資料快速匯入mysql的幾個辦法
2018-05-15
ExcelMySql
高收益的笨辦法：暴破在Windows提權中的應用
2018-06-06
Windows
C語言-for迴圈之窮舉法練習
2024-06-02
C語言

笨辦法學C 練習47：一個快速的URL路由

練習47：一個快速的URL路由

你會看到什麼

如何改進

附加題

相關文章